- Latency devices(CPU cores)
- Throughput devices(GPU cores)
- Use the best match for the job (heterogeneity in mobile SOC


- CPU: Latency Oriented Design
- Powerful ALU
- Reduced operation latency
- Large caches
- convert long latency memory accesses to short latency cache accesses
- Sophisticated control
- Branch prediciton for reduced branch latency
- Data forwarding for reduced data latency
GPU: Throughput Oriented Design
- Small caches
- To boost memory throughput
- Simple control
- No branch prediction
- No data forwarding
- Energy efficient ALUs
- Many long latency but heavily pipelined for high throughput
Scalability
Portability
SPMD – Single Program, Multiple Data Threads within a block cooperate via shared memory, atomic operation, barrier synchronization
相关文章:
-
2021-12-10
-
2021-11-28
-
2021-08-14
-
2021-06-15
-
2021-12-21
-
2021-04-09
-
2022-01-21
-
2021-11-19
猜你喜欢
-
2021-11-17
-
2021-11-17
-
2021-11-17
-
2021-11-17
-
2022-12-23
-
2022-12-23
-
2022-12-23
相关资源
-
下载
2022-12-11
-
下载
2021-06-24
-
下载
2022-12-30
-
下载
2022-12-27