sjtujoe
  1. Latency devices(CPU cores)
  2. Throughput devices(GPU cores)
  3. Use the best match for the job (heterogeneity in mobile SOC
  4. image
  5. image
  6. CPU: Latency Oriented Design
  • Powerful ALU
    • Reduced operation latency
  • Large caches
    • convert long latency memory accesses to short latency cache accesses
  • Sophisticated control
    • Branch prediciton for reduced branch latency
    • Data forwarding for reduced data latency
  • GPU: Throughput Oriented Design
    • Small caches
      • To boost memory throughput
    • Simple control
      • No branch prediction
      • No data forwarding
    • Energy efficient ALUs
      • Many long latency but heavily pipelined for high throughput
  • Scalability
    • image
  • Portability
    • image
  • SPMD – Single Program, Multiple Data
  • Threads within a block cooperate via shared memory, atomic operation, barrier synchronization
  • image
  • 分类:

    技术点:

    相关文章:

    • 2021-12-10
    • 2021-11-28
    • 2021-08-14
    • 2021-06-15
    • 2021-12-21
    • 2021-04-09
    • 2022-01-21
    • 2021-11-19
    猜你喜欢
    • 2021-11-17
    • 2021-11-17
    • 2021-11-17
    • 2021-11-17
    • 2022-12-23
    • 2022-12-23
    • 2022-12-23
    相关资源
    相似解决方案