Exploiting Nested Parallelism on Heterogeneous Processors
Contribuinte(s) |
Yeung, Donald Digital Repository at the University of Maryland University of Maryland (College Park, Md.) Electrical Engineering |
---|---|
Data(s) |
22/06/2016
22/06/2016
2016
|
Resumo |
Heterogeneous computing systems have become common in modern processor architectures. These systems, such as those released by AMD, Intel, and Nvidia, include both CPU and GPU cores on a single die available with reduced communication overhead compared to their discrete predecessors. Currently, discrete CPU/GPU systems are limited, requiring larger, regular, highly-parallel workloads to overcome the communication costs of the system. Without the traditional communication delay assumed between GPUs and CPUs, we believe non-traditional workloads could be targeted for GPU execution. Specifically, this thesis focuses on the execution model of nested parallel workloads on heterogeneous systems. We have designed a simulation flow which utilizes widely used CPU and GPU simulators to model heterogeneous computing architectures. We then applied this simulator to non-traditional GPU workloads using different execution models. We also have proposed a new execution model for nested parallelism allowing users to exploit these heterogeneous systems to reduce execution time. |
Identificador |
doi:10.13016/M28B6V |
Idioma(s) |
en |
Palavras-Chave | #Computer engineering #Heterogeneous Execution Models #Heterogeneous Processors #Multigrain Parallelism #Nested Parallelism |
Tipo |
Thesis |