Co-Exploration of NLA Kernels and Specification of Compute Elements in Distributed Memory CGRAs
Data(s) |
2014
|
---|---|
Resumo |
Coarse Grained Reconfigurable Architectures (CGRA) are emerging as embedded application processing units in computing platforms for Exascale computing. Such CGRAs are distributed memory multi- core compute elements on a chip that communicate over a Network-on-chip (NoC). Numerical Linear Algebra (NLA) kernels are key to several high performance computing applications. In this paper we propose a systematic methodology to obtain the specification of Compute Elements (CE) for such CGRAs. We analyze block Matrix Multiplication and block LU Decomposition algorithms in the context of a CGRA, and obtain theoretical bounds on communication requirements, and memory sizes for a CE. Support for high performance custom computations common to NLA kernels are met through custom function units (CFUs) in the CEs. We present results to justify the merits of such CFUs. |
Formato |
application/pdf |
Identificador |
http://eprints.iisc.ernet.in/52531/1/2014-Int_Con_on_Emb_Com_Sys_225_2014.pdf Mahadurkar, Mahesh and Merchant, Farhad and Maity, Arka and Vatwani, Kapil and Munje, Ishan and Gopalan, Nandhini and Nandy, SK and Narayan, Ranjani (2014) Co-Exploration of NLA Kernels and Specification of Compute Elements in Distributed Memory CGRAs. In: International Conference on Embedded Computer Systems - Architectures, Modeling, and Simulation (SAMOS), JUL 14-17, 2014, Samos, GREECE, pp. 225-232. |
Publicador |
IEEE |
Relação |
http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6893215 http://eprints.iisc.ernet.in/52531/ |
Palavras-Chave | #Supercomputer Education & Research Centre |
Tipo |
Conference Proceedings NonPeerReviewed |