Biblioteca Digital

Co-Exploration of NLA Kernels and Specification of Compute Elements in Distributed Memory CGRAs

**Autoria(s):** Mahadurkar, Mahesh; Merchant, Farhad; Maity, Arka; Vatwani, Kapil; Munje, Ishan; Gopalan, Nandhini; Nandy, SK; Narayan, Ranjani
Data(s)	2014
Resumo	Coarse Grained Reconfigurable Architectures (CGRA) are emerging as embedded application processing units in computing platforms for Exascale computing. Such CGRAs are distributed memory multi- core compute elements on a chip that communicate over a Network-on-chip (NoC). Numerical Linear Algebra (NLA) kernels are key to several high performance computing applications. In this paper we propose a systematic methodology to obtain the specification of Compute Elements (CE) for such CGRAs. We analyze block Matrix Multiplication and block LU Decomposition algorithms in the context of a CGRA, and obtain theoretical bounds on communication requirements, and memory sizes for a CE. Support for high performance custom computations common to NLA kernels are met through custom function units (CFUs) in the CEs. We present results to justify the merits of such CFUs.
Formato	application/pdf
Identificador	http://eprints.iisc.ernet.in/52531/1/2014-Int_Con_on_Emb_Com_Sys_225_2014.pdf Mahadurkar, Mahesh and Merchant, Farhad and Maity, Arka and Vatwani, Kapil and Munje, Ishan and Gopalan, Nandhini and Nandy, SK and Narayan, Ranjani (2014) Co-Exploration of NLA Kernels and Specification of Compute Elements in Distributed Memory CGRAs. In: International Conference on Embedded Computer Systems - Architectures, Modeling, and Simulation (SAMOS), JUL 14-17, 2014, Samos, GREECE, pp. 225-232.
Publicador	IEEE
Relação	http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6893215 http://eprints.iisc.ernet.in/52531/
Palavras-Chave	#Supercomputer Education & Research Centre
Tipo	Conference Proceedings NonPeerReviewed

Acesso ao item digital