The Loop-of-Stencil-Reduce Paradigm


Autoria(s): Aldinucci, Marco; Danelutto, Marco; Drocco, Maurizio; Kilpatrick, Peter; Pezzi, Guilherme Peretti; Torquati, Massimo
Data(s)

02/12/2015

Resumo

<p>In this paper we advocate the Loop-of-stencil-reduce pattern as a way to simplify the parallel programming of heterogeneous platforms (multicore+GPUs). Loop-of-Stencil-reduce is general enough to subsume map, reduce, map-reduce, stencil, stencil-reduce, and, crucially, their usage in a loop. It transparently targets (by using OpenCL) combinations of CPU cores and GPUs, and it makes it possible to simplify the deployment of a single stencil computation kernel on different GPUs. The paper discusses the implementation of Loop-of-stencil-reduce within the FastFlow parallel framework, considering a simple iterative data-parallel application as running example (Game of Life) and a highly effective parallel filter for visual data restoration to assess performance. Thanks to the high-level design of the Loop-of-stencil-reduce, it was possible to run the filter seamlessly on a multicore machine, on multi-GPUs, and on both.</p>

Identificador

http://pure.qub.ac.uk/portal/en/publications/the-loopofstencilreduce-paradigm(b5439089-5d5c-4127-98f3-c983a2c14de4).html

http://dx.doi.org/10.1109/Trustcom.2015.628

http://www.scopus.com/inward/record.url?scp=84969262578&partnerID=8YFLogxK

Idioma(s)

eng

Publicador

Institute of Electrical and Electronics Engineers Inc.

Direitos

info:eu-repo/semantics/restrictedAccess

Fonte

Aldinucci , M , Danelutto , M , Drocco , M , Kilpatrick , P , Pezzi , G P & Torquati , M 2015 , The Loop-of-Stencil-Reduce Paradigm . in Proceedings - 14th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2015 . vol. 3 , 7345644 , Institute of Electrical and Electronics Engineers Inc. , pp. 172-177 , 14th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2015 , Helsinki , Finland , 20-22 August . DOI: 10.1109/Trustcom.2015.628

Palavras-Chave #fastflow #GPUs #heterogeneous platforms #multi-core #OpenCL #parallel patterns #skeletons #/dk/atira/pure/subjectarea/asjc/1700/1705 #Computer Networks and Communications
Tipo

contributionToPeriodical