Biblioteca Digital

246 resultados para parallel architectures

A language-independent parallel refactoring framework

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Recent trends towards increasingly parallel computers mean that there needs to be a seismic shift in programming practice. The time is rapidly approaching when most programming will be for parallel systems. However, most programming techniques in use today are geared towards sequential, or occasionally small-scale parallel, programming. While refactoring has so far mainly been applied to sequential programs, it is our contention that refactoring can play a key role in significantly improving the programmability of parallel systems, by allowing the programmer to apply a set of well-defined transformations in order to parallelise their programs. In this paper, we describe a new language-independent refactoring approach that helps introduce and tune parallelism through high-level design patterns targeting a set of well-specified parallel skeletons. We believe this new refactoring process is the key to allowing programmers to truly start thinking in parallel. © 2012 ACM.

Parallel Programming

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Recent trends in computing systems, such as multi-core processors and cloud computing, expose tens to thousands of processors to the software. Software developers must respond by introducing parallelism in their software. To obtain highest performance, it is not only necessary to identify parallelism, but also to reason about synchronization between threads and the communication of data from one thread to another. This entry gives an overview on some of the most common abstractions that are used in parallel programming, namely explicit vs. implicit expression of parallelism and shared and distributed memory. Several parallel programming models are reviewed and categorized by means of these abstractions. The pros and cons of parallel programming models from the perspective of performance and programmability are discussed.

Achieving Multiprogramming Scalability of Parallel Programs on Intel SMP Platforms: Nanothreading in the Linux Kernel

Relevância:

20.00% 20.00%

Publicador:

A Tool to Schedule Parallel Applications on Multiprocessors: The NANOS CPU Manager

Relevância:

20.00% 20.00%

Publicador:

Deployment on GPUs of an application in computational atomic physics

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper describes the deployment on GPUs of PROP, a program of the 2DRMP suite which models electron collisions with H-like atoms and ions. Because performance on GPUs is better in single precision than in double precision, the numerical stability of the PROP program in single precision has been studied. The numerical quality of PROP results computed in single precision and their impact on the next program of the 2DRMP suite has been analyzed. Successive versions of the PROP program on GPUs have been developed in order to improve its performance. Particular attention has been paid to the optimization of data transfers and of linear algebra operations. Performance obtained on several architectures (including NVIDIA Fermi) are presented.

Scaling Irregular Parallel Codes with Minimal Programming Effort

Relevância:

20.00% 20.00%

Publicador:

Exploiting Simultaneous Multithreading for Parallel Mesh Generation: A Multigrain Approach on Deep Multiprocessors:13th International Meshing Roundtable (IMR)

Relevância:

20.00% 20.00%

Publicador:

Application Awareness in Adaptation Middleware: Balancing Transparency with Performance and Adaptivity:SIAM Conference on Parallel Processing for Scientific Computing (SIAM PP), Miniworkshop on Adaptivity in Parallel and Distributed Computing through Interoperating Systems and Applications

Relevância:

20.00% 20.00%

Publicador:

Factory: An Object-Oriented Parallel Programming Substrate for Deep Multiprocessors

Relevância:

20.00% 20.00%

Publicador:

2-D Parallel Constrained Delaunay Mesh Generation: A Multigrain Approach on Deep Multiprocessors:Workshop in Programming Models for HPCS Ultra-Scale Applications

Relevância:

20.00% 20.00%

Publicador:

Synthesizing Parallel Programming Models for Asymmetric Multi-Core Systems

Relevância:

20.00% 20.00%

Publicador:

RAxML-CELL: Parallel Phylogenetic Tree Construction on the Cell Broadband Engine

Relevância:

20.00% 20.00%

Publicador:

Unified Scheduling of Polymorphic Parallelism on the Cell Processor:Abstracts of the 2008 SIAM Conference on Parallel Processing for Scientific Computing, Miniworkshop on the Cell Processor (SIAM PP)

Relevância:

20.00% 20.00%

Publicador:

Formic: Cost-Efficient and Scalable Prototyping of Manycore Architectures

Relevância:

20.00% 20.00%

Publicador:

Reconciling Explicit with Implicit Parallelism:2012 SIAM Conference on Parallel Processing for Scientific Computing (SIAM PP), Savannah, GA, USA

Relevância:

20.00% 20.00%

Publicador:

«
1
2
...
9
10
11
12
13
14
15
16
17
»