111 resultados para GPU acceleration

em QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Differential equations are often directly solvable by analytical means only in their one dimensional version. Partial differential equations are generally not solvable by analytical means in two and three dimensions, with the exception of few special cases. In all other cases, numerical approximation methods need to be utilized. One of the most popular methods is the finite element method. The main areas of focus, here, are the Poisson heat equation and the plate bending equation. The purpose of this paper is to provide a quick walkthrough of the various approaches that the authors followed in pursuit of creating optimal solvers, accelerated with the use of graphical processing units, and comparing them in terms of accuracy and time efficiency with existing or self-made non-accelerated solvers.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

How can GPU acceleration be obtained as a service in a cluster? This question has become increasingly significant due to the inefficiency of installing GPUs on all nodes of a cluster. The research reported in this paper is motivated to address the above question by employing rCUDA (remote CUDA), a framework that facilitates Acceleration-as-a-Service (AaaS), such that the nodes of a cluster can request the acceleration of a set of remote GPUs on demand. The rCUDA framework exploits virtualisation and ensures that multiple nodes can share the same GPU. In this paper we test the feasibility of the rCUDA framework on a real-world application employed in the financial risk industry that can benefit from AaaS in the production setting. The results confirm the feasibility of rCUDA and highlight that rCUDA achieves similar performance compared to CUDA, provides consistent results, and more importantly, allows for a single application to benefit from all the GPUs available in the cluster without loosing efficiency.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The design cycle for complex special-purpose computing systems is extremely costly and time-consuming. It involves a multiparametric design space exploration for optimization, followed by design verification. Designers of special purpose VLSI implementations often need to explore parameters, such as optimal bitwidth and data representation, through time-consuming Monte Carlo simulations. A prominent example of this simulation-based exploration process is the design of decoders for error correcting systems, such as the Low-Density Parity-Check (LDPC) codes adopted by modern communication standards, which involves thousands of Monte Carlo runs for each design point. Currently, high-performance computing offers a wide set of acceleration options that range from multicore CPUs to Graphics Processing Units (GPUs) and Field Programmable Gate Arrays (FPGAs). The exploitation of diverse target architectures is typically associated with developing multiple code versions, often using distinct programming paradigms. In this context, we evaluate the concept of retargeting a single OpenCL program to multiple platforms, thereby significantly reducing design time. A single OpenCL-based parallel kernel is used without modifications or code tuning on multicore CPUs, GPUs, and FPGAs. We use SOpenCL (Silicon to OpenCL), a tool that automatically converts OpenCL kernels to RTL in order to introduce FPGAs as a potential platform to efficiently execute simulations coded in OpenCL. We use LDPC decoding simulations as a case study. Experimental results were obtained by testing a variety of regular and irregular LDPC codes that range from short/medium (e.g., 8,000 bit) to long length (e.g., 64,800 bit) DVB-S2 codes. We observe that, depending on the design parameters to be simulated, on the dimension and phase of the design, the GPU or FPGA may suit different purposes more conveniently, thus providing different acceleration factors over conventional multicore CPUs.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Measurements of energetic proton production resulting from the interaction of high-intensity laser pulses with foil targets are described. Through the use of layered foil targets and heating of the target material we are able to distinguish three distinct populations of protons. One high energy population is associated with a proton source near the front surface of the target and is observed to be emitted with a characteristic ring structure. A source of typically lower energy, lower divergence protons originates from the rear surface of the target. Finally, a qualitatively separate source of even lower energy protons and ions is observed with a large divergence. Acceleration mechanisms for these separate sources are discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The acceleration of multi-MeV protons from the rear surface of thin solid foils irradiated by an intense (similar to 10(18) W/cm(2)) and short (similar to 1.5 ps) laser pulse has been investigated using transverse proton probing. The structure of the electric field driving the expansion of the proton beam has been resolved with high spatial and temporal resolution. The main features of the experimental observations, namely, an initial intense sheath field and a late time field peaking at the beam front, are consistent with the results from particle-in-cell and fluid simulations of thin plasma expansion into a vacuum.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper reviews recent experimental activity in the area of optimization, control, and application of laser accelerated proton beams, carried out at the Rutherford Appleton Laboratory and the Laboratoire pour l’Utilisation des Lasers Intenses 100 TW facility in France. In particular, experiments have investigated the role of the scale length at the rear of the plasma in reducing target-normal-sheath-acceleration acceleration efficiency. Results match with recent theoretical predictions and provide information in view of the feasibility of proton fast-ignition applications. Experiments aiming to control the divergence of the proton beams have investigated the use of a laser-triggered microlens, which employs laser-driven transient electric fields in cylindrical geometry, enabling to focus the emitted
protons and select monochromatic beam lets out of the broad spectrum beam. This approach could be advantageous in view
of a variety of applications. The use of laser-driven protons as a particle probe for transient field detection has been developed and
applied to a number of experimental conditions. Recent work in this area has focused on the detection of large-scale self-generated magnetic fields in laser-produced plasmas and the investigation of fields associated to the propagation of relativistic electron both on the surface and in the bulk of targets irradiated by high-power laser pulses.