Biblioteca Digital

873 resultados para Parallel Computations

Multi-physics modelling of materials processes on high performance parallel computers

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract not available

Veja mais

PORBS: A parallel observation-based slicer

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract—This paper presents PORBS, a parallelised observation-based slicing tool. The tool itself is written in Java making it platform independent and leverages the build chain of the system being sliced to avoid the need to replicate complex compiler analysis. The target audience of PORBS is software engineers and researchers working with and on tools and techniques for software comprehension, debugging, re-engineering, and maintenance.

Veja mais

On T-Semisimplicity of Iwasawa Modules and Some Computations with Z3-Extensions

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Thesis (Ph.D.)--University of Washington, 2016-08

Veja mais

Parallel Unstructured Mesh Partitioning

Relevância:

20.00% 20.00%

Publicador:

Veja mais

A parallel method for solving pentadiagonal systems of linear equations

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A new parallel approach for solving a pentadiagonal linear system is presented. The parallel partition method for this system and the TW parallel partition method on a chain of P processors are introduced and discussed. The result of this algorithm is a reduced pentadiagonal linear system of order P \Gamma 2 compared with a system of order 2P \Gamma 2 for the parallel partition method. More importantly the new method involves only half the number of communications startups than the parallel partition method (and other standard parallel methods) and hence is a far more efficient parallel algorithm.

Veja mais

Parallel optimisation algorithms for multilevel mesh partitioning

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract not available

Veja mais

A fast parallel hyperspectral coded aperture algorithm for compressive sensing using OpenCL

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, we develop a fast implementation of an hyperspectral coded aperture (HYCA) algorithm on different platforms using OpenCL, an open standard for parallel programing on heterogeneous systems, which includes a wide variety of devices, from dense multicore systems from major manufactures such as Intel or ARM to new accelerators such as graphics processing units (GPUs), field programmable gate arrays (FPGAs), the Intel Xeon Phi and other custom devices. Our proposed implementation of HYCA significantly reduces its computational cost. Our experiments have been conducted using simulated data and reveal considerable acceleration factors. This kind of implementations with the same descriptive language on different architectures are very important in order to really calibrate the possibility of using heterogeneous platforms for efficient hyperspectral imaging processing in real remote sensing missions.

Veja mais

A single-code software model for multiphysics analysis-engine on parallel and distributed computers with the PHYSICA toolkit

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract not available

Veja mais

Automatic generation of portable multi-dimensionally decomposed parallel structured mesh software

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract not available

Veja mais

Communication latency hiding in a parallel conjugate gradient method

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract not available

Veja mais

Large scale parallel simulations for the assembly of a flip-chip component to a printed circuit

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract not available

Veja mais

Load-balancing for parallel adaptive unstructured grids

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A parallel method for the dynamic partitioning of unstructured meshes is outlined. The method includes diffusive load-balancing techniques and an iterative optimisation technique known as relative gain optimisationwhich both balances theworkload and attempts to minimise the interprocessor communications overhead. It can also optionally include amultilevel strategy. Experiments on a series of adaptively refined meshes indicate that the algorithmprovides partitions of an equivalent or higher quality to static partitioners (which do not reuse the existing partition) and much more rapidly. Perhaps more importantly, the algorithm results in only a small fraction of the amount of data migration compared to the static partitioners.

Veja mais

Modelling continuum mechanics phenomena using three dimensional unstructured meshes on massively parallel processors

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The difficulties encountered in implementing large scale CM codes on multiprocessor systems are now fairly well understood. Despite the claims of shared memory architecture manufacturers to provide effective parallelizing compilers, these have not proved to be adequate for large or complex programs. Significant programmer effort is usually required to achieve reasonable parallel efficiencies on significant numbers of processors. The paradigm of Single Program Multi Data (SPMD) domain decomposition with message passing, where each processor runs the same code on a subdomain of the problem, communicating through exchange of messages, has for some time been demonstrated to provide the required level of efficiency, scalability, and portability across both shared and distributed memory systems, without the need to re-author the code into a new language or even to support differing message passing implementations. Extension of the methods into three dimensions has been enabled through the engineering of PHYSICA, a framework for supporting 3D, unstructured mesh and continuum mechanics modeling. In PHYSICA, six inspectors are used. Part of the challenge for automation of parallelization is being able to prove the equivalence of inspectors so that they can be merged into as few as possible.

Veja mais

Multi-physics modelling for design optimization and manufacturing on high-performance parallel systems

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract not available

Veja mais

Parallel dynamic load-balancing for adaptive unstructured meshes

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This chapter describes a parallel optimization technique that incorporates a distributed load-balancing algorithm and provides an extremely fast solution to the problem of load-balancing adaptive unstructured meshes. Moreover, a parallel graph contraction technique can be employed to enhance the partition quality and the resulting strategy outperforms or matches results from existing state-of-the-art static mesh partitioning algorithms. The strategy can also be applied to static partitioning problems. Dynamic procedures have been found to be much faster than static techniques, to provide partitions of similar or higher quality and, in comparison, involve the migration of a fraction of the data. The method employs a new iterative optimization technique that balances the workload and attempts to minimize the interprocessor communications overhead. Experiments on a series of adaptively refined meshes indicate that the algorithm provides partitions of an equivalent or higher quality to static partitioners (which do not reuse the existing partition) and much more quickly. The dynamic evolution of load has three major influences on possible partitioning techniques; cost, reuse, and parallelism. The unstructured mesh may be modified every few time-steps and so the load-balancing must have a low cost relative to that of the solution algorithm in between remeshing.

Veja mais

873 resultados para Parallel Computations

Filtro por publicador