7 resultados para synchronous

em Greenwich Academic Literature Archive - UK


Relevância:

20.00% 20.00%

Publicador:

Resumo:

The parallelization of existing/industrial electromagnetic software using the bulk synchronous parallel (BSP) computation model is presented. The software employs the finite element method with a preconditioned conjugate gradient-type solution for the resulting linear systems of equations. A geometric mesh-partitioning approach is applied within the BSP framework for the assembly and solution phases of the finite element computation. This is combined with a nongeometric, data-driven parallel quadrature procedure for the evaluation of right-hand-side terms in applications involving coil fields. A similar parallel decomposition is applied to the parallel calculation of electron beam trajectories required for the design of tube devices. The BSP parallelization approach adopted is fully portable, conceptually simple, and cost-effective, and it can be applied to a wide range of finite element applications not necessarily related to electromagnetics.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper addresses the exploitation of overlapping communication with calculation within parallel FORTRAN 77 codes for computational fluid dynamics (CFD) and computational structured dynamics (CSD). The obvious objective is to overlap interprocessor communication with calculation on each processor in a distributed memory parallel system and so improve the efficiency of the parallel implementation. A general strategy for converting synchronous to overlapped communication is presented together with tools to enable its automatic implementation in FORTRAN 77 codes. This strategy is then implemented within the parallelisation toolkit, CAPTools, to facilitate the automatic generation of parallel code with overlapped communications. The success of these tools are demonstrated on two codes from the NAS-PAR and PERFECT benchmark suites. In each case, the tools produce parallel code with overlapped communications which is as good as that which could be generated manually. The parallel performance of the codes also improve in line with expectation.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Realizing scalable performance on high performance computing systems is not straightforward for single-phenomenon codes (such as computational fluid dynamics [CFD]). This task is magnified considerably when the target software involves the interactions of a range of phenomena that have distinctive solution procedures involving different discretization methods. The problems of addressing the key issues of retaining data integrity and the ordering of the calculation procedures are significant. A strategy for parallelizing this multiphysics family of codes is described for software exploiting finite-volume discretization methods on unstructured meshes using iterative solution procedures. A mesh partitioning-based SPMD approach is used. However, since different variables use distinct discretization schemes, this means that distinct partitions are required; techniques for addressing this issue are described using the mesh-partitioning tool, JOSTLE. In this contribution, the strategy is tested for a variety of test cases under a wide range of conditions (e.g., problem size, number of processors, asynchronous / synchronous communications, etc.) using a variety of strategies for mapping the mesh partition onto the processor topology.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We report on practical experience using the Oxford BSP Library to parallelize a large electromagnetic code, the British Aerospace finite-difference time-domain code EMMA T:FD3D. The Oxford BS Library is one of the first realizations of the Bulk Synchronous Parallel computational model to be targeted at numerically intensive scientific (typically Fortran) computing. The BAe EMMA code is one of the first large-scale applications to be parallelized using this library, and it is an important demonstration of the cost effectiveness of the BSP approach. We illustrate how BSP cost-modelling techniques can be used to predict and optimize performance for single-source programs across different parallel platforms. We provide predicted and observed performance figures for an industrial-strength, single-source parallel code for a variety of real parallel architectures: shared memory multiprocessors, workstation clusters and massively parallel platforms.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper results obtained from the parallelisation of existing 3D electromagnetic Finite Element codes within the ESPRIT HPCN project PARTEL are presented. The parallelisation procedure, based on the Bulk Synchronous Parallel approach, is outlined and the encouraging results obtained in terms of speed-up on some industrially significant test cases are described and discussed.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper the results obtained from the parallelisation of some 3D industrial electromagnetic Finite Element codes within the ESPRIT Europort 2 project PARTEL are presented. The basic guidelines for the parallelisation procedure, based on the Bulk Synchronous Parallel approach, are presented and the encouraging results obtained in terms of speed-up on some selected test cases of practical design significance are outlined and discussed.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This study investigated the effect of crank configuration on muscle activity and torque production during submaximal arm crank ergometry. Thirteen non-specifically trained male participants volunteered. During the research trials they completed a warm-up at 15 W before two 3-min exercise stages were completed at 50 and 100 W; subjects used either a synchronous or asynchronous pattern of cranking. During the final 30-s of each submaximal exercise stage electromyographic and torque production data were collected. After the data had been processed each parameter was analysed using separate 2-way ANOVA tests with repeated measures. The activity of all muscles increased in line with external workload, although a shift in the temporal pattern of muscle activity was noted between crank configurations. Patterns of torque production during asynchronous and synchronous cranking were distinct. Furthermore, peak, minimum and delta (peak-minimum) torque values were different (P < 0.05) between crank configurations at both workloads. For example, at 100 W, peak torque using synchronous [19.6 (4.3) Nm] cranking was higher (P < 0.05) compared to asynchronous [16.8 (1.6) Nm] cranking. In contrast minimum torque was lower (P < 0.05) at 100 W using synchronous [4.8 (1.7) Nm] compared to asynchronous [7.3 (1.2) Nm] cranking. There was a distinct bilateral asymmetry in torque production during asynchronous cranking with the dominant transmitting significantly more force to the crank arm. Taken together, these preliminary data demonstrate the complex nature of muscle activity during arm crank ergometry performed with an asynchronous or synchronous crank set-up. Further work is required to determine how muscle activity (EMG activity) and associated patterns of torque production influence physiological responses and functional capacity during arm crank ergometry.