342 resultados para MPI


Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper presents a decentralized/peer-to-peer architecture-based parallel version of the vector evaluated particle swarm optimization (VEPSO) algorithm for multi-objective design optimization of laminated composite plates using message passing interface (MPI). The design optimization of laminated composite plates being a combinatorially explosive constrained non-linear optimization problem (CNOP), with many design variables and a vast solution space, warrants the use of non-parametric and heuristic optimization algorithms like PSO. Optimization requires minimizing both the weight and cost of these composite plates, simultaneously, which renders the problem multi-objective. Hence VEPSO, a multi-objective variant of the PSO algorithm, is used. Despite the use of such a heuristic, the application problem, being computationally intensive, suffers from long execution times due to sequential computation. Hence, a parallel version of the PSO algorithm for the problem has been developed to run on several nodes of an IBM P720 cluster. The proposed parallel algorithm, using MPI's collective communication directives, establishes a peer-to-peer relationship between the constituent parallel processes, deviating from the more common master-slave approach, in achieving reduction of computation time by factor of up to 10. Finally we show the effectiveness of the proposed parallel algorithm by comparing it with a serial implementation of VEPSO and a parallel implementation of the vector evaluated genetic algorithm (VEGA) for the same design problem. (c) 2012 Elsevier Ltd. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We discuss the computational bottlenecks in molecular dynamics (MD) and describe the challenges in parallelizing the computation-intensive tasks. We present a hybrid algorithm using MPI (Message Passing Interface) with OpenMP threads for parallelizing a generalized MD computation scheme for systems with short range interatomic interactions. The algorithm is discussed in the context of nano-indentation of Chromium films with carbon indenters using the Embedded Atom Method potential for Cr-Cr interaction and the Morse potential for Cr-C interactions. We study the performance of our algorithm for a range of MPI-thread combinations and find the performance to depend strongly on the computational task and load sharing in the multi-core processor. The algorithm scaled poorly with MPI and our hybrid schemes were observed to outperform the pure message passing scheme, despite utilizing the same number of processors or cores in the cluster. Speed-up achieved by our algorithm compared favorably with that achieved by standard MD packages. (C) 2013 Elsevier Inc. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The aim of this technical report is to present some detailed explanations in order to help to understand and use the Message Passing Interface (MPI) parallel programming for solving several mixed integer optimization problems. We have developed a C++ experimental code that uses the IBM ILOG CPLEX optimizer within the COmputational INfrastructure for Operations Research (COIN-OR) and MPI parallel computing for solving the optimization models under UNIX-like systems. The computational experience illustrates how can we solve 44 optimization problems which are asymmetric with respect to the number of integer and continuous variables and the number of constraints. We also report a comparative with the speedup and efficiency of several strategies implemented for some available number of threads.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

O protótipo de uma interface gráfica para análise de imagens SAR (radar de abertura sintética - da sigla em inglês (Synthetic Aperture Radar) vem sendo desenvolvido e aperfeiçoado. Em seu estágio atual, a ferramenta é capaz de extrair o espectro direcional de ondas e o campo de ventos de imagens ERS 1&2. Dentre as diversas abordagens empregadas no desenvolvimento do protótipo, pode-se citar a avaliação da metodologia atualmente utilizada na interpretação das imagens. Neste sentido, é apresentada uma proposta de geração de imagens SAR sintéticas, de forma a viabilizar a avaliação do método de extração de parâmetros meteo-oceanográficos a partir do acoplamento do simulador de imagens com o interpretador utilizado no protótipo. Uma comparação detalhada entre os valores empregados para simulação e os resultados sacados da interface torna possível identificar possíveis deficiências e indicar aperfeiçoamentos ao método. Desta forma, o presente trabalho, além de discutir o protótipo para análise de imagens SAR, apresenta o desenvolvimento do acoplamento do simulador de imagens. Os resultados obtidos indicam que o acoplamento atingiu parcialmente seu objetivo apresentando bons resultados na interpretação das direções. Porém, ainda há algumas questões a serem avaliadas para a obtenção de resultados mais consistentes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

介绍了一种MPI程序死锁检测的静态方法以及该方法所处理的程序模型.为实现该方法,提出了比例方程组(一种特殊线性方程组)的概念并设计了求解方程组最简解的线性时空复杂度的高效算法.算法由一个四遍扫描过程与一个主控程序构成.主控程序用来处理并行计算节点计算机构成的划分.四遍扫描过程采用深度优先搜索方法确定方程组中各变元之间的比例关系.通过该算法所获得的最简解,任意多个变元之间的比例关系能在常数时间内获得.证明了该算法的正确性,并采用Java语言实现了该算法的标准程序库.该程序库目前已运行于MPI同步通信静态死锁检测的软件框架中.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

本文提出了MPI程序的同步通信模型及三个基本简化模型,给出了判定这些基本模型是否死锁的方法和定理并予以了严格证明.简化模型的死锁检测理论和方法是真实MPI程序死锁检测的必要基础.这些方法基于程序静态分析,必要时进行运行时检测,它们对两种简化模型可以在程序编译前确定是否死锁,对另外一种模型,在编译前可静态确定部分死锁,运行中可确定其他死锁.我们的理论可以证明MPI程序死锁检测主流算法的正确性,其方法可以减少它们对客户源代码或MPI profiling接口的修改量,从而大大降低死锁检测开销,并可在运行前判定死锁.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

静态检测MPI程序同步通信死锁比较困难,通常需要建立程序模型.顺序模型是其他所有复杂模型的基础.通过一种映射方法将顺序模型转化为字符串集合,将死锁检测问题转化为等价的多队列字符申匹配问题,从而设计并实现了一种MPI同步通信顺序模型的静态死锁检测算法.该算法的性能优于通常的环检测方法,并能适应动态消息流.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

针对全节点空间上MPI同步通信程序死锁检测的现有穷举方法的弱点,提出了该场景中MPI程序的两种参数化模型:一次参数化模型和二次参数化模型。通过分析一次参数化模型的一类不变式的性质,给出了该模型的死锁检测算法。提出了同构节点空间的一系列概念和性质并由此设计出该算法的几种优化方法,以加快某些特定程序模型死锁检测的过程。仿真实验表明该算法及其优化方法比现有方法的性能提高若干数量级。

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The mass resolved multiphoton ionization (MPI) spectra of methyl iodide were obtained in the 430-490 nm region using a time-of-flight (TOF) mass spectrometer. They have the same vibrational structure, which testifies that the fragment species, in the wavelength region under study, are from the photodissociation of multiphoton ionized molecular parent ions. Some features in the spectra are identified as three-photon excitations to 6p and 7s Rydberg states of methyl iodide. Two new vibrational structures of some Rydberg states are observed. The mechanism of ionization and dissociation is also discussed. (C) 2001 Elsevier Science B.V. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Most parallel computing applications in highperformance computing use the Message Passing Interface (MPI) API. Given the fundamental importance of parallel computing to science and engineering research, application correctness is paramount. MPI was originally developed around 1993 by the MPI Forum, a group of vendors, parallel programming researchers, and computational scientists. However, the document defining the standard is not issued by an official standards organization but has become a de facto standard © 2011 ACM.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose a dynamic verification approach for large-scale message passing programs to locate correctness bugs caused by unforeseen nondeterministic interactions. This approach hinges on an efficient protocol to track the causality between nondeterministic message receive operations and potentially matching send operations. We show that causality tracking protocols that rely solely on logical clocks fail to capture all nuances of MPI program behavior, including the variety of ways in which nonblocking calls can complete. Our approach is hinged on formally defining the matches-before relation underlying the MPI standard, and devising lazy update logical clock based algorithms that can correctly discover all potential outcomes of nondeterministic receives in practice. can achieve the same coverage as a vector clock based algorithm while maintaining good scalability. LLCP allows us to analyze realistic MPI programs involving a thousand MPI processes, incurring only modest overheads in terms of communication bandwidth, latency, and memory consumption. © 2011 IEEE.