140 resultados para Loops parallelization
Resumo:
An efficient parallelization algorithm for the Fast Multipole Method which aims to alleviate the parallelization bottleneck arising from lower job-count closer to root levels is presented. An electrostatic problem of 12 million non-uniformly distributed mesh elements is solved with 80-85% parallel efficiency in matrix setup and matrix-vector product using 60GB and 16 threads on shared memory architecture.
Resumo:
Task-parallel languages are increasingly popular. Many of them provide expressive mechanisms for intertask synchronization. For example, OpenMP 4.0 will integrate data-driven execution semantics derived from the StarSs research language. Compared to the more restrictive data-parallel and fork-join concurrency models, the advanced features being introduced into task-parallelmodels in turn enable improved scalability through load balancing, memory latency hiding, mitigation of the pressure on memory bandwidth, and, as a side effect, reduced power consumption. In this article, we develop a systematic approach to compile loop nests into concurrent, dynamically constructed graphs of dependent tasks. We propose a simple and effective heuristic that selects the most profitable parallelization idiom for every dependence type and communication pattern. This heuristic enables the extraction of interband parallelism (cross-barrier parallelism) in a number of numerical computations that range from linear algebra to structured grids and image processing. The proposed static analysis and code generation alleviates the burden of a full-blown dependence resolver to track the readiness of tasks at runtime. We evaluate our approach and algorithms in the PPCG compiler, targeting OpenStream, a representative dataflow task-parallel language with explicit intertask dependences and a lightweight runtime. Experimental results demonstrate the effectiveness of the approach.
Resumo:
We formulate a natural model of loops and isolated vertices for arbitrary planar graphs, which we call the monopole-dimer model. We show that the partition function of this model can be expressed as a determinant. We then extend the method of Kasteleyn and Temperley-Fisher to calculate the partition function exactly in the case of rectangular grids. This partition function turns out to be a square of a polynomial with positive integer coefficients when the grid lengths are even. Finally, we analyse this formula in the infinite volume limit and show that the local monopole density, free energy and entropy can be expressed in terms of well-known elliptic functions. Our technique is a novel determinantal formula for the partition function of a model of isolated vertices and loops for arbitrary graphs.
Resumo:
In a classic study, Kacser & Burns (1981, Genetics 97, 639-666) demonstrated that given certain plausible assumptions, the flux in a metabolic pathway was more or less indifferent to the activity of any of the enzymes in the pathway taken singly. It was inferred from this that the observed dominance of most wild-type alleles with respect to loss-of-function mutations did not require an adaptive, meaning selectionist, explanation. Cornish-Bowden (1987, J. theor. Biol. 125, 333-338) showed that the Kacser-Burns inference was not valid when substrate concentrations were large relative to the relevant Michaelis constants. We find that in a randomly constructed functional pathway, even when substrate levels are small, one can expect high values of control coefficients for metabolic flux in the presence of significant nonlinearities as exemplified by enzymes with Hill coefficients ranging from two to six, or by the existence of oscillatory loops. Under these conditions the flux can be quite sensitive to changes in enzyme activity as might be caused by inactivating one of the two alleles in a diploid. Therefore, the phenomenon of dominance cannot be a trivial ''default'' consequence of physiology but must be intimately linked to the manner in which metabolic networks have been moulded by natural selection.
Resumo:
Barium lanthanum bismuth titanate (Ba1−(3/2)xLaxBi4Ti4O15, x = 0–0.4) ceramics were fabricated using the powders synthesized via the solid-state reaction route. X-ray powder diffraction analysis confirmed the above compositions to be monophasic and belonged to the m = 4 member of the Aurivillius family of oxides. The effect of the partial presence of La3+ on Ba2+ sites on the microstructure, dielectric and relaxor behaviour of BaBi4Ti4O15 (BBT) ceramics was investigated. For the compositions pertaining to x ≤ 0.1, the dielectric constant at both room temperature and in the vicinity of the temperature of the dielectric maximum (Tm) of the parent phase (BBT) increased significantly with an increase in x while Tm remained almost constant. Tm shifted towards lower temperatures accompanied by a decrease in the magnitude of the dielectric maximum (εm) with an increase in the lanthanum content (0.1 < x ≤ 0.4). The dielectric relaxation was modelled using the Vogel–Fulcher relation and a decrease in the activation energy for frequency dispersion with increasing x was observed. The frequency dispersion of Tm was found to decrease with an increase in lanthanum doping, and for compositions corresponding to x ≥ 0.3, Tm was frequency independent. Well-developed P(polarization)–E(electric field) hysteresis loops were observed at 150 °C for all the samples and the remanent polarization (2Pr) was improved from 6.3 µC cm−2 for pure BBT to 13.4 µC cm−2 for Ba0.7La0.2Bi4Ti4O15 ceramics. Dc conductivities and associated activation energies were evaluated using impedance spectroscopy.
Resumo:
The research in software science has so far been concentrated on three measures of program complexity: (a) software effort; (b) cyclomatic complexity; and (c) program knots. In this paper we propose a measure of the logical complexity of programs in terms of the variable dependency of sequence of computations, inductive effort in writing loops and complexity of data structures. The proposed complexity mensure is described with the aid of a graph which exhibits diagrammatically the dependence of a computation at a node upon the computation of other (earlier) nodes. Complexity measures of several example programs have been computed and the related issues have been discussed. The paper also describes the role played by data structures in deciding the program complexity.
Resumo:
A constitutive modeling approach for shape memory alloy (SMA) wire by taking into account the microstructural phase inhomogeneity and the associated solid-solid phase transformation kinetics is reported in this paper. The approach is applicable to general thermomechanical loading. Characterization of various scales in the non-local rate sensitive kinetics is the main focus of this paper. Design of SMA materials and actuators not only involve an optimal exploitation of the hysteresis loops during loading-unloading, but also accounts for fatigue and training cycle identifications. For a successful design of SMA integrated actuator systems, it is essential to include the microstructural inhomogeneity effects and the loading rate dependence of the martensitic evolution, since these factors play predominant role in fatigue. In the proposed formulation, the evolution of new phase is assumed according to Weibull distribution. Fourier transformation and finite difference methods are applied to arrive at the analytical form of two important scaling parameters. The ratio of these scaling parameters is of the order of 10(6) for stress-free temperature-induced transformation and 10(4) for stress-induced transformation. These scaling parameters are used in order to study the effect of microstructural variation on the thermo-mechanical force and interface driving force. It is observed that the interface driving force is significant during the evolution. Increase in the slopes of the transformation start and end regions in the stress-strain hysteresis loop is observed for mechanical loading with higher rates.
Resumo:
Although incidence matrix representation has been used to analyze the Petri net based models of a system, it has the limitation that it does not preserve reflexive properties (i.e., the presence of selfloops) of Petri nets. But in many practical applications self-loops play very important roles. This paper proposes a new representation scheme for general Petri nets. This scheme defines a matrix called "reflexive incidence matrix (RIM) c which is a combination of two matrices, a "base matrix Cb,,, and a "power matrix CP." This scheme preserves the reflexive and other properties of the Petri nets. Through a detailed analysis it is shown that the proposed scheme requires less memory space and less processing time for answering commonly encountered net queries compared to other schemes. Algorithms to generate the RIM from the given net description and to decompose RIM into input and output function matrices are also given. The proposed Petri net representation scheme is very useful to model and analyze the systems having shared resources, chemical processes, network protocols, etc., and to evaluate the performance of asynchronous concurrent systems.
Resumo:
A simple algorithm has been developed to detect β-bends and 'loops'-chain reversals containing five amino acid residues, using only coordinates of Cα-atoms from crystal structure data of globular proteins using the above algorithm. Analysis of bends have showed that the total number of bends in each protein (TB) is linearly related to total number of non-hydrophobic residues in that protein which in turn is related linearly to total number of amino acid residues. Secondly, we found that a large number of consecutive bends occur in each protein which give rise to on an average only three independent residues per turn. Positional preference of amino acid residues in chain reversals is stressed. Consideration of pairs of amino acid residues in positions (i + 1) and (i + 2) of bends seems to provide a more reliable basis for predicting chain reversals in proteins.
Resumo:
The recently developed single network adaptive critic (SNAC) design has been used in this study to design a power system stabiliser (PSS) for enhancing the small-signal stability of power systems over a wide range of operating conditions. PSS design is formulated as a discrete non-linear quadratic regulator problem. SNAC is then used to solve the resulting discrete-time optimal control problem. SNAC uses only a single critic neural network instead of the action-critic dual network architecture of typical adaptive critic designs. SNAC eliminates the iterative training loops between the action and critic networks and greatly simplifies the training procedure. The performance of the proposed PSS has been tested on a single machine infinite bus test system for various system and loading conditions. The proposed stabiliser, which is relatively easier to synthesise, consistently outperformed stabilisers based on conventional lead-lag and linear quadratic regulator designs.
Resumo:
This paper discusses the consistent regularization property of the generalized α method when applied as an integrator to an initial value high index and singular differential-algebraic equation model of a multibody system. The regularization comes from within the discretization itself and the discretization remains consistent over the range of values the regularization parameter may take. The regularization involves increase of the smallest singular values of the ill-conditioned Jacobian of the discretization and is different from Baumgarte and similar techniques which tend to be inconsistent for poor choice of regularization parameter. This regularization also helps where pre-conditioning the Jacobian by scaling is of limited effect, for example, when the scleronomic constraints contain multiple closed loops or singular configuration or when high index path constraints are present. The feed-forward control in Kane's equation models is additionally considered in the numerical examples to illustrate the effect of regularization. The discretization presented in this work is adopted to the first order DAE system (unlike the original method which is intended for second order systems) for its A-stability and same order of accuracy for positions and velocities.
Resumo:
. Measurement of the relation between polarisation P and electric field E for lithium potassium sulphate (LiKSO4) was made in the low temperature range below room temperature. The P-E hysteresis loops along the c axis of LiKSO4 were observed in the low-temperature phase below the lower transition point Ttl of about -70 degrees C, and in the intermediate phase below the upper transition point Ttu of about -25 degrees C. These phases were found to be ferroelectric. The temperature dependence of the spontaneous polarisation Ps and the electric coercive field Ec were obtained.
Resumo:
An important limitation of the existing IGC algorithms, is that they do not explicitly exploit the inherent time scale separation that exist in aerospace vehicles between rotational and translational motions and hence can be ineffective. To address this issue, a two-loop partial integrated guidance and control (PIGC) scheme has been proposed in this paper. In this design, the outer loop uses a recently developed, computationally efficient, optimal control formulation named as model predictive static programming. It gives the commanded pitch and yaw rates whereas necessary roll-rate command is generated from a roll-stabilization loop. The inner loop tracks the outer loop commands using the Dynamic inversion philosophy. Uncommonly, Six-Degree of freedom (Six-DOF) model is used directly in both the loops. This intelligent manipulation preserves the inherent time scale separation property between the translational and rotational dynamics, and hence overcomes the deficiency of current IGC designs, while preserving its benefits. Comparative studies of PIGC with one loop IGC and conventional three loop design were carried out for engaging incoming high speed target. Simulation studies demonstrate the usefulness of this method.
Resumo:
We obtained the images of the eastern part of the solar corona in the Fe xiv 530.3 nm (green) and Fe x 637.4 nm (red) coronal emission lines during the total solar eclipse of 29 March 2006 at Manavgat, Antalya, Turkey. The images were obtained using a 35 cm Meade telescope equipped with a Peltier-cooled 2k x 2k CCD and 0.3 nm pass-band interference filters at the rates of 2.95 s (exposure times of 100 ms) and 2.0 s (exposure times of 300 ms) in the Fe xiv and Fe x emission lines,respectively. The analysis of the data indicates intensity variations at some locations with period of strongest power around 27 s for the green line and 20 s for the red line. These results confirm earlier findings of variations in the continuum intensity with periods in the range of 5 to 56 s by Singh et al. (Solar Phys. 170, 235, 1997). The wavelet analysis has been used to identify significant intensity oscillations at all pixels within our field of view. Significant oscillations with high probability estimates were detected for some locations only. These locations seem to follow the boundary of an active region and in the neighborhood, rather than within the loops themselves. These intensity oscillations may be caused by fast magneto-sonic waves in the solar corona and partly account for heating of the plasma in the corona.
Resumo:
Gadolinium strontium manganite single crystals of the composition Gd0.5Sr0.5MnO3 were grown using the optical float zone method. We report here the magnetic and magnetotransport properties of these crystals. A large magnetoresistance similar to 10(9)% was observed at 45 K under the application of a 110 kOe field. We have observed notable thermomagnetic anomalies such as open hysteresis loops across the broadened first-order transition between the charge order insulator and the ferromagnetic metallic phase while traversing the magnetic field-temperature (H-T) plane isothermally or isomagnetically. In order to discern the cause of these observed anomalies, the H-T phase diagram for Gd0.5Sr0.5MnO3 is formulated using the magnetization-field (M-H), magnetization-temperature (M-T) and resistance-temperature (R-T) measurements. The temperature dependence of the critical field (i.e. H-up, the field required for transformation to the ferromagnetic metallic phase) is non-monotonic. We note that the non-monotonic variation of the supercooling limit is anomalous according to the classical concepts of the first-order phase transition. Accordingly, H-up values below similar to 20 K are unsuitable to represent the supercooling limit. It is possible that the nature of the metastable states responsible for the observed open hysteresis loops is different from that of the supercooled ones.