969 resultados para Massive Parallelization
Resumo:
Diffuse optical tomographic image reconstruction uses advanced numerical models that are computationally costly to be implemented in the real time. The graphics processing units (GPUs) offer desktop massive parallelization that can accelerate these computations. An open-source GPU-accelerated linear algebra library package is used to compute the most intensive matrix-matrix calculations and matrix decompositions that are used in solving the system of linear equations. These open-source functions were integrated into the existing frequency-domain diffuse optical image reconstruction algorithms to evaluate the acceleration capability of the GPUs (NVIDIA Tesla C 1060) with increasing reconstruction problem sizes. These studies indicate that single precision computations are sufficient for diffuse optical tomographic image reconstruction. The acceleration per iteration can be up to 40, using GPUs compared to traditional CPUs in case of three-dimensional reconstruction, where the reconstruction problem is more underdetermined, making the GPUs more attractive in the clinical settings. The current limitation of these GPUs in the available onboard memory (4 GB) that restricts the reconstruction of a large set of optical parameters, more than 13, 377. (C) 2010 Society of Photo-Optical Instrumentation Engineers. DOI: 10.1117/1.3506216]
Resumo:
The modern GPUs are well suited for intensive computational tasks and massive parallel computation. Sparse matrix multiplication and linear triangular solver are the most important and heavily used kernels in scientific computation, and several challenges in developing a high performance kernel with the two modules is investigated. The main interest it to solve linear systems derived from the elliptic equations with triangular elements. The resulting linear system has a symmetric positive definite matrix. The sparse matrix is stored in the compressed sparse row (CSR) format. It is proposed a CUDA algorithm to execute the matrix vector multiplication using directly the CSR format. A dependence tree algorithm is used to determine which variables the linear triangular solver can determine in parallel. To increase the number of the parallel threads, a coloring graph algorithm is implemented to reorder the mesh numbering in a pre-processing phase. The proposed method is compared with parallel and serial available libraries. The results show that the proposed method improves the computation cost of the matrix vector multiplication. The pre-processing associated with the triangular solver needs to be executed just once in the proposed method. The conjugate gradient method was implemented and showed similar convergence rate for all the compared methods. The proposed method showed significant smaller execution time.
Resumo:
Understanding the natural variability of the Earth's climate system and accurately identifying potential anthropogenic influences requires long term, geographically distributed records of key climate indicators, such as temperature and precipitation that extend prior to the last 400. years of the Holocene. Reef corals provide an excellent source of high resolution climate records, and importantly represent the tropical marine environment where palaeoclimate data are urgently required. Recent decades have seen significant improvement in our understanding of coral biomineralisation, the associated uptake of geochemical proxies and methods of identifying and understanding the effects of both early and late, post depositional diagenetic alteration. These processes all have significant implications for interpreting geochemical proxies relevant to palaeoclimatic reconstructions. This paper reviews the current 'state of the art' in terms of coral based palaeoclimate reconstructions and highlights a key remaining problem. The majority of coral based palaeoclimate research has been derived from massive colonies of Porites. However, massive Porites are not globally abundant and may not provide material of a particular age of interest in those regions where they are present. Therefore, there is great potential for alternate coral genera to act as complimentary climate archives. While it remains critical to consider five key factors - vital effects, differential growth morphologies, geochemical heterogeneity in the skeletal ultrastructure, transfer equation selection and diagenetic screening of skeletal material - in order to allow the highest level of accuracy in coral palaeoclimate reconstructions, it is also important to develop alternate taxa for palaeoclimate studies in regions where Porites colonies are absent or rare. Currently as many as nine genera other than Porites have proven at least limited utility in palaeothermometry, most of which are found in the Atlantic/Caribbean region where massive Porites do not exist. Even branching taxa such as Acropora have significant potential to preserve environmental archives. Increasing this capability will greatly expand the number of potential geochemical archives available for longer term temporal records of palaeoclimate.
Resumo:
This thesis presents a novel program parallelization technique incorporating with dynamic and static scheduling. It utilizes a problem specific pattern developed from the prior knowledge of the targeted problem abstraction. Suitable for solving complex parallelization problems such as data intensive all-to-all comparison constrained by memory, the technique delivers more robust and faster task scheduling compared to the state-of-the art techniques. Good performance is achieved from the technique in data intensive bioinformatics applications.
Resumo:
The Archean Hollandaire volcanogenic massive sulfide deposit is a felsic–siliciclastic VMS deposit located in the Murchison Domain of the Youanmi Terrane, Yilgarn Craton, Western Australia. It is hosted in a succession of turbidites, mudstones and coherent rhyodacite sills and has been metamorphosed to upper greenschist/lower amphibolite facies and includes a pervasive S1 deformational fabric. The coherent rhyodacitic sills are interpreted as syndepositional based on geochemical similarities with well-known VMS-associated felsic rocks and similar foliations to the metasediments. We offer several explanations for the absence of textural evidence (e.g. breccias) for syn-depositional origins: 1) the subaqueous sediments were dehydrated by long-lived magmatism such that no pore-water remained to drive quench fragmentation; 2) pore-space occlusion by burial and/or, 3) alteration overprinting and obscuring of primary breccias at contact margins. Mineralisation occurs by sub-seafloor replacement of original host rocks in two ore bodies, Hollandaire Main (~125 x >500 m and ~8 m thick) and Hollandaire West (~100 x 470 m and ~5 m thick), and occurs in three main textural styles, massive sulfides, which are exclusively hosted in turbidites and mudstones, and stringer and disseminated sulfides, which are also hosted in coherent rhyodacite. Most sulfides have textures consistent with remobilisation and recrystallisation. Hydrothermal metamorphism has altered the hangingwall and footwall to similar degrees, with significant gains in Mg, Mn and K and losses in Na, Ca and Sr. Garnet and staurolite porphyryoblasts also exhibit a footprint around mineralisation, extending up to 30 m both above and below the ore zone. High precision thermal ionisation mass spectrometry of zircons extracted from the coherent rhyodacite yield an age of 2759.5 ± 0.9 Ma, which along with geochemical comparisons, places the succession within the 2760–2735 Ma Greensleeves Formation of the Polelle Group of the Murchison Supergroup. Geochemical and geochronological evidence link the coherent rhyodacite sills to the Peter Well Granodiorite pluton ~2 km to the W, which acted as the heat engine driving hydrothermal circulation during VMS mineralisation. This study highlights the importance of both: detailed physical volcanological studies from which an accurate assessment of timing relationships, particularly the possibility of intrusions dismembering ore horizons, can be made; and identifying synvolcanic plutons and other similar suites, for VMS exploration targets in the Youanmi Terrane and worldwide.
Resumo:
We report sensitive high mass resolution ion microprobe, stable isotopes (SHRIMP SI) multiple sulfur isotope analyses (32S, 33S, 34S) to constrain the sources of sulfur in three Archean VMS deposits—Teutonic Bore, Bentley, and Jaguar—from the Teutonic Bore volcanic complex of the Yilgarn Craton, Western Australia, together with sedimentary pyrites from associated black shales and interpillow pyrites. The pyrites from VMS mineralization are dominated by mantle sulfur but include a small amount of slightly negative mass-independent fractionation (MIF) anomalies, whereas sulfur from the pyrites in the sedimentary rocks has pronounced positive MIF, with ∆33S values that lie between 0.19 and 6.20‰ (with one outlier at −1.62‰). The wall rocks to the mineralization include sedimentary rocks that have contributed no detectable positive MIF sulfur to the VMS deposits, which is difficult to reconcile with the leaching model for the formation of these deposits. The sulfur isotope data are best explained by mixing between sulfur derived from a magmatic-hydrothermal fluid and seawater sulfur as represented by the interpillow pyrites. The massive sulfide lens pyrites have a weighted mean ∆33S value of −0.27 ± 0.05‰ (MSWD = 1.6) nearly identical with −0.31 ± 0.08‰ (MSWD = 2.4) for pyrites from the stringer zone, which requires mixing to have occurred below the sea floor. We employed a two-component mixing model to estimate the contribution of seawater sulfur to the total sulfur budget of the two Teutonic Bore volcanic complex VMS deposits. The results are 15 to 18% for both Teutonic Bore and Bentley, much higher than the 3% obtained by Jamieson et al. (2013) for the giant Kidd Creek deposit. Similar calculations, carried out for other Neoarchean VMS deposits give value between 2% and 30%, which are similar to modern hydrothermal VMS deposits. We suggest that multiple sulfur isotope analyses may be used to predict the size of Archean VMS deposits and to provide a vector to ore deposit but further studies are needed to test these suggestions.
Resumo:
The LISA Parameter Estimation Taskforce was formed in September 2007 to provide the LISA Project with vetted codes, source distribution models and results related to parameter estimation. The Taskforce's goal is to be able to quickly calculate the impact of any mission design changes on LISA's science capabilities, based on reasonable estimates of the distribution of astrophysical sources in the universe. This paper describes our Taskforce's work on massive black-hole binaries (MBHBs). Given present uncertainties in the formation history of MBHBs, we adopt four different population models, based on (i) whether the initial black-hole seeds are small or large and (ii) whether accretion is efficient or inefficient at spinning up the holes. We compare four largely independent codes for calculating LISA's parameter-estimation capabilities. All codes are based on the Fisher-matrix approximation, but in the past they used somewhat different signal models, source parametrizations and noise curves. We show that once these differences are removed, the four codes give results in extremely close agreement with each other. Using a code that includes both spin precession and higher harmonics in the gravitational-wave signal, we carry out Monte Carlo simulations and determine the number of events that can be detected and accurately localized in our four population models.
Resumo:
The effect of massive blowing rates on the steady laminar compressible boundary-layer flow with variable gas properties at a 3-dim. stagnation point (which includes both nodal and saddle points of attachment) has been studied. The equations governing the flow have been solved numerically using an implicit finite-difference scheme in combination with the quasilinearization technique for nodal points of attachment but employing a parametric differentiation technique instead of quasilinearization for saddle points of attachment. It is found that the effect of massive blowing rates is to move the viscous layer away from the surface. The effect of the variation of the density- viscosity product across the boundary layer is found to be negligible for massive blowing rates but significant for moderate blowing rates. The velocity profiles in the transverse direction for saddle points of attachment in the presence of massive blowing show both the reverse flow as well as velocity overshoot.
Resumo:
A semi-similar solution of an unsteady laminar compressible three-dimensional stagnation point boundary layer flow with massive blowing has been obtained when the free stream velocity varies arbitrarily with time. The resulting partial differential equations governing the flow have been solved numerically using an implicit finite-difference scheme with a quasi-linearization technique in the nodal point region and an implicit finite-difference scheme with a parametric differentiation technique in the saddle point region. The results have been obtained for two particular unsteady free stream velocity distributions: (i) an accelerating stream and (ii) a fluctuating stream. Results show that the skin-friction and heat-transfer parameters respond significantly to the time dependent arbitrary free stream velocity. Velocity and enthalpy profiles approach their free stream values faster as time increases. There is a reverse flow in the y-wise velocity profile, and overshoot in the x-wise velocity and enthalpy profiles in the saddle point region, which increase as injection and wall temperature increase. Location of the dividing streamline increases as injection increases, but as the wall temperature and time increase, it decreases.
Resumo:
he notion of the gravity-induced electric field has been applied to an entire self-gravitating massive body. The resulting electric polarization of the otherwise neutral body, when taken in conjunction with the latter's rotation, is shown to generate an axial-magnetic field of the right type and order of magnitude for certain astrophysical objects. In the present treatment the electric polarization is calculated in the ion-continuum Thomas-Fermi approximation while the electrodynamics of the continuous medium is treated in the nonrelativistic approximation.
Resumo:
A numerical procedure, based on the parametric differentiation and implicit finite difference scheme, has been developed for a class of problems in the boundary-layer theory for saddle-point regions. Here, the results are presented for the case of a three-dimensional stagnation-point flow with massive blowing. The method compares very well with other methods for particular cases (zero or small mass blowing). Results emphasize that the present numerical procedure is well suited for the solution of saddle-point flows with massive blowing, which could not be solved by other methods.