202 resultados para GPU acceleration


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Graph algorithms have been shown to possess enough parallelism to keep several computing resources busy-even hundreds of cores on a GPU. Unfortunately, tuning their implementation for efficient execution on a particular hardware configuration of heterogeneous systems consisting of multicore CPUs and GPUs is challenging, time consuming, and error prone. To address these issues, we propose a domain-specific language (DSL), Falcon, for implementing graph algorithms that (i) abstracts the hardware, (ii) provides constructs to write explicitly parallel programs at a higher level, and (iii) can work with general algorithms that may change the graph structure (morph algorithms). We illustrate the usage of our DSL to implement local computation algorithms (that do not change the graph structure) and morph algorithms such as Delaunay mesh refinement, survey propagation, and dynamic SSSP on GPU and multicore CPUs. Using a set of benchmark graphs, we illustrate that the generated code performs close to the state-of-the-art hand-tuned implementations.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A fully real-time coherent dedispersion system has been developed for the pulsar back-end at the Giant Metrewave Radio Telescope (GMRT). The dedispersion pipeline uses the single phased array voltage beam produced by the existing GMRT software back-end (GSB) to produce coherently dedispersed intensity output in real time, for the currently operational bandwidths of 16 MHz and 32 MHz. Provision has also been made to coherently dedisperse voltage beam data from observations recorded on disk. We discuss the design and implementation of the real-time coherent dedispersion system, describing the steps carried out to optimise the performance of the pipeline. Presently functioning on an Intel Xeon X5550 CPU equipped with a NVIDIA Tesla C2075 GPU, the pipeline allows dispersion free, high time resolution data to be obtained in real-time. We illustrate the significant improvements over the existing incoherent dedispersion system at the GMRT, and present some preliminary results obtained from studies of pulsars using this system, demonstrating its potential as a useful tool for low frequency pulsar observations. We describe the salient features of our implementation, comparing it with other recently developed real-time coherent dedispersion systems. This implementation of a real-time coherent dedispersion pipeline for a large, low frequency array instrument like the GMRT, will enable long-term observing programs using coherent dedispersion to be carried out routinely at the observatory. We also outline the possible improvements for such a pipeline, including prospects for the upgraded GMRT which will have bandwidths about ten times larger than at present.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A new method of selection of time-to-go (t(go)) for Generalized Vector Explicit Guidance (GENEX) law have been proposed in this paper. t(go) is known to be an important parameter in the control and cost function of GENEX guidance law. In this paper the formulation has been done to find an optimal value of t(go) that minimizes the performance cost. Mechanization of GENEX with this optimal t(go) reduces the lateral acceleration demand and consequently increases the range of the interceptor. This new formulation of computing t(go) comes in closed form and thus it can be implemented onboard. This new formulation is applied in the terminal phase of an surface-to-air interceptor for an angle constrained engagement. Results generated by simulation justify the use of optimal t(go).

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Seismic design of landfills requires an understanding of the dynamic properties of municipal solid waste (MSW) and the dynamic site response of landfill waste during seismic events. The dynamic response of the Mavallipura landfill situated in Bangalore, India, is investigated using field measurements, laboratory studies and recorded ground motions from the intraplate region. The dynamic shear modulus values for the MSW were established on the basis of field measurements of shear wave velocities. Cyclic triaxial testing was performed on reconstituted MSW samples and the shear modulus reduction and damping characteristics of MSW were studied. Ten ground motions were selected based on regional seismicity and site response parameters have been obtained considering one-dimensional non-linear analysis in the DEEPSOIL program. The surface spectral response varied from 0.6 to 2g and persisted only for a period of 1s for most of the ground motions. The maximum peak ground acceleration (PGA) obtained was 0.5g and the minimum and maximum amplifications are 1.35 and 4.05. Amplification of the base acceleration was observed at the top surface of the landfill underlined by a composite soil layer and bedrock for all ground motions. Dynamic seismic properties with amplification and site response parameters for MSW landfill in Bangalore, India, are presented in this paper. This study shows that MSW has less shear stiffness and more amplification due to loose filling and damping, which need to be accounted for seismic design of MSW landfills in India.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Seismic design of landfills requires an understanding of the dynamic properties of municipal solid waste (MSW) and the dynamic site response of landfill waste during seismic events. The dynamic response of the Mavallipura landfill situated in Bangalore, India, is investigated using field measurements, laboratory studies and recorded ground motions from the intraplate region. The dynamic shear modulus values for the MSW were established on the basis of field measurements of shear wave velocities. Cyclic triaxial testing was performed on reconstituted MSW samples and the shear modulus reduction and damping characteristics of MSW were studied. Ten ground motions were selected based on regional seismicity and site response parameters have been obtained considering one-dimensional non-linear analysis in the DEEPSOIL program. The surface spectral response varied from 0.6 to 2g and persisted only for a period of 1s for most of the ground motions. The maximum peak ground acceleration (PGA) obtained was 0.5g and the minimum and maximum amplifications are 1.35 and 4.05. Amplification of the base acceleration was observed at the top surface of the landfill underlined by a composite soil layer and bedrock for all ground motions. Dynamic seismic properties with amplification and site response parameters for MSW landfill in Bangalore, India, are presented in this paper. This study shows that MSW has less shear stiffness and more amplification due to loose filling and damping, which need to be accounted for seismic design of MSW landfills in India.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this article, a Field Programmable Gate Array (FPGA)-based hardware accelerator for 3D electromagnetic extraction, using Method of Moments (MoM) is presented. As the number of nets or ports in a system increases, leading to a corresponding increase in the number of right-hand-side (RHS) vectors, the computational cost for multiple matrix-vector products presents a time bottleneck in a linear-complexity fast solver framework. In this work, an FPGA-based hardware implementation is proposed toward a two-level parallelization scheme: (i) matrix level parallelization for single RHS and (ii) pipelining for multiple-RHS. The method is applied to accelerate electrostatic parasitic capacitance extraction of multiple nets in a Ball Grid Array (BGA) package. The acceleration is shown to be linearly scalable with FPGA resources and speed-ups over 10x against equivalent software implementation on a 2.4GHz Intel Core i5 processor is achieved using a Virtex-6 XC6VLX240T FPGA on Xilinx's ML605 board with the implemented design operating at 200MHz clock frequency. (c) 2016 Wiley Periodicals, Inc. Microwave Opt Technol Lett 58:776-783, 2016

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We perform global linear stability analysis and idealized numerical simulations in global thermal balance to understand the condensation of cold gas from hot/virial atmospheres (coronae), in particular the intracluster medium (ICM). We pay particular attention to geometry (e.g. spherical versus plane-parallel) and the nature of the gravitational potential. Global linear analysis gives a similar value for the fastest growing thermal instability modes in spherical and Cartesian geometries. Simulations and observations suggest that cooling in haloes critically depends on the ratio of the cooling time to the free-fall time (t(cool)/t(ff)). Extended cold gas condenses out of the ICM only if this ratio is smaller than a threshold value close to 10. Previous works highlighted the difference between the nature of cold gas condensation in spherical and plane-parallel atmospheres; namely, cold gas condensation appeared easier in spherical atmospheres. This apparent difference due to geometry arises because the previous plane-parallel simulations focused on in situ condensation of multiphase gas but spherical simulations studied condensation anywhere in the box. Unlike previous claims, our non-linear simulations show that there are only minor differences in cold gas condensation, either in situ or anywhere, for different geometries. The amount of cold gas depends on the shape of tcool/tff; gas has more time to condense if gravitational acceleration decreases towards the centre. In our idealized plane-parallel simulations with heating balancing cooling in each layer, there can be significant mass/energy/momentum transfer across layers that can trigger condensation and drive tcool/tff far beyond the critical value close to 10.