971 resultados para Homogeneous Kernels
Resumo:
Knowledge about program worst case execution time (WCET) is essential in validating real-time systems and helps in effective scheduling. One popular approach used in industry is to measure execution time of program components on the target architecture and combine them using static analysis of the program. Measurements need to be taken in the least intrusive way in order to avoid affecting accuracy of estimated WCET. Several programs exhibit phase behavior, wherein program dynamic execution is observed to be composed of phases. Each phase being distinct from the other, exhibits homogeneous behavior with respect to cycles per instruction (CPI), data cache misses etc. In this paper, we show that phase behavior has important implications on timing analysis. We make use of the homogeneity of a phase to reduce instrumentation overhead at the same time ensuring that accuracy of WCET is not largely affected. We propose a model for estimating WCET using static worst case instruction counts of individual phases and a function of measured average CPI. We describe a WCET analyzer built on this model which targets two different architectures. The WCET analyzer is observed to give safe estimates for most benchmarks considered in this paper. The tightness of the WCET estimates are observed to be improved for most benchmarks compared to Chronos, a well known static WCET analyzer.
Resumo:
Present study performs the spatial and temporal trend analysis of annual, monthly and seasonal maximum and minimum temperatures (t(max), t(min)) in India. Recent trends in annual, monthly, winter, pre-monsoon, monsoon and post-monsoon extreme temperatures (t(max), t(min)) have been analyzed for three time slots viz. 1901-2003,1948-2003 and 1970-2003. For this purpose, time series of extreme temperatures of India as a whole and seven homogeneous regions, viz. Western Himalaya (WH), Northwest (NW), Northeast (NE), North Central (NC), East coast (EC), West coast (WC) and Interior Peninsula (IP) are considered. Rigorous trend detection analysis has been exercised using variety of non-parametric methods which consider the effect of serial correlation during analysis. During the last three decades minimum temperature trend is present in All India as well as in all temperature homogeneous regions of India either at annual or at any seasonal level (winter, pre-monsoon, monsoon, post-monsoon). Results agree with the earlier observation that the trend in minimum temperature is significant in the last three decades over India (Kothawale et al., 2010). Sequential MK test reveals that most of the trend both in maximum and minimum temperature began after 1970 either in annual or seasonal levels. (C) 2012 Elsevier B.V. All rights reserved.
Resumo:
High-level loop transformations are a key instrument in mapping computational kernels to effectively exploit the resources in modern processor architectures. Nevertheless, selecting required compositions of loop transformations to achieve this remains a significantly challenging task; current compilers may be off by orders of magnitude in performance compared to hand-optimized programs. To address this fundamental challenge, we first present a convex characterization of all distinct, semantics-preserving, multidimensional affine transformations. We then bring together algebraic, algorithmic, and performance analysis results to design a tractable optimization algorithm over this highly expressive space. Our framework has been implemented and validated experimentally on a representative set of benchmarks running on state-of-the-art multi-core platforms.
Resumo:
The solution of the forward equation that models the transport of light through a highly scattering tissue material in diffuse optical tomography (DOT) using the finite element method gives flux density (Phi) at the nodal points of the mesh. The experimentally measured flux (U-measured) on the boundary over a finite surface area in a DOT system has to be corrected to account for the system transfer functions (R) of various building blocks of the measurement system. We present two methods to compensate for the perturbations caused by R and estimate true flux density (Phi) from U-measured(cal). In the first approach, the measurement data with a homogeneous phantom (U-measured(homo)) is used to calibrate the measurement system. The second scheme estimates the homogeneous phantom measurement using only the measurement from a heterogeneous phantom, thereby eliminating the necessity of a homogeneous phantom. This is done by statistically averaging the data (U-measured(hetero)) and redistributing it to the corresponding detector positions. The experiments carried out on tissue mimicking phantom with single and multiple inhomogeneities, human hand, and a pork tissue phantom demonstrate the robustness of the approach. (C) 2013 Society of Photo-Optical Instrumentation Engineers (SPIE) DOI: 10.1117/1.JBO.18.2.026023]
Resumo:
MATLAB is an array language, initially popular for rapid prototyping, but is now being increasingly used to develop production code for numerical and scientific applications. Typical MATLAB programs have abundant data parallelism. These programs also have control flow dominated scalar regions that have an impact on the program's execution time. Today's computer systems have tremendous computing power in the form of traditional CPU cores and throughput oriented accelerators such as graphics processing units(GPUs). Thus, an approach that maps the control flow dominated regions to the CPU and the data parallel regions to the GPU can significantly improve program performance. In this paper, we present the design and implementation of MEGHA, a compiler that automatically compiles MATLAB programs to enable synergistic execution on heterogeneous processors. Our solution is fully automated and does not require programmer input for identifying data parallel regions. We propose a set of compiler optimizations tailored for MATLAB. Our compiler identifies data parallel regions of the program and composes them into kernels. The problem of combining statements into kernels is formulated as a constrained graph clustering problem. Heuristics are presented to map identified kernels to either the CPU or GPU so that kernel execution on the CPU and the GPU happens synergistically and the amount of data transfer needed is minimized. In order to ensure required data movement for dependencies across basic blocks, we propose a data flow analysis and edge splitting strategy. Thus our compiler automatically handles composition of kernels, mapping of kernels to CPU and GPU, scheduling and insertion of required data transfer. The proposed compiler was implemented and experimental evaluation using a set of MATLAB benchmarks shows that our approach achieves a geometric mean speedup of 19.8X for data parallel benchmarks over native execution of MATLAB.
Resumo:
Several experimental studies have shown that fracture surfaces in brittle metallic glasses (MGs) generally exhibit nanoscale corrugations which may be attributed to the nucleation and coalescence of nanovoids during crack propagation. Recent atomistic simulations suggest that this phenomenon is due to large spatial fluctuations in material properties in a brittle MG, which leads to void nucleation in regions of low atomic density and then catastrophic fracture through void coalescence. To explain this behavior, we propose a model of a heterogeneous solid containing a distribution of weak zones to represent a brittle MG. Plane strain continuum finite element analysis of cavitation in such an elastic-plastic solid is performed with the weak zones idealized as periodically distributed regions having lower yield strength than the background material. It is found that the presence of weak zones can significantly reduce the critical hydrostatic stress for the onset of cavitation which is controlled uniquely by the local yield properties of these zones. Also, the presence of weak zones diminishes the sensitivity of the cavitation stress to the volume fraction of a preexisting void. These results provide plausible explanations for the observations reported in recent atomistic simulations of brittle MGs. An analytical solution for a composite, incompressible elastic-plastic solid with a weak inner core is used to investigate the effect of volume fraction and yield strength of the core on the nature of cavitation bifurcation. It is shown that snap-cavitation may occur, giving rise to sudden formation of voids with finite size, which does not happen in a homogeneous plastic solid. (c) 2012 Elsevier Ltd. All rights reserved.
Resumo:
Intraseasonal time-scales play an important role in tropical variability. Two modes that contribute significantly to tropical intraseasonal variability (ISV) are the eastward-propagating MaddenJulian Oscillation (MJO), and westward-moving moist equatorial Rossby waves. This note reports on a correspondence between the longitudinal gradient of mean tropical precipitable water (PW), and the geographical regions of genesis, and convective activity, of both these large-scale tropical systems. Our finding is based on an analysis of PW from the MERRA reanalysis product. The data indicate that the mean tropical PW has a dominant wavenumber two (three) structure in longitude in the Northern (Southern) Hemisphere. Departures from a longitudinally homogeneous state are attributed to the influence of subtropical anticyclones, and are accentuated by monsoonal regions of both hemispheres. This mean structure results in a sharply localized longitudinal gradient of PW. Remarkably, regions with positive gradients (such as the Northern and Southern Hemisphere western Indian Ocean), i.e. they have larger PW to the east, are the very zones that are implicated in the formation, and show high levels of convective activity, of the eastward-moving MJO. On the other hand, regions with negative gradients (such as the Southern Hemisphere central Pacific) are the very regions where genesis, and maxima in variance, of westward-moving moist equatorial Rossby waves are known to occur. Apart from providing a first-order longitudinal footprint of the convective phase of these systems, this correspondence reinforces the role of the mean climatic state in tropical ISV. Copyright (c) 2012 Royal Meteorological Society
Resumo:
Various forms of carbon, especially the nanocarbons, have received considerable attention in recent years. There has also been some effort to investigate borocarbonitrides, BxCyNz, comprising besides carbon, the two elements on either side. Although uniformly homogeneous compositions of borocarbonitrides may be difficult to generate, there have been attempts to prepare them by solid state as well as gas phase reactions. Some of the products so obtained show evidence for the presence of BCN networks. Then, there are composites (G-BN) containing hexagonal BN (h-BN) and graphene (G) domains, G(1-x)(BN)(x), in varying proportions. Nanotubes of BxCyNz have been reported by several workers. The borocarbonitrides exhibit some interesting electronic and gas adsorption properties. Thus, some of the preparations show selective CO2 adsorption. They also exhibit excellent characteristics for supercapacitor applications. In order to understand the nature of these understudied materials, it is necessary to examine the results from first-principles calculations. These calculations throw light on the variation in the band gap of G-BN with the concentration of h-BN, for different geometries of the domains and their boundaries. The possibility of formation of Stone-Wales (SW) defects at the interfaces of graphene and h-BN has been studied and the estimates of the formation energies of SW defects at the interfaces are similar to 4 to 6 eV. The presence of such defects at the interfaces influences the electronic structure near the band gap and the associated properties. For example, adsorption of CH4 and CO2 occurs with significantly stronger binding at the interfacial defects.
Resumo:
The governing differential equation of the rotating beam reduces to that of a stiff string when the centrifugal force is assumed as constant. The solution of the static homogeneous part of this equation is enhanced with a polynomial term and used in the Rayleighs method. Numerical experiments show better agreement with converged finite element solutions compared to polynomials. Using this as an estimate for the first mode shape, higher mode shape approximations are obtained using Gram-Schmidt orthogonalization. Estimates for the first five natural frequencies of uniform and tapered beams are obtained accurately using a very low order Rayleigh-Ritz approximation.
Resumo:
Realistic and realtime computational simulation of soft biological organs (e.g., liver, kidney) is necessary when one tries to build a quality surgical simulator that can simulate surgical procedures involving these organs. Since the realistic simulation of these soft biological organs should account for both nonlinear material behavior and large deformation, achieving realistic simulations in realtime using continuum mechanics based numerical techniques necessitates the use of a supercomputer or a high end computer cluster which are costly. Hence there is a need to employ soft computing techniques like Support Vector Machines (SVMs) which can do function approximation, and hence could achieve physically realistic simulations in realtime by making use of just a desktop computer. Present work tries to simulate a pig liver in realtime. Liver is assumed to be homogeneous, isotropic, and hyperelastic. Hyperelastic material constants are taken from the literature. An SVM is employed to achieve realistic simulations in realtime, using just a desktop computer. The code for the SVM is obtained from [1]. The SVM is trained using the dataset generated by performing hyperelastic analyses on the liver geometry, using the commercial finite element software package ANSYS. The methodology followed in the present work closely follows the one followed in [2] except that [2] uses Artificial Neural Networks (ANNs) while the present work uses SVMs to achieve realistic simulations in realtime. Results indicate the speed and accuracy that is obtained by employing the SVM for the targeted realistic and realtime simulation of the liver.
Resumo:
Carbonaceous nickel oxide powder samples have been synthesized from an adducted nickel beta-ketoester complex used as a ``single source precursor'' through a solution-based microwave-assisted chemical route. Comprehensive analysis of the resulting powder material has been carried out using various characterization techniques. These analysis reveal that, depending on the solvent used, either NiO/C or Ni/NiO/C composites are formed, wherein Ni and/or NiO nanocrystals are enveloped in amorphous carbon. As the components emerge from the same molecular source, the composites are homogeneous on a fine scale, making them promising electrode materials for supercapacitors. Electrochemical capacitive behavior of these oxide composites is studied in a three-electrode configuration. With a specific capacitance of 113 F g(-1), Ni/NiO/C is superior to NiO/C as capacitor electrode material, in 0.1 M Na2SO4 electrolyte. This is confirmed by impedance measurements, which show that charge-transfer resistance and equivalent series resistance are lower in Ni/NiO/C than in NiO/C, presumably because of the presence of metallic nickel in the former. The cyclic voltammograms are nearly rectangular and the electrodes display excellent cyclability in different electrolytes: Na2SO4, KOH and Ca(NO3)(2)center dot 4H(2)O. Specific capacitance as high as 143 F g(-1), is measured in Ca(NO3)(2)center dot 4H(2)O electrolyte.
Resumo:
Let M be the completion of the polynomial ring C(z) under bar] with respect to some inner product, and for any ideal I subset of C (z) under bar], let I] be the closure of I in M. For a homogeneous ideal I, the joint kernel of the submodule I] subset of M is shown, after imposing some mild conditions on M, to be the linear span of the set of vectors {p(i)(partial derivative/partial derivative(w) over bar (1),...,partial derivative/partial derivative(w) over bar (m)) K-I] (., w)vertical bar(w=0), 1 <= i <= t}, where K-I] is the reproducing kernel for the submodule 2] and p(1),..., p(t) is some minimal ``canonical set of generators'' for the ideal I. The proof includes an algorithm for constructing this canonical set of generators, which is determined uniquely modulo linear relations, for homogeneous ideals. A short proof of the ``Rigidity Theorem'' using the sheaf model for Hilbert modules over polynomial rings is given. We describe, via the monoidal transformation, the construction of a Hermitian holomorphic line bundle for a large class of Hilbert modules of the form I]. We show that the curvature, or even its restriction to the exceptional set, of this line bundle is an invariant for the unitary equivalence class of I]. Several examples are given to illustrate the explicit computation of these invariants.
Resumo:
Regionalization of precipitation refers to delineation of rain gauges in an area into homogeneous groups (clusters or regions). Various regionalization procedures are employed by researchers in hydrometeorology for addressing a wide spectrum of problems. This paper provides an overview of underlying concepts as well as advantages and limitations of procedures that have been developed over the past six decades. Emphasis is given to studies that have been carried out in India. Following this, gaps where more research needs to be focussed are highlighted, and challenges for regionalization in a climate change scenario are discussed.
Resumo:
Cardiac fibroblasts, when coupled functionally with myocytes, can modulate the electrophysiological properties of cardiac tissue. We present systematic numerical studies of such modulation of electrophysiological properties in mathematical models for (a) single myocyte-fibroblast (MF) units and (b) two-dimensional (2D) arrays of such units; our models build on earlier ones and allow for zero-, one-, and two-sided MF couplings. Our studies of MF units elucidate the dependence of the action-potential (AP) morphology on parameters such as E-f, the fibroblast resting-membrane potential, the fibroblast conductance G(f), and the MF gap-junctional coupling G(gap). Furthermore, we find that our MF composite can show autorhythmic and oscillatory behaviors in addition to an excitable response. Our 2D studies use (a) both homogeneous and inhomogeneous distributions of fibroblasts, (b) various ranges for parameters such as G(gap), G(f), and E-f, and (c) intercellular couplings that can be zero-sided, one-sided, and two-sided connections of fibroblasts with myocytes. We show, in particular, that the plane-wave conduction velocity CV decreases as a function of G(gap), for zero-sided and one-sided couplings; however, for two-sided coupling, CV decreases initially and then increases as a function of G(gap), and, eventually, we observe that conduction failure occurs for low values of G(gap). In our homogeneous studies, we find that the rotation speed and stability of a spiral wave can be controlled either by controlling G(gap) or E-f. Our studies with fibroblast inhomogeneities show that a spiral wave can get anchored to a local fibroblast inhomogeneity. We also study the efficacy of a low-amplitude control scheme, which has been suggested for the control of spiral-wave turbulence in mathematical models for cardiac tissue, in our MF model both with and without heterogeneities.
Resumo:
Auction based mechanisms have become popular in industrial procurement settings. These mechanisms minimize the cost of procurement and at the same time achieve desirable properties such as truthful bidding by the suppliers. In this paper, we investigate the design of truthful procurement auctions taking into account an additional important issue namely carbon emissions. In particular, we focus on the following procurement problem: A buyer wishes to source multiple units of a homogeneous item from several competing suppliers who offer volume discount bids and who also provide emission curves that specify the cost of emissions as a function of volume of supply. We assume that emission curves are reported truthfully since that information is easily verifiable through standard sources. First we formulate the volume discount procurement auction problem with emission constraints under the assumption that the suppliers are honest (that is they report production costs truthfully). Next we describe a mechanism design formulation for green procurement with strategic suppliers. Our numerical experimentation shows that emission constraints can significantly alter sourcing decisions and affect the procurement costs dramatically. To the best of our knowledge, this is the first effort in explicitly taking into account carbon emissions in planning procurement auctions.