132 resultados para basic block reduce
em Indian Institute of Science - Bangalore - Índia
Resumo:
The performance of a program will ultimately be limited by its serial (scalar) portion, as pointed out by Amdahl′s Law. Reported studies thus far of instruction-level parallelism have mixed data-parallel program portions with scalar program portions, often leading to contradictory and controversial results. We report an instruction-level behavioral characterization of scalar code containing minimal data-parallelism, extracted from highly vectorized programs of the PERFECT benchmark suite running on a Cray Y-MP system. We classify scalar basic blocks according to their instruction mix, characterize the data dependencies seen in each class, and, as a first step, measure the maximum intrablock instruction-level parallelism available. We observe skewed rather than balanced instruction distributions in scalar code and in individual basic block classes of scalar code; nonuniform distribution of parallelism across instruction classes; and, as expected, limited available intrablock parallelism. We identify frequently occurring data-dependence patterns and discuss new instructions to reduce latency. Toward effective scalar hardware, we study latency-pipelining trade-offs and restricted multiple instruction issue mechanisms.
Resumo:
Fly ash has potential application in the construction of base liners for waste containment facilities. While most of the fly ashes improve in the strength with curing, the ranges of permeabilities they attain may often not meet the basic requirement of a liner material. An attempt has been made in the present context to reduce the hydraulic conductivity by adding lime content up to 10% to two selected samples of class F fly ashes. The use of gypsum, which is known to accelerate the unconfined compressive strength by increasing the lime reactivity, has been investigated in further improving the hydraulic conductivity. Hydraulic conductivities of the compacted specimens have been determined in the laboratory using the falling head method. It has been observed that the addition of gypsum reduces the hydraulic conductivity of the lime treated fly ashes. The reduction in the hydraulic conductivity of the samples containing gypsum is significantly more for samples with high amounts of lime contents (as high as 1000 times) than those fly ashes with lower amounts of lime. However there is a relatively more increase in the strengths of the samples with the inclusion of gypsum to the fly ashes at lower lime contents. This is due to the fact that excess lime added to fly ash is not effectively converted into pozzolanic compounds. Even the presence of gypsum is observed not to activate these reactions with excess lime. On the other hand the higher amount of lime in the presence of sulphate is observed to produce more cementitious compounds which block the pores in the fly ash. The consequent reduction in the hydraulic conductivity of fly ash would be beneficial in reducing the leachability of trace elements present in the fly ash when used as a base liner. (C) 2010 Elsevier Ltd. All rights reserved.
Resumo:
Construction of high rate Space Time Block Codes (STBCs) with low decoding complexity has been studied widely using techniques such as sphere decoding and non Maximum-Likelihood (ML) decoders such as the QR decomposition decoder with M paths (QRDM decoder). Recently Ren et al., presented a new class of STBCs known as the block orthogonal STBCs (BOSTBCs), which could be exploited by the QRDM decoders to achieve significant decoding complexity reduction without performance loss. The block orthogonal property of the codes constructed was however only shown via simulations. In this paper, we give analytical proofs for the block orthogonal structure of various existing codes in literature including the codes constructed in the paper by Ren et al. We show that codes formed as the sum of Clifford Unitary Weight Designs (CUWDs) or Coordinate Interleaved Orthogonal Designs (CIODs) exhibit block orthogonal structure. We also provide new construction of block orthogonal codes from Cyclic Division Algebras (CDAs) and Crossed-Product Algebras (CPAs). In addition, we show how the block orthogonal property of the STBCs can be exploited to reduce the decoding complexity of a sphere decoder using a depth first search approach. Simulation results of the decoding complexity show a 30% reduction in the number of floating point operations (FLOPS) of BOSTBCs as compared to STBCs without the block orthogonal structure.
Resumo:
In routine industrial design, fatigue life estimation is largely based on S-N curves and ad hoc cycle counting algorithms used with Miner's rule for predicting life under complex loading. However, there are well known deficiencies of the conventional approach. Of the many cumulative damage rules that have been proposed, Manson's Double Linear Damage Rule (DLDR) has been the most successful. Here we follow up, through comparisons with experimental data from many sources, on a new approach to empirical fatigue life estimation (A Constructive Empirical Theory for Metal Fatigue Under Block Cyclic Loading', Proceedings of the Royal Society A, in press). The basic modeling approach is first described: it depends on enforcing mathematical consistency between predictions of simple empirical models that include indeterminate functional forms, and published fatigue data from handbooks. This consistency is enforced through setting up and (with luck) solving a functional equation with three independent variables and six unknown functions. The model, after eliminating or identifying various parameters, retains three fitted parameters; for the experimental data available, one of these may be set to zero. On comparison against data from several different sources, with two fitted parameters, we find that our model works about as well as the DLDR and much better than Miner's rule. We finally discuss some ways in which the model might be used, beyond the scope of the DLDR.
Resumo:
A large part of the rural people of developing countries use traditional biomass stoves to meet their cooking and heating energy demands. These stoves possess very low thermal efficiency; besides, most of them cannot handle agricultural wastes. Thus, there is a need to develop an alternate cooking contrivance which is simple, efficient and can handle a range of biomass including agricultural wastes. In this reported work, a highly densified solid fuel block using a range of low cost agro residues has been developed to meet the cooking and heating needs. A strategy was adopted to determine the best suitable raw materials, which was optimized in terms of cost and performance. Several experiments were conducted using solid fuel block which was manufactured using various raw materials in different proportions; it was found that fuel block composed of 40% biomass, 40% charcoal powder, 15% binder and 5% oxidizer fulfilled the requirement. Based on this finding, fuel blocks of two different configurations viz. cylindrical shape with single and multi-holes (3, 6, 9 and 13) were constructed and its performance was evaluated. For instance, the 13 hole solid fuel block met the requirement of domestic cooking; the mean thermal power was 1.6 kWth with a burn time of 1.5 h. Furthermore, the maximum thermal efficiency recorded for this particular design was 58%. Whereas, the power level of single hole solid fuel block was found to be lower but adequate for barbecue cooking application.
Resumo:
The conformational preferences of hydrazinecarbothioamide (HCTA, H2NNHCSNH2) in its basic and N-protonated (PHCTA, H3NNNHCSNH2) forms have been studied by 1H and 13C NMR spectroscopy and by theoretical LCAO-MO methods (ab initio, CNDO/2 and EHT). The hindered rotation around the C---N bond has been investigated by a total line shape analysis for the thioamide protons and by the three MO methods. Changes in the molecular conformation and electronic structure on protonation are briefly discussed.
Resumo:
Conventional thinkin g holds that increased energy consumption is a prerequisite for economic and social development. This belief, together With the prospect of dwindling global petroleum supplies and the high costs of expanding energy supply generally, lead many to believe that it is not feasible to improve living standards substantially in the developing countries. But by shifting to high-quality energy carriers and by exploiting cost-effective opportunities for more efficient energy use, it would be possible to satisfy basic human needs and to provide considerable further improvements in living standards without significantly increasing per-capita energy use above the present level.
Resumo:
Conformational preferences of thiocarbonohydrazide (H2NNHCSNHNH2) in its basic and N,N′-diprotonated forms are examined by calculating the barrier to internal rotation around the C---N bonds, using the theoretical LCAO—MO (ab initio and semiempirical CNDO and EHT) methods. The calculated and experimental results are compared with each other and also with values for N,N′-dimethylthiourea which is isoelectronic with thiocarbonohydrazide. The suitability of these methods for studying rotational isomerism seems suspect when lone pair interactions are present.
Resumo:
The efficiency of dephosphorisation is governed by the thermodynamic behaviour of phosphorus and oxygen in molten metal, and P2O5 and FeO in slag. The equilibrium distribution of phosphorus and oxygen, for a wide range of chemical compositions simulating the evolution of slag composition during a typical BOF blow, has been experimentally determined. A mathematical model for estimation of the activity coefficients, as a function of the chemical composition, was also attempted.
Resumo:
The paper deals with the basic problem of adjusting a matrix gain in a discrete-time linear multivariable system. The object is to obtain a global convergence criterion, i.e. conditions under which a specified error signal asymptotically approaches zero and other signals in the system remain bounded for arbitrary initial conditions and for any bounded input to the system. It is shown that for a class of up-dating algorithms for the adjustable gain matrix, global convergence is crucially dependent on a transfer matrix G(z) which has a simple block diagram interpretation. When w(z)G(z) is strictly discrete positive real for a scalar w(z) such that w-1(z) is strictly proper with poles and zeros within the unit circle, an augmented error scheme is suggested and is proved to result in global convergence. The solution avoids feeding back a quadratic term as recommended in other schemes for single-input single-output systems.
Resumo:
In this paper we develop compilation techniques for the realization of applications described in a High Level Language (HLL) onto a Runtime Reconfigurable Architecture. The compiler determines Hyper Operations (HyperOps) that are subgraphs of a data flow graph (of an application) and comprise elementary operations that have strong producer-consumer relationship. These HyperOps are hosted on computation structures that are provisioned on demand at runtime. We also report compiler optimizations that collectively reduce the overheads of data-driven computations in runtime reconfigurable architectures. On an average, HyperOps offer a 44% reduction in total execution time and a 18% reduction in management overheads as compared to using basic blocks as coarse grained operations. We show that HyperOps formed using our compiler are suitable to support data flow software pipelining.
Resumo:
Abstract is not available.
Resumo:
The unsteady pseudo plane motions have been investigated in which each point of the parallel planes is subjected to non-torsional oscillations in their own plane and at any given instant the streamlines are concentric circles. Exact solutions are obtained and the form of the curve , the locus of the centers of these concentric circles, is discussed. The existence of three infinite sets of exact solutions, for the flow in the geometry of an orthogonal rheometer in which the above non-torsional oscillations are superposed on the disks, is established. Three cases arise according to whether is greater than, equal to or less than , where is angular velocity of the basic rotation and is the frequency of the superposed oscillations. For a symmetric solution of the flow these solutions reduce to a single unique solution. The nature of the curve is illustrated graphically by considering an example of the flow between coaxial rotating disks.
Resumo:
Prequantization has been forwarded as a means to improve the performance of double phase holograms (DPHs). We show here that any improvement (even under the best of conditions) is not large enough to help the DPH to compete favourably with other holograms.
Resumo:
A new method of generating polynomials using microprocessors is proposed. The polynomial is generated as a 16-bit digital word. The algorithm for generating a variety of basic 'building block' functions and its implementation is discussed. A technique for generating a generalized polynomial based on the proposed algorithm is indicated. The performance of the proposed generator is evaluated using a commercially available microprocessor kit.