443 resultados para Supercomputer


Relevância:

10.00% 10.00%

Publicador:

Resumo:

An associative memory with parallel architecture is presented. The neurons are modelled by perceptrons having only binary, rather than continuous valued input. To store m elements each having n features, m neurons each with n connections are needed. The n features are coded as an n-bit binary vector. The weights of the n connections that store the n features of an element has only two values -1 and 1 corresponding to the absence or presence of a feature. This makes the learning very simple and straightforward. For an input corrupted by binary noise, the associative memory indicates the element that is closest (in terms of Hamming distance) to the noisy input. In the case where the noisy input is equidistant from two or more stored vectors, the associative memory indicates two or more elements simultaneously. From some simple experiments performed on the human memory and also on the associative memory, it can be concluded that the associative memory presented in this paper is in some respect more akin to a human memory than a Hopfield model.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Detailed molecular dynamics simulations of argon in zeolite NaCaA are reported. Thermodynamic, structural, and dynamical properties of the sorbate as a function of temperature have been obtained. The properties calculated include various site-site radial distribution functions, different energy distribution functions, selfdiffusion coefficients, the power spectra, and properties relating to cage-to-cage diffusion. The results suggest that sorbate is delocalized above 300 K. Both modes of cage-to-cage diffusion-the surface-mediated and centralized diffusion-are associated with negative barrier heights. Surprisingly, rate of cage-to-cage diffusion is associated with negative and positive activation energies below and above 500 K. The observed differences in the behavior of the rate of cage-to-cage diffusion between Xe-NaY and Ar-NaCaA systems and the nature of the potential energy surface are discussed. Presence of sorbatezeolite interactions results in significant enhancement in the rate of cage-to-cage diffusion and rate of cage visits. It is shown that properties dependent on the long-time behavior such as the diffusion coefficient and the rate of cages visited exhibit the expected Arrhenius dependence on temperature.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Mutation and/or dysfunction of signaling proteins in the mitogen activated protein kinase (MAPK) signal transduction pathway are frequently observed in various kinds of human cancer. Consistent with this fact, in the present study, we experimentally observe that the epidermal growth factor (EGF) induced activation profile of MAP kinase signaling is not straightforward dose-dependent in the PC3 prostate cancer cells. To find out what parameters and reactions in the pathway are involved in this departure from the normal dose-dependency, a model-based pathway analysis is performed. The pathway is mathematically modeled with 28 rate equations yielding those many ordinary differential equations (ODE) with kinetic rate constants that have been reported to take random values in the existing literature. This has led to us treating the ODE model of the pathways kinetics as a random differential equations (RDE) system in which the parameters are random variables. We show that our RDE model captures the uncertainty in the kinetic rate constants as seen in the behavior of the experimental data and more importantly, upon simulation, exhibits the abnormal EGF dose-dependency of the activation profile of MAP kinase signaling in PC3 prostate cancer cells. The most likely set of values of the kinetic rate constants obtained from fitting the RDE model into the experimental data is then used in a direct transcription based dynamic optimization method for computing the changes needed in these kinetic rate constant values for the restoration of the normal EGF dose response. The last computation identifies the parameters, i.e., the kinetic rate constants in the RDE model, that are the most sensitive to the change in the EGF dose response behavior in the PC3 prostate cancer cells. The reactions in which these most sensitive parameters participate emerge as candidate drug targets on the signaling pathway. (C) 2011 Elsevier Ireland Ltd. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Software transactional memory (STM) is a promising programming paradigm for shared memory multithreaded programs as an alternative to traditional lock based synchronization. However adoption of STM in mainstream software has been quite low due to its considerable overheads and its poor cache/memory performance. In this paper, we perform a detailed study of the cache behavior of STM applications and quantify the impact of different STM factors on the cache misses experienced by the applications. Based on our analysis, we propose a compiler driven Lock-Data Colocation (LDC), targeted at reducing the cache overheads on STM. We show that LDC is effective in improving the cache behavior of STM applications by reducing the dcache miss latency and improving execution time performance.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this work, we propose a new organization for the last level shared cache of a rnulticore system. Our design is based on the observation that the Next-Use distance, measured in terms of intervening misses between the eviction of a line and its next use, for lines brought in by a given delinquent PC falls within a predictable range of values. We exploit this correlation to improve the performance of shared caches in multi-core architectures by proposing the NUcache organization.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Relentless CMOS scaling coupled with lower design tolerances is making ICs increasingly susceptible to wear-out related permanent faults and transient faults, necessitating on-chip fault tolerance in future chip microprocessors (CMPs). In this paper we introduce a new energy-efficient fault-tolerant CMP architecture known as Redundant Execution using Critical Value Forwarding (RECVF). RECVF is based on two observations: (i) forwarding critical instruction results from the leading to the trailing core enables the latter to execute faster, and (ii) this speedup can be exploited to reduce energy consumption by operating the trailing core at a lower voltage-frequency level. Our evaluation shows that RECVF consumes 37% less energy than conventional dual modular redundant (DMR) execution of a program. It consumes only 1.26 times the energy of a non-fault-tolerant baseline and has a performance overhead of just 1.2%.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The performance of a program will ultimately be limited by its serial (scalar) portion, as pointed out by Amdahl′s Law. Reported studies thus far of instruction-level parallelism have mixed data-parallel program portions with scalar program portions, often leading to contradictory and controversial results. We report an instruction-level behavioral characterization of scalar code containing minimal data-parallelism, extracted from highly vectorized programs of the PERFECT benchmark suite running on a Cray Y-MP system. We classify scalar basic blocks according to their instruction mix, characterize the data dependencies seen in each class, and, as a first step, measure the maximum intrablock instruction-level parallelism available. We observe skewed rather than balanced instruction distributions in scalar code and in individual basic block classes of scalar code; nonuniform distribution of parallelism across instruction classes; and, as expected, limited available intrablock parallelism. We identify frequently occurring data-dependence patterns and discuss new instructions to reduce latency. Toward effective scalar hardware, we study latency-pipelining trade-offs and restricted multiple instruction issue mechanisms.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Diffuse optical tomographic image reconstruction uses advanced numerical models that are computationally costly to be implemented in the real time. The graphics processing units (GPUs) offer desktop massive parallelization that can accelerate these computations. An open-source GPU-accelerated linear algebra library package is used to compute the most intensive matrix-matrix calculations and matrix decompositions that are used in solving the system of linear equations. These open-source functions were integrated into the existing frequency-domain diffuse optical image reconstruction algorithms to evaluate the acceleration capability of the GPUs (NVIDIA Tesla C 1060) with increasing reconstruction problem sizes. These studies indicate that single precision computations are sufficient for diffuse optical tomographic image reconstruction. The acceleration per iteration can be up to 40, using GPUs compared to traditional CPUs in case of three-dimensional reconstruction, where the reconstruction problem is more underdetermined, making the GPUs more attractive in the clinical settings. The current limitation of these GPUs in the available onboard memory (4 GB) that restricts the reconstruction of a large set of optical parameters, more than 13, 377. (C) 2010 Society of Photo-Optical Instrumentation Engineers. DOI: 10.1117/1.3506216]

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Rotating shear flows, when angular momentum increases and angular velocity decreases as functions of radiation coordinate, are hydrodynamically stable under linear perturbation. The Keplerian flow is an example of such a system, which appears in an astrophysical context. Although decaying eigenmodes exhibit large transient energy growth of perturbation which could govern nonlinearity in the system, the feedback of inherent instability to generate turbulence seems questionable. We show that such systems exhibiting growing pseudo-eigenmodes easily reach an upper bound of growth rate in terms of the logarithmic norm of the involved non-normal operators, thus exhibiting feedback of inherent instability. This supports the existence of turbulence of hydrodynamic origin in the Keplerian accretion disc in astrophysics. Hence, this answers the question of the mismatch between the linear theory and experimental/observed data and helps in resolving the outstanding question of the origin of turbulence therein.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Presented here, in a vector formulation, is an O(mn2) direct concise algorithm that prunes/identifies the linearly dependent (ld) rows of an arbitrary m X n matrix A and computes its reflexive type minimum norm inverse A(mr)-, which will be the true inverse A-1 if A is nonsingular and the Moore-Penrose inverse A+ if A is full row-rank. The algorithm, without any additional computation, produces the projection operator P = (I - A(mr)- A) that provides a means to compute any of the solutions of the consistent linear equation Ax = b since the general solution may be expressed as x = A(mr)+b + Pz, where z is an arbitrary vector. The rank r of A will also be produced in the process. Some of the salient features of this algorithm are that (i) the algorithm is concise, (ii) the minimum norm least squares solution for consistent/inconsistent equations is readily computable when A is full row-rank (else, a minimum norm solution for consistent equations is obtainable), (iii) the algorithm identifies ld rows, if any, and reduces concerned computation and improves accuracy of the result, (iv) error-bounds for the inverse as well as the solution x for Ax = b are readily computable, (v) error-free computation of the inverse, solution vector, rank, and projection operator and its inherent parallel implementation are straightforward, (vi) it is suitable for vector (pipeline) machines, and (vii) the inverse produced by the algorithm can be used to solve under-/overdetermined linear systems.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Random Access Scan, which addresses individual flip-flops in a design using a memory array like row and column decoder architecture, has recently attracted widespread attention, due to its potential for lower test application time, test data volume and test power dissipation when compared to traditional Serial Scan. This is because typically only a very limited number of random ``care'' bits in a test response need be modified to create the next test vector. Unlike traditional scan, most flip-flops need not be updated. Test application efficiency can be further improved by organizing the access by word instead of by bit. In this paper we present a new decoder structure that takes advantage of basis vectors and linear algebra to further significantly optimize test application in RAS by performing the write operations on multiple bits consecutively. Simulations performed on benchmark circuits show an average of 2-3 times speed up in test write time compared to conventional RAS.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A systematic investigation of monatomic spherical sorbates in the supercages of zeolites Y and A by molecular dynamics technique is presented. Rates of intercage diffusion, rates of cage visits, and the diffusion coefficients have been calculated as a function of the sorbate-zeolite interaction strength. These properties exhibit markedly different dependences on interaction strength for the two zeolites. The observed behavior is shown to be a consequence of the two principal mechanisms of intercage diffusion and the energetic barrier associated with them. The diffusion coefficient and other properties associated with intercage diffusion are found to be directly proportional to the reciprocal of the square of the sorbate diameter when the sorbate diameter is significantly smaller than the window diameter. As the sorbate diameter increases, a peak is observed in all the transport properties investigated including the diffusion coefficient. We call this surprising effect as the ring or levitation effect and it explains several anomalous results reported in the literature and suggests a breakdown of the geometrical criterion for diffusion of sorbates. It shows that under certain conditions nongeometrical factors play a major role and geometrical factors become secondary in the determination of the molecular sieve property. A generalized parameter has been proposed which suggests conditions under which one can expect the ring or levitation effect in any porous medium. Inverse size selectivity becomes operative under these conditions.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Recent computer simulations on zeolites Y and A have found that the diffusion coefficient and the rate of intercage diffusion exhibit, apart from a linear dependence on the reciprocal of the square of the sorbate diameter, an anomalous peak as sorbate diameter approaches the window diameter. Here we report molecular dynamics simulations of zeolite NaA incorporating framework flexibility as a function of sorbate diameter in order to verify the existence of anomalous diffusion. Results suggest persistence of anomalous diffusion or ring effect. This suggests that the anomalous behavior is a general effect characteristic of zeolites Y and A. The barrier for diffusion across the eight-ring window is seen to be negative and is found to decrease with sorbate size. The effect of sorbate on the cage motion has also been investigated. Results suggest that the window expands during intercage migration only if the sorbate size is comparable to the window diameter. Flexible cage simulations yield a higher value for the diffusion coefficient and also the rate of intercage diffusion. This increase has been shown to be due to an increase in the intercage diffusions via the centralized diffusion mode rather than the surface-mediated mode. It is shown that this increase arises from an increase in the single particle density distribution in the region near the cage center.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Various geometrical and energetic distribution functions and other properties connected with the cage-to-cage diffusion of xenon in sodium Y zeolite have been obtained from long molecular dynamics calculations. Analysis of diffusion pathways reveals two interesting mechanisms-surface-mediated and centralized modes for cage-to-cage diffusion. The surface-mediated mode of diffusion exhibits a small positive barrier, while the centralized diffusion exhibits a negative barrier for the sorbate to diffuse across the 12-ring window. In both modes, however, the sorbate has to be activated from the adsorption site to enable it to gain mobility. The centralized diffusion additionally requires the sorbate to be free of the influence of the surface of the cage as well. The overall rate for cage-to-cage diffusion shows an Arrhenius temperature dependence with E(a) = 3 kJ/mol. It is found that the decay in the dynamical correction factor occurs on a time scale comparable to the cage residence time. The distributions of barrier heights have been calculated. Functions reflecting the distribution of the sorbate-zeolite interaction at the window and the variations of the distance between the sorbate and the centers of the parent and daughter cages are presented.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We present results for one-loop matching coefficients between continuum four-fermion operators, defined in the Naive Dimensional Regularization scheme, and staggered fermion operators of various types. We calculate diagrams involving gluon exchange between quark fines, and ''penguin'' diagrams containing quark loops. For the former we use Landau-gauge operators, with and without O(a) improvement, and including the tadpole improvement suggested by Lepage and Mackenzie. For the latter we use gauge-invariant operators. Combined with existing results for two-loop anomalous dimension matrices and one-loop matching coefficients, our results allow a lattice calculation of the amplitudes for KKBAR mixing and K --> pipi decays with all corrections of O(g2) included. We also discuss the mixing of DELTAS = 1 operators with lower dimension operators, and show that, with staggered fermions, only a single lower dimension operator need be removed by non-perturbative subtraction.