882 resultados para Parallel computing. Multilayer perceptron. OpenMP
Resumo:
Laminar separation bubbles are thought to be highly non-parallel, and hence global stability studies start from this premise. However, experimentalists have always realized that the flow is more parallel than is commonly believed, for pressure-gradient-induced bubbles, and this is why linear parallel stability theory has been successful in describing their early stages of transition. The present experimental/numerical study re-examines this important issue and finds that the base flow in such a separation bubble becomes nearly parallel due to a strong-interaction process between the separated boundary layer and the outer potential flow. The so-called dead-air region or the region of constant pressure is a simple consequence of this strong interaction. We use triple-deck theory to qualitatively explain these features. Next, the implications of global analysis for the linear stability of separation bubbles are considered. In particular we show that in the initial portion of the bubble, where the flow is nearly parallel, local stability analysis is sufficient to capture the essential physics. It appears that the real utility of the global analysis is perhaps in the rear portion of the bubble, where the flow is highly non-parallel, and where the secondary/nonlinear instability stages are likely to dominate the dynamics.
Resumo:
Fragment Finder 2.0 is a web-based interactive computing server which can be used to retrieve structurally similar protein fragments from 25 and 90% nonredundant data sets. The computing server identifies structurally similar fragments using the protein backbone C alpha angles. In addition, the identified fragments can be superimposed using either of the two structural superposition programs, STAMP and PROFIT, provided in the server. The freely available Java plug-in Jmol has been interfaced with the server for the visualization of the query and superposed fragments. The server is the updated version of a previously developed search engine and employs an in-house-developed fast pattern matching algorithm. This server can be accessed freely over the World Wide Web through the URL http://cluster.physics.iisc.ernet.in/ff/.
Resumo:
Many common activities, like reading, scanning scenes, or searching for an inconspicuous item in a cluttered environment, entail serial movements of the eyes that shift the gaze from one object to another. Previous studies have shown that the primate brain is capable of programming sequential saccadic eye movements in parallel. Given that the onset of saccades directed to a target are unpredictable in individual trials, what prevents a saccade during parallel programming from being executed in the direction of the second target before execution of another saccade in the direction of the first target remains unclear. Using a computational model, here we demonstrate that sequential saccades inhibit each other and share the brain's limited processing resources (capacity) so that the planning of a saccade in the direction of the first target always finishes first. In this framework, the latency of a saccade increases linearly with the fraction of capacity allocated to the other saccade in the sequence, and exponentially with the duration of capacity sharing. Our study establishes a link between the dual-task paradigm and the ramp-to-threshold model of response time to identify a physiologically viable mechanism that preserves the serial order of saccades without compromising the speed of performance.
Resumo:
AIN/CrN multilayer hard coatings with various bilayer thicknesses were fabricated by a reactive sputtering process. The microstructural and mechanical characterizations of multilayer coatings were investigated through transmission electron microscope (TEM) observations and the hardness measurements by nano indentation. In particular, the variation of chemical bonding states of the bilayer nitrides was elucidated by near edge X-ray absorption fine structure (NEXAFS) spectroscopy. Many broken nitrogen bonds were formed by decreasing the bilayer thickness of AIN/CrN multilayer coatings. Existence of optimum AIN/CrN multilayer coatings thickness for maximum hardness could be explained by the competition of softening by the formation of broken nitrogen bonds and strengthening induced by decreasing bilayer thickness.
Resumo:
AlxTi1-xN/CrN multilayer coatings were fabricated by magnetron sputtering and those hardness variations were studied by observing the crack propagation and measuring the chemical bonding state of nitrides by Ti addition. While AlN/CrN multilayer shown stair-like crack propagation, AlxTi1-xN/CrN multilayer illustrated straight crack propagation. Most interestingly, Ti addition induced more broken nitrogen bonds in the nitride multilayers, leading to the reduction of hardness. However, the hardness of Al0.25Ti0.75N/CrN multilayer, having high Ti contents, increased by the formation of many Ti-N bond again instead of Al-N bond. From these results, we found that linear crack propagation behavior was dominated by broken nitrogen bonds in the AlxT1-xN/CrN multilayer coatings.
Resumo:
In order to resolve some missing micromechanistic details regarding contact deformation in nitride multilayer coatings we report here observations from cross-sectional transmission electron microscopy and focused ion beam studies of the Vickers indentations on TiN/TiAlN multilayer films of various total thicknesses as well as bilayer periods. The study of damage induced by contact deformation in a nitride multilayer coating is complemented by stress calculated using an analytical model. Kinked boundaries of sliding columns give rise to cracks which propagate at an angle to the indentation axis under a combination of compressive and shear stresses. It is seen that multilayers provide more distributed columnar sliding, thereby reducing the stress intensity factor for shear cracking, while interfacial dislocations provide a stress relief mechanism by enabling lateral movement of material. (C) 2012 Acta Materialia Inc. Published by Elsevier Ltd. All rights reserved.
Resumo:
Novel composite graphene oxide (GO)/poly(allylamine hydrochloride) (PAH) multilayer capsules have been fabricated by layer-by-layer (LbL) assembly. They were found to possess unique permeability properties compared to traditional LbL capsules. These hybrid capsules showed special ``core-shell'' loading property for encapsulation of dual drugs simultaneously into the core and shell of the capsules respectively.
Resumo:
Carbon nanotubes dispersed in polymer matrix have been aligned in the form of fibers and interconnects and cured electrically and by UV light. Conductivity and effective semiconductor tunneling against reverse to forward bias field have been designed to have differentiable current-voltage response of each of the fiber/channel. The current-voltage response is a function of the strain applied to the fibers along axial direction. Biaxial and shear strains are correlated by differentiating signals from the aligned fibers/channels. Using a small doping of magnetic nanoparticles in these composite fibers, magneto-resistance properties are realized which are strong enough to use the resulting magnetostriction as a state variable for signal processing and computing. Various basic analog signal processing tasks such as addition, convolution and filtering etc. can be performed. These preliminary study shows promising application of the concept in combined analog-digital computation in carbon nanotube based fibers. Various dynamic effects such as relaxation, electric field dependent nonlinearities and hysteresis on the output signals are studied using experimental data and analytical model.
Resumo:
In this paper, we address a scheduling problem for minimizing total weighted flowtime, observed in automobile gear manufacturing. Specifically, the bottleneck operation of the pre-heat treatment stage of gear manufacturing process has been dealt with in scheduling. Many real-life scenarios like unequal release times, sequence dependent setup times, and machine eligibility restrictions have been considered. A mathematical model taking into account dynamic starting conditions has been proposed. The problem is derived to be NP-hard. To approach the problem, a few heuristic algorithms have been proposed. Based on planned computational experiments, the performance of the proposed heuristic algorithms is evaluated: (a) in comparison with optimal solution for small-size problem instances and (b) in comparison with the estimated optimal solution for large-size problem instances. Extensive computational analyses reveal that the proposed heuristic algorithms are capable of consistently yielding near-statistically estimated optimal solutions in a reasonable computational time.
Resumo:
Clustered architecture processors are preferred for embedded systems because centralized register file architectures scale poorly in terms of clock rate, chip area, and power consumption. Although clustering helps by improving the clock speed, reducing the energy consumption of the logic, and making the design simpler, it introduces extra overheads by way of inter-cluster communication. This communication happens over long global wires having high load capacitance which leads to delay in execution and significantly high energy consumption. Inter-cluster communication also introduces many short idle cycles, thereby significantly increasing the overall leakage energy consumption in the functional units. The trend towards miniaturization of devices (and associated reduction in threshold voltage) makes energy consumption in interconnects and functional units even worse, and limits the usability of clustered architectures in smaller technologies. However, technological advancements now permit the design of interconnects and functional units with varying performance and power modes. In this paper, we propose scheduling algorithms that aggregate the scheduling slack of instructions and communication slack of data values to exploit the low-power modes of functional units and interconnects. Finally, we present a synergistic combination of these algorithms that simultaneously saves energy in functional units and interconnects to improves the usability of clustered architectures by achieving better overall energy-performance trade-offs. Even with conservative estimates of the contribution of the functional units and interconnects to the overall processor energy consumption, the proposed combined scheme obtains on average 8% and 10% improvement in overall energy-delay product with 3.5% and 2% performance degradation for a 2-clustered and a 4-clustered machine, respectively. We present a detailed experimental evaluation of the proposed schemes. Our test bed uses the Trimaran compiler infrastructure. (C) 2012 Elsevier Inc. All rights reserved.
Resumo:
Multilayers of poly(diallyldimethylammonium chloride) (PDDA) and citrate capped Au nanoparticles (AuNPs) anchored on sodium 3-mercapto-1-propanesulfonate modified gold electrode by electrostatic layer-by-layer assembly (LbL) technique are shown to be an excellent architecture for the direct electrochemical oxidation of As(III) species. The growth of successive layers in the proposed LbL architecture is followed by atomic force microscopy, UV-vis spectroscopy, quartz crystal microbalance with energy dissipation, and electrochemistry. The first bilayer is found to show rather different physico-chemical characteristics as compared to the subsequent bilayers, and this is attributed to the difference in the adsorption environments. The analytical utility of the architecture with five bilayers is exploited for arsenic sensing via the direct electrocatalytic oxidation of As(III), and the detection limit is found to be well below the WHO guidelines of 10 ppb. When the non-redox active PDDA is replaced by the redoxactive Os(2,2'-bipyridine)(2)Cl-poly(4-vinylpyridine) polyelectrolyte (PVPOs) in the LbL assembly, the performance is found to be inferior, demonstrating that the redox activity of the polyelectrolyte is futile as far as the direct electro-oxidation of As(III) is concerned. (C) 2012 Elsevier Inc. All rights reserved.
Resumo:
This paper presents a decentralized/peer-to-peer architecture-based parallel version of the vector evaluated particle swarm optimization (VEPSO) algorithm for multi-objective design optimization of laminated composite plates using message passing interface (MPI). The design optimization of laminated composite plates being a combinatorially explosive constrained non-linear optimization problem (CNOP), with many design variables and a vast solution space, warrants the use of non-parametric and heuristic optimization algorithms like PSO. Optimization requires minimizing both the weight and cost of these composite plates, simultaneously, which renders the problem multi-objective. Hence VEPSO, a multi-objective variant of the PSO algorithm, is used. Despite the use of such a heuristic, the application problem, being computationally intensive, suffers from long execution times due to sequential computation. Hence, a parallel version of the PSO algorithm for the problem has been developed to run on several nodes of an IBM P720 cluster. The proposed parallel algorithm, using MPI's collective communication directives, establishes a peer-to-peer relationship between the constituent parallel processes, deviating from the more common master-slave approach, in achieving reduction of computation time by factor of up to 10. Finally we show the effectiveness of the proposed parallel algorithm by comparing it with a serial implementation of VEPSO and a parallel implementation of the vector evaluated genetic algorithm (VEGA) for the same design problem. (c) 2012 Elsevier Ltd. All rights reserved.
Resumo:
The Reeb graph of a scalar function tracks the evolution of the topology of its level sets. This paper describes a fast algorithm to compute the Reeb graph of a piecewise-linear (PL) function defined over manifolds and non-manifolds. The key idea in the proposed approach is to maximally leverage the efficient contour tree algorithm to compute the Reeb graph. The algorithm proceeds by dividing the input into a set of subvolumes that have loop-free Reeb graphs using the join tree of the scalar function and computes the Reeb graph by combining the contour trees of all the subvolumes. Since the key ingredient of this method is a series of union-find operations, the algorithm is fast in practice. Experimental results demonstrate that it outperforms current generic algorithms by a factor of up to two orders of magnitude, and has a performance on par with algorithms that are catered to restricted classes of input. The algorithm also extends to handle large data that do not fit in memory.
Resumo:
A novel and simple route for near-infrared (NIR)-light controlled release of drugs has been demonstrated using graphene oxide (GO) composite microcapsules based on the unique optical properties of GO. Upon NIR-laser irradiation, the microcapsules were ruptured in a point-wise fashion due to local heating which in turn triggers the light-controlled release of the encapsulated anticancer drug doxorubicin (Dox) from these capsules.
Resumo:
Surface electrodes are essentially required to be switched for boundary data collection in electrical impedance tomography (Ell). Parallel digital data bits are required to operate the multiplexers used, generally, for electrode switching in ELT. More the electrodes in an EIT system more the digital data bits are needed. For a sixteen electrode system. 16 parallel digital data bits are required to operate the multiplexers in opposite or neighbouring current injection method. In this paper a common ground current injection is proposed for EIT and the resistivity imaging is studied. Common ground method needs only two analog multiplexers each of which need only 4 digital data bits and hence only 8 digital bits are required to switch the 16 surface electrodes. Results show that the USB based data acquisition system sequentially generate digital data required for multiplexers operating in common ground current injection method. The profile of the boundary data collected from practical phantom show that the multiplexers are operating in the required sequence in common ground current injection protocol. The voltage peaks obtained for all the inhomogeneity configurations are found at the accurate positions in the boundary data matrix which proved the sequential operation of multiplexers. Resistivity images reconstructed from the boundary data collected from the practical phantom with different configurations also show that the entire digital data generation module is functioning properly. Reconstructed images and their image parameters proved that the boundary data are successfully acquired by the DAQ system which in turn indicates a sequential and proper operation of multiplexers.