40 resultados para Simulacao paralela
Resumo:
The last years have presented an increase in the acceptance and adoption of the parallel processing, as much for scientific computation of high performance as for applications of general intention. This acceptance has been favored mainly for the development of environments with massive parallel processing (MPP - Massively Parallel Processing) and of the distributed computation. A common point between distributed systems and MPPs architectures is the notion of message exchange, that allows the communication between processes. An environment of message exchange consists basically of a communication library that, acting as an extension of the programming languages that allow to the elaboration of applications parallel, such as C, C++ and Fortran. In the development of applications parallel, a basic aspect is on to the analysis of performance of the same ones. Several can be the metric ones used in this analysis: time of execution, efficiency in the use of the processing elements, scalability of the application with respect to the increase in the number of processors or to the increase of the instance of the treat problem. The establishment of models or mechanisms that allow this analysis can be a task sufficiently complicated considering parameters and involved degrees of freedom in the implementation of the parallel application. An joined alternative has been the use of collection tools and visualization of performance data, that allow the user to identify to points of strangulation and sources of inefficiency in an application. For an efficient visualization one becomes necessary to identify and to collect given relative to the execution of the application, stage this called instrumentation. In this work it is presented, initially, a study of the main techniques used in the collection of the performance data, and after that a detailed analysis of the main available tools is made that can be used in architectures parallel of the type to cluster Beowulf with Linux on X86 platform being used libraries of communication based in applications MPI - Message Passing Interface, such as LAM and MPICH. This analysis is validated on applications parallel bars that deal with the problems of the training of neural nets of the type perceptrons using retro-propagation. The gotten conclusions show to the potentiality and easinesses of the analyzed tools.
Resumo:
Artificial neural networks are usually applied to solve complex problems. In problems with more complexity, by increasing the number of layers and neurons, it is possible to achieve greater functional efficiency. Nevertheless, this leads to a greater computational effort. The response time is an important factor in the decision to use neural networks in some systems. Many argue that the computational cost is higher in the training period. However, this phase is held only once. Once the network trained, it is necessary to use the existing computational resources efficiently. In the multicore era, the problem boils down to efficient use of all available processing cores. However, it is necessary to consider the overhead of parallel computing. In this sense, this paper proposes a modular structure that proved to be more suitable for parallel implementations. It is proposed to parallelize the feedforward process of an RNA-type MLP, implemented with OpenMP on a shared memory computer architecture. The research consistes on testing and analizing execution times. Speedup, efficiency and parallel scalability are analyzed. In the proposed approach, by reducing the number of connections between remote neurons, the response time of the network decreases and, consequently, so does the total execution time. The time required for communication and synchronization is directly linked to the number of remote neurons in the network, and so it is necessary to investigate which one is the best distribution of remote connections
Resumo:
The objective of the dissertation was the realization of kinematic modeling of a robotic wheelchair using virtual chains, allowing the wheelchair modeling as a set of robotic manipulator arms forming a cooperative parallel kinematic chain. This document presents the development of a robotic wheelchair to transport people with special needs who overcomes obstacles like a street curb and barriers to accessibility in streets and avenues, including the study of assistive technology, parallel architecture, kinematics modeling, construction and assembly of the prototype robot with the completion of a checklist of problems and barriers to accessibility in several pathways, based on rules, ordinances and existing laws. As a result, simulations were performed on the chair in various states of operation to accomplish the task of going up and down stair with different measures, making the proportional control based on kinematics. To verify the simulated results we developed a prototype robotic wheelchair. This project was developed to provide a better quality of life for people with disabilities
Resumo:
The increasing demand for high performance wireless communication systems has shown the inefficiency of the current model of fixed allocation of the radio spectrum. In this context, cognitive radio appears as a more efficient alternative, by providing opportunistic spectrum access, with the maximum bandwidth possible. To ensure these requirements, it is necessary that the transmitter identify opportunities for transmission and the receiver recognizes the parameters defined for the communication signal. The techniques that use cyclostationary analysis can be applied to problems in either spectrum sensing and modulation classification, even in low signal-to-noise ratio (SNR) environments. However, despite the robustness, one of the main disadvantages of cyclostationarity is the high computational cost for calculating its functions. This work proposes efficient architectures for obtaining cyclostationary features to be employed in either spectrum sensing and automatic modulation classification (AMC). In the context of spectrum sensing, a parallelized algorithm for extracting cyclostationary features of communication signals is presented. The performance of this features extractor parallelization is evaluated by speedup and parallel eficiency metrics. The architecture for spectrum sensing is analyzed for several configuration of false alarm probability, SNR levels and observation time for BPSK and QPSK modulations. In the context of AMC, the reduced alpha-profile is proposed as as a cyclostationary signature calculated for a reduced cyclic frequencies set. This signature is validated by a modulation classification architecture based on pattern matching. The architecture for AMC is investigated for correct classification rates of AM, BPSK, QPSK, MSK and FSK modulations, considering several scenarios of observation length and SNR levels. The numerical results of performance obtained in this work show the eficiency of the proposed architectures
Resumo:
We have developed a theoretical study of magnetic bilayers composed by a ferromagnetic film grown in direct contact on an antiferromagnetic one. We have investigated the interface effects in this systems due to the interfilms coupling. We describe the interface effects by a Heisenberg like coupling with an additional unidirectional anisotropy. In the first approach we assume that the magnetic layers are thick enough to be described by the bulk parameters and they are coupled through the interaction between the magnetic moments located at the interface. We use this approach to calculate the modified dynamical response of each material. We use the magnetic permeability of the layers (with corrections introduced by interface interactions) to obtain a correlation between the interface characteristics and the physical behavior of the magnetic excitations propagating in the system. In the second model, we calculated an effective susceptibility of the system considering a nearly microscopical approach. The dynamic response obtained by this approach was used to study the modifications in the spectrum of the polaritons and its consequences on the attenuated total reflection (ATR). In addition, we have calculated the oblique reflectivity. We compare our result with those obtained for the dispersion relation of the magnetostatic modes in these systems
Resumo:
We study the critical behavior of the one-dimensional pair contact process (PCP), using the Monte Carlo method for several lattice sizes and three different updating: random, sequential and parallel. We also added a small modification to the model, called Monte Carlo com Ressucitamento" (MCR), which consists of resuscitating one particle when the order parameter goes to zero. This was done because it is difficult to accurately determine the critical point of the model, since the order parameter(particle pair density) rapidly goes to zero using the traditional approach. With the MCR, the order parameter becomes null in a softer way, allowing us to use finite-size scaling to determine the critical point and the critical exponents β, ν and z. Our results are consistent with the ones already found in literature for this model, showing that not only the process of resuscitating one particle does not change the critical behavior of the system, it also makes it easier to determine the critical point and critical exponents of the model. This extension to the Monte Carlo method has already been used in other contact process models, leading us to believe its usefulness to study several others non-equilibrium models
Resumo:
Neste trabalho investigamos aspectos da propagação de danos em sistemas cooperativos, descritos por modelos de variáveis discretas (spins), mutuamente interagentes, distribuídas nos sítios de uma rede regular. Os seguintes casos foram examinados: (i) A influência do tipo de atualização (paralela ou sequencial) das configurações microscópicas, durante o processo de simulação computacional de Monte Carlo, no modelo de Ising em uma rede triangular. Observamos que a atualização sequencial produz uma transição de fase dinâmica (Caótica- Congelada) a uma temperatura TD ≈TC (Temperatura de Curie), para acoplamentos ferromagnéticos (TC=3.6409J/Kb) e antiferromagnéticos (TC=0). A atualização paralela, que neste caso é incapaz de diferenciar os dois tipos de acoplamentos, leva a uma transição em TD ≠TC; (ii) Um estudo do modelo de Ising na rede quadrada, com diluição temperada de sítios, mostrou que a técnica de propagação de danos é um eficiente método para o cálculo da fronteira crítica e da dimensão fractal do aglomerado percolante, já que os resultados obtidos (apesar de um esforço computacional relativamente modesto), são comparáveis àqueles resultantes da aplicação de outros métodos analíticos e/ou computacionais de alto empenho; (iii) Finalmente, apresentamos resultados analíticos que mostram como certas combinações especiais de danos podem ser utilizadas para o cálculo de grandezas termodinâmicas (parâmetros de ordem, funções de correlação e susceptibilidades) do modelo Nα x Nβ, o qual contém como casos particulares alguns dos modelos mais estudados em Mecânica Estatística (Ising, Potts, Ashkin Teller e Cúbico)
Resumo:
Iron nitrite films, with hundred of nanometers thick, were deposited using the Cathodic cage plasma nitriding method, with a N2/H2 plasma, over a common glass substract. The structure, surface morphology and magnetic properties were investigated using X-ray diffractometry (XRD), atomic force microscopy (AFM) and vibrating sample magnetometer (VSM). XRD shows the formation of γ FeN phase and a combination of ζFe2N + ɛFe3N phases. The film s saturation magnetization and coercivity depends on morphology, composition, grain size and treatment temperature. Temperature raising from 250 ºC to 350 ºC were followed by an increase in saturation magnetization and film s surface coercivity on the parallel direction in relative proportion. This fact can be attributed to the grain sizes and to the different phases formed, since iron rich fases, like the ɛFe3N phase, emerges more frequently on more elevated treatment s temperature. Using this new and reasonably low cost method, it was possible to deposit films with both good adhesion and good magnetic properties, with wide application in magnetic devices
Resumo:
While providing physical and psychological benefits, excessive exercise could be or cause a compulsive behavior, making the individual dependent on it. In a parallel discussion, computerized psychological instruments, for a hand, reflects the development of information technology and your applicability to other areas, but also shows little advance for Psychological Assessment. In this perspective, this study aims to adapt the Exercise Dependence Scale (EDS-R) in two formats (paper-and-pencil and computerized) and evaluate evidence of factorial and convergent validity, and reliability of each version and compare them with each other. It is also proposed to observe the relationship of some bio-demographic (Sex, age, frequency, duration and intensity of practice exercise) and the exercise dependence (DEF). For this purpose, 709 regular physical activity practitioners, selected by procedures non-probabilistic sampling, responded a adapted version of EDS-R, Muscle Appearance Satisfaction Scale (MASS), Body Modification Scale (BMS) and a demographic questionnaire, analyzed through Exploratory Factor Analysis, Cronbach's Alpha and not parametric tests. Both the traditional version and the computer showed a seven factors structure, explaining 57 and 62% of the variance, respectively, and Cronbach's alphas of 0.83 and 0.89. Factors were: (1) intentionality, (2) continuity, (3) tolerance, (4) reduction of other activities, (5) lack of control, (6) abstinence and (7) time spent on exercise. Relationships were observed between the Exercise Dependence and the variables: age, diets, consumption of food supplements and medicines for weight change, desire to do plastic surgery and body satisfaction. We observed also a positive correlation between the DEF and the frequency, duration and intensity of exercise, and the factor "Dependence on exercising" from MASS, indicating convergent validity of the EDS-R. Finally, comparisons between the two formats were equivalent, with few changes: computerized version achieved higher DEF scores. Based on these results, it can be concluded that the EDS-R has factorial and convergent validity, reliability, to measure exerceise dependence on traditional e computerized formats. DEF is related to actions used to body modification and behaviors toward exercise. Finally, it was found equivalence between the formats, especially in psychometric parameters, thus suggesting feasibility of a computerized assessment. However, it was observed that the computerized data has sample recruiting strategies more limited
Resumo:
It bet on the next generation of computers as architecture with multiple processors and/or multicore processors. In this sense there are challenges related to features interconnection, operating frequency, the area on chip, power dissipation, performance and programmability. The mechanism of interconnection and communication it was considered ideal for this type of architecture are the networks-on-chip, due its scalability, reusability and intrinsic parallelism. The networks-on-chip communication is accomplished by transmitting packets that carry data and instructions that represent requests and responses between the processing elements interconnected by the network. The transmission of packets is accomplished as in a pipeline between the routers in the network, from source to destination of the communication, even allowing simultaneous communications between pairs of different sources and destinations. From this fact, it is proposed to transform the entire infrastructure communication of network-on-chip, using the routing mechanisms, arbitration and storage, in a parallel processing system for high performance. In this proposal, the packages are formed by instructions and data that represent the applications, which are executed on routers as well as they are transmitted, using the pipeline and parallel communication transmissions. In contrast, traditional processors are not used, but only single cores that control the access to memory. An implementation of this idea is called IPNoSys (Integrated Processing NoC System), which has an own programming model and a routing algorithm that guarantees the execution of all instructions in the packets, preventing situations of deadlock, livelock and starvation. This architecture provides mechanisms for input and output, interruption and operating system support. As proof of concept was developed a programming environment and a simulator for this architecture in SystemC, which allows configuration of various parameters and to obtain several results to evaluate it
Resumo:
The vascular segmentation is important in diagnosing vascular diseases like stroke and is hampered by noise in the image and very thin vessels that can pass unnoticed. One way to accomplish the segmentation is extracting the centerline of the vessel with height ridges, which uses the intensity as features for segmentation. This process can take from seconds to minutes, depending on the current technology employed. In order to accelerate the segmentation method proposed by Aylward [Aylward & Bullitt 2002] we have adapted it to run in parallel using CUDA architecture. The performance of the segmentation method running on GPU is compared to both the same method running on CPU and the original Aylward s method running also in CPU. The improvemente of the new method over the original one is twofold: the starting point for the segmentation process is not a single point in the blood vessel but a volume, thereby making it easier for the user to segment a region of interest, and; the overall gain method was 873 times faster running on GPU and 150 times more fast running on the CPU than the original CPU in Aylward
Resumo:
A remoção de inconsistências em um projeto é menos custosa quando realizadas nas etapas iniciais da sua concepção. A utilização de Métodos Formais melhora a compreensão dos sistemas além de possuir diversas técnicas, como a especificação e verificação formal, para identificar essas inconsistências nas etapas iniciais de um projeto. Porém, a transformação de uma especificação formal para uma linguagem de programação é uma tarefa não trivial. Quando feita manualmente, é uma tarefa passível da inserção de erros. O uso de ferramentas que auxiliem esta etapa pode proporcionar grandes benefícios ao produto final a ser desenvolvido. Este trabalho propõe a extensão de uma ferramenta cujo foco é a tradução automática de especificações em CSPm para Handel-C. CSP é uma linguagem de descrição formal adequada para trabalhar com sistemas concorrentes. Handel-C é uma linguagem de programação cujo resultado pode ser compilado diretamente para FPGA's. A extensão consiste no aumento no número de operadores CSPm aceitos pela ferramenta, permitindo ao usuário definir processos locais, renomear canais e utilizar guarda booleana em escolhas externas. Além disto, propomos também a implementação de um protocolo de comunicação que elimina algumas restrições da composição paralela de processos na tradução para Handel-C, permitindo que a comunicação entre múltiplos processos possa ser mapeada de maneira consistente e que a mesma somente ocorra quando for autorizada.
Resumo:
Removing inconsistencies in a project is a less expensive activity when done in the early steps of design. The use of formal methods improves the understanding of systems. They have various techniques such as formal specification and verification to identify these problems in the initial stages of a project. However, the transformation from a formal specification into a programming language is a non-trivial task and error prone, specially when done manually. The aid of tools at this stage can bring great benefits to the final product to be developed. This paper proposes the extension of a tool whose focus is the automatic translation of specifications written in CSPM into Handel-C. CSP is a formal description language suitable for concurrent systems, and CSPM is the notation used in tools support. Handel-C is a programming language whose result can be compiled directly into FPGA s. Our extension increases the number of CSPM operators accepted by the tool, allowing the user to define local processes, to rename channels in a process and to use Boolean guards on external choices. In addition, we also propose the implementation of a communication protocol that eliminates some restrictions on parallel composition of processes in the translation into Handel-C, allowing communication in a same channel between multiple processes to be mapped in a consistent manner and that improper communication in a channel does not ocurr in the generated code, ie, communications that are not allowed in the system specification
Resumo:
Baixo Vermelho area, situated on the northern portion of Umbuzeiro Graben (onshore Potiguar Basin), represents a typical example of a rift basin, characterized, in subsurface, by the sedimentary rift sequence, correlated to Pendência Formation (Valanginian-Barremian), and by the Carnaubais fault system. In this context, two main goals, the stratigraphic and the structural analysis, had guided the research. For this purpose, it was used the 3D seismic volume and eight wells located in the study area and adjacencies. The stratigraphic analysis of the Valanginian-Barremian interval was carried through in two distinct phases, 1D and 2D, in which the basic concepts of the sequence stratigraphy had been adapted. In these phases, the individual analysis of each well and the correlation between them, allowed to recognize the main lithofacies, to interpret the effective depositional systems and to identify the genetic units and key-surfaces of chronostratigraphic character. The analyzed lithofacies are represented predominantly by conglomerates, sandstones, siltites and shales, with carbonate rocks and marls occurring subordinately. According to these lithofacies associations, it is possible to interpret the following depositional systems: alluvial fan, fluvio-deltaic and lacustrine depositional systems. The alluvial fan system is mainly composed by conglomerates deposits, which had developed, preferentially in the south portion of the area, being directly associated to Carnaubais fault system. The fluvial-deltaic system, in turn, was mainly developed in the northwest portion of the area, at the flexural edge, being characterized by coarse sandstones with shales and siltites intercalated. On the other hand, the lacustrine system, the most dominant one in the study area, is formed mainly by shales that could occur intercalated with thin layers of fine to very fine sandstones, interpreted as turbidite deposits. The recognized sequence stratigraphy units in the wells are represented by parasequence sets, systems tracts and depositional sequences. The parasequence sets, which are progradational or retrogradational, had been grouped and related to the systems tracts. The predominance of the progradation parasequence sets (general trend with coarsening-upward) characterizes the Regressive Systems Tract, while the occurrence, more frequently, of the retrogradation parasequence sets (general trend with finning-upward) represents the Transgressive System Tract. In the seismic stratigraphic analysis, the lithofacies described in the wells had been related to chaotic, progradational and parallel/subparallel seismic facies, which are associated, frequently, to the alluvial fans, fluvial-deltaic and lacustrine depositional systems, respectively. In this analysis, it was possible to recognize fifteen seismic horizons that correspond to sequence boundaries and to maximum flooding surfaces, which separates Transgressive to Regressive systems tracts. The recognition of transgressive-regressive cycles allowed to identify nine, possibly, 3a order deposicional sequences, related to the tectonic-sedimentary cycles. The structural analysis, in turn, was done at Baixo Vermelho seismic volume, which shows, clearly, the structural complexity printed in the area, mainly related to Carnaubais fault system, acting as an important fault system of the rift edge. This fault system is characterized by a main arrangement of normal faults with trend NE-SO, where Carnaubais Fault represents the maximum expression of these lineations. Carnaubais Fault corresponds to a fault with typically listric geometry, with general trend N70°E, dipping to northwest. It is observed, throughout all the seismic volume, with variations in its surface, which had conditioned, in its evolutive stages, the formation of innumerable structural features that normally are identified in Pendencia Formation. In this unit, part of these features is related to the formation of longitudinal foldings (rollover structures and distentional folding associated), originated by the displacement of the main fault plan, propitiating variations in geometry and thickness of the adjacent layers, which had been deposited at the same time. Other structural features are related to the secondary faultings, which could be synthetic or antithetic to Carnaubais Fault. In a general way, these faults have limited lateral continuity, with listric planar format and, apparently, they play the role of the accomodation of the distentional deformation printed in the area. Thus, the interaction between the stratigraphic and structural analysis, based on an excellent quality of the used data, allowed to get one better agreement on the tectonicsedimentary evolution of the Valanginian-Barremian interval (Pendência Formation) in the studied area
Resumo:
Geological and geophysical studies (resistivity, self potential and VLF) were undertaken in the Tararaca and Santa Rita farms, respectively close to the Santo Antônio and Santa Cruz villages, eastern Rio Grande do Norte State, NE Brazil. Their aim was to characterize water acummulation structures in crystalline rocks. Based on geological and geophysical data, two models were characterized, the fracture-stream and the eluvio-alluvial through, in part already described in the literature. In the Tararaca Farm, a water well was located in a NW-trending streamlet; surrounding outcrops display fractures with the same orientation. Apparent resistivity sections, accross the stream channel, confirm fracturing at depth. The VLF profiles systematically display an alignment of equivalent current density anomalies, coinciding with the stream. Based on such data, the classical fracture-stream model seems to be well characterized at this place. In the Santa Rita Farm, a NE-trending stream display a metric-thick eluvioregolith-alluvial cover. The outcropping bedrock do not present fractures paralell to the stream direction, although the latter coincides with the trend of the gneiss foliation, which dips to the south. Geophysical data confirm the absence of a fracture zone at this place, but delineate the borders of a through-shaped structure filled with sediments (alluvium and regolith). The southern border of this structure dips steeper compared to the northern one. This water acummulation structure corresponds to an alternative model as regards to the classical fracture-stream, being named as the eluvio-alluvial trough. Its local controls are the drainage and relief, coupled with the bedrock weathering preferentially following foliation planes, generating the asymmetry of the through