998 resultados para parallel potential


Relevância:

60.00% 60.00%

Publicador:

Resumo:

Simulation has traditionally been used for analyzing the behavior of complex real world problems. Even though only some features of the problems are considered, simulation time tends to become quite high even for common simulation problems. Parallel and distributed simulation is a viable technique for accelerating the simulations. The success of parallel simulation depends heavily on the combination of the simulation application, algorithm and message population in the simulation is sufficient, no additional delay is caused by this environment. In this thesis a conservative, parallel simulation algorithm is applied to the simulation of a cellular network application in a distributed workstation environment. This thesis presents a distributed simulation environment, Diworse, which is based on the use of networked workstations. The distributed environment is considered especially hard for conservative simulation algorithms due to the high cost of communication. In this thesis, however, the distributed environment is shown to be a viable alternative if the amount of communication is kept reasonable. Novel ideas of multiple message simulation and channel reduction enable efficient use of this environment for the simulation of a cellular network application. The distribution of the simulation is based on a modification of the well known Chandy-Misra deadlock avoidance algorithm with null messages. The basic Chandy Misra algorithm is modified by using the null message cancellation and multiple message simulation techniques. The modifications reduce the amount of null messages and the time required for their execution, thus reducing the simulation time required. The null message cancellation technique reduces the processing time of null messages as the arriving null message cancels other non processed null messages. The multiple message simulation forms groups of messages as it simulates several messages before it releases the new created messages. If the message population in the simulation is suffiecient, no additional delay is caused by this operation A new technique for considering the simulation application is also presented. The performance is improved by establishing a neighborhood for the simulation elements. The neighborhood concept is based on a channel reduction technique, where the properties of the application exclusively determine which connections are necessary when a certain accuracy for simulation results is required. Distributed simulation is also analyzed in order to find out the effect of the different elements in the implemented simulation environment. This analysis is performed by using critical path analysis. Critical path analysis allows determination of a lower bound for the simulation time. In this thesis critical times are computed for sequential and parallel traces. The analysis based on sequential traces reveals the parallel properties of the application whereas the analysis based on parallel traces reveals the properties of the environment and the distribution.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Modern embedded systems embrace many-core shared-memory designs. Due to constrained power and area budgets, most of them feature software-managed scratchpad memories instead of data caches to increase the data locality. It is therefore programmers’ responsibility to explicitly manage the memory transfers, and this make programming these platform cumbersome. Moreover, complex modern applications must be adequately parallelized before they can the parallel potential of the platform into actual performance. To support this, programming languages were proposed, which work at a high level of abstraction, and rely on a runtime whose cost hinders performance, especially in embedded systems, where resources and power budget are constrained. This dissertation explores the applicability of the shared-memory paradigm on modern many-core systems, focusing on the ease-of-programming. It focuses on OpenMP, the de-facto standard for shared memory programming. In a first part, the cost of algorithms for synchronization and data partitioning are analyzed, and they are adapted to modern embedded many-cores. Then, the original design of an OpenMP runtime library is presented, which supports complex forms of parallelism such as multi-level and irregular parallelism. In the second part of the thesis, the focus is on heterogeneous systems, where hardware accelerators are coupled to (many-)cores to implement key functional kernels with orders-of-magnitude of speedup and energy efficiency compared to the “pure software” version. However, three main issues rise, namely i) platform design complexity, ii) architectural scalability and iii) programmability. To tackle them, a template for a generic hardware processing unit (HWPU) is proposed, which share the memory banks with cores, and the template for a scalable architecture is shown, which integrates them through the shared-memory system. Then, a full software stack and toolchain are developed to support platform design and to let programmers exploiting the accelerators of the platform. The OpenMP frontend is extended to interact with it.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The 3-UPU three degrees of freedom fully parallel manipulator, where U and P are for universal and prismatic pair respectively, is a very well known manipulator that can provide the platform with three degrees of freedom of pure translation, pure rotation or mixed translation and rotation with respect to the base, according to the relative directions of the revolute pair axes (each universal pair comprises two revolute pairs with intersecting and perpendicular axes). In particular, pure translational parallel 3-UPU manipulators (3-UPU TPMs) received great attention. Many studies have been reported in the literature on singularities, workspace, and joint clearance influence on the platform accuracy of this manipulator. However, much work has still to be done to reveal all the features this topology can offer to the designer when different architecture, i.e. different geometry are considered. Therefore, this dissertation will focus on this type of the 3-UPU manipulators. The first part of the dissertation presents six new architectures of the 3-UPU TPMs which offer interesting features to the designer. In the second part, a procedure is presented which is based on some indexes, in order to allows the designer to select the best architecture of the 3-UPU TPMs for a given task. Four indexes are proposed as stiffness, clearance, singularity and size of the manipulator in order to apply the procedure.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Rubus niveus Thunb. plant belongs to Rosaceae family and have been used traditionally to treat wounds, burns, inflammation, dysentery, diarrhea and for curing excessive bleeding during menstrual cycle. The present study was undertaken to investigate the in vivo genotoxicity of Rubus niveus aerial parts extract and its possible chemoprotection on doxorubicin (DXR)-induced DNA damage. In parallel, the main phytochemicals constituents in the extract were determined. The animals were exposed to the extract for 24 and 48h, and the doses selected were 500, 1000 and 2000mg/kg b.w. administered by gavage alone or prior to DXR (30mg/kg b.w.) administered by intraperitoneal injection. The endpoints analyzed were DNA damage in bone marrow and peripheral blood cells assessed by the alkaline alkaline (pH>13) comet assay and bone marrow micronucleus test. The results of chemical analysis of the extract showed the presence of tormentic acid, stigmasterol, quercitinglucoronide (miquelianin) and niga-ichigoside F1 as main compounds. Both cytogenetic endpoints analyzed showed that there were no statistically significant differences (p>0.05) between the negative control and the treated groups with the two higher doses of Rubus niveus extract alone, demonstrating absence of genotoxic and mutagenic effects. Aneugenic/clastogenic effect was observed only at 2000mg/kg dose. On the other hand, in the both assays and all tested doses were observed a significant reduction of DNA damage and chromosomal aberrations in all groups co-treated with DXR and extract compared to those which received only DXR. These results indicate that Rubus niveus aerial parts extract did not revealed any genotoxic effect, but presented some aneugenic/clastogenic effect at higher dose; and suggest that it could be a potential adjuvant against development of second malignant neoplasms caused by the cancer chemotherapic DXR.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The main objective of this study was to evaluate the potential application of a lightweight concrete produced with lightweight coarse aggregate made of the water treatment sludge and sawdust (lightweight composite), by determining the thermal properties and possible environmental impact of future residue of this concrete. Two types of concrete were prepared: concrete produced with the lightweight composite dosed with cement/sand/composite/water in a mass ratio of 1:2.5:0.67:0.6 and conventional concrete dosed with cement/sand/crushed stone/water in a mass ratio of 1:4.8:5.8:0.8. The thermal properties were determined by the hot wire parallel technique. The possible environmental impact was measured using the procedures and guidelines of the Brazilian Association of Technical Standards - ABNT. The concrete produced with the lightweight composite presented a 23% lower thermal conductivity than the conventional concrete. The concrete produced with the lightweight composite presented a set of thermal properties suitable for the application of this concrete in non-structural sealing elements. The concentration of aluminum in the solubilized extract of the concrete produced with the lightweight composite was much lower than the concentration of aluminum in the water treatment sludge, confirming the possible reduction of environmental impact of this composite for use in concrete. (C) 2010 Elsevier Ltd. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper discusses the integrated design of parallel manipulators, which exhibit varying dynamics. This characteristic affects the machine stability and performance. The design methodology consists of four main steps: (i) the system modeling using flexible multibody technique, (ii) the synthesis of reduced-order models suitable for control design, (iii) the systematic flexible model-based input signal design, and (iv) the evaluation of some possible machine designs. The novelty in this methodology is to take structural flexibilities into consideration during the input signal design; therefore, enhancing the standard design process which mainly considers rigid bodies dynamics. The potential of the proposed strategy is exploited for the design evaluation of a two degree-of-freedom high-speed parallel manipulator. The results are experimentally validated. (C) 2010 Elsevier Ltd. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Dissertação para obtenção do Grau de Mestre em Biotecnologia

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Article in Press, Corrected Proof

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this study we investigated the efficacy of hyperbaric oxygen (HBO) therapy, alone or combined with the pentavalent antimonial glucantime on Leishmania amazonensis infection. In parallel, the effect of Brazilian red propolis gel (propain) alone or combined with glucantime on L. amazonensis infection was evaluated. The inhibition of the infection in macrophages treated with glucantime in combination with HBO exposition was greater than that of macrophages treated with glucantime alone or HBO alone. The susceptible mouse strain BALB/c infected in the shaved rump with L. amazonensis treated with glucantime and exposed to HBO showed: time points in the course of the disease in which lesions were smaller than those of mice treated with glucantime alone and revascularization of the skin in the lesion site; interferon-gamma (IFN-g) levels were not elevated in lymph node cells from these animals. Propain alone was not efficient against lesions, although less exudative lesions were observed in animals treated with propain alone or combined with glucantime. These results reveal the potential value of HBO and red propolis in combination with glucantime for treating cutaneous leishmaniasis and encourage further studies on the effect of more aggressive HBO, propolis and glucantime therapies on different mouse models of leishmaniasis.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Dissertação para obtenção do Grau de Mestre em Engenharia Informática

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Face à estagnação da tecnologia uniprocessador registada na passada década, aos principais fabricantes de microprocessadores encontraram na tecnologia multi-core a resposta `as crescentes necessidades de processamento do mercado. Durante anos, os desenvolvedores de software viram as suas aplicações acompanhar os ganhos de performance conferidos por cada nova geração de processadores sequenciais, mas `a medida que a capacidade de processamento escala em função do número de processadores, a computação sequencial tem de ser decomposta em várias partes concorrentes que possam executar em paralelo, para que possam utilizar as unidades de processamento adicionais e completar mais rapidamente. A programação paralela implica um paradigma completamente distinto da programação sequencial. Ao contrário dos computadores sequenciais tipificados no modelo de Von Neumann, a heterogeneidade de arquiteturas paralelas requer modelos de programação paralela que abstraiam os programadores dos detalhes da arquitectura e simplifiquem o desenvolvimento de aplicações concorrentes. Os modelos de programação paralela mais populares incitam os programadores a identificar instruções concorrentes na sua lógica de programação, e a especificá-las sob a forma de tarefas que possam ser atribuídas a processadores distintos para executarem em simultâneo. Estas tarefas são tipicamente lançadas durante a execução, e atribuídas aos processadores pelo motor de execução subjacente. Como os requisitos de processamento costumam ser variáveis, e não são conhecidos a priori, o mapeamento de tarefas para processadores tem de ser determinado dinamicamente, em resposta a alterações imprevisíveis dos requisitos de execução. `A medida que o volume da computação cresce, torna-se cada vez menos viável garantir as suas restrições temporais em plataformas uniprocessador. Enquanto os sistemas de tempo real se começam a adaptar ao paradigma de computação paralela, há uma crescente aposta em integrar execuções de tempo real com aplicações interativas no mesmo hardware, num mundo em que a tecnologia se torna cada vez mais pequena, leve, ubíqua, e portável. Esta integração requer soluções de escalonamento que simultaneamente garantam os requisitos temporais das tarefas de tempo real e mantenham um nível aceitável de QoS para as restantes execuções. Para tal, torna-se imperativo que as aplicações de tempo real paralelizem, de forma a minimizar os seus tempos de resposta e maximizar a utilização dos recursos de processamento. Isto introduz uma nova dimensão ao problema do escalonamento, que tem de responder de forma correcta a novos requisitos de execução imprevisíveis e rapidamente conjeturar o mapeamento de tarefas que melhor beneficie os critérios de performance do sistema. A técnica de escalonamento baseado em servidores permite reservar uma fração da capacidade de processamento para a execução de tarefas de tempo real, e assegurar que os efeitos de latência na sua execução não afectam as reservas estipuladas para outras execuções. No caso de tarefas escalonadas pelo tempo de execução máximo, ou tarefas com tempos de execução variáveis, torna-se provável que a largura de banda estipulada não seja consumida por completo. Para melhorar a utilização do sistema, os algoritmos de partilha de largura de banda (capacity-sharing) doam a capacidade não utilizada para a execução de outras tarefas, mantendo as garantias de isolamento entre servidores. Com eficiência comprovada em termos de espaço, tempo, e comunicação, o mecanismo de work-stealing tem vindo a ganhar popularidade como metodologia para o escalonamento de tarefas com paralelismo dinâmico e irregular. O algoritmo p-CSWS combina escalonamento baseado em servidores com capacity-sharing e work-stealing para cobrir as necessidades de escalonamento dos sistemas abertos de tempo real. Enquanto o escalonamento em servidores permite partilhar os recursos de processamento sem interferências a nível dos atrasos, uma nova política de work-stealing que opera sobre o mecanismo de capacity-sharing aplica uma exploração de paralelismo que melhora os tempos de resposta das aplicações e melhora a utilização do sistema. Esta tese propõe uma implementação do algoritmo p-CSWS para o Linux. Em concordância com a estrutura modular do escalonador do Linux, ´e definida uma nova classe de escalonamento que visa avaliar a aplicabilidade da heurística p-CSWS em circunstâncias reais. Ultrapassados os obstáculos intrínsecos `a programação da kernel do Linux, os extensos testes experimentais provam que o p-CSWS ´e mais do que um conceito teórico atrativo, e que a exploração heurística de paralelismo proposta pelo algoritmo beneficia os tempos de resposta das aplicações de tempo real, bem como a performance e eficiência da plataforma multiprocessador.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Breast cancer is the most common cancer among women, being a major public health problem. Worldwide, X-ray mammography is the current gold-standard for medical imaging of breast cancer. However, it has associated some well-known limitations. The false-negative rates, up to 66% in symptomatic women, and the false-positive rates, up to 60%, are a continued source of concern and debate. These drawbacks prompt the development of other imaging techniques for breast cancer detection, in which Digital Breast Tomosynthesis (DBT) is included. DBT is a 3D radiographic technique that reduces the obscuring effect of tissue overlap and appears to address both issues of false-negative and false-positive rates. The 3D images in DBT are only achieved through image reconstruction methods. These methods play an important role in a clinical setting since there is a need to implement a reconstruction process that is both accurate and fast. This dissertation deals with the optimization of iterative algorithms, with parallel computing through an implementation on Graphics Processing Units (GPUs) to make the 3D reconstruction faster using Compute Unified Device Architecture (CUDA). Iterative algorithms have shown to produce the highest quality DBT images, but since they are computationally intensive, their clinical use is currently rejected. These algorithms have the potential to reduce patient dose in DBT scans. A method of integrating CUDA in Interactive Data Language (IDL) is proposed in order to accelerate the DBT image reconstructions. This method has never been attempted before for DBT. In this work the system matrix calculation, the most computationally expensive part of iterative algorithms, is accelerated. A speedup of 1.6 is achieved proving the fact that GPUs can accelerate the IDL implementation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Nowadays a huge attention of the academia and research teams is attracted to the potential of the usage of the 60 GHz frequency band in the wireless communications. The use of the 60GHz frequency band offers great possibilities for wide variety of applications that are yet to be implemented. These applications also imply huge implementation challenges. Such example is building a high data rate transceiver which at the same time would have very low power consumption. In this paper we present a prototype of Single Carrier -SC transceiver system, illustrating a brief overview of the baseband design, emphasizing the most important decisions that need to be done. A brief overview of the possible approaches when implementing the equalizer, as the most complex module in the SC transceiver, is also presented. The main focus of this paper is to suggest a parallel architecture for the receiver in a Single Carrier communication system. This would provide higher data rates that the communication system canachieve, for a price of higher power consumption. The suggested architecture of such receiver is illustrated in this paper,giving the results of its implementation in comparison with its corresponding serial implementation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The intraerythrocytic malarial parasite is involved in an extremely intensive anabolic activity while it resides in its metabolically quiescent host cell. The necessary fast uptake of nutrients and the discharge of waste product, are guaranteed by parasite-induced alterations of the constitutive transporters of the host cell and the production of new parallel pathways. The membrane of the host cell thus becomes permeable to phospholipids, purine bases and nucleosides, small non-electrolytes, anions and cations. When the new pathways are quantitatively unimportant, classical inhibitors of native transporters can be used to inhibit parasite growth. Several compounds were found to effectively inhibit the new pathways and consequently, parasite growth. The pathways have also been used to introduce cytotoxic agents. The parasitophorous membrane consists of channels which are highly permeable to small solutes and display no ion selectivity. Transport of some cations and anions across the parasite membrane is rapid and insensitive to classical inhibitors, and in some cases it is mediated by specific antiporters which respond to their respective inhibitors. Macromolecules have been shown to reach the parasitophorous space through a duct contiguous with the host cell membrane, and subsequently to be endocytosed at the parasite membrane. The simultaneous presence of the parasitophorous membrane channels and the duct, however, is incompatible with experimental evidences. No specific inhibitors were found as yet that would efficiently inhibit transport through the channels or the duct.