914 resultados para Acceleration


Relevância:

20.00% 20.00%

Publicador:

Resumo:

We consider a second-order variational problem depending on the covariant acceleration, which is related to the notion of Riemannian cubic polynomials. This problem and the corresponding optimal control problem are described in the context of higher order tangent bundles using geometric tools. The main tool, a presymplectic variant of Pontryagin’s maximum principle, allows us to study the dynamics of the control problem.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Reconfigurable platforms are a promising technology that offers an interesting trade-off between flexibility and performance, which many recent embedded system applications demand, especially in fields such as multimedia processing. These applications typically involve multiple ad-hoc tasks for hardware acceleration, which are usually represented using formalisms such as Data Flow Diagrams (DFDs), Data Flow Graphs (DFGs), Control and Data Flow Graphs (CDFGs) or Petri Nets. However, none of these models is able to capture at the same time the pipeline behavior between tasks (that therefore can coexist in order to minimize the application execution time), their communication patterns, and their data dependencies. This paper proves that the knowledge of all this information can be effectively exploited to reduce the resource requirements and the timing performance of modern reconfigurable systems, where a set of hardware accelerators is used to support the computation. For this purpose, this paper proposes a novel task representation model, named Temporal Constrained Data Flow Diagram (TCDFD), which includes all this information. This paper also presents a mapping-scheduling algorithm that is able to take advantage of the new TCDFD model. It aims at minimizing the dynamic reconfiguration overhead while meeting the communication requirements among the tasks. Experimental results show that the presented approach achieves up to 75% of resources saving and up to 89% of reconfiguration overhead reduction with respect to other state-of-the-art techniques for reconfigurable platforms.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Today, modern System-on-a-Chip (SoC) systems have grown rapidly due to the increased processing power, while maintaining the size of the hardware circuit. The number of transistors on a chip continues to increase, but current SoC designs may not be able to exploit the potential performance, especially with energy consumption and chip area becoming two major concerns. Traditional SoC designs usually separate software and hardware. Thus, the process of improving the system performance is a complicated task for both software and hardware designers. The aim of this research is to develop hardware acceleration workflow for software applications. Thus, system performance can be improved with constraints of energy consumption and on-chip resource costs. The characteristics of software applications can be identified by using profiling tools. Hardware acceleration can have significant performance improvement for highly mathematical calculations or repeated functions. The performance of SoC systems can then be improved, if the hardware acceleration method is used to accelerate the element that incurs performance overheads. The concepts mentioned in this study can be easily applied to a variety of sophisticated software applications. The contributions of SoC-based hardware acceleration in the hardware-software co-design platform include the following: (1) Software profiling methods are applied to H.264 Coder-Decoder (CODEC) core. The hotspot function of aimed application is identified by using critical attributes such as cycles per loop, loop rounds, etc. (2) Hardware acceleration method based on Field-Programmable Gate Array (FPGA) is used to resolve system bottlenecks and improve system performance. The identified hotspot function is then converted to a hardware accelerator and mapped onto the hardware platform. Two types of hardware acceleration methods – central bus design and co-processor design, are implemented for comparison in the proposed architecture. (3) System specifications, such as performance, energy consumption, and resource costs, are measured and analyzed. The trade-off of these three factors is compared and balanced. Different hardware accelerators are implemented and evaluated based on system requirements. 4) The system verification platform is designed based on Integrated Circuit (IC) workflow. Hardware optimization techniques are used for higher performance and less resource costs. Experimental results show that the proposed hardware acceleration workflow for software applications is an efficient technique. The system can reach 2.8X performance improvements and save 31.84% energy consumption by applying the Bus-IP design. The Co-processor design can have 7.9X performance and save 75.85% energy consumption.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Diffuse radio emission in galaxy clusters has been observed with different size and properties. Giant radio halos (RH), Mpc-size sources found in merging clusters, and mini halos (MH), 0.1-0.5 Mpc size sources located in relaxed cool-core clusters, are thought to be distinct classes of objects with different formation mechanisms. However, recent observations have revealed the unexpected presence of diffuse emission on Mpc-scales in relaxed clusters that host a central MH and show no signs of major mergers. The study of these sources is still at the beginning and it is not yet clear what could be the origin of their unusual emission. The main goal of this thesis is to test the occurrence of these peculiar sources and investigate their properties using low frequency radio observations. This thesis consists in the study of a sample of 12 cool-core galaxy clusters which present some level of dynamical disturbances on large-scale. The heterogeneity of sources in the sample allowed me to investigate under which conditions a halo-type emission is present in MH clusters; and also to study the connection between AGN bubbles and the local environment. Using high sensitivity LOFAR observations, I have detected large-scale emission in four non-merging clusters, in addition to the central MH. I have constrained for the first time the spectral properties of diffuse emission in these double radio component galaxy clusters, and I have investigated the connection between their thermal and non-thermal emission for a better comprehension of the acceleration mechanism. Furthermore, I derived upper limits to the halo power for the other clusters in the sample, which could present large-scale diffuse emission under the detection threshold. Finally, I have reconstructed the duty-cycle of one of the most powerful AGN known, located at the centre of a galaxy cluster of the sample.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Embedding intelligence in extreme edge devices allows distilling raw data acquired from sensors into actionable information, directly on IoT end-nodes. This computing paradigm, in which end-nodes no longer depend entirely on the Cloud, offers undeniable benefits, driving a large research area (TinyML) to deploy leading Machine Learning (ML) algorithms on micro-controller class of devices. To fit the limited memory storage capability of these tiny platforms, full-precision Deep Neural Networks (DNNs) are compressed by representing their data down to byte and sub-byte formats, in the integer domain. However, the current generation of micro-controller systems can barely cope with the computing requirements of QNNs. This thesis tackles the challenge from many perspectives, presenting solutions both at software and hardware levels, exploiting parallelism, heterogeneity and software programmability to guarantee high flexibility and high energy-performance proportionality. The first contribution, PULP-NN, is an optimized software computing library for QNN inference on parallel ultra-low-power (PULP) clusters of RISC-V processors, showing one order of magnitude improvements in performance and energy efficiency, compared to current State-of-the-Art (SoA) STM32 micro-controller systems (MCUs) based on ARM Cortex-M cores. The second contribution is XpulpNN, a set of RISC-V domain specific instruction set architecture (ISA) extensions to deal with sub-byte integer arithmetic computation. The solution, including the ISA extensions and the micro-architecture to support them, achieves energy efficiency comparable with dedicated DNN accelerators and surpasses the efficiency of SoA ARM Cortex-M based MCUs, such as the low-end STM32M4 and the high-end STM32H7 devices, by up to three orders of magnitude. To overcome the Von Neumann bottleneck while guaranteeing the highest flexibility, the final contribution integrates an Analog In-Memory Computing accelerator into the PULP cluster, creating a fully programmable heterogeneous fabric that demonstrates end-to-end inference capabilities of SoA MobileNetV2 models, showing two orders of magnitude performance improvements over current SoA analog/digital solutions.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The pervasive availability of connected devices in any industrial and societal sector is pushing for an evolution of the well-established cloud computing model. The emerging paradigm of the cloud continuum embraces this decentralization trend and envisions virtualized computing resources physically located between traditional datacenters and data sources. By totally or partially executing closer to the network edge, applications can have quicker reactions to events, thus enabling advanced forms of automation and intelligence. However, these applications also induce new data-intensive workloads with low-latency constraints that require the adoption of specialized resources, such as high-performance communication options (e.g., RDMA, DPDK, XDP, etc.). Unfortunately, cloud providers still struggle to integrate these options into their infrastructures. That risks undermining the principle of generality that underlies the cloud computing scale economy by forcing developers to tailor their code to low-level APIs, non-standard programming models, and static execution environments. This thesis proposes a novel system architecture to empower cloud platforms across the whole cloud continuum with Network Acceleration as a Service (NAaaS). To provide commodity yet efficient access to acceleration, this architecture defines a layer of agnostic high-performance I/O APIs, exposed to applications and clearly separated from the heterogeneous protocols, interfaces, and hardware devices that implement it. A novel system component embodies this decoupling by offering a set of agnostic OS features to applications: memory management for zero-copy transfers, asynchronous I/O processing, and efficient packet scheduling. This thesis also explores the design space of the possible implementations of this architecture by proposing two reference middleware systems and by adopting them to support interactive use cases in the cloud continuum: a serverless platform and an Industry 4.0 scenario. A detailed discussion and a thorough performance evaluation demonstrate that the proposed architecture is suitable to enable the easy-to-use, flexible integration of modern network acceleration into next-generation cloud platforms.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Negli ultimi anni la necessità di processare e mantenere dati di qualsiasi natura è aumentata considerevolmente, in aggiunta a questo, l’obsolescenza del modello centralizzato ha contribuito alla sempre più frequente adozione del modello distribuito. Inevitabile dunque l’aumento di traffico che attraversa i nodi appartenenti alle infrastrutture, un traffico sempre più in aumento e che con l’avvento dell’IoT, dei Big Data, del Cloud Computing, del Serverless Computing etc., ha raggiunto picchi elevatissimi. Basti pensare che se prima i dati erano contenuti in loco, oggi non è assurdo pensare che l’archiviazione dei propri dati sia completamente affidata a terzi. Così come cresce, quindi, il traffico che attraversa i nodi facenti parte di un’infrastruttura, cresce la necessità che questo traffico sia filtrato e gestito dai nodi stessi. L’obbiettivo di questa tesi è quello di estendere un Message-oriented Middleware, in grado di garantire diverse qualità di servizio per la consegna di messaggi, in modo da accelerarne la fase di routing verso i nodi destinazione. L’estensione consiste nell’aggiungere al Message-oriented Middleware, precedentemente implementato, la funzione di intercettare i pacchetti in arrivo (che nel caso del middleware in questione possono rappresentare la propagazione di eventi) e redirigerli verso un nuovo nodo in base ad alcuni parametri. Il Message-oriented Middleware oggetto di tesi sarà considerato il message broker di un modello pub/sub, pertanto la redirezione deve avvenire con tempi molto bassi di latenza e, a tal proposito, deve avvenire senza l’uscita dal kernel space del sistema operativo. Per questo motivo si è deciso di utilizzare eBPF, in particolare il modulo XDP, che permette di scrivere programmi che eseguono all’interno del kernel.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This is an ecological, analytical and retrospective study comprising the 645 municipalities in the State of São Paulo, the scope of which was to determine the relationship between socioeconomic, demographic variables and the model of care in relation to infant mortality rates in the period from 1998 to 2008. The ratio of average annual change for each indicator per stratum coverage was calculated. Infant mortality was analyzed according to the model for repeated measures over time, adjusted for the following correction variables: the city's population, proportion of Family Health Programs (PSFs) deployed, proportion of Growth Acceleration Programs (PACs) deployed, per capita GDP and SPSRI (São Paulo social responsibility index). The analysis was performed by generalized linear models, considering the gamma distribution. Multiple comparisons were performed with the likelihood ratio with chi-square approximate distribution, considering a significance level of 5%. There was a decrease in infant mortality over the years (p < 0.05), with no significant difference from 2004 to 2008 (p > 0.05). The proportion of PSFs deployed (p < 0.0001) and per capita GDP (p < 0.0001) were significant in the model. The decline of infant mortality in this period was influenced by the growth of per capita GDP and PSFs.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The association between tridimensional scaffolds to cells of interest has provided excellent perspectives for obtaining viable complex tissues in vitro, such as skin, resulting in impressive advances in the field of tissue engineering applied to regenerative therapies. The use of multipotent mesenchymal stromal cells in the treatment of dermo-epidermal wounds is particularly promising due to several relevant properties of these cells, such as high capacity of proliferation in culture, potential of differentiation in multiple skin cell types, important paracrine and immunomodulatory effects, among others. Membranes of chitosan complexed with xanthan may be potentially useful as scaffolds for multipotent mesenchymal stromal cells, given that they present suitable physico-chemical characteristics and have adequate tridimensional structure for the adhesion, growth, and maintenance of cell function. Therefore, the purpose of this work was to assess the applicability of bioactive dressings associating dense and porous chitosan-xanthan membranes to multipotent mesenchymal stromal cells for the treatment of skin wounds. The membranes showed to be non-mutagenic and allowed efficient adhesion and proliferation of the mesenchymal stromal cells in vitro. In vivo assays performed with mesenchymal stromal cells grown on the surface of the dense membranes showed acceleration of wound healing in Wistar rats, thus indicating that the use of this cell-scaffold association for tissue engineering purposes is feasible and attractive.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Tomatoes are one of the most important vegetable crops grown in Brazil and are among the crops that have one of the highest post-harvest losses indexes in the country. The present work aimed at evaluating impact damage observed in packing lines of fresh tomatoes as well as to determine, under laboratory conditions, quality alterations of tomato fruits submitted to impact damage in different surface types. Critical points evaluation was accomplished using an instrumented sphere. Critical transference points found showed variations in acceleration levels from 30 to 129 G (m s-2). Tests carried out under laboratory conditions showed that padded surfaces reduced up to 31% impact damage. Incidence of severe internal physical damage was evaluated by a subjective scale and increased by 79% on hard surfaces for the highest fall drop. On the other hand, it was observed an effective reduction in physical damage on fruits when padded surfaces were used. When a 10-cm drop was performed, the maximum reduction measured was 10% for hard surfaces and 5% for previously padded surfaces. For quality parameters, it was observed for high drops on hard surfaces, highest values for weight loss, total acidity, lower values for vitamin C and Soluble Solids.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Excessive and inadequate handling of fruits and vegetables provides high incidences of physical damage, consequently, post harvest losses. The main goal of this work was to evaluate the impact magnitude in persimmon packing lines, Rama Forte, and to determine, at the laboratory, its impact limits. For evaluating the critical points it was used an instrumented sphere of 76 mm of diameter (Technmark, Inc, Lansing, USA), which registered the impact magnitude in seven distinctive impact lines located in four packing houses. For determining physical damages, tests were carried out at the laboratory, where fruit drop was related to impact magnitude, physical damage incidence and fruit post harvest losses. At the packing lines, the values found varied from 21 to 87 G on the transfer points and the majority of registered impacts (over 94%) were down 50G. Drops from 20 cm caused an increase in weight losses after six days of storage at room temperature. Drops from 20 and 30 cm caused skin darkness (low L values), associated to a decrease in color intensity (chroma). Impact drop did not affect pulp fruit chemical features.