914 resultados para level of fault-tolerance
Resumo:
Technology scaling has proceeded into dimensions in which the reliability of manufactured devices is becoming endangered. The reliability decrease is a consequence of physical limitations, relative increase of variations, and decreasing noise margins, among others. A promising solution for bringing the reliability of circuits back to a desired level is the use of design methods which introduce tolerance against possible faults in an integrated circuit. This thesis studies and presents fault tolerance methods for network-onchip (NoC) which is a design paradigm targeted for very large systems-onchip. In a NoC resources, such as processors and memories, are connected to a communication network; comparable to the Internet. Fault tolerance in such a system can be achieved at many abstraction levels. The thesis studies the origin of faults in modern technologies and explains the classification to transient, intermittent and permanent faults. A survey of fault tolerance methods is presented to demonstrate the diversity of available methods. Networks-on-chip are approached by exploring their main design choices: the selection of a topology, routing protocol, and flow control method. Fault tolerance methods for NoCs are studied at different layers of the OSI reference model. The data link layer provides a reliable communication link over a physical channel. Error control coding is an efficient fault tolerance method especially against transient faults at this abstraction level. Error control coding methods suitable for on-chip communication are studied and their implementations presented. Error control coding loses its effectiveness in the presence of intermittent and permanent faults. Therefore, other solutions against them are presented. The introduction of spare wires and split transmissions are shown to provide good tolerance against intermittent and permanent errors and their combination to error control coding is illustrated. At the network layer positioned above the data link layer, fault tolerance can be achieved with the design of fault tolerant network topologies and routing algorithms. Both of these approaches are presented in the thesis together with realizations in the both categories. The thesis concludes that an optimal fault tolerance solution contains carefully co-designed elements from different abstraction levels
Resumo:
This paper presents an architecture (Multi-μ) being implemented to study and develop software based fault tolerant mechanisms for Real-Time Systems, using the Ada language (Ada 95) and Commercial Off-The-Shelf (COTS) components. Several issues regarding fault tolerance are presented and mechanisms to achieve fault tolerance by software active replication in Ada 95 are discussed. The Multi-μ architecture, based on a specifically proposed Fault Tolerance Manager (FTManager), is then described. Finally, some considerations are made about the work being done and essential future developments.
Resumo:
The occurrence of green soybean seed due to forced maturation or premature plant death caused by drought or foliar and/or root diseases has been common in several Brazilian production areas. Physiological quality of seed lots with green seed may have their germination and vigor potentials affected and therefore discarded by the grain industry. The objective of this experiment was to determine the maximum tolerated level of green seed in soybean seed lots, which is information of major importance for seed producers when taking the decision whether to sell these lots. Soybean seed of the cultivars CD 206, produced in Ubirata, Parana, and FMT Tucunare, produced in Alto Garças, Mato Grosso, were used in the study. Green seed and yellow seed of both cultivars were mixed in the following proportions: 0%, 3%, 6%, 9%, 12%, 15%, 20%, 30%, 40%, 50%, 75% and 100%. Seed quality was evaluated by the germination, accelerated aging, tetrazolium and electrical conductivity tests. The contents of a, b and total chlorophyll in the seed were also determined. A complete randomized block design in a factorial scheme (two cultivars x 12 levels of green seed) was used. Seed quality was negatively affected and chlorophyll contents incremented with the increase in the percentage of green seed. Seed germination, viability and vigor, measured by the accelerated aging test, were not reduced with levels of up to 3% green seed, for both cultivars. Levels above 6% green seed significantly reduced the quality of the seed. The quality of seed lots with 9% or more green seed was significantly reduced to the point that their commercialization is not recommended.
Resumo:
An n-dimensional Mobius cube, 0MQ(n) or 1MQ(n), is a variation of n-dimensional cube Q(n) which possesses many attractive properties such as significantly smaller communication delay and stronger graph-embedding capabilities. In some practical situations, the fault tolerance of a distributed memory multiprocessor system can be measured more precisely by the connectivity of the underlying graph under forbidden fault set models. This article addresses the connectivity of 0MQ(n)/1MQ(n), under two typical forbidden fault set models. We first prove that the connectivity of 0MQ(n)/1MQ(n) is 2n - 2 when the fault set does not contain the neighborhood of any vertex as a subset. We then prove that the connectivity of 0MQ(n)/1MQ(n) is 3n - 5 provided that the neighborhood of any vertex as well as that of any edge cannot fail simultaneously These results demonstrate that 0MQ(n)/1MQ(n) has the same connectivity as Q(n) under either of the previous assumptions.
Resumo:
This paper presents an analysis of the fault tolerance achieved by an autonomous, fully embedded evolvable hardware system, which uses a combination of partial dynamic reconfiguration and an evolutionary algorithm (EA). It demonstrates that the system may self-recover from both transient and cumulative permanent faults. This self-adaptive system, based on a 2D array of 16 (4×4) Processing Elements (PEs), is tested with an image filtering application. Results show that it may properly recover from faults in up to 3 PEs, that is, more than 18% cumulative permanent faults. Two fault models are used for testing purposes, at PE and CLB levels. Two self-healing strategies are also introduced, depending on whether fault diagnosis is available or not. They are based on scrubbing, fitness evaluation, dynamic partial reconfiguration and in-system evolutionary adaptation. Since most of these adaptability features are already available on the system for its normal operation, resource cost for self-healing is very low (only some code additions in the internal microprocessor core)
Resumo:
Immunization of mice with rat type II collagen (CII), a cartilage-specific protein, leads to development of collagen-induced arthritis (CIA), a model for rheumatoid arthritis. To define the interaction between the immune system and cartilage, we produced two sets of transgenic mice. In the first we point mutated the mouse CII gene to express an earlier defined T-cell epitope, CII-(256-270), present in rat CII. In the second we mutated the mouse type I collagen gene to express the same T-cell epitope. The mice with mutated type I collagen showed no T-cell reactivity to rat CII and were resistant to CIA. Thus, the CII-(256-270) epitope is immunodominant and critical for development of CIA. In contrast, the mice with mutated CII had an intact B-cell response and had T cells which could produce gamma interferon, but not proliferate, in response to CII. They developed CIA, albeit with a reduced incidence. Thus, we conclude that T cells recognize CII derived from endogenous cartilage and are partially tolerized but may still be capable of mediating CIA.
Resumo:
This work presents a theoretical-graph method of determining the fault tolerance degree of the computer network interconnections and nodes. Experimental results received from simulations of this method over a distributed computing network environment are also presented.
Resumo:
Dendritic cells (DC) have a key role in controlling the immune response, by determining the outcome of antigen presentation to T cells. Through costimulatory molecules and other factors, DC are involved in the maintenance of peripheral tolerance through modulation of the immune response. This modulation occurs both constitutively, and in inflammation, in order to prevent autoimmunity and to control established immune responses. Dendritic cell control of immune responses may be mediated through cytokine or cell-contact dependent mechanisms. The molecular and cellular basis of these controls is being understood at an increasingly more complex level. This understanding is reaching a level at which DC-based therapies for the induction of immune regulation in autoimmunity can be tested in vivo. This review outlines the current state of knowledge of DC in immune tolerance, and proposes how DC might control both T cell responses, and themselves, to prevent autoimmunity and maintain peripheral tolerance.
Resumo:
It is imperative to accept that failures can and will occur, even in meticulously designed distributed systems, and design proper measures to counter those failures. Passive replication minimises resource consumption by only activating redundant replicas in case of failures, as typically providing and applying state updates is less resource demanding than requesting execution. However, most existing solutions for passive fault tolerance are usually designed and configured at design time, explicitly and statically identifying the most critical components and their number of replicas, lacking the needed flexibility to handle the runtime dynamics of distributed component-based embedded systems. This paper proposes a cost-effective adaptive fault tolerance solution with a significant lower overhead compared to a strict active redundancy-based approach, achieving a high error coverage with the minimum amount of redundancy. The activation of passive replicas is coordinated through a feedback-based coordination model that reduces the complexity of the needed interactions among components until a new collective global service solution is determined, improving the overall maintainability and robustness of the system.
Resumo:
Aluminum (Al) toxicity is a major factor limiting barley growth in acid soils, and genotypes with adequate level of tolerance are needed for improving barley adaptation in Brazil. To study the inheritance of Al tolerance in Brazilian barleys, cultivars Antarctica 1, BR 1 and FM 404 were crossed to sensitive Kearney and PFC 8026, and intercrossed. Parental, F1, F2 and F6 generations were grown in nutrient solution containing 0.03, 0.05 and 0.07 mM of Al and classified for tolerance by the root tip hematoxylin staining assay. Tolerant by sensitive F2 progenies segregated three tolerant to one sensitive, fitting the 3:1 ratio expected for a single gene. The F6 populations segregated one tolerant to one sensitive also fitting a monogenic ratio. The F2 seedlings from crosses among tolerant genotypes scored the same as the parents. Since the population size used would allow detection of recombination as low as 7%, the complete absence of Al sensitive recombinants suggests that tolerance in these cultivars is most probably, controlled by the same gene. Thus, the potential for improving Al tolerance through recombination of these genotypes is very low and different gene sources should be evaluated.
Resumo:
Initial contacts with a T-dependent antigen by mucosal routes may result in oral tolerance, defined as the inhibition of specific antibody formation after subsequent parenteral immunizations with the same antigen. We describe here an additional and permanent consequence of these initial contacts, namely, the blockade of secondary-type responsiveness to subsequent parenteral contacts with the antigen. When repeatedly boosted ip with small doses (3 µg) of ovalbumin (OVA) (or lysozyme), primed B6D2F1 mice showed progressively higher antibody responses. In contrast, mice primed after a single oral exposure to the antigen, although repeatedly boosted, maintained their secondary antibody titers on a level which was inversely proportional to the dose of antigen in the oral pretreatment. This phenomenon also occurred in situations in which oral tolerance was not induced. For example, senile 70-week-old B6D2F1 mice pretreated with a single gavage of 20 mg OVA did not become tolerant, i.e., they formed the same secondary levels of anti-OVA antibodies as non-pretreated mice. However, after 4 weekly challenges with 3 µg OVA ip, orally pretreated mice maintained the same anti-OVA serum levels, whereas the levels of control mice increased sequentially. This "stabilizing" effect of mucosal exposure was dose dependent, occurred with different proteins and was triggered by single or multiple oral or nasal exposures to the antigen.
Resumo:
Processor virtualization for process migration in distributed parallel computing systems has formed a significant component of research on load balancing. In contrast, the potential of processor virtualization for fault tolerance has been addressed minimally. The work reported in this paper is motivated towards extending concepts of processor virtualization towards ‘intelligent cores’ as a means to achieve fault tolerance in distributed parallel computing systems. Intelligent cores are an abstraction of the hardware processing cores, with the incorporation of cognitive capabilities, on which parallel tasks can be executed and migrated. When a processing core executing a task is predicted to fail the task being executed is proactively transferred onto another core. A parallel reduction algorithm incorporating concepts of intelligent cores is implemented on a computer cluster using Adaptive MPI and Charm ++. Preliminary results confirm the feasibility of the approach.
Resumo:
Recent research in multi-agent systems incorporate fault tolerance concepts, but does not explore the extension and implementation of such ideas for large scale parallel computing systems. The work reported in this paper investigates a swarm array computing approach, namely 'Intelligent Agents'. A task to be executed on a parallel computing system is decomposed to sub-tasks and mapped onto agents that traverse an abstracted hardware layer. The agents intercommunicate across processors to share information during the event of a predicted core/processor failure and for successfully completing the task. The feasibility of the approach is validated by simulations on an FPGA using a multi-agent simulator, and implementation of a parallel reduction algorithm on a computer cluster using the Message Passing Interface.
Resumo:
Service-based architectures enable the development of new classes of Grid and distributed applications. One of the main capabilities provided by such systems is the dynamic and flexible integration of services, according to which services are allowed to be a part of more than one distributed system and simultaneously serve different applications. This increased flexibility in system composition makes it difficult to address classical distributed system issues such as fault-tolerance. While it is relatively easy to make an individual service fault-tolerant, improving fault-tolerance of services collaborating in multiple application scenarios is a challenging task. In this paper, we look at the issue of developing fault-tolerant service-based distributed systems, and propose an infrastructure to implement fault tolerance capabilities transparent to services.