Passive Fault-Tolerance Management in Component-based Embedded Systems


Autoria(s): Nogueira, Luis Miguel; Coelho, Jorge
Data(s)

09/12/2015

09/12/2015

2015

Resumo

It is imperative to accept that failures can and will occur, even in meticulously designed distributed systems, and design proper measures to counter those failures. Passive replication minimises resource consumption by only activating redundant replicas in case of failures, as typically providing and applying state updates is less resource demanding than requesting execution. However, most existing solutions for passive fault tolerance are usually designed and configured at design time, explicitly and statically identifying the most critical components and their number of replicas, lacking the needed flexibility to handle the runtime dynamics of distributed component-based embedded systems. This paper proposes a cost-effective adaptive fault tolerance solution with a significant lower overhead compared to a strict active redundancy-based approach, achieving a high error coverage with the minimum amount of redundancy. The activation of passive replicas is coordinated through a feedback-based coordination model that reduces the complexity of the needed interactions among components until a new collective global service solution is determined, improving the overall maintainability and robustness of the system.

Identificador

http://hdl.handle.net/10400.22/7124

Idioma(s)

eng

Publicador

Slovak Academy of Sciences

Relação

ENCOURAGE project, ref. ARTEMIS/0002/2010, JU grant nr. 269354

Computing and Informatics;Vol. 35

Direitos

openAccess

Palavras-Chave #Component-based systems #Embedded real-time systems #Coordination model #Fault-tolerance #Passive replication
Tipo

article