Fault-tolerant scheduling with dynamic number of replicas in heterogeneous systems


Autoria(s): Zhao, Laiping; Ren, Yizhi; Xiang, Yang; Sakurai, Kouichi
Contribuinte(s)

[Unknown]

Data(s)

01/01/2010

Resumo

<p>In the existing studies on fault-tolerant scheduling, the active replication schema makes use of <i>ε</i> + 1 replicas for each task to tolerate <i>ε </i>failures. However, in this paper, we show that it does not always lead to a higher reliability with more replicas. Besides, the more replicas implies more resource consumption and higher economic cost. To address this problem, with the target to satisfy the user’s reliability requirement with minimum resources, this paper proposes a new fault tolerant scheduling algorithm: <i>MaxRe</i>. In the algorithm, we incorporate the reliability analysis into the active replication schema and the theoretical analysis and experiments prove that the <i>MaxRe</i> algorithm’s schedule can certainly satisfy user’s reliability requirements. And the <i>MaxRe</i> scheduling algorithm can achieve the corresponding reliability with at most 70% fewer resources than the FTSA algorithm.</p>

Identificador

http://hdl.handle.net/10536/DRO/DU:30034376

Idioma(s)

eng

Publicador

IEEE

Relação

http://dro.deakin.edu.au/eserv/DU:30034376/xiang-HPCC-evidence-2010.pdf

http://dro.deakin.edu.au/eserv/DU:30034376/xiang-faulttolerant-2010.pdf

http://dx.doi.org/10.1109/HPCC.2010.72

Direitos

2010, IEEE

Palavras-Chave #resource scheduling #fault-tolerance #reliability #heterogeneous system
Tipo

Conference Paper