Reliability-aware distributed computing scheduling policy


Autoria(s): Abawajy, Jemal; Hassan, Mohammad Mehedi
Contribuinte(s)

Wang, Guojun

Zomaya, Albert

Perez, Gregorio Martinez

Li, Kenli

Data(s)

01/01/2015

Resumo

One of the primary issues associated with the efficient and effective utilization of distributed computing is resource management and scheduling. As distributed computing resource failure is a common occurrence, the issue of deploying support for integrated scheduling and fault-tolerant approaches becomes paramount importance. To this end, we propose a fault-tolerant dynamic scheduling policy that loosely couples dynamic job scheduling with job replication scheme such that jobs are efficiently and reliably executed. The novelty of the proposed algorithm is that it uses passive replication approach under high system load and active replication approach under low system loads. The switch between these two replication methods is also done dynamically and transparently. Performance evaluation of the proposed fault-tolerant scheduler and a comparison with similar fault-tolerant scheduling policy is presented and shown that the proposed policy performs better than the existing approach.

Identificador

http://hdl.handle.net/10536/DRO/DU:30084095

Idioma(s)

eng

Publicador

Springer International Publishing

Relação

http://www.dx.doi.org/10.1007/978-3-319-27161-3_57

Direitos

2015, Springer

Palavras-Chave #cloud computing #job scheduling #fault-tolerance #replication #performances
Tipo

Book Chapter