Realizing Accelerated Cost-Effective Distributed RAID


Autoria(s): Khasymski, Aleksandr; Rafique, M. Mustafa; Butt, Ali R.; Vazhkudai, Sudharshan S.; Nikolopoulos, Dimitrios S.
Contribuinte(s)

Khan, Samee Ullah

Zomaya, Albert Y.

Data(s)

2015

Resumo

The exponential growth in user and application data entails new means for providing fault tolerance and protection against data loss. High Performance Com- puting (HPC) storage systems, which are at the forefront of handling the data del- uge, typically employ hardware RAID at the backend. However, such solutions are costly, do not ensure end-to-end data integrity, and can become a bottleneck during data reconstruction. In this paper, we design an innovative solution to achieve a flex- ible, fault-tolerant, and high-performance RAID-6 solution for a parallel file system (PFS). Our system utilizes low-cost, strategically placed GPUs — both on the client and server sides — to accelerate parity computation. In contrast to hardware-based approaches, we provide full control over the size, length and location of a RAID array on a per file basis, end-to-end data integrity checking, and parallelization of RAID array reconstruction. We have deployed our system in conjunction with the widely-used Lustre PFS, and show that our approach is feasible and imposes ac- ceptable overhead.

Identificador

http://pure.qub.ac.uk/portal/en/publications/realizing-accelerated-costeffective-distributed-raid(29fc0b6b-2563-4254-b10c-d781487186a0).html

Idioma(s)

eng

Publicador

Springer

Direitos

info:eu-repo/semantics/closedAccess

Fonte

Khasymski , A , Rafique , M M , Butt , A R , Vazhkudai , S S & Nikolopoulos , D S 2015 , Realizing Accelerated Cost-Effective Distributed RAID . in S U Khan & A Y Zomaya (eds) , Handbook on Data Centers . Springer , pp. 729-753 .

Tipo

contributionToPeriodical