Streaming Support for Data Intensive Cloud-Based Sequence Analysis


Autoria(s): Issa, Shadi A.; Kienzler, Romeo; El-Kalioby, Mohamed; Tonellato, Peter J.; Wall, Dennis; Bruggmann, Rémy; Abouelhoda, Mohamed
Data(s)

24/04/2013

Resumo

Cloud computing provides a promising solution to the genomics data deluge problem resulting from the advent of next-generation sequencing (NGS) technology. Based on the concepts of “resources-on-demand” and “pay-as-you-go”, scientists with no or limited infrastructure can have access to scalable and cost-effective computational resources. However, the large size of NGS data causes a significant data transfer latency from the client’s site to the cloud, which presents a bottleneck for using cloud computing services. In this paper, we provide a streaming-based scheme to overcome this problem, where the NGS data is processed while being transferred to the cloud. Our scheme targets the wide class of NGS data analysis tasks, where the NGS sequences can be processed independently from one another. We also provide the elastream package that supports the use of this scheme with individual analysis programs or with workflow systems. Experiments presented in this paper show that our solution mitigates the effect of data transfer latency and saves both time and cost of computation.

Formato

application/pdf

Identificador

http://boris.unibe.ch/47842/1/791051.pdf

Issa, Shadi A.; Kienzler, Romeo; El-Kalioby, Mohamed; Tonellato, Peter J.; Wall, Dennis; Bruggmann, Rémy; Abouelhoda, Mohamed (2013). Streaming Support for Data Intensive Cloud-Based Sequence Analysis. BioMed research international, 2013(8), pp. 1-16. Hindawi Publishing Corporation 10.1155/2013/791051 <http://dx.doi.org/10.1155/2013/791051>

doi:10.7892/boris.47842

info:doi:10.1155/2013/791051

urn:issn:2314-6133

Idioma(s)

eng

Publicador

Hindawi Publishing Corporation

Relação

http://boris.unibe.ch/47842/

Direitos

info:eu-repo/semantics/openAccess

Fonte

Issa, Shadi A.; Kienzler, Romeo; El-Kalioby, Mohamed; Tonellato, Peter J.; Wall, Dennis; Bruggmann, Rémy; Abouelhoda, Mohamed (2013). Streaming Support for Data Intensive Cloud-Based Sequence Analysis. BioMed research international, 2013(8), pp. 1-16. Hindawi Publishing Corporation 10.1155/2013/791051 <http://dx.doi.org/10.1155/2013/791051>

Palavras-Chave #570 Life sciences; biology
Tipo

info:eu-repo/semantics/article

info:eu-repo/semantics/publishedVersion

PeerReviewed