StreamCloud: An elastic and scalable data streaming system


Autoria(s): Gulisano, Vincenzo Massimiliano; Jiménez-Peris, Ricardo; Patiño-Martínez, M.; Soriente, Claudio; Valduriez, Patrick
Data(s)

01/12/2012

Resumo

Many applications in several domains such as telecommunications, network security, large scale sensor networks, require online processing of continuous data lows. They produce very high loads that requires aggregating the processing capacity of many nodes. Current Stream Processing Engines do not scale with the input load due to single-node bottlenecks. Additionally, they are based on static con?gurations that lead to either under or over-provisioning. In this paper, we present StreamCloud, a scalable and elastic stream processing engine for processing large data stream volumes. StreamCloud uses a novel parallelization technique that splits queries into subqueries that are allocated to independent sets of nodes in a way that minimizes the distribution overhead. Its elastic protocols exhibit low intrusiveness, enabling effective adjustment of resources to the incoming load. Elasticity is combined with dynamic load balancing to minimize the computational resources used. The paper presents the system design, implementation and a thorough evaluation of the scalability and elasticity of the fully implemented system.

Formato

application/pdf

Identificador

http://oa.upm.es/16848/

Idioma(s)

eng

Publicador

Facultad de Informática (UPM)

Relação

http://oa.upm.es/16848/1/INVE_MEM_2012_137816.pdf

http://www.computer.org/csdl/trans/td/2012/12/ttd2012122351-abs.html

info:eu-repo/grantAgreement/EC/FP7/257475

info:eu-repo/semantics/altIdentifier/doi/10.1109/TPDS.2012.24

Direitos

http://creativecommons.org/licenses/by-nc-nd/3.0/es/

info:eu-repo/semantics/openAccess

Fonte

IEEE Transactions on Parallel And Distributed Systems, ISSN 1045-9219, 2012-12, Vol. 23, No. 12

Palavras-Chave #Informática
Tipo

info:eu-repo/semantics/article

Artículo

PeerReviewed