Towards a parallel computationally efficient approach to scaling up data stream classification


Autoria(s): Tennant, Mark; Stahl, Frederic; Di Fatta, Giuseppe; Gomes, João Bártolo
Data(s)

2014

Resumo

Advances in hardware technologies allow to capture and process data in real-time and the resulting high throughput data streams require novel data mining approaches. The research area of Data Stream Mining (DSM) is developing data mining algorithms that allow us to analyse these continuous streams of data in real-time. The creation and real-time adaption of classification models from data streams is one of the most challenging DSM tasks. Current classifiers for streaming data address this problem by using incremental learning algorithms. However, even so these algorithms are fast, they are challenged by high velocity data streams, where data instances are incoming at a fast rate. This is problematic if the applications desire that there is no or only a very little delay between changes in the patterns of the stream and absorption of these patterns by the classifier. Problems of scalability to Big Data of traditional data mining algorithms for static (non streaming) datasets have been addressed through the development of parallel classifiers. However, there is very little work on the parallelisation of data stream classification techniques. In this paper we investigate K-Nearest Neighbours (KNN) as the basis for a real-time adaptive and parallel methodology for scalable data stream classification tasks.

Formato

text

Identificador

http://centaur.reading.ac.uk/38837/1/Paper263.pdf

Tennant, M., Stahl, F. <http://centaur.reading.ac.uk/view/creators/90005065.html>, Di Fatta, G. <http://centaur.reading.ac.uk/view/creators/90000558.html> and Gomes, J. B. (2014) Towards a parallel computationally efficient approach to scaling up data stream classification. In: Thirty-fourth SGAI International Conference on Artificial Intelligence, 9-11 Dec 2014, Cambridge, England, pp. 51-65.

Idioma(s)

en

Publicador

Springer International Publishing

Relação

http://centaur.reading.ac.uk/38837/

creatorInternal Stahl, Frederic

creatorInternal Di Fatta, Giuseppe

http://dx.doi.org/10.1007/978-3-319-12069-0_4

Tipo

Conference or Workshop Item

PeerReviewed