Collaborative data stream mining in ubiquitous environments using dynamic classifier selection


Autoria(s): Bártolo Gomes, Joao Paulo; Medhat Gaber, Mohamed; Sousa, Pedro; Menasalvas Ruiz, Ernestina
Data(s)

2013

Resumo

In ubiquitous data stream mining applications, different devices often aim to learn concepts that are similar to some extent. In these applications, such as spam filtering or news recommendation, the data stream underlying concept (e.g., interesting mail/news) is likely to change over time. Therefore, the resultant model must be continuously adapted to such changes. This paper presents a novel Collaborative Data Stream Mining (Coll-Stream) approach that explores the similarities in the knowledge available from other devices to improve local classification accuracy. Coll-Stream integrates the community knowledge using an ensemble method where the classifiers are selected and weighted based on their local accuracy for different partitions of the feature space. We evaluate Coll-Stream classification accuracy in situations with concept drift, noise, partition granularity and concept similarity in relation to the local underlying concept. The experimental results show that Coll-Stream resultant model achieves stability and accuracy in a variety of situations using both synthetic and real world datasets.

Formato

application/pdf

Identificador

http://oa.upm.es/19176/

Idioma(s)

eng

Publicador

Facultad de Informática (UPM)

Relação

http://oa.upm.es/19176/1/INVE_MEM_2013_142697.pdf

http://www.worldscientific.com/worldscinet/ijitdm

Direitos

http://creativecommons.org/licenses/by-nc-nd/3.0/es/

info:eu-repo/semantics/openAccess

Fonte

International Journal of Information Technology & Decision Making, ISSN 0219-6220, 2013, Vol. 12

Palavras-Chave #Informática
Tipo

info:eu-repo/semantics/article

Artículo

PeerReviewed