On the suitability of Prototype Selection methods for kNN classification with distributed data


Autoria(s): Valero Mas, José Javier; Calvo-Zaragoza, Jorge; Rico Juan, Juan Ramón
Contribuinte(s)

Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos

Reconocimiento de Formas e Inteligencia Artificial

Data(s)

16/06/2016

16/06/2016

26/08/2016

Resumo

In the current Information Age, data production and processing demands are ever increasing. This has motivated the appearance of large-scale distributed information. This phenomenon also applies to Pattern Recognition so that classic and common algorithms, such as the k-Nearest Neighbour, are unable to be used. To improve the efficiency of this classifier, Prototype Selection (PS) strategies can be used. Nevertheless, current PS algorithms were not designed to deal with distributed data, and their performance is therefore unknown under these conditions. This work is devoted to carrying out an experimental study on a simulated framework in which PS strategies can be compared under classical conditions as well as those expected in distributed scenarios. Our results report a general behaviour that is degraded as conditions approach to more realistic scenarios. However, our experiments also show that some methods are able to achieve a fairly similar performance to that of the non-distributed scenario. Thus, although there is a clear need for developing specific PS methodologies and algorithms for tackling these situations, those that reported a higher robustness against such conditions may be good candidates from which to start.

This work was partially supported by the Spanish Ministerio de Educación, Cultura y Deporte through a FPU Fellowship (AP2012-0939), Vicerrectorado de Investigación, Desarrollo e Innovación de la Universidad de Alicante through FPU program (UAFPU2014-5883) and the Spanish Ministerio de Economía y Competitividad through Project TIMuL (No. TIN2013-48152-C2-1-R, supported by UE FEDER funds).

Identificador

Neurocomputing. 2016, 203: 150-160. doi:10.1016/j.neucom.2016.04.018

0925-2312 (Print)

1872-8286 (Online)

http://hdl.handle.net/10045/55947

10.1016/j.neucom.2016.04.018

Idioma(s)

eng

Publicador

Elsevier

Relação

http://dx.doi.org/10.1016/j.neucom.2016.04.018

Direitos

© 2016 Elsevier B.V.

info:eu-repo/semantics/openAccess

Palavras-Chave #Prototype Selection #Distributed data #k-Nearest Neighbour #Experimental study #Lenguajes y Sistemas Informáticos
Tipo

info:eu-repo/semantics/article