Improving classification using a Confidence Matrix based on weak classifiers applied to OCR


Autoria(s): Rico Juan, Juan Ramón; Calvo-Zaragoza, Jorge
Contribuinte(s)

Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos

Reconocimiento de Formas e Inteligencia Artificial

Data(s)

19/01/2015

19/01/2015

03/03/2015

Resumo

This paper proposes a new feature representation method based on the construction of a Confidence Matrix (CM). This representation consists of posterior probability values provided by several weak classifiers, each one trained and used in different sets of features from the original sample. The CM allows the final classifier to abstract itself from discovering underlying groups of features. In this work the CM is applied to isolated character image recognition, for which several set of features can be extracted from each sample. Experimentation has shown that the use of CM permits a significant improvement in accuracy in most cases, while the others remain the same. The results were obtained after experimenting with four well-known corpora, using evolved meta-classifiers with the k-Nearest Neighbor rule as a weak classifier and by applying statistical significance tests.

This work was partially supported by the Spanish CICyT through the project TIN2013-48152-C2-1-R, the Consejería de Educación de la Comunidad Valenciana through Project PROMETEO/2012/017 and a FPU fellowship (AP2012-0939) from the Spanish Ministerio de Educación Cultura y Deporte.

Identificador

Neurocomputing. 2015, 151(3): 1354-1361. doi:10.1016/j.neucom.2014.10.058

0925-2312 (Print)

1872-8286 (Online)

http://hdl.handle.net/10045/44107

10.1016/j.neucom.2014.10.058

Idioma(s)

eng

Publicador

Elsevier

Relação

http://dx.doi.org/10.1016/j.neucom.2014.10.058

Direitos

info:eu-repo/semantics/openAccess

Palavras-Chave #Confidence Matrix #Posterior probability #Weak classifiers #Feature spaces #Lenguajes y Sistemas Informáticos
Tipo

info:eu-repo/semantics/article