A hybrid approach to learn with imbalanced classes using evolutionary algorithms


Autoria(s): MILARE, Claudia R.; BATISTA, Gustavo E. A. P. A.; CARVALHO, Andre C. P. L. F.
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

20/10/2012

20/10/2012

2011

Resumo

There is an increasing interest in the application of Evolutionary Algorithms (EAs) to induce classification rules. This hybrid approach can benefit areas where classical methods for rule induction have not been very successful. One example is the induction of classification rules in imbalanced domains. Imbalanced data occur when one or more classes heavily outnumber other classes. Frequently, classical machine learning (ML) classifiers are not able to learn in the presence of imbalanced data sets, inducing classification models that always predict the most numerous classes. In this work, we propose a novel hybrid approach to deal with this problem. We create several balanced data sets with all minority class cases and a random sample of majority class cases. These balanced data sets are fed to classical ML systems that produce rule sets. The rule sets are combined creating a pool of rules and an EA is used to build a classifier from this pool of rules. This hybrid approach has some advantages over undersampling, since it reduces the amount of discarded information, and some advantages over oversampling, since it avoids overfitting. The proposed approach was experimentally analysed and the experimental results show an improvement in the classification performance measured as the area under the receiver operating characteristics (ROC) curve.

FAPESP

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)

CNPq

Identificador

LOGIC JOURNAL OF THE IGPL, v.19, n.2, p.293-303, 2011

1367-0751

http://producao.usp.br/handle/BDPI/28803

10.1093/jigpal/jzq027

http://dx.doi.org/10.1093/jigpal/jzq027

Idioma(s)

eng

Publicador

OXFORD UNIV PRESS

Relação

Logic Journal of the Igpl

Direitos

restrictedAccess

Copyright OXFORD UNIV PRESS

Palavras-Chave #Evolutionary algorithms #treatment of imbalanced classes #CLASSIFIERS
Tipo

article

original article

publishedVersion