A hybrid approach to learn with imbalanced classes using evolutionary algorithms
Contribuinte(s) |
UNIVERSIDADE DE SÃO PAULO |
---|---|
Data(s) |
20/10/2012
20/10/2012
2011
|
Resumo |
There is an increasing interest in the application of Evolutionary Algorithms (EAs) to induce classification rules. This hybrid approach can benefit areas where classical methods for rule induction have not been very successful. One example is the induction of classification rules in imbalanced domains. Imbalanced data occur when one or more classes heavily outnumber other classes. Frequently, classical machine learning (ML) classifiers are not able to learn in the presence of imbalanced data sets, inducing classification models that always predict the most numerous classes. In this work, we propose a novel hybrid approach to deal with this problem. We create several balanced data sets with all minority class cases and a random sample of majority class cases. These balanced data sets are fed to classical ML systems that produce rule sets. The rule sets are combined creating a pool of rules and an EA is used to build a classifier from this pool of rules. This hybrid approach has some advantages over undersampling, since it reduces the amount of discarded information, and some advantages over oversampling, since it avoids overfitting. The proposed approach was experimentally analysed and the experimental results show an improvement in the classification performance measured as the area under the receiver operating characteristics (ROC) curve. FAPESP Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) CNPq |
Identificador |
LOGIC JOURNAL OF THE IGPL, v.19, n.2, p.293-303, 2011 1367-0751 http://producao.usp.br/handle/BDPI/28803 10.1093/jigpal/jzq027 |
Idioma(s) |
eng |
Publicador |
OXFORD UNIV PRESS |
Relação |
Logic Journal of the Igpl |
Direitos |
restrictedAccess Copyright OXFORD UNIV PRESS |
Palavras-Chave | #Evolutionary algorithms #treatment of imbalanced classes #CLASSIFIERS |
Tipo |
article original article publishedVersion |