Evolutionary model trees for handling continuous classes in machine learning


Autoria(s): BARROS, Rodrigo C.; RUIZ, Duncan D.; BASGALUPP, Marcio P.
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

20/10/2012

20/10/2012

2011

Resumo

Model trees are a particular case of decision trees employed to solve regression problems. They have the advantage of presenting an interpretable output, helping the end-user to get more confidence in the prediction and providing the basis for the end-user to have new insight about the data, confirming or rejecting hypotheses previously formed. Moreover, model trees present an acceptable level of predictive performance in comparison to most techniques used for solving regression problems. Since generating the optimal model tree is an NP-Complete problem, traditional model tree induction algorithms make use of a greedy top-down divide-and-conquer strategy, which may not converge to the global optimal solution. In this paper, we propose a novel algorithm based on the use of the evolutionary algorithms paradigm as an alternate heuristic to generate model trees in order to improve the convergence to globally near-optimal solutions. We call our new approach evolutionary model tree induction (E-Motion). We test its predictive performance using public UCI data sets, and we compare the results to traditional greedy regression/model trees induction algorithms, as well as to other evolutionary approaches. Results show that our method presents a good trade-off between predictive performance and model comprehensibility, which may be crucial in many machine learning applications. (C) 2010 Elsevier Inc. All rights reserved.

Fundacao de Amparo a Pesquisa do Estado de Sao Paulo (FAPESP)

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

European Research Consortium for Informatics and Mathematics (ERCIM)

European Research Consortium for Informatics and Mathematics (ERCIM)

Identificador

INFORMATION SCIENCES, v.181, n.5, p.954-971, 2011

0020-0255

http://producao.usp.br/handle/BDPI/29004

10.1016/j.ins.2010.11.010

http://dx.doi.org/10.1016/j.ins.2010.11.010

Idioma(s)

eng

Publicador

ELSEVIER SCIENCE INC

Relação

Information Sciences

Direitos

restrictedAccess

Copyright ELSEVIER SCIENCE INC

Palavras-Chave #Evolutionary algorithms #Genetic programming #Model trees #Continuous classes #Machine learning #REGRESSION #OPTIMIZATION #SELECTION #TARGET #Computer Science, Information Systems
Tipo

article

original article

publishedVersion