Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data


Autoria(s): Barros, Rodrigo Coelho; Winck, Ana T.; Machado, Karina S.; Basgalupp, Marcio P.; Carvalho, André C. P. L. F. de; Ruiz, Duncan D.; Souza, Osmar Norberto de
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

24/10/2013

24/10/2013

2012

Resumo

Background: This paper addresses the prediction of the free energy of binding of a drug candidate with enzyme InhA associated with Mycobacterium tuberculosis. This problem is found within rational drug design, where interactions between drug candidates and target proteins are verified through molecular docking simulations. In this application, it is important not only to correctly predict the free energy of binding, but also to provide a comprehensible model that could be validated by a domain specialist. Decision-tree induction algorithms have been successfully used in drug-design related applications, specially considering that decision trees are simple to understand, interpret, and validate. There are several decision-tree induction algorithms available for general-use, but each one has a bias that makes it more suitable for a particular data distribution. In this article, we propose and investigate the automatic design of decision-tree induction algorithms tailored to particular drug-enzyme binding data sets. We investigate the performance of our new method for evaluating binding conformations of different drug candidates to InhA, and we analyze our findings with respect to decision tree accuracy, comprehensibility, and biological relevance. Results: The empirical analysis indicates that our method is capable of automatically generating decision-tree induction algorithms that significantly outperform the traditional C4.5 algorithm with respect to both accuracy and comprehensibility. In addition, we provide the biological interpretation of the rules generated by our approach, reinforcing the importance of comprehensible predictive models in this particular bioinformatics application. Conclusions: We conclude that automatically designing a decision-tree algorithm tailored to molecular docking data is a promising alternative for the prediction of the free energy from the binding of a drug candidate with a flexible-receptor.

Fundacao de Amparo a Pesquisa do Estado de Sao Paulo (FAPESP)

Conselho Nacional de Desenvolvimento Cientifico e Tecnologico (CNPq)

Identificador

BMC BIOINFORMATICS, LONDON, v. 13, 2012

1471-2105

http://www.producao.usp.br/handle/BDPI/35838

10.1186/1471-2105-13-310

http://dx.doi.org/10.1186/1471-2105-13-310

Idioma(s)

eng

Publicador

BIOMED CENTRAL LTD

LONDON

Relação

BMC BIOINFORMATICS

Direitos

openAccess

Copyright BIOMED CENTRAL LTD

Palavras-Chave #RESISTANT MYCOBACTERIUM-TUBERCULOSIS #FUNCTION PREDICTION #MOLECULAR DOCKING #PROTEIN FUNCTION #ENOYL REDUCTASE #DRUG DESIGN #WILD-TYPE #COMPLEX #DISCRETIZATION #FLEXIBILITY #BIOCHEMICAL RESEARCH METHODS #BIOTECHNOLOGY & APPLIED MICROBIOLOGY #MATHEMATICAL & COMPUTATIONAL BIOLOGY
Tipo

article

original article

publishedVersion