Automatic classification of enzyme family in protein annotation


Autoria(s): Dos Santos, Cássia T.; Bazzan, Ana L. C.; Lemke, Ney
Contribuinte(s)

Universidade Estadual Paulista (UNESP)

Data(s)

27/05/2014

27/05/2014

14/09/2009

Resumo

Most of the tasks in genome annotation can be at least partially automated. Since this annotation is time-consuming, facilitating some parts of the process - thus freeing the specialist to carry out more valuable tasks - has been the motivation of many tools and annotation environments. In particular, annotation of protein function can benefit from knowledge about enzymatic processes. The use of sequence homology alone is not a good approach to derive this knowledge when there are only a few homologues of the sequence to be annotated. The alternative is to use motifs. This paper uses a symbolic machine learning approach to derive rules for the classification of enzymes according to the Enzyme Commission (EC). Our results show that, for the top class, the average global classification error is 3.13%. Our technique also produces a set of rules relating structural to functional information, which is important to understand the protein tridimensional structure and determine its biological function. © 2009 Springer Berlin Heidelberg.

Formato

86-96

Identificador

http://dx.doi.org/10.1007/978-3-642-03223-3_8

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 5676 LNBI, p. 86-96.

0302-9743

1611-3349

http://hdl.handle.net/11449/71147

10.1007/978-3-642-03223-3_8

2-s2.0-69949190117

Idioma(s)

eng

Relação

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Direitos

closedAccess

Palavras-Chave #Automatic classification #Biological functions #Classification errors #Enzymatic process #Enzyme commissions #Functional information #Genome annotation #Protein annotation #Protein functions #Sequence homology #Set of rules #Symbolic machine learning #Tri-dimensional structure #Automatic indexing #Biology #Enzymes #Bioinformatics
Tipo

info:eu-repo/semantics/conferencePaper