The applicability of recurrent neural networks for biological sequence analysis


Autoria(s): Hawkins, J.; Boden, M.
Contribuinte(s)

D. Gusfidd

C. X. Ling

W. S. Noble

Q. Yang

Data(s)

01/01/2005

Resumo

Selection of machine learning techniques requires a certain sensitivity to the requirements of the problem. In particular, the problem can be made more tractable by deliberately using algorithms that are biased toward solutions of the requisite kind. In this paper, we argue that recurrent neural networks have a natural bias toward a problem domain of which biological sequence analysis tasks are a subset. We use experiments with synthetic data to illustrate this bias. We then demonstrate that this bias can be exploitable using a data set of protein sequences containing several classes of subcellular localization targeting peptides. The results show that, compared with feed forward, recurrent neural networks will generally perform better on sequence analysis tasks. Furthermore, as the patterns within the sequence become more ambiguous, the choice of specific recurrent architecture becomes more critical.

Identificador

http://espace.library.uq.edu.au/view/UQ:77510

Idioma(s)

eng

Publicador

IEEE Computer Society

Palavras-Chave #No Category #Machine Learning #Neural Network Architecture #Recurrent Neural Network #Bias #Biological Sequence Analysis #Motif #Subcellular Localization #Pattern Recognition #Classifier Design #Protein Secondary Structure #Architectural Bias #Prediction #C1 #280207 Pattern Recognition #780101 Mathematical sciences
Tipo

Journal Article