Identification of protein coding regions using the modified Gabor-wavelet transform


Autoria(s): MENA-CHALCO, Jesus P.; CARRER, Helaine; ZANA, Yossi; CESAR JR., Roberto M.
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

18/10/2012

18/10/2012

2008

Resumo

An important topic in genomic sequence analysis is the identification of protein coding regions. In this context, several coding DNA model-independent methods based on the occurrence of specific patterns of nucleotides at coding regions have been proposed. Nonetheless, these methods have not been completely suitable due to their dependence on an empirically predefined window length required for a local analysis of a DNA region. We introduce a method based on a modified Gabor-wavelet transform (MGWT) for the identification of protein coding regions. This novel transform is tuned to analyze periodic signal components and presents the advantage of being independent of the window length. We compared the performance of the MGWT with other methods by using eukaryote data sets. The results show that MGWT outperforms all assessed model-independent methods with respect to identification accuracy. These results indicate that the source of at least part of the identification errors produced by the previous methods is the fixed working scale. The new method not only avoids this source of errors but also makes a tool available for detailed exploration of the nucleotide occurrence.

Identificador

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, v.5, n.2, p.198-207, 2008

1545-5963

http://producao.usp.br/handle/BDPI/18910

10.1109/TCBB.2007.70259

http://dx.doi.org/10.1109/TCBB.2007.70259

Idioma(s)

eng

Publicador

IEEE COMPUTER SOC

Relação

Ieee-acm Transactions on Computational Biology and Bioinformatics

Direitos

restrictedAccess

Copyright IEEE COMPUTER SOC

Palavras-Chave #sequence analysis #wavelet transform #coding regions #signal processing #pattern recognition #DNA-SEQUENCES #GENE PREDICTION #3-BASE PERIODICITY #FOURIER-ANALYSIS #NONCODING DNA #BIOINFORMATICS #RECOGNITION #PROGRAMS #Biochemical Research Methods #Computer Science, Interdisciplinary Applications #Mathematics, Interdisciplinary Applications #Statistics & Probability
Tipo

article

original article

publishedVersion