Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags


Autoria(s): de Souza, S. J.; Camargo, A. A.; Briones, MRS; Costa, F. F.; Nagai, M. A.; Verjovski-Almeida, S.; Zago, M. A.; Andrade, LEC; Carrer, H.; El-Dorry, HFA; Espreafico, E. M.; Habr-Gama, A.; Giannella-Neto, D.; Goldman, G. H.; Gruber, A.; Hackel, C.; Kimura, E. T.; Maciel, RMB; Marie, SKN; Martins, EAL; Nobrega, M. P.; Paco-Larson, M. L.; Pardini, MIMC; Pereira, G. G.; Pesquero, J. B.; Rodrigues, V; Rogatto, Silvia Regina; da Silva, IDCG; Sogayar, M. C.; Sonati, M. D.; Tajara, E. H.; Valentini, SR; Acencio, M.; Alberto, F. L.; Amaral, MEJ; Aneas, I; Bengtson, M. H.; Carraro, D. M.; Carvalho, A. F.; Carvalho, L. H.; Cerutti, J. M.; Correa, MLC; Costa, MCR; Curcio, C.; Gushiken, T.; Ho, P. L.; Kimura, E.; Leite, LCC; Maia, G.; Majumder, P.; Marins, M.; Matsukuma, A.; Melo, ASA; Mestriner, C. A.; Miracca, E. C.; Miranda, D. C.; Nascimento, ALTO; Nobrega, F. G.; Ojopi, EPB; Pandolfi, JRC; Pessoa, L. G.; Rahal, Paula; Rainho, C. A.; da Ro's, N.; de Sa, R. G.; Sales, M. M.; da Silva, N. P.; Silva, T. C.; da Silva, W.; Simao, D. F.; Sousa, J. F.; Stecconi, D.; Tsukumo, F.; Valente, V; Zalcberg, H.; Bretani, R. R.; Reis, LFL; Dias-Neto, E.; Simpson, AJG
Contribuinte(s)

Universidade Estadual Paulista (UNESP)

Data(s)

20/05/2014

20/05/2014

07/11/2000

Resumo

Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central coding regions of the resulting transcripts. They are termed ORF expressed sequence tags (ORESTES). The 250,000 ORESTEs were assembled into 81,429 contigs. of these, 1,181 (1.45%) were found to match sequences in chromosome 22 with at least one ORESTES contig for 162 (65.6%) of the 247 known genes, for 67 (44.6%) of the 150 related genes, and for 45 of the 148 (30.4%) EST-predicted genes on this chromosome. Using a set of stringent criteria to validate our sequences, we identified a further 219 previously unannotated transcribed sequences on chromosome 22. of these, 171 were in fact also defined by EST or full length cDNA sequences available in GenBank but not utilized in the initial annotation of the first human chromosome sequence. Thus despite representing less than 15% of all expressed human sequences in the public databases at the time of the present analysis, ORESTEs sequences defined 48 transcribed sequences on chromosome 22 not defined by other sequences. All of the transcribed sequences defined by ORESTEs coincided with DNA regions predicted as encoding exons by GENSCAN.

Formato

12690-12693

Identificador

http://dx.doi.org/10.1073/pnas.97.23.12690

Proceedings of the National Academy of Sciences of the United States of America. Washington: Natl Acad Sciences, v. 97, n. 23, p. 12690-12693, 2000.

0027-8424

http://hdl.handle.net/11449/17867

10.1073/pnas.97.23.12690

WOS:000165225800062

Idioma(s)

eng

Publicador

Natl Acad Sciences

Relação

Proceedings of the National Academy of Sciences of the United States of America

Direitos

closedAccess

Tipo

info:eu-repo/semantics/article