Construction of representative transcript and protein sets of human, mouse, and rat as a platform for their transcriptome and proteome analysis


Autoria(s): Kasukawa, T.; Katayama, S.; Kawaji, H.; Suzuki, H.; Hume, D. A.; Hayashizaki, Y.
Data(s)

01/01/2004

Resumo

The number of mammalian transcripts identified by full-length cDNA projects and genome sequencing projects is increasing remarkably. Clustering them into a strictly nonredundant and comprehensive set provides a platform for functional analysis of the transcriptome and proteome, but the quality of the clustering and predictive usefulness have previously required manual curation to identify truncated transcripts and inappropriate clustering of closely related sequences. A Representative Transcript and Protein Sets (RTPS) pipeline was previously designed to identify the nonredundant and comprehensive set of mouse transcripts based on clustering of a large mouse full-length cDNA set (FANTOM2). Here we propose an alternative method that is more robust, requires less manual curation, and is applicable to other organisms in addition to mouse. RTPSs of human, mouse, and rat have been produced by this method and used for validation. Their comprehensiveness and quality are discussed by comparison with other clustering approaches. The RTPSs are available at ftp://fantom2.gsc.riken.go.jp/RTPS/. (C). 2004 Elsevier Inc. All rights reserved.

Identificador

http://espace.library.uq.edu.au/view/UQ:73595

Idioma(s)

eng

Publicador

Academic Press

Palavras-Chave #Biotechnology & Applied Microbiology #Genetics & Heredity #Transcriptome #Transcript #Splicing Variant #Mammal #Clustering #Full-length Human #Noncoding Rnas #Genome #Identification #Annotation #Sequences #Cdnas #C1 #270202 Genome Structure #780106 Political science and public policy
Tipo

Journal Article