Sequence-specific reconstruction from fragmentary databases using seed sequences: implementation and validation on SAGE, proteome and generic sequencing data
Contribuinte(s) |
UNIVERSIDADE DE SÃO PAULO |
---|---|
Data(s) |
20/10/2012
20/10/2012
2008
|
Resumo |
Motivation: DNA assembly programs classically perform an all-against-all comparison of reads to identify overlaps, followed by a multiple sequence alignment and generation of a consensus sequence. If the aim is to assemble a particular segment, instead of a whole genome or transcriptome, a target-specific assembly is a more sensible approach. GenSeed is a Perl program that implements a seed-driven recursive assembly consisting of cycles comprising a similarity search, read selection and assembly. The iterative process results in a progressive extension of the original seed sequence. GenSeed was tested and validated on many applications, including the reconstruction of nuclear genes or segments, full-length transcripts, and extrachromosomal genomes. The robustness of the method was confirmed through the use of a variety of DNA and protein seeds, including short sequences derived from SAGE and proteome projects. |
Identificador |
BIOINFORMATICS, v.24, n.15, p.1676-1680, 2008 1367-4803 http://producao.usp.br/handle/BDPI/28564 10.1093/bioinformatics/btn283 |
Idioma(s) |
eng |
Publicador |
OXFORD UNIV PRESS |
Relação |
Bioinformatics |
Direitos |
restrictedAccess Copyright OXFORD UNIV PRESS |
Palavras-Chave | #TOXOPLASMA-GONDII #Biochemical Research Methods #Biotechnology & Applied Microbiology #Computer Science, Interdisciplinary Applications #Mathematical & Computational Biology #Statistics & Probability |
Tipo |
article original article publishedVersion |