3 resultados para Multi objective evolutionary algorithms
em National Center for Biotechnology Information - NCBI
Resumo:
Pairwise sequence comparison methods have been assessed using proteins whose relationships are known reliably from their structures and functions, as described in the scop database [Murzin, A. G., Brenner, S. E., Hubbard, T. & Chothia C. (1995) J. Mol. Biol. 247, 536–540]. The evaluation tested the programs blast [Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. (1990). J. Mol. Biol. 215, 403–410], wu-blast2 [Altschul, S. F. & Gish, W. (1996) Methods Enzymol. 266, 460–480], fasta [Pearson, W. R. & Lipman, D. J. (1988) Proc. Natl. Acad. Sci. USA 85, 2444–2448], and ssearch [Smith, T. F. & Waterman, M. S. (1981) J. Mol. Biol. 147, 195–197] and their scoring schemes. The error rate of all algorithms is greatly reduced by using statistical scores to evaluate matches rather than percentage identity or raw scores. The E-value statistical scores of ssearch and fasta are reliable: the number of false positives found in our tests agrees well with the scores reported. However, the P-values reported by blast and wu-blast2 exaggerate significance by orders of magnitude. ssearch, fasta ktup = 1, and wu-blast2 perform best, and they are capable of detecting almost all relationships between proteins whose sequence identities are >30%. For more distantly related proteins, they do much less well; only one-half of the relationships between proteins with 20–30% identity are found. Because many homologs have low sequence similarity, most distant relationships cannot be detected by any pairwise comparison method; however, those which are identified may be used with confidence.
Resumo:
The recent sequencing of several complete genomes has made it possible to track the evolution of large gene families by their genomic structure. Following the large-scale association of exons encoding domains with well defined functions in invertebrates could be useful in predicting the function of complex multidomain proteins in mammals produced by accretion of domains. With this objective, we have determined the genomic structure of the 14 genes in invertebrates and vertebrates that contain rel domains. The sequence encoding the rel domain is defined by intronic boundaries and has been recombined with at least three structurally and functionally distinct genomic sequences to generate coding sequences for: (i) the rel/Dorsal/NFκB proteins that are retained in the cytoplasm by IkB-like proteins; (ii) the NFATc proteins that sense calcium signals and undergo cytoplasmic-to-nuclear translocation in response to dephosphorylation by calcineurin; and (iii) the TonEBP tonicity-responsive proteins. Remarkably, a single exon in each NFATc family member encodes the entire Ca2+/calcineurin sensing region, including nuclear import/export, calcineurin-binding, and substrate regions. The Rel/Dorsal proteins and the TonEBP proteins are present in Drosophila but not Caenorhabditis elegans. On the other hand, the calcium-responsive NFATc proteins are present only in vertebrates, suggesting that the NFATc family is dedicated to functions specific to vertebrates such as a recombinational immune response, cardiovascular development, and vertebrate-specific aspects of the development and function of the nervous system.
Resumo:
The reconstruction of multitaxon trees from molecular sequences is confounded by the variety of algorithms and criteria used to evaluate trees, making it difficult to compare the results of different analyses. A global method of multitaxon phylogenetic reconstruction described here, Bootstrappers Gambit, can be used with any four-taxon algorithm, including distance, maximum likelihood, and parsimony methods. It incorporates a Bayesian-Jeffreys'-bootstrap analysis to provide a uniform probability-based criterion for comparing the results from diverse algorithms. To examine the usefulness of the method, the origin of the eukaryotes has been investigated by the analysis of ribosomal small subunit RNA sequences. Three common algorithms (paralinear distances, Jukes-Cantor distances, and Kimura distances) support the eocyte topology, whereas one (maximum parsimony) supports the archaebacterial topology, suggesting that the eocyte prokaryotes are the closest prokaryotic relatives of the eukaryotes.