952 resultados para Vertebrate Genomes
Resumo:
The global amino acid compositions as deduced from the complete genomic sequences of six thermophilic archaea, two thermophilic bacteria, 17 mesophilic bacteria and two eukaryotic species were analysed by hierarchical clustering and principal components analysis. Both methods showed an influence of several factors on amino acid composition. Although GC content has a dominant effect, thermophilic species can be identified by their global amino acid compositions alone. This study presents a careful statistical analysis of factors that affect amino acid composition and also yielded specific features of the average amino acid composition of thermophilic species. Moreover, we introduce the first example of a ‘compositional tree’ of species that takes into account not only homologous proteins, but also proteins unique to particular species. We expect this simple yet novel approach to be a useful additional tool for the study of phylogeny at the genome level.
Resumo:
The evolution of novelty in tightly integrated biological systems, such as hormones and their receptors, seems to challenge the theory of natural selection: it has not been clear how a new function for any one part (such as a ligand) can be selected for unless the other members of the system (e.g., a receptor) are already present. Here I show—based on identification and phylogenetic analysis of steroid receptors in basal vertebrates and reconstruction of the sequences and functional attributes of ancestral proteins—that the first steroid receptor was an estrogen receptor, followed by a progesterone receptor. Genome mapping and phylogenetic analyses indicate that the full complement of mammalian steroid receptors evolved from these ancient receptors by two large-scale genome expansions, one before the advent of jawed vertebrates and one after. Specific regulation of physiological processes by androgens and corticoids are relatively recent innovations that emerged after these duplications. These findings support a model of ligand exploitation in which the terminal ligand in a biosynthetic pathway is the first for which a receptor evolves; selection for this hormone also selects for the synthesis of intermediates despite the absence of receptors, and duplicated receptors then evolve affinity for these substances. In this way, novel hormone-receptor pairs are created, and an integrated system of increasing complexity elaborated. This model suggests that ligands for some “orphan” receptors may be found among intermediates in the synthesis of ligands for phylogenetically related receptors.
Resumo:
Understanding the factors responsible for variations in mutation patterns and selection efficacy along chromosomes is a prerequisite for deciphering genome sequences. Population genetics models predict a positive correlation between the efficacy of selection at a given locus and the local rate of recombination because of Hill–Robertson effects. Codon usage is considered one of the most striking examples that support this prediction at the molecular level. In a wide range of species including Caenorhabditis elegans and Drosophila melanogaster, codon usage is essentially shaped by selection acting for translational efficiency. Codon usage bias correlates positively with recombination rate in Drosophila, apparently supporting the hypothesis that selection on codon usage is improved by recombination. Here we present an exhaustive analysis of codon usage in C. elegans and D. melanogaster complete genomes. We show that in both genomes there is a positive correlation between recombination rate and the frequency of optimal codons. However, we demonstrate that in both species, this effect is due to a mutational bias toward G and C bases in regions of high recombination rate, possibly as a direct consequence of the recombination process. The correlation between codon usage bias and recombination rate in these species appears to be essentially determined by recombination-dependent mutational patterns, rather than selective effects. This result highlights that it is necessary to take into account the mutagenic effect of recombination to understand the evolutionary role and impact of recombination.
Resumo:
The recent sequencing of several complete genomes has made it possible to track the evolution of large gene families by their genomic structure. Following the large-scale association of exons encoding domains with well defined functions in invertebrates could be useful in predicting the function of complex multidomain proteins in mammals produced by accretion of domains. With this objective, we have determined the genomic structure of the 14 genes in invertebrates and vertebrates that contain rel domains. The sequence encoding the rel domain is defined by intronic boundaries and has been recombined with at least three structurally and functionally distinct genomic sequences to generate coding sequences for: (i) the rel/Dorsal/NFκB proteins that are retained in the cytoplasm by IkB-like proteins; (ii) the NFATc proteins that sense calcium signals and undergo cytoplasmic-to-nuclear translocation in response to dephosphorylation by calcineurin; and (iii) the TonEBP tonicity-responsive proteins. Remarkably, a single exon in each NFATc family member encodes the entire Ca2+/calcineurin sensing region, including nuclear import/export, calcineurin-binding, and substrate regions. The Rel/Dorsal proteins and the TonEBP proteins are present in Drosophila but not Caenorhabditis elegans. On the other hand, the calcium-responsive NFATc proteins are present only in vertebrates, suggesting that the NFATc family is dedicated to functions specific to vertebrates such as a recombinational immune response, cardiovascular development, and vertebrate-specific aspects of the development and function of the nervous system.
Resumo:
Concerted evolution is often invoked to explain the diversity and evolution of the multigene families of major histocompatibility complex (MHC) genes and immunoglobulin (Ig) genes. However, this hypothesis has been controversial because the member genes of these families from the same species are not necessarily more closely related to one another than to the genes from different species. To resolve this controversy, we conducted phylogenetic analyses of several multigene families of the MHC and Ig systems. The results show that the evolutionary pattern of these families is quite different from that of concerted evolution but is in agreement with the birth-and-death model of evolution in which new genes are created by repeated gene duplication and some duplicate genes are maintained in the genome for a long time but others are deleted or become nonfunctional by deleterious mutations. We found little evidence that interlocus gene conversion plays an important role in the evolution of MHC and Ig multigene families.
Resumo:
For the most part, studies of grass genome structure have been limited to the generation of whole-genome genetic maps or the fine structure and sequence analysis of single genes or gene clusters. We have investigated large contiguous segments of the genomes of maize, sorghum, and rice, primarily focusing on intergenic spaces. Our data indicate that much (>50%) of the maize genome is composed of interspersed repetitive DNAs, primarily nested retrotransposons that insert between genes. These retroelements are less abundant in smaller genome plants, including rice and sorghum. Although 5- to 200-kb blocks of methylated, presumably heterochromatic, retrotransposons flank most maize genes, rice and sorghum genes are often adjacent. Similar genes are commonly found in the same relative chromosomal locations and orientations in each of these three species, although there are numerous exceptions to this collinearity (i.e., rearrangements) that can be detected at the levels of both the recombinational map and cloned DNA. Evolutionarily conserved sequences are largely confined to genes and their regulatory elements. Our results indicate that a knowledge of grass genome structure will be a useful tool for gene discovery and isolation, but the general rules and biological significance of grass genome organization remain to be determined. Moreover, the nature and frequency of exceptions to the general patterns of grass genome structure and collinearity are still largely unknown and will require extensive further investigation.
Resumo:
Progress in agricultural and environmental technologies is hampered by a slower rate of gene discovery in plants than animals. The vast pool of genes in plants, however, will be an important resource for insertion of genes, via biotechnological procedures, into an array of plants, generating unique germ plasms not achievable by conventional breeding. It just became clear that genomes of grasses have evolved in a manner analogous to Lego blocks. Large chromosome segments have been reshuffled and stuffer pieces added between genes. Although some genomes have become very large, the genome with the fewest stuffer pieces, the rice genome, is the Rosetta Stone of all the bigger grass genomes. This means that sequencing the rice genome as anchor genome of the grasses will provide instantaneous access to the same genes in the same relative physical position in other grasses (e.g., corn and wheat), without the need to sequence each of these genomes independently. (i) The sequencing of the entire genome of rice as anchor genome for the grasses will accelerate plant gene discovery in many important crops (e.g., corn, wheat, and rice) by several orders of magnitudes and reduce research and development costs for government and industry at a faster pace. (ii) Costs for sequencing entire genomes have come down significantly. Because of its size, rice is only 12% of the human or the corn genome, and technology improvements by the human genome project are completely transferable, translating in another 50% reduction of the costs. (iii) The physical mapping of the rice genome by a group of Japanese researchers provides a jump start for sequencing the genome and forming an international consortium. Otherwise, other countries would do it alone and own proprietary positions.
Resumo:
Vertebrate innovations include neural crest cells and their derivatives, neurogenic placodes, an elaborate segmented brain, endoskeleton, and an increase in the number of genes in the genome. Comparative molecular and developmental data give new insights into the evolutionary origins of these characteristics and the complexity of the vertebrate body.
Resumo:
Deflection of the hair bundle atop a sensory hair cell modulates the open probability of mechanosensitive ion channels. In response to sustained deflections, hair cells adapt. Two fundamentally distinct models have been proposed to explain transducer adaptation. Both models support the notion that channel open probability is modulated by calcium that enters via the transduction channels. Both also suggest that the primary effect of adaptation is to shift the deflection-response [I(X)] relationship in the direction of the applied stimulus, thus maintaining hair bundle sensitivity. The models differ in several respects. They operate on different time scales: the faster on the order of a few milliseconds or less and the slower on the order of 10 ms or more. The model proposed to explain fast adaptation suggests that calcium enters and binds at or near the transduction channels to stabilize a closed conformation. The model proposed to explain the slower adaptation suggests that adaptation is mediated by an active, force-generating process that regulates the effective stimulus applied to the transduction channels. Here we discuss the evidence in support of each model and consider the possibility that both may function to varying degrees in hair cells of different species and sensory organs.
Resumo:
We summarize our recent studies showing that angiosperm mitochondrial (mt) genomes have experienced remarkably high rates of gene loss and concomitant transfer to the nucleus and of intron acquisition by horizontal transfer. Moreover, we find substantial lineage-specific variation in rates of these structural mutations and also point mutations. These findings mostly arise from a Southern blot survey of gene and intron distribution in 281 diverse angiosperms. These blots reveal numerous losses of mt ribosomal protein genes but, with one exception, only rare loss of respiratory genes. Some lineages of angiosperms have kept all of their mt ribosomal protein genes whereas others have lost most of them. These many losses appear to reflect remarkably high (and variable) rates of functional transfer of mt ribosomal protein genes to the nucleus in angiosperms. The recent transfer of cox2 to the nucleus in legumes provides both an example of interorganellar gene transfer in action and a starting point for discussion of the roles of mechanistic and selective forces in determining the distribution of genetic labor between organellar and nuclear genomes. Plant mt genomes also acquire sequences by horizontal transfer. A striking example of this is a homing group I intron in the mt cox1 gene. This extraordinarily invasive mobile element has probably been acquired over 1,000 times separately during angiosperm evolution via a recent wave of cross-species horizontal transfers. Finally, whereas all previously examined angiosperm mtDNAs have low rates of synonymous substitutions, mtDNAs of two distantly related angiosperms have highly accelerated substitution rates.