12 resultados para Prokaryotic Genomes
em University of Queensland eSpace - Australia
Resumo:
Non-tree-based ('surrogate') methods have been used to identify instances of lateral genetic transfer in microbial genomes but agreement among predictions of different methods can be poor. It has been proposed that this disagreement arises because different surrogate methods are biased towards the detection of certain types of transfer events. This conjecture is supported by a rigorous phylogenetic analysis of 3776 proteins in Escherichia coli K12 MG1655 to map the ages of transfer events relative to one another.
Resumo:
We generated draft genome sequences for two cold-adapted Archaea, Methanogenium frigidum and Methanococcoides burtonii, to identify genotypic characteristics that distinguish them from Archaea with a higher optimal growth temperature (OGT). Comparative genomics revealed trends in amino acid and tRNA composition, and structural features of proteins. Proteins from the cold-adapted Archaea are characterized by a higher content of noncharged polar amino acids, particularly Gin and Thr and a lower content of hydrophobic amino acids, particularly Leu. Sequence data from nine methanogen genomes (OGT 15degrees-98degreesC) were used to generate IIII modeled protein structures. Analysis of the models from the cold-adapted Archaea showed a strong tendency in the solvent-accessible area for more Gin, Thr, and hydrophobic residues and fewer charged residues. A cold shock domain (CSD) protein (CspA homolog) was identified in M. frigidum, two hypothetical proteins with CSD-folds in M. burtonii, and a unique winged helix DNA-binding domain protein in M. burtonii. This suggests that these types of nucleic acid binding proteins have a critical role in cold-adapted Archaea. Structural analysis of tRNA sequences from the Archaea indicated that GC content is the major factor influencing tRNA stability in hyperthermophiles, but not in the psychrophiles, mesophiles or moderate thermophiles. Below an OGT of 60degreesC, the GC content in tRNA was largely unchanged, indicating that any requirement for flexibility of tRNA in psychrophiles is mediated by other means. This is the first time that comparisons have been performed with genome data from Archaea spanning the growth temperature extremes. from psychrophiles to hyperthermophiles
Resumo:
There are two major groups of ticks: soft ticks and hard ticks. The hard ticks comprise the prostriate ticks and the metastriate ticks. The mitochondrial (mt) genomes of one species of prostriate tick and two species of metastriate ticks had been sequenced prior to our study. The prostriate tick has the ancestral arrangement of mt genes of arthropods, whereas the two metastriate ticks have rearrangements of eight genes and duplicate control regions. However, the arrangement of genes in the mt genomes of soft ticks had not been studied. We sequenced the mt genomes of two species of soft ticks, Carios capensis and Ornithodoros moubata, and a metastriate tick, Haemaphysalis flava. We found that the soft ticks have the ancestral arrangement of mt genes of arthropods, whereas the metastriate tick, H. flava, shares the rearrangements of mt genes and duplicate control regions with the other two metastriate ticks that have previously been studied. Our study indicates that gene rearrangements and duplicate control regions in mt genomes occurred once in the most recent common ancestor of metastriate ticks, whereas the ancestral arrangement of arthropods has remained unchanged for over 400 million years in the lineages leading to the soft ticks and the prostriate ticks.
Resumo:
The extent to which lateral genetic transfer has shaped microbial genomes has major implications for the emergence of community structures. We have performed a rigorous phylogenetic analysis of > 220,000 proteins from genomes of 144 prokaryotes to determine the contribution of gene sharing to current prokaryotic diversity, and to identify highways of sharing between lineages. The inferred relationships suggest a pattern of inheritance that is largely vertical, but with notable exceptions among closely related taxa, and among distantly related organisms that live in similar environments.
Resumo:
To investigate the evolution pattern and phylogenetic utility of duplicate control regions (CRs) in mitochondrial (mt) genomes, we sequenced the entire mt genomes of three Ixodes species and part of the mt genomes of another I I species. All the species from the Australasian lineage have duplicate CRs, whereas the other species have one CR. Sequence analyses indicate that the two CRs of the Australasian Ixodes ticks have evolved in concert in each species. In addition to the Australasian Ixodes ticks, species from seven other lineages of metazoa also have mt genomes with duplicate CRs. Accumulated mtDNA sequence data from these metazoans and two recent experiments on replication of mt genomes in human cell lines with duplicate CRs allowed us to re-examine four intriguing questions about the presence of duplicate CRs in the mt genomes of metazoa: (1) Why do some mt genomes, but not others, have duplicate CRs? (2) How did mt genomes with duplicate CRs evolve? (3) How could the nucleotide sequences of duplicate CRs remain identical or very similar over evolutionary time? (4) Are duplicate CRs phylogenetic markers? It appears that mt genomes with duplicate CRs have a selective advantage in replication over mt genomes with one CR. Tandem duplication followed by deletion of genes is the most plausible mechanism for the generation of mt genomes with duplicate CRs. Once duplicate CRs occur in an mt genome, they tend to evolve in concert, probably by gene conversion. However, there are lineages where gene conversion may not always occur, and, thus, the two CRs may evolve independently in these lineages. Duplicate CRs have much potential as phylogenetic markers at low taxonomic levels, such as within genera, within families, or among families, but not at high taxonomic levels, such as among orders.
Resumo:
Recently, we identified a large number of ultraconserved (uc) sequences in noncoding regions of human, mouse, and rat genomes that appear to be essential for vertebrate and amniote ontogeny. Here, we used similar methods to identify ultraconserved genomic regions between the insect species Drosophila melanogaster and Drosophila pseudoobscura, as well as the more distantly related Anopheles gambiae. As with vertebrates, ultraconserved sequences in insects appear to Occur primarily in intergenic and intronic sequences, and at intron-exon junctions. The sequences are significantly associated with genes encoding developmental regulators and transcription factors, but are less frequent and are smaller in size than in vertebrates. The longest identical, nongapped orthologous match between the three genomes was found within the homothorax (hth) gene. This sequence spans an internal exon-intron junction, with the majority located within the intron, and is predicted to form a highly stable stem-loop RNA structure. Real-time quantitative PCR analysis of different hth splice isoforms and Northern blotting showed that the conserved element is associated with a high incidence of intron retention in hth pre-mRNA, suggesting that the conserved intronic element is critically important in the post-transcriptional regulation of hth expression in Diptera.
Resumo:
The arrangement of genes in the mitochondrial (mt) genomes of most insects is the same, or near-identical, to that inferred to be ancestral for insects. We sequenced the entire mt genome of the small pigeon louse, Campanulotes bidentatus compar, and part of the mt genomes of nine other species of lice. These species were from six families and the three main suborders of the order Phthiraptera. There was no variation in gene arrangement among species within a family but there was much variation in gene arrangement among the three suborders of lice. There has been an extraordinary number of gene rearrangements in the mitochondrial genomes of lice!
Resumo:
Eukaryotic genomes display segmental patterns of variation in various properties, including GC content and degree of evolutionary conservation. DNA segmentation algorithms are aimed at identifying statistically significant boundaries between such segments. Such algorithms may provide a means of discovering new classes of functional elements in eukaryotic genomes. This paper presents a model and an algorithm for Bayesian DNA segmentation and considers the feasibility of using it to segment whole eukaryotic genomes. The algorithm is tested on a range of simulated and real DNA sequences, and the following conclusions are drawn. Firstly, the algorithm correctly identifies non-segmented sequence, and can thus be used to reject the null hypothesis of uniformity in the property of interest. Secondly, estimates of the number and locations of change-points produced by the algorithm are robust to variations in algorithm parameters and initial starting conditions and correspond to real features in the data. Thirdly, the algorithm is successfully used to segment human chromosome 1 according to GC content, thus demonstrating the feasibility of Bayesian segmentation of eukaryotic genomes. The software described in this paper is available from the author's website (www.uq.edu.au/similar to uqjkeith/) or upon request to the author.
Resumo:
Despite the presence of over 3 million transposons separated on average by similar to 500 bp, the human and mouse genomes each contain almost 1000 transposon-free regions (TFRs) over 10 kb in length. The majority of human TFRs correlate with orthologous TFRs in the mouse, despite the fact that most transposons are lineage specific. Many human TFRs also overlap with orthologous TFRs in the marsupial opossum, indicating that these regions have remained refractory to transposon insertion for long evolutionary periods. Over 90% of the bases covered by TFRs are noncoding, much of which is not highly conserved. Most TFRs are not associated with unusual nucleotide composition, but are significantly associated with genes encoding developmental regulators, suggesting that they represent extended regions of regulatory information that are largely unable to tolerate insertions, a conclusion difficult to reconcile with current conceptions of gene regulation.