45 resultados para Prokaryotic Genomes
em National Center for Biotechnology Information - NCBI
Resumo:
Increasingly, studies of genes and genomes are indicating that considerable horizontal transfer has occurred between prokaryotes. Extensive horizontal transfer has occurred for operational genes (those involved in housekeeping), whereas informational genes (those involved in transcription, translation, and related processes) are seldomly horizontally transferred. Through phylogenetic analysis of six complete prokaryotic genomes and the identification of 312 sets of orthologous genes present in all six genomes, we tested two theories describing the temporal flow of horizontal transfer. We show that operational genes have been horizontally transferred continuously since the divergence of the prokaryotes, rather than having been exchanged in one, or a few, massive events that occurred early in the evolution of prokaryotes. In agreement with earlier studies, we found that differences in rates of evolution between operational and informational genes are minimal, suggesting that factors other than rate of evolution are responsible for the observed differences in horizontal transfer. We propose that a major factor in the more frequent horizontal transfer of operational genes is that informational genes are typically members of large, complex systems, whereas operational genes are not, thereby making horizontal transfer of informational gene products less probable (the complexity hypothesis).
Resumo:
Analyses of complete genomes indicate that a massive prokaryotic gene transfer (or transfers) preceded the formation of the eukaryotic cell. In comparisons of the entire set of Methanococcus jannaschii genes with their orthologs from Escherichia coli, Synechocystis 6803, and the yeast Saccharomyces cerevisiae, it is shown that prokaryotic genomes consist of two different groups of genes. The deeper, diverging informational lineage codes for genes which function in translation, transcription, and replication, and also includes GTPases, vacuolar ATPase homologs, and most tRNA synthetases. The more recently diverging operational lineage codes for amino acid synthesis, the biosynthesis of cofactors, the cell envelope, energy metabolism, intermediary metabolism, fatty acid and phospholipid biosynthesis, nucleotide biosynthesis, and regulatory functions. In eukaryotes, the informational genes are most closely related to those of Methanococcus, whereas the majority of operational genes are most closely related to those of Escherichia, but some are closest to Methanococcus or to Synechocystis.
Resumo:
With more than 10 fully sequenced, publicly available prokaryotic genomes, it is now becoming possible to gain useful insights into genome evolution. Before the genome era, many evolutionary processes were evaluated from limited data sets and evolutionary models were constructed on the basis of small amounts of evidence. In this paper, I show that genes on the Borrelia burgdorferi genome have two separate, distinct, and significantly different codon usages, depending on whether the gene is transcribed on the leading or lagging strand of replication. Asymmetrical replication is the major source of codon usage variation. Replicational selection is responsible for the higher number of genes on the leading strands, and transcriptional selection appears to be responsible for the enrichment of highly expressed genes on these strands. Replicational–transcriptional selection, therefore, has an influence on the codon usage of a gene. This is a new paradigm of codon selection in prokaryotes.
Resumo:
The Ribosomal RNA Operon Copy Number Database (rrndb) is an Internet-accessible database containing annotated information on rRNA operon copy number among prokaryotes. Gene redundancy is uncommon in prokaryotic genomes, yet the rRNA genes can vary from one to as many as 15 copies. Despite the widespread use of 16S rRNA gene sequences for identification of prokaryotes, information on the number and sequence of individual rRNA genes in a genome is not readily accessible. In an attempt to understand the evolutionary implications of rRNA operon redundancy, we have created a phylogenetically arranged report on rRNA gene copy number for a diverse collection of prokaryotic microorganisms. Each entry (organism) in the rrndb contains detailed information linked directly to external websites including the Ribosomal Database Project, GenBank, PubMed and several culture collections. Data contained in the rrndb will be valuable to researchers investigating microbial ecology and evolution using 16S rRNA gene sequences. The rrndb web site is directly accessible on the WWW at http://rrndb.cme.msu.edu.
Resumo:
Predicted highly expressed (PHX) and putative alien genes determined by codon usages are characterized in the genome of Deinococcus radiodurans (strain R1). Deinococcus radiodurans (DEIRA) can survive very high doses of ionizing radiation that are lethal to virtually all other organisms. It has been argued that DEIRA is endowed with enhanced repair systems that provide protection and stability. However, predicted expression levels of DNA repair proteins with the exception of RecA tend to be low and do not distinguish DEIRA from other prokaryotes. In this paper, the capability of DEIRA to resist extreme doses of ionizing and UV radiation is attributed to an unusually high number of PHX chaperone/degradation, protease, and detoxification genes. Explicitly, compared with all current complete prokaryotic genomes, DEIRA contains the greatest number of PHX detoxification and protease proteins. Other sources of environmental protection against severe conditions of UV radiation, desiccation, and thermal effects for DEIRA are the several S-layer (surface structure) PHX proteins. The top PHX gene of DEIRA is the multifunctional tricarboxylic acid (TCA) gene aconitase, which, apart from its role in respiration, also alerts the cell to oxidative damage.
Resumo:
Two RNases H of mammalian tissues have been described: RNase HI, the activity of which was found to rise during DNA replication, and RNase HII, which may be involved in transcription. RNase HI is the major mammalian enzyme representing around 85% of the total RNase H activity in the cell. By using highly purified calf thymus RNase HI we identified the sequences of several tryptic peptides. This information enabled us to determine the sequence of the cDNA coding for the large subunit of human RNase HI. The corresponding ORF of 897 nt defines a polypeptide of relative molecular mass of 33,367, which is in agreement with the molecular mass obtained earlier by SDS/PAGE. Expression of the cloned ORF in Escherichia coli leads to a polypeptide, which is specifically recognized by an antiserum raised against calf thymus RNase HI. Interestingly, the deduced amino acid sequence of this subunit of human RNase HI displays significant homology to RNase HII from E. coli, an enzyme of unknown function and previously judged as a minor activity. This finding suggests an evolutionary link between the mammalian RNases HI and the prokaryotic RNases HII. The idea of a mammalian RNase HI large subunit being a strongly conserved protein is substantiated by the existence of homologous ORFs in the genomes of other eukaryotes and of all eubacteria and archaebacteria that have been completely sequenced.
Resumo:
A satellite DNA sequence, As120a, specific to the A-genome chromosomes in the hexaploid oat, Avena sativa L., was isolated by subcloning a fragment with internal tandem repeats from a plasmid, pAs120, that had been obtained from an Avena strigosa (As genome) genomic library. Southern and in situ hybridization showed that sequences with homology to sequences within pAs120 were dispersed throughout the genome of diploid (A and C genomes), tetraploid (AC genomes), and hexaploid (ACD genomes) Avena species. In contrast, sequences homologous to As120a were found in two A-genome species (A. strigosa and Avena longiglumis) and in the hexaploid A. sativa whereas this sequence was little amplified in the tetraploid Avena murphyi and was absent in the remaining A- and C-genome diploid species. In situ hybridization of pAs120a to hexaploid oat species revealed the distribution of elements of the As120a repeated family over both arms of 14 of 42 chromosomes of this species. By using double in situ hybridization with pAs120a and a C genome-specific probe, three sets of 14 chromosomes were revealed corresponding to the A, C, and D genomes of the hexaploid species. Simultaneous in situ hybridizations with pAs120a and ribosomal probes were used to assign the SAT chromosomes of hexaploid species to their correct genomes. This work reports a sequence able to distinguish between the closely related A and D genomes of hexaploid oats. This sequence offers new opportunities to analyze the relationships of Avena species and to explore the possible evolution of various polyploid oat species.
Resumo:
Unmethylated CpG dinucleotides in particular base contexts (CpG-S motifs) are relatively common in bacterial DNA but are rare in vertebrate DNA. B cells and monocytes have the ability to detect such CpG-S motifs that trigger innate immune defenses with production of Th1-like cytokines. Despite comparable levels of unmethylated CpG dinucleotides, DNA from serotype 12 adenovirus is immune-stimulatory, but serotype 2 is nonstimulatory and can even inhibit activation by bacterial DNA. In type 12 genomes, the distribution of CpG-flanking bases is similar to that predicted by chance. However, in type 2 adenoviral DNA the immune stimulatory CpG-S motifs are outnumbered by a 15- to 30-fold excess of CpG dinucleotides in clusters of direct repeats or with a C on the 5′ side or a G on the 3′ side. Synthetic oligodeoxynucleotides containing these putative neutralizing (CpG-N) motifs block immune activation by CpG-S motifs in vitro and in vivo. Eliminating 52 of the 134 CpG-N motifs present in a DNA vaccine markedly enhanced its Th1-like function in vivo, which was increased further by the addition of CpG-S motifs. Thus, depending on the CpG motif, prokaryotic DNA can be either immune-stimulatory or neutralizing. These results have important implications for understanding microbial pathogenesis and molecular evolution and for the clinical development of DNA vaccines and gene therapy vectors.
Resumo:
Eukaryotic genome similarity relationships are inferred using sequence information derived from large aggregates of genomic sequences. Comparisons within and between species sample sequences are based on the profile of dinucleotide relative abundance values (The profile is ρ*XY = f*XY/f*Xf*Y for all XY, where f*X denotes the frequency of the nucleotide X and f*XY denotes the frequency of the dinucleotide XY, both computed from the sequence concatenated with its inverted complement). Previous studies with respect to prokaryotes and this study document that profiles of different DNA sequence samples (sample size ≥50 kb) from the same organism are generally much more similar to each other than they are to profiles from other organisms, and that closely related organisms generally have more similar profiles than do distantly related organisms. On this basis we refer to the collection {ρ*XY} as the genome signature. This paper identifies ρ*XY extremes and compares genome signature differences for a diverse range of eukaryotic species. Interpretations on the mechanisms maintaining these profile differences center on genome-wide replication, repair, DNA structures, and context-dependent mutational biases. It is also observed that mitochondrial genome signature differences between species parallel the corresponding nuclear genome signature differences despite large differences between corresponding mitochondrial and nuclear signatures. The genome signature differences also have implications for contrasts between rodents and other mammals, and between monocot and dicot plants, as well as providing evidence for similarities among fungi and the diversity of protists.
Resumo:
Understanding the effects of the external environment on bacterial gene expression can provide valuable insights into an array of cellular mechanisms including pathogenesis, drug resistance, and, in the case of Mycobacterium tuberculosis, latency. Because of the absence of poly(A)+ mRNA in prokaryotic organisms, studies of differential gene expression currently must be performed either with large amounts of total RNA or rely on amplification techniques that can alter the proportional representation of individual mRNA sequences. We have developed an approach to study differences in bacterial mRNA expression that enables amplification by the PCR of a complex mixture of cDNA sequences in a reproducible manner that obviates the confounding effects of selected highly expressed sequences, e.g., ribosomal RNA. Differential expression using customized amplification libraries (DECAL) uses a library of amplifiable genomic sequences to convert total cellular RNA into an amplified probe for gene expression screens. DECAL can detect 4-fold differences in the mRNA levels of rare sequences and can be performed on as little as 10 ng of total RNA. DECAL was used to investigate the in vitro effect of the antibiotic isoniazid on M. tuberculosis, and three previously uncharacterized isoniazid-induced genes, iniA, iniB, and iniC, were identified. The iniB gene has homology to cell wall proteins, and iniA contains a phosphopantetheine attachment site motif suggestive of an acyl carrier protein. The iniA gene is also induced by the antibiotic ethambutol, an agent that inhibits cell wall biosynthesis by a mechanism that is distinct from isoniazid. The DECAL method offers a powerful new tool for the study of differential gene expression.
Resumo:
The rice genus, Oryza, which comprises 23 species and 9 recognized genome types, represents an enormous gene pool for genetic improvement of rice cultivars. Clarification of phylogenetic relationships of rice genomes is critical for effective utilization of the wild rice germ plasm. By generating and comparing two nuclear gene (Adh1 and Adh2) trees and a chloroplast gene (matK) tree of all rice species, phylogenetic relationships among the rice genomes were inferred. Origins of the allotetraploid species, which constitute more than one-third of rice species diversity, were reconstructed based on the Adh gene phylogenies. Genome types of the maternal parents of allotetraploid species were determined based on the matK gene tree. The phylogenetic reconstruction largely supports the previous recognition of rice genomes. It further revealed that the EE genome species is most closely related to the DD genome progenitor that gave rise to the CCDD genome. Three species of the CCDD genome may have originated through a single hybridization event, and their maternal parent had the CC genome. The BBCC genome species had different origins, and their maternal parents had either a BB or CC genome. An additional genome type, HHKK, was recognized for Oryza schlechteri and Porteresia coarctata, suggesting that P. coarctata is an Oryza species. The AA genome lineage, which contains cultivated rice, is a recently diverged and rapidly radiated lineage within the rice genus.
Resumo:
Using computer programs developed for this purpose, we searched for various repeated sequences including inverted, direct tandem, and homopurine–homopyrimidine mirror repeats in various prokaryotes, eukaryotes, and an archaebacterium. Comparison of observed frequencies with expectations revealed that in bacterial genomes and organelles the frequency of different repeats is either random or enriched for inverted and/or direct tandem repeats. By contrast, in all eukaryotic genomes studied, we observed an overrepresentation of all repeats, especially homopurine–homopyrimidine mirror repeats. Analysis of the genomic distribution of all abundant repeats showed that they are virtually excluded from coding sequences. Unexpectedly, the frequencies of abundant repeats normalized for their expectations were almost perfect exponential functions of their size, and for a given repeat this function was indistinguishable between different genomes.
Resumo:
We created a simulation based on experimental data from bacteriophage T7 that computes the developmental cycle of the wild-type phage and also of mutants that have an altered genome order. We used the simulation to compute the fitness of more than 105 mutants. We tested these computations by constructing and experimentally characterizing T7 mutants in which we repositioned gene 1, coding for T7 RNA polymerase. Computed protein synthesis rates for ectopic gene 1 strains were in moderate agreement with observed rates. Computed phage-doubling rates were close to observations for two of four strains, but significantly overestimated those of the other two. Computations indicate that the genome organization of wild-type T7 is nearly optimal for growth: only 2.8% of random genome permutations were computed to grow faster, the highest 31% faster, than wild type. Specific discrepancies between computations and observations suggest that a better understanding of the translation efficiency of individual mRNAs and the functions of qualitatively “nonessential” genes will be needed to improve the T7 simulation. In silico representations of biological systems can serve to assess and advance our understanding of the underlying biology. Iteration between computation, prediction, and observation should increase the rate at which biological hypotheses are formulated and tested.
Resumo:
To determine whether pathogenic mutations in mtDNA are involved in phenotypic expression of Alzheimer’s disease (AD), the transfer of mtDNA from elderly patients with AD into mtDNA-less (ρ0) HeLa cells was carried out by fusion of platelets or synaptosomal fractions of autopsied brain tissues with ρ0 HeLa cells. The results showed that mtDNA in postmortem brain tissue survives for a long time without degradation and could be rescued in ρ0 HeLa cells. Next, the cybrid clones repopulated with exogenously imported mtDNA from patients with AD were used for examination of respiratory enzyme activity and transfer of mtDNA with the pathogenic mutations that induce mitochondrial dysfunction. The presence of the mutated mtDNA was restricted to brain tissues and their cybrid clones that formed with synaptosomes as mtDNA donors, whereas no cybrid clones that isolated with platelets as mtDNA donors had detectable mutated mtDNA. However, biochemical analyses showed that all cybrid clones with mtDNA imported from platelets or brain tissues of patients with AD restored mitochondrial respiration activity to almost the same levels as those of cybrid clones with mtDNA from age-matched normal controls, suggesting functional integrity of mtDNA in both platelets and brain tissues of elderly patients with AD. These observations warrant the reassessment of the conventional concept that the accumulation of pathogenic mutations in mtDNA throughout the aging process is responsible for the decrease of mitochondrial respiration capacity with age and with the development of age-associated neurodegenerative diseases.
Resumo:
Isopentenyl diphosphate (IPP) is the central intermediate in the biosynthesis of isoprenoids, the most ancient and diverse class of natural products. Two distinct routes of IPP biosynthesis occur in nature: the mevalonate pathway and the recently discovered deoxyxylulose 5-phosphate (DXP) pathway. The evolutionary history of the enzymes involved in both routes and the phylogenetic distribution of their genes across genomes suggest that the mevalonate pathway is germane to archaebacteria, that the DXP pathway is germane to eubacteria, and that eukaryotes have inherited their genes for IPP biosynthesis from prokaryotes. The occurrence of genes specific to the DXP pathway is restricted to plastid-bearing eukaryotes, indicating that these genes were acquired from the cyanobacterial ancestor of plastids. However, the individual phylogenies of these genes, with only one exception, do not provide evidence for a specific affinity between the plant genes and their cyanobacterial homologues. The results suggest that lateral gene transfer between eubacteria subsequent to the origin of plastids has played a major role in the evolution of this pathway.