11 resultados para Bacterial Genomes
em National Center for Biotechnology Information - NCBI
Resumo:
The recently sequenced genome of the parasitic bacterium Mycoplasma genitalium contains only 468 identified protein-coding genes that have been dubbed a minimal gene complement [Fraser, C.M., Gocayne, J.D., White, O., Adams, M.D., Clayton, R.A., et al. (1995) Science 270, 397-403]. Although the M. genitalium gene complement is indeed the smallest among known cellular life forms, there is no evidence that it is the minimal self-sufficient gene set. To derive such a set, we compared the 468 predicted M. genitalium protein sequences with the 1703 protein sequences encoded by the other completely sequenced small bacterial genome, that of Haemophilus influenzae. M. genitalium and H. influenzae belong to two ancient bacterial lineages, i.e., Gram-positive and Gram-negative bacteria, respectively. Therefore, the genes that are conserved in these two bacteria are almost certainly essential for cellular function. It is this category of genes that is most likely to approximate the minimal gene set. We found that 240 M. genitalium genes have orthologs among the genes of H. influenzae. This collection of genes falls short of comprising the minimal set as some enzymes responsible for intermediate steps in essential pathways are missing. The apparent reason for this is the phenomenon that we call nonorthologous gene displacement when the same function is fulfilled by nonorthologous proteins in two organisms. We identified 22 nonorthologous displacements and supplemented the set of orthologs with the respective M. genitalium genes. After examining the resulting list of 262 genes for possible functional redundancy and for the presence of apparently parasite-specific genes, 6 genes were removed. We suggest that the remaining 256 genes are close to the minimal gene set that is necessary and sufficient to sustain the existence of a modern-type cell. Most of the proteins encoded by the genes from the minimal set have eukaryotic or archaeal homologs but seven key proteins of DNA replication do not. We speculate that the last common ancestor of the three primary kingdoms had an RNA genome. Possibilities are explored to further reduce the minimal set to model a primitive cell that might have existed at a very early stage of life evolution.
Resumo:
Using computer programs developed for this purpose, we searched for various repeated sequences including inverted, direct tandem, and homopurine–homopyrimidine mirror repeats in various prokaryotes, eukaryotes, and an archaebacterium. Comparison of observed frequencies with expectations revealed that in bacterial genomes and organelles the frequency of different repeats is either random or enriched for inverted and/or direct tandem repeats. By contrast, in all eukaryotic genomes studied, we observed an overrepresentation of all repeats, especially homopurine–homopyrimidine mirror repeats. Analysis of the genomic distribution of all abundant repeats showed that they are virtually excluded from coding sequences. Unexpectedly, the frequencies of abundant repeats normalized for their expectations were almost perfect exponential functions of their size, and for a given repeat this function was indistinguishable between different genomes.
Resumo:
Operon structure is an important organization feature of bacterial genomes. Many sets of genes occur in the same order on multiple genomes; these conserved gene groupings represent candidate operons. This study describes a computational method to estimate the likelihood that such conserved gene sets form operons. The method was used to analyze 34 bacterial and archaeal genomes, and yielded more than 7600 pairs of genes that are highly likely (P ≥ 0.98) to belong to the same operon. The sensitivity of our method is 30–50% for the Escherichia coli genome. The predicted gene pairs are available from our World Wide Web site http://www.tigr.org/tigr-scripts/operons/operons.cgi.
Resumo:
The 4,188-kb circular genome of Bacillus subtilis 168 was artificially dissected into two stable circular chromosomes in vivo, one being the 3,878-kb main genome and the other the 310-kb subgenome that was recovered as covalently closed circular DNA in CsCl-ethidium bromide ultracentrifugation. The minimal requirements to physically separate the 310-kb DNA segment out of the genome were two interrepeat homologous sequences and an origin of DNA replication between them. The subgenome originated from the 1,255–1,551-kb region of the B. subtilis genome was essential for the cell to survive because the subgenome was not lost from the cell. The finding that the B. subtilis genome has a potential to be divided and the resulting two replicons stably maintained may shed light on origins and formation mechanisms of giant plasmids or second chromosomes present in many bacteria. Similar excision or its reversal process, i.e., integration of large sized covalently closed circular DNA pieces into the main genome, implies significant roles of subgenomes in the exchange of genetic information and size variation of bacterial genomes in bacterial evolution.
Resumo:
We present a method for discovering conserved sequence motifs from families of aligned protein sequences. The method has been implemented as a computer program called emotif (http://motif.stanford.edu/emotif). Given an aligned set of protein sequences, emotif generates a set of motifs with a wide range of specificities and sensitivities. emotif also can generate motifs that describe possible subfamilies of a protein superfamily. A disjunction of such motifs often can represent the entire superfamily with high specificity and sensitivity. We have used emotif to generate sets of motifs from all 7,000 protein alignments in the blocks and prints databases. The resulting database, called identify (http://motif.stanford.edu/identify), contains more than 50,000 motifs. For each alignment, the database contains several motifs having a probability of matching a false positive that range from 10−10 to 10−5. Highly specific motifs are well suited for searching entire proteomes, while generating very few false predictions. identify assigns biological functions to 25–30% of all proteins encoded by the Saccharomyces cerevisiae genome and by several bacterial genomes. In particular, identify assigned functions to 172 of proteins of unknown function in the yeast genome.
Resumo:
We have investigated genetic differences between the closely related pathogenic Neisseria species, Neisseria meningitidis and Neisseria gonorrhoeae, as a novel approach to the elucidation of the genetic basis for their different pathogenicities. N. meningitidis is a major cause of cerebrospinal meningitis, whereas N. gonorrhoeae is the agent of gonorrhoea. The technique of representational difference analysis was adapted to the search for genes present in the meningococcus but absent from the gonococcus. The libraries achieved are comprehensive and specific in that they contain sequences corresponding to the presently identified meningococcus-specific genes (capsule, frp, rotamase, and opc) but lack genes more or less homologous between the two species, e.g., ppk and pilC1. Of 35 randomly chosen clones specific to N. meningitidis, DNA sequence analysis has confirmed that the large majority have no homology with published neisserial sequences. Mapping of the cloned DNA fragments onto the chromosome of N. meningitidis strain Z2491 has revealed a nonrandom distribution of meningococcus-specific sequences. Most of the genetic differences between the meningococcus and gonococcus appear to be clustered in three distinct regions, one of which (region 1) contains the capsule-related genes. Region 3 was found only in strains of serogroup A, whereas region 2 is present in a variety of meningococci belonging to different serogroups. At a time when bacterial genomes are being sequenced, we believe that this technique is a powerful tool for a rapid and directed analysis of the genetic basis of inter- or intraspecific phenotypic variations.
Resumo:
A loxP-transposon retrofitting strategy for generating large nested deletions from one end of the insert DNA in bacterial artificial chromosomes and P1 artificial chromosomes was described recently [Chatterjee, P. K. & Coren, J. S. (1997) Nucleic Acids Res. 25, 2205–2212]. In this report, we combine this procedure with direct sequencing of nested-deletion templates by using primers located in the transposon end to illustrate its value for position-specific single-nucleotide polymorphism (SNP) discovery from chosen regions of large insert clones. A simple ampicillin sensitivity screen was developed to facilitate identification and recovery of deletion clones free of transduced transposon plasmid. This directed approach requires minimal DNA sequencing, and no in vitro subclone library generation; positionally oriented SNPs are a consequence of the method. The procedure is used to discover new SNPs as well as physically map those identified from random subcloned libraries or sequence databases. The deletion templates, positioned SNPs, and markers are also used to orient large insert clones into a contig. The deletion clone can serve as a ready resource for future functional genomic studies because each carries a mammalian cell-specific antibiotic resistance gene from the transposon. Furthermore, the technique should be especially applicable to the analysis of genomes for which a full genome sequence or radiation hybrid cell lines are unavailable.
Resumo:
A strategy for cloning and mutagenesis of an infectious herpesvirus genome is described. The mouse cytomegalovirus genome was cloned and maintained as a 230 kb bacterial artificial chromosome (BAC) in E. coli. Transfection of the BAC plasmid into eukaryotic cells led to a productive virus infection. The feasibility to introduce targeted mutations into the BAC cloned virus genome was shown by mutation of the immediate-early 1 gene and generation of a mutant virus. Thus, the complete construction of a mutant herpesvirus genome can now be carried out in a controlled manner prior to the reconstitution of infectious progeny. The described approach should be generally applicable to the mutagenesis of genomes of other large DNA viruses.
Resumo:
The construction of cDNA clones encoding large-size RNA molecules of biological interest, like coronavirus genomes, which are among the largest mature RNA molecules known to biology, has been hampered by the instability of those cDNAs in bacteria. Herein, we show that the application of two strategies, cloning of the cDNAs into a bacterial artificial chromosome and nuclear expression of RNAs that are typically produced within the cytoplasm, is useful for the engineering of large RNA molecules. A cDNA encoding an infectious coronavirus RNA genome has been cloned as a bacterial artificial chromosome. The rescued coronavirus conserved all of the genetic markers introduced throughout the sequence and showed a standard mRNA pattern and the antigenic characteristics expected for the synthetic virus. The cDNA was transcribed within the nucleus, and the RNA translocated to the cytoplasm. Interestingly, the recovered virus had essentially the same sequence as the original one, and no splicing was observed. The cDNA was derived from an attenuated isolate that replicates exclusively in the respiratory tract of swine. During the engineering of the infectious cDNA, the spike gene of the virus was replaced by the spike gene of an enteric isolate. The synthetic virus replicated abundantly in the enteric tract and was fully virulent, demonstrating that the tropism and virulence of the recovered coronavirus can be modified. This demonstration opens up the possibility of employing this infectious cDNA as a vector for vaccine development in human, porcine, canine, and feline species susceptible to group 1 coronaviruses.
Resumo:
Fluorescence in situ hybridization (FISH) is a powerful tool for physical mapping in human and other mammalian species. However, application of the FISH technique has been limited in plant species, especially for mapping single- or low-copy DNA sequences, due to inconsistent signal production in plant chromosome preparations. Here we demonstrate that bacterial artificial chromosome (BAC) clones can be mapped readily on rice (Oryza sativa L.) chromosomes by FISH. Repetitive DNA sequences in BAC clones can be suppressed efficiently by using rice genomic DNA as a competitor in the hybridization mixture. BAC clones as small as 40 kb were successfully mapped. To demonstrate the application of the FISH technique in physical mapping of plant genomes, both anonymous BAC clones and clones closely linked to a rice bacterial blight-resistance locus, Xa21, were chosen for analysis. The physical location of Xa21 and the relationships among the linked clones were established, thus demonstrating the utility of FISH in plant genome analysis.