89 resultados para Klebsiella pneumoniae genome sequence
em National Center for Biotechnology Information - NCBI
Resumo:
The complete genome sequence of Caulobacter crescentus was determined to be 4,016,942 base pairs in a single circular chromosome encoding 3,767 genes. This organism, which grows in a dilute aquatic environment, coordinates the cell division cycle and multiple cell differentiation events. With the annotated genome sequence, a full description of the genetic network that controls bacterial differentiation, cell growth, and cell cycle progression is within reach. Two-component signal transduction proteins are known to play a significant role in cell cycle progression. Genome analysis revealed that the C. crescentus genome encodes a significantly higher number of these signaling proteins (105) than any bacterial genome sequenced thus far. Another regulatory mechanism involved in cell cycle progression is DNA methylation. The occurrence of the recognition sequence for an essential DNA methylating enzyme that is required for cell cycle regulation is severely limited and shows a bias to intergenic regions. The genome contains multiple clusters of genes encoding proteins essential for survival in a nutrient poor habitat. Included are those involved in chemotaxis, outer membrane channel function, degradation of aromatic ring compounds, and the breakdown of plant-derived carbon sources, in addition to many extracytoplasmic function sigma factors, providing the organism with the ability to respond to a wide range of environmental fluctuations. C. crescentus is, to our knowledge, the first free-living α-class proteobacterium to be sequenced and will serve as a foundation for exploring the biology of this group of bacteria, which includes the obligate endosymbiont and human pathogen Rickettsia prowazekii, the plant pathogen Agrobacterium tumefaciens, and the bovine and human pathogen Brucella abortus.
Resumo:
The 1,852,442-bp sequence of an M1 strain of Streptococcus pyogenes, a Gram-positive pathogen, has been determined and contains 1,752 predicted protein-encoding genes. Approximately one-third of these genes have no identifiable function, with the remainder falling into previously characterized categories of known microbial function. Consistent with the observation that S. pyogenes is responsible for a wider variety of human disease than any other bacterial species, more than 40 putative virulence-associated genes have been identified. Additional genes have been identified that encode proteins likely associated with microbial “molecular mimicry” of host characteristics and involved in rheumatic fever or acute glomerulonephritis. The complete or partial sequence of four different bacteriophage genomes is also present, with each containing genes for one or more previously undiscovered superantigen-like proteins. These prophage-associated genes encode at least six potential virulence factors, emphasizing the importance of bacteriophages in horizontal gene transfer and a possible mechanism for generating new strains with increased pathogenic potential.
Resumo:
The rpoH regulatory region of different members of the enteric bacteria family was sequenced or downloaded from GenBank and compared. In addition, the transcriptional start sites of rpoH of Yersinia frederiksenii and Proteus mirabilis, two distant members of this family, were determined. Sequences similar to the σ70 promoters P1, P4 and P5, to the σE promoter P3 and to boxes DnaA1, DnaA2, cAMP receptor protein (CRP) boxes CRP1, CRP2 and box CytR present in Escherichia coli K12, were identified in sequences of closely related bacteria such as: E.coli, Shigella flexneri, Salmonella enterica serovar Typhimurium, Citrobacter freundii, Enterobacter cloacae and Klebsiella pneumoniae. In more distant bacteria, Y.frederiksenii and P.mirabilis, the rpoH regulatory region has a distal P1-like σ70 promoter and two proximal promoters: a heat-induced σE-like promoter and a σ70 promoter. Sequences similar to the regulatory boxes were not identified in these bacteria. This study suggests that the general pattern of transcription of the rpoH gene in enteric bacteria includes a distal σ70 promoter, >200 nt upstream of the initiation codon, and two proximal promoters: a heat-induced σE-like promoter and a σ70 promoter. A second proximal σ70 promoter under catabolite-regulation is probably present only in bacteria closely related to E.coli.
Resumo:
Whole-genome duplication approximately 108 years ago was proposed as an explanation for the many duplicated chromosomal regions in Saccharomyces cerevisiae. Here we have used computer simulations and analytic methods to estimate some parameters describing the evolution of the yeast genome after this duplication event. Computer simulation of a model in which 8% of the original genes were retained in duplicate after genome duplication, and 70–100 reciprocal translocations occurred between chromosomes, produced arrangements of duplicated chromosomal regions very similar to the map of real duplications in yeast. An analytical method produced an independent estimate of 84 map disruptions. These results imply that many smaller duplicated chromosomal regions exist in the yeast genome in addition to the 55 originally reported. We also examined the possibility of determining the original order of chromosomal blocks in the ancestral unduplicated genome, but this cannot be done without information from one or more additional species. If the genome sequence of one other species (such as Kluyveromyces lactis) were known it should be possible to identify 150–200 paired regions covering the whole yeast genome and to reconstruct approximately two-thirds of the original order of blocks of genes in yeast. Rates of interchromosome translocation in yeast and mammals appear similar despite their very different rates of homologous recombination per kilobase.
Resumo:
Despite more than a century of debate, the evolutionary position of turtles (Testudines) relative to other amniotes (reptiles, birds, and mammals) remains uncertain. One of the major impediments to resolving this important evolutionary problem is the highly distinctive and enigmatic morphology of turtles that led to their traditional placement apart from diapsid reptiles as sole descendants of presumably primitive anapsid reptiles. To address this question, the complete (16,787-bp) mitochondrial genome sequence of the African side-necked turtle (Pelomedusa subrufa) was determined. This molecule contains several unusual features: a (TA)n microsatellite in the control region, the absence of an origin of replication for the light strand in the WANCY region of five tRNA genes, an unusually long noncoding region separating the ND5 and ND6 genes, an overlap between ATPase 6 and COIII genes, and the existence of extra nucleotides in ND3 and ND4L putative ORFs. Phylogenetic analyses of the complete mitochondrial genome sequences supported the placement of turtles as the sister group of an alligator and chicken (Archosauria) clade. This result clearly rejects the Haematothermia hypothesis (a sister-group relationship between mammals and birds), as well as rejecting the placement of turtles as the most basal living amniotes. Moreover, evidence from both complete mitochondrial rRNA genes supports a sister-group relationship of turtles to Archosauria to the exclusion of Lepidosauria (tuatara, snakes, and lizards). These results challenge the classic view of turtles as the only survivors of primary anapsid reptiles and imply that turtles might have secondarily lost their skull fenestration.
Resumo:
We present an approach to map large numbers of Tc1 transposon insertions in the genome of Caenorhabditis elegans. Strains have been described that contain up to 500 polymorphic Tc1 insertions. From these we have cloned and shotgun sequenced over 2000 Tc1 flanks, resulting in an estimated set of 400 or more distinct Tc1 insertion alleles. Alignment of these sequences revealed a weak Tc1 insertion site consensus sequence that was symmetric around the invariant TA target site and reads CAYATATRTG. The Tc1 flanking sequences were compared with 40 Mbp of a C. elegans genome sequence. We found 151 insertions within the sequenced area, a density of ≈1 Tc1 insertion in every 265 kb. As the rest of the C. elegans genome sequence is obtained, remaining Tc1 alleles will fall into place. These mapped Tc1 insertions can serve two functions: (i) insertions in or near genes can be used to isolate deletion derivatives that have that gene mutated; and (ii) they represent a dense collection of polymorphic sequence-tagged sites. We demonstrate a strategy to use these Tc1 sequence-tagged sites in fine-mapping mutations.
Resumo:
We present here the complete genome sequence of a common avian clone of Pasteurella multocida, Pm70. The genome of Pm70 is a single circular chromosome 2,257,487 base pairs in length and contains 2,014 predicted coding regions, 6 ribosomal RNA operons, and 57 tRNAs. Genome-scale evolutionary analyses based on pairwise comparisons of 1,197 orthologous sequences between P. multocida, Haemophilus influenzae, and Escherichia coli suggest that P. multocida and H. influenzae diverged ≈270 million years ago and the γ subdivision of the proteobacteria radiated about 680 million years ago. Two previously undescribed open reading frames, accounting for ≈1% of the genome, encode large proteins with homology to the virulence-associated filamentous hemagglutinin of Bordetella pertussis. Consistent with the critical role of iron in the survival of many microbial pathogens, in silico and whole-genome microarray analyses identified more than 50 Pm70 genes with a potential role in iron acquisition and metabolism. Overall, the complete genomic sequence and preliminary functional analyses provide a foundation for future research into the mechanisms of pathogenesis and host specificity of this important multispecies pathogen.
Sequence similarity analysis of Escherichia coli proteins: functional and evolutionary implications.
Resumo:
A computer analysis of 2328 protein sequences comprising about 60% of the Escherichia coli gene products was performed using methods for database screening with individual sequences and alignment blocks. A high fraction of E. coli proteins--86%--shows significant sequence similarity to other proteins in current databases; about 70% show conservation at least at the level of distantly related bacteria, and about 40% contain ancient conserved regions (ACRs) shared with eukaryotic or Archaeal proteins. For > 90% of the E. coli proteins, either functional information or sequence similarity, or both, are available. Forty-six percent of the E. coli proteins belong to 299 clusters of paralogs (intraspecies homologs) defined on the basis of pairwise similarity. Another 10% could be included in 70 superclusters using motif detection methods. The majority of the clusters contain only two to four members. In contrast, nearly 25% of all E. coli proteins belong to the four largest superclusters--namely, permeases, ATPases and GTPases with the conserved "Walker-type" motif, helix-turn-helix regulatory proteins, and NAD(FAD)-binding proteins. We conclude that bacterial protein sequences generally are highly conserved in evolution, with about 50% of all ACR-containing protein families represented among the E. coli gene products. With the current sequence databases and methods of their screening, computer analysis yields useful information on the functions and evolutionary relationships of the vast majority of genes in a bacterial genome. Sequence similarity with E. coli proteins allows the prediction of functions for a number of important eukaryotic genes, including several whose products are implicated in human diseases.
Resumo:
The genome sequence of the extremely thermophilic archaeon Methanococcus jannaschii provides a wealth of data on proteins from a thermophile. In this paper, sequences of 115 proteins from M. jannaschii are compared with their homologs from mesophilic Methanococcus species. Although the growth temperatures of the mesophiles are about 50°C below that of M. jannaschii, their genomic G+C contents are nearly identical. The properties most correlated with the proteins of the thermophile include higher residue volume, higher residue hydrophobicity, more charged amino acids (especially Glu, Arg, and Lys), and fewer uncharged polar residues (Ser, Thr, Asn, and Gln). These are recurring themes, with all trends applying to 83–92% of the proteins for which complete sequences were available. Nearly all of the amino acid replacements most significantly correlated with the temperature change are the same relatively conservative changes observed in all proteins, but in the case of the mesophile/thermophile comparison there is a directional bias. We identify 26 specific pairs of amino acids with a statistically significant (P < 0.01) preferred direction of replacement.
Resumo:
Members of the bacterial families Haemophilus and Neisseria, important human pathogens that commonly colonize the nasopharynx, are naturally competent for DNA uptake from their environment. In each genus this process is discriminant in favor of its own and against foreign DNA through sequence specificity of DNA receptors. The Haemophilus DNA uptake apparatus binds a 29-bp oligonucleotide domain containing a highly conserved 9-bp core sequence, whereas the neisserial apparatus binds a 10-bp motif. Each motif (“uptake sequence”, US) is highly over-represented in the chromosome of the corresponding genus, particularly concentrated with core sequences in inverted pairs forming gene terminators. Two Haemophilus core USs were unexpectedly found forming the terminator of sodC in Neisseria meningitidis (meningococcus), and sequence analysis strongly suggests that this virulence gene, located next to IS1106, arose through horizontal transfer from Haemophilus. By using USs as search strings in a computer-based analysis of genome sequence, it was established that while USs of the “wrong” genus do not occur commonly in Neisseria or Haemophilus, where they do they are highly likely to flag domains of chromosomal DNA that have been transferred from Haemophilus. Three independent domains of Haemophilus-like DNA were found in the meningococcal chromosome, associated respectively with the virulence gene sodC, the bio gene cluster, and an unidentified orf. This report identifies intergenerically transferred DNA and its source in bacteria, and further identifies transformation with heterologous chromosomal DNA as a way of establishing potentially important chromosomal mosaicism in these pathogenic bacteria.
Resumo:
The alternative bacterial σN RNA polymerase holoenzyme binds promoters as a transcriptionally inactive complex that is activated by enhancer-binding proteins. Little is known about how sigma factors respond to their ligands or how the responses lead to transcription. To examine the liganded state of σN, the assembly of end-labeled Klebsiella pneumoniae σN into holoenzyme, closed promoter complexes, and initiated transcription complexes was analyzed by enzymatic protein footprinting. V8 protease-sensitive sites in free σN were identified in the acidic region II and bordering or within the minimal DNA binding domain. Interaction with core RNA polymerase prevented cleavage at noncontiguous sites in region II and at some DNA binding domain sites, probably resulting from conformational changes. Formation of closed complexes resulted in further protections within the DNA binding domain, suggesting close contact to promoter DNA. Interestingly, residue E36 becomes sensitive to proteolysis in initiated transcription complexes, indicating a conformational change in holoenzyme during initiation. Residue E36 is located adjacent to an element involved in nucleating strand separation and in inhibiting polymerase activity in the absence of activation. The sensitivity of E36 may reflect one or both of these functions. Changing patterns of protease sensitivity strongly indicate that σN can adjust conformation upon interaction with ligands, a property likely important in the dynamics of the protein during transcription initiation.
Resumo:
A loxP-transposon retrofitting strategy for generating large nested deletions from one end of the insert DNA in bacterial artificial chromosomes and P1 artificial chromosomes was described recently [Chatterjee, P. K. & Coren, J. S. (1997) Nucleic Acids Res. 25, 2205–2212]. In this report, we combine this procedure with direct sequencing of nested-deletion templates by using primers located in the transposon end to illustrate its value for position-specific single-nucleotide polymorphism (SNP) discovery from chosen regions of large insert clones. A simple ampicillin sensitivity screen was developed to facilitate identification and recovery of deletion clones free of transduced transposon plasmid. This directed approach requires minimal DNA sequencing, and no in vitro subclone library generation; positionally oriented SNPs are a consequence of the method. The procedure is used to discover new SNPs as well as physically map those identified from random subcloned libraries or sequence databases. The deletion templates, positioned SNPs, and markers are also used to orient large insert clones into a contig. The deletion clone can serve as a ready resource for future functional genomic studies because each carries a mammalian cell-specific antibiotic resistance gene from the transposon. Furthermore, the technique should be especially applicable to the analysis of genomes for which a full genome sequence or radiation hybrid cell lines are unavailable.
Resumo:
Bacillus subtilis strain ATCC6633 has been identified as a producer of mycosubtilin, a potent antifungal peptide antibiotic. Mycosubtilin, which belongs to the iturin family of lipopeptide antibiotics, is characterized by a β-amino fatty acid moiety linked to the circular heptapeptide Asn-Tyr-Asn-Gln-Pro-Ser-Asn, with the second, third, and sixth position present in the D-configuration. The gene cluster from B. subtilis ATCC6633 specifying the biosynthesis of mycosubtilin was identified. The putative operon spans 38 kb and consists of four ORFs, designated fenF, mycA, mycB, and mycC, with strong homologies to the family of peptide synthetases. Biochemical characterization showed that MycB specifically adenylates tyrosine, as expected for mycosubtilin synthetase, and insertional mutagenesis of the operon resulted in a mycosubtilin-negative phenotype. The mycosubtilin synthetase reveals features unique for peptide synthetases as well as for fatty acid synthases: (i) The mycosubtilin synthase subunit A (MycA) combines functional domains derived from peptide synthetases, amino transferases, and fatty acid synthases. MycA represents the first example of a natural hybrid between these enzyme families. (ii) The organization of the synthetase subunits deviates from that commonly found in peptide synthetases. On the basis of the described characteristics of the mycosubtilin synthetase, we present a model for the biosynthesis of iturin lipopeptide antibiotics. Comparison of the sequences flanking the mycosubtilin operon of B. subtilis ATCC6633, with the complete genome sequence of B. subtilis strain 168 indicates that the fengycin and mycosubtilin lipopeptide synthetase operons are exchanged between the two B. subtilis strains.
Resumo:
The molecular identity and function of the Drosophila melanogaster Y-linked fertility factors have long eluded researchers. Although the D. melanogaster genome sequence was recently completed, the fertility factors still were not identified, in part because of low cloning efficiency of heterochromatic Y sequences. Here we report a method for iterative blast searching to assemble heterochromatic genes from shotgun assemblies, and we successfully identify kl-2 and kl-3 as 1β- and γ-dynein heavy chains, respectively. Our conclusions are supported by formal genetics with X-Y translocation lines. Reverse transcription–PCR was successful in linking together unmapped sequence fragments from the whole-genome shotgun assembly, although some sequences were missing altogether from the shotgun effort and had to be generated de novo. We also found a previously undescribed Y gene, polycystine-related (PRY). The closest paralogs of kl-2, kl-3, and PRY (and also of kl-5) are autosomal and not X-linked, suggesting that the evolution of the Drosophila Y chromosome has been driven by an accumulation of male-related genes arising de novo from the autosomes.
Resumo:
With the completion of the determination of its entire genome sequence, one of the next major targets of Bacillus subtilis genomics is to clarify the whole gene regulatory network. To this end, the results of systematic experiments should be compared with the rich source of individual experimental results accumulated so far. Thus, we constructed a database of the upstream regulatory information of B.subtilis (DBTBS). The current version was constructed by surveying 291 references and contains information on 90 binding factors and 403 promoters. For each promoter, all of its known cis-elements are listed according to their positions, while these cis-elements are aligned to illustrate their consensus sequence for each transcription factor. All probable transcription factors coded in the genome were classified with the Pfam motifs. Using this database, we compared the character of B.subtilis promoters with that of Escherichia coli promoters. Our database is accessible at http://elmo.ims.u-tokyo.ac.jp/dbtbs/.