84 resultados para genome sequence

em National Center for Biotechnology Information - NCBI


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The complete genome sequence of Caulobacter crescentus was determined to be 4,016,942 base pairs in a single circular chromosome encoding 3,767 genes. This organism, which grows in a dilute aquatic environment, coordinates the cell division cycle and multiple cell differentiation events. With the annotated genome sequence, a full description of the genetic network that controls bacterial differentiation, cell growth, and cell cycle progression is within reach. Two-component signal transduction proteins are known to play a significant role in cell cycle progression. Genome analysis revealed that the C. crescentus genome encodes a significantly higher number of these signaling proteins (105) than any bacterial genome sequenced thus far. Another regulatory mechanism involved in cell cycle progression is DNA methylation. The occurrence of the recognition sequence for an essential DNA methylating enzyme that is required for cell cycle regulation is severely limited and shows a bias to intergenic regions. The genome contains multiple clusters of genes encoding proteins essential for survival in a nutrient poor habitat. Included are those involved in chemotaxis, outer membrane channel function, degradation of aromatic ring compounds, and the breakdown of plant-derived carbon sources, in addition to many extracytoplasmic function sigma factors, providing the organism with the ability to respond to a wide range of environmental fluctuations. C. crescentus is, to our knowledge, the first free-living α-class proteobacterium to be sequenced and will serve as a foundation for exploring the biology of this group of bacteria, which includes the obligate endosymbiont and human pathogen Rickettsia prowazekii, the plant pathogen Agrobacterium tumefaciens, and the bovine and human pathogen Brucella abortus.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The 1,852,442-bp sequence of an M1 strain of Streptococcus pyogenes, a Gram-positive pathogen, has been determined and contains 1,752 predicted protein-encoding genes. Approximately one-third of these genes have no identifiable function, with the remainder falling into previously characterized categories of known microbial function. Consistent with the observation that S. pyogenes is responsible for a wider variety of human disease than any other bacterial species, more than 40 putative virulence-associated genes have been identified. Additional genes have been identified that encode proteins likely associated with microbial “molecular mimicry” of host characteristics and involved in rheumatic fever or acute glomerulonephritis. The complete or partial sequence of four different bacteriophage genomes is also present, with each containing genes for one or more previously undiscovered superantigen-like proteins. These prophage-associated genes encode at least six potential virulence factors, emphasizing the importance of bacteriophages in horizontal gene transfer and a possible mechanism for generating new strains with increased pathogenic potential.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Whole-genome duplication approximately 108 years ago was proposed as an explanation for the many duplicated chromosomal regions in Saccharomyces cerevisiae. Here we have used computer simulations and analytic methods to estimate some parameters describing the evolution of the yeast genome after this duplication event. Computer simulation of a model in which 8% of the original genes were retained in duplicate after genome duplication, and 70–100 reciprocal translocations occurred between chromosomes, produced arrangements of duplicated chromosomal regions very similar to the map of real duplications in yeast. An analytical method produced an independent estimate of 84 map disruptions. These results imply that many smaller duplicated chromosomal regions exist in the yeast genome in addition to the 55 originally reported. We also examined the possibility of determining the original order of chromosomal blocks in the ancestral unduplicated genome, but this cannot be done without information from one or more additional species. If the genome sequence of one other species (such as Kluyveromyces lactis) were known it should be possible to identify 150–200 paired regions covering the whole yeast genome and to reconstruct approximately two-thirds of the original order of blocks of genes in yeast. Rates of interchromosome translocation in yeast and mammals appear similar despite their very different rates of homologous recombination per kilobase.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Despite more than a century of debate, the evolutionary position of turtles (Testudines) relative to other amniotes (reptiles, birds, and mammals) remains uncertain. One of the major impediments to resolving this important evolutionary problem is the highly distinctive and enigmatic morphology of turtles that led to their traditional placement apart from diapsid reptiles as sole descendants of presumably primitive anapsid reptiles. To address this question, the complete (16,787-bp) mitochondrial genome sequence of the African side-necked turtle (Pelomedusa subrufa) was determined. This molecule contains several unusual features: a (TA)n microsatellite in the control region, the absence of an origin of replication for the light strand in the WANCY region of five tRNA genes, an unusually long noncoding region separating the ND5 and ND6 genes, an overlap between ATPase 6 and COIII genes, and the existence of extra nucleotides in ND3 and ND4L putative ORFs. Phylogenetic analyses of the complete mitochondrial genome sequences supported the placement of turtles as the sister group of an alligator and chicken (Archosauria) clade. This result clearly rejects the Haematothermia hypothesis (a sister-group relationship between mammals and birds), as well as rejecting the placement of turtles as the most basal living amniotes. Moreover, evidence from both complete mitochondrial rRNA genes supports a sister-group relationship of turtles to Archosauria to the exclusion of Lepidosauria (tuatara, snakes, and lizards). These results challenge the classic view of turtles as the only survivors of primary anapsid reptiles and imply that turtles might have secondarily lost their skull fenestration.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

We present an approach to map large numbers of Tc1 transposon insertions in the genome of Caenorhabditis elegans. Strains have been described that contain up to 500 polymorphic Tc1 insertions. From these we have cloned and shotgun sequenced over 2000 Tc1 flanks, resulting in an estimated set of 400 or more distinct Tc1 insertion alleles. Alignment of these sequences revealed a weak Tc1 insertion site consensus sequence that was symmetric around the invariant TA target site and reads CAYATATRTG. The Tc1 flanking sequences were compared with 40 Mbp of a C. elegans genome sequence. We found 151 insertions within the sequenced area, a density of ≈1 Tc1 insertion in every 265 kb. As the rest of the C. elegans genome sequence is obtained, remaining Tc1 alleles will fall into place. These mapped Tc1 insertions can serve two functions: (i) insertions in or near genes can be used to isolate deletion derivatives that have that gene mutated; and (ii) they represent a dense collection of polymorphic sequence-tagged sites. We demonstrate a strategy to use these Tc1 sequence-tagged sites in fine-mapping mutations.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

We present here the complete genome sequence of a common avian clone of Pasteurella multocida, Pm70. The genome of Pm70 is a single circular chromosome 2,257,487 base pairs in length and contains 2,014 predicted coding regions, 6 ribosomal RNA operons, and 57 tRNAs. Genome-scale evolutionary analyses based on pairwise comparisons of 1,197 orthologous sequences between P. multocida, Haemophilus influenzae, and Escherichia coli suggest that P. multocida and H. influenzae diverged ≈270 million years ago and the γ subdivision of the proteobacteria radiated about 680 million years ago. Two previously undescribed open reading frames, accounting for ≈1% of the genome, encode large proteins with homology to the virulence-associated filamentous hemagglutinin of Bordetella pertussis. Consistent with the critical role of iron in the survival of many microbial pathogens, in silico and whole-genome microarray analyses identified more than 50 Pm70 genes with a potential role in iron acquisition and metabolism. Overall, the complete genomic sequence and preliminary functional analyses provide a foundation for future research into the mechanisms of pathogenesis and host specificity of this important multispecies pathogen.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

A computer analysis of 2328 protein sequences comprising about 60% of the Escherichia coli gene products was performed using methods for database screening with individual sequences and alignment blocks. A high fraction of E. coli proteins--86%--shows significant sequence similarity to other proteins in current databases; about 70% show conservation at least at the level of distantly related bacteria, and about 40% contain ancient conserved regions (ACRs) shared with eukaryotic or Archaeal proteins. For > 90% of the E. coli proteins, either functional information or sequence similarity, or both, are available. Forty-six percent of the E. coli proteins belong to 299 clusters of paralogs (intraspecies homologs) defined on the basis of pairwise similarity. Another 10% could be included in 70 superclusters using motif detection methods. The majority of the clusters contain only two to four members. In contrast, nearly 25% of all E. coli proteins belong to the four largest superclusters--namely, permeases, ATPases and GTPases with the conserved "Walker-type" motif, helix-turn-helix regulatory proteins, and NAD(FAD)-binding proteins. We conclude that bacterial protein sequences generally are highly conserved in evolution, with about 50% of all ACR-containing protein families represented among the E. coli gene products. With the current sequence databases and methods of their screening, computer analysis yields useful information on the functions and evolutionary relationships of the vast majority of genes in a bacterial genome. Sequence similarity with E. coli proteins allows the prediction of functions for a number of important eukaryotic genes, including several whose products are implicated in human diseases.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The genome sequence of the extremely thermophilic archaeon Methanococcus jannaschii provides a wealth of data on proteins from a thermophile. In this paper, sequences of 115 proteins from M. jannaschii are compared with their homologs from mesophilic Methanococcus species. Although the growth temperatures of the mesophiles are about 50°C below that of M. jannaschii, their genomic G+C contents are nearly identical. The properties most correlated with the proteins of the thermophile include higher residue volume, higher residue hydrophobicity, more charged amino acids (especially Glu, Arg, and Lys), and fewer uncharged polar residues (Ser, Thr, Asn, and Gln). These are recurring themes, with all trends applying to 83–92% of the proteins for which complete sequences were available. Nearly all of the amino acid replacements most significantly correlated with the temperature change are the same relatively conservative changes observed in all proteins, but in the case of the mesophile/thermophile comparison there is a directional bias. We identify 26 specific pairs of amino acids with a statistically significant (P < 0.01) preferred direction of replacement.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Members of the bacterial families Haemophilus and Neisseria, important human pathogens that commonly colonize the nasopharynx, are naturally competent for DNA uptake from their environment. In each genus this process is discriminant in favor of its own and against foreign DNA through sequence specificity of DNA receptors. The Haemophilus DNA uptake apparatus binds a 29-bp oligonucleotide domain containing a highly conserved 9-bp core sequence, whereas the neisserial apparatus binds a 10-bp motif. Each motif (“uptake sequence”, US) is highly over-represented in the chromosome of the corresponding genus, particularly concentrated with core sequences in inverted pairs forming gene terminators. Two Haemophilus core USs were unexpectedly found forming the terminator of sodC in Neisseria meningitidis (meningococcus), and sequence analysis strongly suggests that this virulence gene, located next to IS1106, arose through horizontal transfer from Haemophilus. By using USs as search strings in a computer-based analysis of genome sequence, it was established that while USs of the “wrong” genus do not occur commonly in Neisseria or Haemophilus, where they do they are highly likely to flag domains of chromosomal DNA that have been transferred from Haemophilus. Three independent domains of Haemophilus-like DNA were found in the meningococcal chromosome, associated respectively with the virulence gene sodC, the bio gene cluster, and an unidentified orf. This report identifies intergenerically transferred DNA and its source in bacteria, and further identifies transformation with heterologous chromosomal DNA as a way of establishing potentially important chromosomal mosaicism in these pathogenic bacteria.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A loxP-transposon retrofitting strategy for generating large nested deletions from one end of the insert DNA in bacterial artificial chromosomes and P1 artificial chromosomes was described recently [Chatterjee, P. K. & Coren, J. S. (1997) Nucleic Acids Res. 25, 2205–2212]. In this report, we combine this procedure with direct sequencing of nested-deletion templates by using primers located in the transposon end to illustrate its value for position-specific single-nucleotide polymorphism (SNP) discovery from chosen regions of large insert clones. A simple ampicillin sensitivity screen was developed to facilitate identification and recovery of deletion clones free of transduced transposon plasmid. This directed approach requires minimal DNA sequencing, and no in vitro subclone library generation; positionally oriented SNPs are a consequence of the method. The procedure is used to discover new SNPs as well as physically map those identified from random subcloned libraries or sequence databases. The deletion templates, positioned SNPs, and markers are also used to orient large insert clones into a contig. The deletion clone can serve as a ready resource for future functional genomic studies because each carries a mammalian cell-specific antibiotic resistance gene from the transposon. Furthermore, the technique should be especially applicable to the analysis of genomes for which a full genome sequence or radiation hybrid cell lines are unavailable.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Bacillus subtilis strain ATCC6633 has been identified as a producer of mycosubtilin, a potent antifungal peptide antibiotic. Mycosubtilin, which belongs to the iturin family of lipopeptide antibiotics, is characterized by a β-amino fatty acid moiety linked to the circular heptapeptide Asn-Tyr-Asn-Gln-Pro-Ser-Asn, with the second, third, and sixth position present in the D-configuration. The gene cluster from B. subtilis ATCC6633 specifying the biosynthesis of mycosubtilin was identified. The putative operon spans 38 kb and consists of four ORFs, designated fenF, mycA, mycB, and mycC, with strong homologies to the family of peptide synthetases. Biochemical characterization showed that MycB specifically adenylates tyrosine, as expected for mycosubtilin synthetase, and insertional mutagenesis of the operon resulted in a mycosubtilin-negative phenotype. The mycosubtilin synthetase reveals features unique for peptide synthetases as well as for fatty acid synthases: (i) The mycosubtilin synthase subunit A (MycA) combines functional domains derived from peptide synthetases, amino transferases, and fatty acid synthases. MycA represents the first example of a natural hybrid between these enzyme families. (ii) The organization of the synthetase subunits deviates from that commonly found in peptide synthetases. On the basis of the described characteristics of the mycosubtilin synthetase, we present a model for the biosynthesis of iturin lipopeptide antibiotics. Comparison of the sequences flanking the mycosubtilin operon of B. subtilis ATCC6633, with the complete genome sequence of B. subtilis strain 168 indicates that the fengycin and mycosubtilin lipopeptide synthetase operons are exchanged between the two B. subtilis strains.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The molecular identity and function of the Drosophila melanogaster Y-linked fertility factors have long eluded researchers. Although the D. melanogaster genome sequence was recently completed, the fertility factors still were not identified, in part because of low cloning efficiency of heterochromatic Y sequences. Here we report a method for iterative blast searching to assemble heterochromatic genes from shotgun assemblies, and we successfully identify kl-2 and kl-3 as 1β- and γ-dynein heavy chains, respectively. Our conclusions are supported by formal genetics with X-Y translocation lines. Reverse transcription–PCR was successful in linking together unmapped sequence fragments from the whole-genome shotgun assembly, although some sequences were missing altogether from the shotgun effort and had to be generated de novo. We also found a previously undescribed Y gene, polycystine-related (PRY). The closest paralogs of kl-2, kl-3, and PRY (and also of kl-5) are autosomal and not X-linked, suggesting that the evolution of the Drosophila Y chromosome has been driven by an accumulation of male-related genes arising de novo from the autosomes.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

With the completion of the determination of its entire genome sequence, one of the next major targets of Bacillus subtilis genomics is to clarify the whole gene regulatory network. To this end, the results of systematic experiments should be compared with the rich source of individual experimental results accumulated so far. Thus, we constructed a database of the upstream regulatory information of B.subtilis (DBTBS). The current version was constructed by surveying 291 references and contains information on 90 binding factors and 403 promoters. For each promoter, all of its known cis-elements are listed according to their positions, while these cis-elements are aligned to illustrate their consensus sequence for each transcription factor. All probable transcription factors coded in the genome were classified with the Pfam motifs. Using this database, we compared the character of B.subtilis promoters with that of Escherichia coli promoters. Our database is accessible at http://elmo.ims.u-tokyo.ac.jp/dbtbs/.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The extremely halophilic archaeon Halobacterium sp. NRC-1 can grow phototrophically by means of light-driven proton pumping by bacteriorhodopsin in the purple membrane. Here, we show by genetic analysis of the wild type, and insertion and double-frame shift mutants of Bat that this transcriptional regulator coordinates synthesis of a structural protein and a chromophore for purple membrane biogenesis in response to both light and oxygen. Analysis of the complete Halobacterium sp. NRC-1 genome sequence showed that the regulatory site, upstream activator sequence (UAS), the putative binding site for Bat upstream of the bacterio-opsin gene (bop), is also present upstream to the other Bat-regulated genes. The transcription regulator Bat contains a photoresponsive cGMP-binding (GAF) domain, and a bacterial AraC type helix–turn–helix DNA binding motif. We also provide evidence for involvement of the PAS/PAC domain of Bat in redox-sensing activity by genetic analysis of a purple membrane overproducer. Five additional Bat-like putative regulatory genes were found, which together are likely to be responsible for orchestrating the complex response of this archaeon to light and oxygen. Similarities of the bop-like UAS and transcription factors in diverse organisms, including a plant and a γ-proteobacterium, suggest an ancient origin for this regulon capable of coordinating light and oxygen responses in the three major branches of the evolutionary tree of life. Finally, sensitivity of four of five regulon genes to DNA supercoiling is demonstrated and correlated to presence of alternating purine–pyrimidine sequences (RY boxes) near the regulated promoters.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A gene encoding a product with substantial similarity to ribulose-1,5-bisphosphate carboxylase/oxygenase (RubisCO) was identified in the preliminary genome sequence of the green sulfur bacterium Chlorobium tepidum. A highly similar gene was subsequently isolated and sequenced from Chlorobium limicola f.sp. thiosulfatophilum strain Tassajara. Analysis of these amino acid sequences indicated that they lacked several conserved RubisCO active site residues. The Chlorobium RubisCO-like proteins are most closely related to deduced sequences in Bacillus subtilis and Archaeoglobus fulgidus, which also lack some typical RubisCO active site residues. When the C. tepidum gene encoding the RubisCO-like protein was disrupted, the resulting mutant strain displayed a pleiotropic phenotype with defects in photopigment content, photoautotrophic growth and carbon fixation rates, and sulfur metabolism. Most important, the mutant strain showed substantially enhanced accumulation of two oxidative stress proteins. These results indicated that the C. tepidum RubisCO-like protein might be involved in oxidative stress responses and/or sulfur metabolism. This protein might be an evolutional link to bona fide RubisCO and could serve as an important tool to analyze how the RubisCO active site developed.