84 resultados para Prokaryotic Genomes
Resumo:
The genome of the soil-dwelling heterotrophic N2-fixing Gram-negative bacterium Azotobacter chroococcum NCIMB 8003 (ATCC 4412) (Ac-8003) has been determined. It consists of 7 circular replicons totalling 5,192,291 bp comprising a circular chromosome of 4,591,803 bp and six plasmids pAcX50a, b, c, d, e, f of 10,435 bp, 13,852, 62,783, 69,713, 132,724, and 311,724 bp respectively. The chromosome has a G+C content of 66.27% and the six plasmids have G+C contents of 58.1, 55.3, 56.7, 59.2, 61.9, and 62.6% respectively. The methylome has also been determined and 5 methylation motifs have been identified. The genome also contains a very high number of transposase/inactivated transposase genes from at least 12 of the 17 recognised insertion sequence families. The Ac-8003 genome has been compared with that of Azotobacter vinelandii ATCC BAA-1303 (Av-DJ), a derivative of strain O, the only other member of the Azotobacteraceae determined so far which has a single chromosome of 5,365,318 bp and no plasmids. The chromosomes show significant stretches of synteny throughout but also reveal a history of many deletion/insertion events. The Ac-8003 genome encodes 4628 predicted protein-encoding genes of which 568 (12.2%) are plasmid borne. 3048 (65%) of these show > 85% identity to the 5050 protein-encoding genes identified in Av-DJ, and of these 99 are plasmid-borne. The core biosynthetic and metabolic pathways and macromolecular architectures and machineries of these organisms appear largely conserved including genes for CO-dehydrogenase, formate dehydrogenase and a soluble NiFe-hydrogenase. The genetic bases for many of the detailed phenotypic differences reported for these organisms have also been identified. Also many other potential phenotypic differences have been uncovered. Properties endowed by the plasmids are described including the presence of an entire aerobic corrin synthesis pathway in pAcX50f and the presence of genes for retro-conjugation in pAcX50c. All these findings are related to the potentially different environmental niches from which these organisms were isolated and to emerging theories about how microbes contribute to their communities.
Resumo:
The human gut microbiota comprises a diverse microbial consortium closely co-evolved with the human genome and diet. The importance of the gut microbiota in regulating human health and disease has however been largely overlooked due to the inaccessibility of the intestinal habitat, the complexity of the gut microbiota itself and the fact that many of its members resist cultivation and are in fact new to science. However, with the emergence of 16S rRNA molecular tools and "post-genomics" high resolution technologies for examining microorganisms as they occur in nature without the need for prior laboratory culture, this limited view of the gut microbiota is rapidly changing. This review will discuss the application of molecular microbiological tools to study the human gut microbiota in a culture independent manner. Genomics or metagenomics approaches have a tremendous capability to generate compositional data and to measure the metabolic potential encoded by the combined genomes of the gut microbiota. Another post-genomics approach, metabonomics, has the capacity to measure the metabolic kinetic or flux of metabolites through an ecosystem at a particular point in time or over a time course. Metabonomics thus derives data on the function of the gut microbiota in situ and how it responds to different environmental stimuli e. g. substrates like prebiotics, antibiotics and other drugs and in response to disease. Recently these two culture independent, high resolution approaches have been combined into a single "transgenomic" approach which allows correlation of changes in metabolite profiles within human biofluids with microbiota compositional metagenomic data. Such approaches are providing novel insight into the composition, function and evolution of our gut microbiota.
Resumo:
BACKGROUND: Serial Analysis of Gene Expression (SAGE) is a powerful tool for genome-wide transcription studies. Unlike microarrays, it has the ability to detect novel forms of RNA such as alternatively spliced and antisense transcripts, without the need for prior knowledge of their existence. One limitation of using SAGE on an organism with a complex genome and lacking detailed sequence information, such as the hexaploid bread wheat Triticum aestivum, is accurate annotation of the tags generated. Without accurate annotation it is impossible to fully understand the dynamic processes involved in such complex polyploid organisms. Hence we have developed and utilised novel procedures to characterise, in detail, SAGE tags generated from the whole grain transcriptome of hexaploid wheat. RESULTS: Examination of 71,930 Long SAGE tags generated from six libraries derived from two wheat genotypes grown under two different conditions suggested that SAGE is a reliable and reproducible technique for use in studying the hexaploid wheat transcriptome. However, our results also showed that in poorly annotated and/or poorly sequenced genomes, such as hexaploid wheat, considerably more information can be extracted from SAGE data by carrying out a systematic analysis of both perfect and "fuzzy" (partially matched) tags. This detailed analysis of the SAGE data shows first that while there is evidence of alternative polyadenylation this appears to occur exclusively within the 3' untranslated regions. Secondly, we found no strong evidence for widespread alternative splicing in the developing wheat grain transcriptome. However, analysis of our SAGE data shows that antisense transcripts are probably widespread within the transcriptome and appear to be derived from numerous locations within the genome. Examination of antisense transcripts showing sequence similarity to the Puroindoline a and Puroindoline b genes suggests that such antisense transcripts might have a role in the regulation of gene expression. CONCLUSION: Our results indicate that the detailed analysis of transcriptome data, such as SAGE tags, is essential to understand fully the factors that regulate gene expression and that such analysis of the wheat grain transcriptome reveals that antisense transcripts maybe widespread and hence probably play a significant role in the regulation of gene expression during grain development.
Resumo:
Land plants have had the reputation of being problematic for DNA barcoding for two general reasons: (i) the standard DNA regions used in algae, animals and fungi have exceedingly low levels of variability and (ii) the typically used land plant plastid phylogenetic markers (e.g. rbcL, trnL-F, etc.) appear to have too little variation. However, no one has assessed how well current phylogenetic resources might work in the context of identification (versus phylogeny reconstruction). In this paper, we make such an assessment, particularly with two of the markers commonly sequenced in land plant phylogenetic studies, plastid rbcL and internal transcribed spacers of the large subunits of nuclear ribosomal DNA (ITS), and find that both of these DNA regions perform well even though the data currently available in GenBank/EBI were not produced to be used as barcodes and BLAST searches are not an ideal tool for this purpose. These results bode well for the use of even more variable regions of plastid DNA (such as, for example, psbA-trnH) as barcodes, once they have been widely sequenced. In the short term, efforts to bring land plant barcoding up to the standards being used now in other organisms should make swift progress. There are two categories of DNA barcode users, scientists in fields other than taxonomy and taxonomists. For the former, the use of mitochondrial and plastid DNA, the two most easily assessed genomes, is at least in the short term a useful tool that permits them to get on with their studies, which depend on knowing roughly which species or species groups they are dealing with, but these same DNA regions have important drawbacks for use in taxonomic studies (i.e. studies designed to elucidate species limits). For these purposes, DNA markers from uniparentally (usually maternally) inherited genomes can only provide half of the story required to improve taxonomic standards being used in DNA barcoding. In the long term, we will need to develop more sophisticated barcoding tools, which would be multiple, low-copy nuclear markers with sufficient genetic variability and PCR-reliability; these would permit the detection of hybrids and permit researchers to identify the 'genetic gaps' that are useful in assessing species limits.
Resumo:
The cupin superfamily is a group of functionally diverse proteins that are found in all three kingdoms of life, Archaea, Eubacteria, and Eukaryota. These proteins have a characteristic signature domain comprising two histidine- containing motifs separated by an intermotif region of variable length. This domain consists of six beta strands within a conserved beta barrel structure. Most cupins, such as microbial phosphomannose isomerases (PMIs), AraC- type transcriptional regulators, and cereal oxalate oxidases (OXOs), contain only a single domain, whereas others, such as seed storage proteins and oxalate decarboxylases (OXDCs), are bi-cupins with two pairs of motifs. Although some cupins have known functions and have been characterized at the biochemical level, the majority are known only from gene cloning or sequencing projects. In this study, phylogenetic analyses were conducted on the conserved domain to investigate the evolution and structure/function relationships of cupins, with an emphasis on single- domain plant germin-like proteins (GLPs). An unrooted phylogeny of cupins from a wide spectrum of evolutionary lineages identified three main clusters, microbial PMIs, OXDCs, and plant GLPs. The sister group to the plant GLPs in the global analysis was then used to root a phylogeny of all available plant GLPs. The resulting phylogeny contained three main clades, classifying the GLPs into distinct subfamilies. It is suggested that these subfamilies correlate with functional categories, one of which contains the bifunctional barley germin that has both OXO and superoxide dismutase (SOD) activity. It is proposed that GLPs function primarily as SODs, enzymes that protect plants from the effects of oxidative stress. Closer inspection of the DNA sequence encoding the intermotif region in plant GLPs showed global conservation of thymine in the second codon position, a character associated with hydrophobic residues. Since many of these proteins are multimeric and enzymatically inactive in their monomeric state, this conservation of hydrophobicity is thought to be associated with the need to maintain the various monomer- monomer interactions. The type of structure-based predictive analysis presented in this paper is an important approach for understanding gene function and evolution in an era when genomes from a wide range of organisms are being sequenced at a rapid rate.
Resumo:
This review summarizes the recent discovery of the cupin superfamily (from the Latin term "cupa," a small barrel) of functionally diverse proteins that initially were limited to several higher plant proteins such as seed storage proteins, germin (an oxalate oxidase), germin-like proteins, and auxin-binding protein. Knowledge of the three-dimensional structure of two vicilins, seed proteins with a characteristic beta-barrel core, led to the identification of a small number of conserved residues and thence to the discovery of several microbial proteins which share these key amino acids. In particular, there is a highly conserved pattern of two histidine-containing motifs with a varied intermotif spacing. This cupin signature is found as a central component of many microbial proteins including certain types of phosphomannose isomerase, polyketide synthase, epimerase, and dioxygenase. In addition, the signature has been identified within the N-terminal effector domain in a subgroup of bacterial AraC transcription factors. As well as these single-domain cupins, this survey has identified other classes of two-domain bicupins including bacterial gentisate 1, 2-dioxygenases and 1-hydroxy-2-naphthoate dioxygenases, fungal oxalate decarboxylases, and legume sucrose-binding proteins. Cupin evolution is discussed from the perspective of the structure-function relationships, using data from the genomes of several prokaryotes, especially Bacillus subtilis. Many of these functions involve aspects of sugar metabolism and cell wall synthesis and are concerned with responses to abiotic stress such as heat, desiccation, or starvation. Particular emphasis is also given to the oxalate-degrading enzymes from microbes, their biological significance, and their value in a range of medical and other applications.
Resumo:
Plant storage proteins comprise a major part of the human diet. Sequence analysis has revealed that these proteins probably share a common ancestor with a fungal oxalate decarboxylase and/or related bacterial genes. Additionally, all these proteins share a central core sequence with several other functionally diverse enzymes and binding proteins, many of which are associated with synthesis of the extracellular matrix during sporulation/encystment. A possible prokaryotic relative of this sequence is a bacterial protein (SASP) known to bind to DNA and thereby protect spores from extreme environmental conditions. This ability to maintain cell viability during periods of dehydration in spores and seeds may relate to absolute conservation of residues involved in structure determination.
Resumo:
The emergence in 2009 of a swine-origin H1N1 influenza virus as the first pandemic of the 21st Century is a timely reminder of the international public health impact of influenza viruses, even those associated with mild disease. The widespread distribution of highly pathogenic H5N1 influenza virus in the avian population has spawned concern that it may give rise to a human influenza pandemic. The mortality rate associated with occasional human infection by H5N1 virus approximates 60%, suggesting that an H5N1 pandemic would be devastating to global health and economy. To date, the H5N1 virus has not acquired the propensity to transmit efficiently between humans. The reasons behind this are unclear, especially given the high mutation rate associated with influenza virus replication. Here we used a panel of recombinant H5 hemagglutinin (HA) variants to demonstrate the potential for H5 HA to bind human airway epithelium, the predominant target tissue for influenza virus infection and spread. While parental H5 HA exhibited limited binding to human tracheal epithelium, introduction of selected mutations converted the binding profile to that of a current human influenza strain HA. Strikingly, these amino-acid changes required multiple simultaneous mutations in the genomes of naturally occurring H5 isolates. Moreover, H5 HAs bearing intermediate sequences failed to bind airway tissues and likely represent mutations that are an evolutionary "dead end." We conclude that, although genetic changes that adapt H5 to human airways can be demonstrated, they may not readily arise during natural virus replication. This genetic barrier limits the likelihood that current H5 viruses will originate a human pandemic.
Resumo:
Our understanding of the evolution of microbial pathogens has been advanced by the discovery of "islands" of DNA that differ from core genomes and contain determinants of virulence [1, 2]. The acquisition of genomic islands (GIs) by horizontal gene transfer (HGT) is thought to have played a major role in microbial evolution. There are, however, few practical demonstrations of the acquisition of genes that control virulence, and, significantly, all have been achieved outside the animal or plant host. Loss of a GI from the bean pathogen Pseudomonas syringae pv. phaseolicola (Pph) is driven by exposure to the stress imposed by the plant's resistance response [3]. Here, we show that the complete episomal island, which carries pathogenicity genes including the effector avrPphB, transfers between strains of Pph by transformation in planta and inserts at a specific att site in the genome of the recipient. Our results show that the evolution of bacterial pathogens by HGT may be achieved via transformation, the simplest mechanism of DNA exchange. This process is activated by exposure to plant defenses, when the pathogen is in greatest need of acquiring new genetic traits to alleviate the antimicrobial stress imposed by plant innate immunity [4].
Resumo:
Background: Pseudomonas fluorescens are common soil bacteria that can improve plant health through nutrient cycling, pathogen antagonism and induction of plant defenses. The genome sequences of strains SBW25 and Pf0-1 were determined and compared to each other and with P. fluorescens Pf-5. A functional genomic in vivo expression technology (IVET) screen provided insight into genes used by P. fluorescens in its natural environment and an improved understanding of the ecological significance of diversity within this species. Results: Comparisons of three P. fluorescens genomes (SBW25, Pf0-1, Pf-5) revealed considerable divergence: 61% of genes are shared, the majority located near the replication origin. Phylogenetic and average amino acid identity analyses showed a low overall relationship. A functional screen of SBW25 defined 125 plant-induced genes including a range of functions specific to the plant environment. Orthologues of 83 of these exist in Pf0-1 and Pf-5, with 73 shared by both strains. The P. fluorescens genomes carry numerous complex repetitive DNA sequences, some resembling Miniature Inverted-repeat Transposable Elements (MITEs). In SBW25, repeat density and distribution revealed 'repeat deserts' lacking repeats, covering approximately 40% of the genome. Conclusions: P. fluorescens genomes are highly diverse. Strain-specific regions around the replication terminus suggest genome compartmentalization. The genomic heterogeneity among the three strains is reminiscent of a species complex rather than a single species. That 42% of plant-inducible genes were not shared by all strains reinforces this conclusion and shows that ecological success requires specialized and core functions. The diversity also indicates the significant size of genetic information within the Pseudomonas pan genome.
Resumo:
We know little about the genomic events that led to the advent of a multicellular grade of organization in animals, one of the most dramatic transitions in evolution. Metazoan multicellularity is correlated with the evolution of embryogenesis, which presumably was underpinned by a gene regulatory network reliant on the differential activation of signaling pathways and transcription factors. Many transcription factor genes that play critical roles in bilaterian development largely appear to have evolved before the divergence of cnidarian and bilaterian lineages. In contrast, sponges seem to have a more limited suite of transcription factors, suggesting that the developmental regulatory gene repertoire changed markedly during early metazoan evolution. Using whole- genome information from the sponge Amphimedon queenslandica, a range of eumetazoans, and the choanoflagellate Monosiga brevicollis, we investigate the genesis and expansion of homeobox, Sox, T- box, and Fox transcription factor genes. Comparative analyses reveal that novel transcription factor domains ( such as Paired, POU, and T- box) arose very early in metazoan evolution, prior to the separation of extant metazoan phyla but after the divergence of choanoflagellate and metazoan lineages. Phylogenetic analyses indicate that transcription factor classes then gradually expanded at the base of Metazoa before the bilaterian radiation, with each class following a different evolutionary trajectory. Based on the limited number of transcription factors in the Amphimedon genome, we infer that the genome of the metazoan last common ancestor included fewer gene members in each class than are present in extant eumetazoans. Transcription factor orthologues present in sponge, cnidarian, and bilaterian genomes may represent part of the core metazoan regulatory network underlying the origin of animal development and multicellularity.
Resumo:
An attenuated strain (263) of the tick-borne encephalitis virus, isolated from field ticks, was either serially subcultured, 5 times in mice, or at 40 degrees C in PS cells, producing 2 independent strains, 263-m5 and 263-TR with identical genomes; both strains exhibited increased plaque size, neuroinvasiveness and temperature-resistance. Sequencing revealed two unique amino acid substitutions, one mapping close to the catalytic site of the viral protease. These observations imply that virus adaptation from ticks to mammals occurs by selection of pre-existing virulent variants from the quasispecies population rather than by the emergence of new random mutations. The significance of these observations is discussed. (c) 2008 Elsevier Inc. All rights reserved.
Resumo:
Motivation: We compare phylogenetic approaches for inferring functional gene links. The approaches detect independent instances of the correlated gain and loss of pairs of genes from species' genomes. We investigate the effect on results of basing evidence of correlations on two phylogenetic approaches, Dollo parsminony and maximum likelihood (ML). We further examine the effect of constraining the ML model by fixing the rate of gene gain at a low value, rather than estimating it from the data. Results: We detect correlated evolution among a test set of pairs of yeast (Saccharomyces cerevisiae) genes, with a case study of 21 eukaryotic genomes and test data derived from known yeast protein complexes. If the rate at which genes are gained is constrained to be low, ML achieves by far the best results at detecting known functional links. The model then has fewer parameters but it is more realistic by preventing genes from being gained more than once. Availability: BayesTraits by M. Pagel and A. Meade, and a script to configure and repeatedly launch it by D. Barker and M. Pagel, are available at http://www.evolution.reading.ac.uk .
Resumo:
An example of the evolution of the interacting behaviours of parents and progeny is studied using iterative equations linking the frequencies of the gametes produced by the progeny to the frequencies of the gametes in the parental generation. This population genetics approach shows that a model in which both behaviours are determined by a single locus can lead to a stable equilibrium in which the two behaviours continue to segregate. A model in which the behaviours are determined by genes at two separate loci leads eventually to fixation of the alleles at both loci but this can take many generations of selection. Models of the type described in this paper will be needed to understand the evolution of complex behaviour when genomic or experimental information is available about the genetic determinants of behaviour and the selective values of different genomes. (c) 2007 Elsevier Inc. All rights reserved.
Direct repeats in the flavivirus 3' untranslated region; a strategy for survival in the environment?
Resumo:
Previously, direct repeats (DRs) of 20-70 nucleotides were identified in the 3' untranslated regions (3'UTR) of flavivirus sequences. To address their functional significance, we have manually generated a pan-flavivirus 3'UTR alignment and correlated it with the corresponding predicted RNA secondary structures. This approach revealed that intra-group-conserved DRs evolved from six long repeated sequences (LRSs) which, as approximately 200-nucleotide domains were preserved only in the genomes of the slowly evolving tick-borne flaviviruses. We propose that short DRs represent the evolutionary remnants of LRSs rather than distinct molecular duplications. The relevance of DRs to virus replication enhancer function, and thus survival, is discussed.