952 resultados para Prokaryotic Genomes
Resumo:
Sorghum is a food and feed cereal crop adapted to heat and drought and a staple for 500 million of the world’s poorest people. Its small diploid genome and phenotypic diversity make it an ideal C4 grass model as a complement to C3 rice. Here we present high coverage (16–45 × ) resequenced genomes of 44 sorghum lines representing the primary gene pool and spanning dimensions of geographic origin, end-use and taxonomic group. We also report the first resequenced genome of S. propinquum, identifying 8 M high-quality SNPs, 1.9 M indels and specific gene loss and gain events in S. bicolor. We observe strong racial structure and a complex domestication history involving at least two distinct domestication events. These assembled genomes enable the leveraging of existing cereal functional genomics data against the novel diversity available in sorghum, providing an unmatched resource for the genetic improvement of sorghum and other grass species.
Resumo:
The mitochondrial (mt) genome is, to date, the most extensively studied genomic system in insects, outnumbering nuclear genomes tenfold and representing all orders versus very few. Phylogenomic analysis methods have been tested extensively, identifying compositional bias and rate variation, both within and between lineages, as the principal issues confronting accurate analyses. Major studies at both inter- and intraordinal levels have contributed to our understanding of phylogenetic relationships within many groups. Genome rearrangements are an additional data type for defining relationships, with rearrangement synapomorphies identified across multiple orders and at many different taxonomic levels. Hymenoptera and Psocodea have greatly elevated rates of rearrangement offering both opportunities and pitfalls for identifying rearrangement synapomorphies in each group. Finally, insects are model systems for studying aberrant mt genomes, including truncated tRNAs and multichromosomal genomes. Greater integration of nuclear and mt genomic studies is necessary to further our understanding of insect genomic evolution.
Resumo:
Transposable elements, which are DNA sequences that can move between different sites in genomes, comprise approximately 40% of the genome of mammals and are emerging as important contributors to biological diversity. Here we report a transcription unit lying within intron 1 of the murine Magi1 (membrane associated guanylate kinase inverted 1) gene that codes for a cell-cell junction scaffolding protein. The transcription unit, termed Magi1OS (Magi1 Opposite Strand), originates from a region with tandem B1 short interspersed nuclear elements (SINEs) and is an antisense gene to Magi1. Mag1OS transcription initiates in a proximal B1 element that shows only 4% divergence from the consensus sequence, indicating that it has been recently inserted into the mouse genome and could be replication competent. Moreover, a chimaeric transcript may result from intra-chromosomal interaction and trans-splicing of the Magi1 antisense transcript (Magi1OS) and Ghrl, which codes for the multifunctional peptide hormone ghrelin. These two genes are 20 megabases apart on chromosome 6 and are transcribed in opposite directions. We propose that the Magi1OS locus may serve as a useful model system to study exaptation and retrotransposition of B1 SINEs, as well as to examine the mechanisms of intra-chromosomal trans-splicing.
Resumo:
An in vivo screen has been devised for NF-κB p50 activity in Escherichia coli exploiting the ability of the mammalian transcription factor to emulate a prokaryotic repressor. Active intracellular p50 was shown to repress the expression of a green fluorescent protein reporter gene allowing for visual screening of colonies expressing active p50 on agar plates. A library of mutants was constructed in which the residues Y267, L269, A308 and V310 of the dimer interface were simultaneously randomised and twenty-five novel functional interfaces were selected which repressed the reporter gene to similar levels as the wild-type protein. The leucine-269 alanine-308 core was repeatedly, but not exclusively, selected from the library whilst a diversity of predominantly non-polar residues were selected at positions 267 and 310. These results indicate that L269 and A308 may form a hot spot of interaction and allow an insight into the processes of dimer selectivity and evolution within this family of transcription factors.
Resumo:
Gene expression profiling using microarrays and xenograft transplants of human cancer cell lines are both popular tools to investigate human cancer. However, the undefined degree of cross hybridization between the mouse and human genomes hinders the use of microarrays to characterize gene expression of both the host and the cancer cell within the xenograft. Since an increasingly recognized aspect of cancer is the host response (or cancer-stroma interaction), we describe here a bioinformatic manipulation of the Affymetrix profiling that allows interrogation of the gene expression of both the mouse host and the human tumour. Evidence of microenvironmental regulation of epithelial mesenchymal transition of the tumour component in vivo is resolved against a background of mesenchymal gene expression. This tool could allow deeper insight to the mechanism of action of anti-cancer drugs, as typically novel drug efficacy is being tested in xenograft systems.
Resumo:
The expression of transgenes in plant genomes can be inhibited by either transcriptional gene silencing or posttranscriptional gene silencing (PTGS). Overexpression of the chalcone synthase-A (CHS-A) transgene triggers PTGS of CHS-A and thus results in loss of flower pigmentation in petunia. We previously demonstrated that epigenetic inactivation of CHS-A transgene transcription leads to a reversion of the PTGS phenotype. Although neomycin phosphotransferase II (nptII), a marker gene co-introduced into the genome with the CHS-A transgene, is not normally silenced in petunia, even when CHS-A is silenced, here we found that nptII was silenced in a petunia line in which CHS-A PTGS was induced, but not in the revertant plants that had no PTGS of CHS-A. Transcriptional activity, accumulation of short interfering RNAs, and restoration of mRNA level after infection with viruses that had suppressor proteins of gene silencing indicated that the mechanism for nptII silencing was posttranscriptional. Read-through transcripts of the CHS-A gene toward the nptII gene were detected. Deep-sequencing analysis revealed a striking difference between the predominant size class of small RNAs produced from the read-through transcripts (22 nt) and that from the CHS-A RNAs (21 nt). These results implicate the involvement of read-through transcription and distinct phases of RNA degradation in the coincident PTGS of linked transgenes and provide new insights into the destabilization of transgene expression.
Resumo:
In a recent paper, Wang and colleagues described the genomes of two turtles, the Chinese soft-shell turtle (Pelodiscus sinensis) and the green sea turtle (Chelonia mydas)1. A salient finding was an apparent absence of GHRL, the gene encoding the only known circulating orexigen, the peptide hormone ghrelin. The highly conserved GHRL encodes at least two bioactive peptide hormones, ghrelin2 and obestatin3, which are recognized to have a diverse range of functions in a number of cell types and physiological systems4, 5. Wang and colleagues hypothesized that the absence of ghrelin was associated with the low metabolic rate observed in these turtle species1.
Resumo:
In this paper, the complete mitochondrial genome of Acraea issoria (Lepidoptera: Nymphalidae: Heliconiinae: Acraeini) is reported; a circular molecule of 15,245 bp in size. For A. issoria, genes are arranged in the same order and orientation as the complete sequenced mitochondrial genomes of the other lepidopteran species, except for the presence of an extra copy of tRNAIle(AUR)b in the control region. All protein-coding genes of A. issoria mitogenome start with a typical ATN codon and terminate in the common stop codon TAA, except that COI gene uses TTG as its initial codon and terminates in a single T residue. All tRNA genes possess the typical clover leaf secondary structure except for tRNASer(AGN), which has a simple loop with the absence of the DHU stem. The sequence, organization and other features including nucleotide composition and codon usage of this mitochondrial genome were also reported and compared with those of other sequenced lepidopterans mitochondrial genomes. There are some short microsatellite-like repeat regions (e.g., (TA)9, polyA and polyT) scattered in the control region, however, the conspicuous macro-repeats units commonly found in other insect species are absent.
Resumo:
Marsupials exhibit great diversity in ecology and morphology. However, compared to their sister group, the placental mammals, our understanding of many aspects of marsupial evolution remains limited. We use 101 mitochondrial genomes and data from 26 nuclear loci to reconstruct a dated phylogeny including 97% of extant genera and 58% of modern marsupial species. This tree allows us to analyze the evolution of habitat preference and geographic distributions of marsupial species through time. We found a pattern of mesic-adapted lineages evolving to use more arid and open habitats, which is broadly consistent with regional climate and environmental change. However, contrary to the general trend, several lineages subsequently appear to have reverted from drier to more mesic habitats. Biogeographic reconstructions suggest that current views on the connectivity between Australia and New Guinea/Wallacea during the Miocene and Pliocene need to be revised. The antiquity of several endemic New Guinean clades strongly suggests a substantially older period of connection stretching back to the Middle Miocene, and implies that New Guinea was colonized by multiple clades almost immediately after its principal formation.
Resumo:
We present entire sequences of two hymenopteran mitochondrial genomes and the major portion of three others. We combined these data with nine previously sequenced hymenopteran mitochondrial genomes. This allowed us to infer and analyze the evolution of the 67 mitochondrial gene rearrangements so far found in this order. All of these involve tRNA genes, whereas four also involve larger (protein-coding or ribosomal RNA) genes. We find that the vast majority of mitochondrial gene rearrangements are independently derived. A maximum of four of these rearrangements represent shared, derived organizations, whereas three are convergently derived. The remaining mitochondrial gene rearrangements represent new mitochondrial genome organizations. These data are consistent with the proposal that there are an enormous number of alternative mitochondrial genome organizations possible and that mitochondrial genome organization is, for the most part, selectively neutral. Nevertheless, some mitochondrial genes appear less mobile than others. Genes close to the noncoding region are generally more mobile but only marginally so. Some mitochondrial genes rearrange in a pattern consistent with the duplication/random loss model, but more mitochondrial genes move in a pattern inconsistent with this model. An increased rate of mitochondrial gene rearrangement is not tightly associated with the evolution of parasitism. Although parasitic lineages tend to have more mitochondrial gene rearrangements than nonparasitic lineages, there are exceptions (e.g., Orussus and Schlettererius). It is likely that only a small proportion of the total number of mitochondrial gene rearrangements that have occurred during the evolution of the Hymenoptera have been sampled in the present study.
Resumo:
Escherichia coli ST131 is now recognised as a leading contributor to urinary tract and bloodstream infections in both community and clinical settings. Here we present the complete, annotated genome of E. coli EC958, which was isolated from the urine of a patient presenting with a urinary tract infection in the Northwest region of England and represents the most well characterised ST131 strain. Sequencing was carried out using the Pacific Biosciences platform, which provided sufficient depth and read-length to produce a complete genome without the need for other technologies. The discovery of spurious contigs within the assembly that correspond to site-specific inversions in the tail fibre regions of prophages demonstrates the potential for this technology to reveal dynamic evolutionary mechanisms. E. coli EC958 belongs to the major subgroup of ST131 strains that produce the CTX-M-15 extended spectrum β-lactamase, are fluoroquinolone resistant and encode the fimH30 type 1 fimbrial adhesin. This subgroup includes the Indian strain NA114 and the North American strain JJ1886. A comparison of the genomes of EC958, JJ1886 and NA114 revealed that differences in the arrangement of genomic islands, prophages and other repetitive elements in the NA114 genome are not biologically relevant and are due to misassembly. The availability of a high quality uropathogenic E. coli ST131 genome provides a reference for understanding this multidrug resistant pathogen and will facilitate novel functional, comparative and clinical studies of the E. coli ST131 clonal lineage.
Resumo:
Chaperone-usher (CU) fimbriae are adhesive surface organelles common to many Gram-negative bacteria. Escherichia coli genomes contain a large variety of characterised and putative CU fimbrial operons, however, the classification and annotation of individual loci remains problematic. Here we describe a classification model based on usher phylogeny and genomic locus position to categorise the CU fimbrial types of E. coli. Using the BLASTp algorithm, an iterative usher protein search was performed to identify CU fimbrial operons from 35 E. coli (and one Escherichia fergusonnii) genomes representing different pathogenic and phylogenic lineages, as well as 132 Escherichia spp. plasmids. A total of 458 CU fimbrial operons were identified, which represent 38 distinct fimbrial types based on genomic locus position and usher phylogeny. The majority of fimbrial operon types occupied a specific locus position on the E. coli chromosome; exceptions were associated with mobile genetic elements. A group of core-associated E. coli CU fimbriae were defined and include the Type 1, Yad, Yeh, Yfc, Mat, F9 and Ybg fimbriae. These genes were present as intact or disrupted operons at the same genetic locus in almost all genomes examined. Evaluation of the distribution and prevalence of CU fimbrial types among different pathogenic and phylogenic groups provides an overview of group specific fimbrial profiles and insight into the ancestry and evolution of CU fimbriae in E. coli.
Resumo:
Trimeric autotransporter proteins (TAAs) are important virulence factors of many Gram-negative bacterial pathogens. A common feature of most TAAs is the ability to mediate adherence to eukaryotic cells or extracellular matrix (ECM) proteins via a cell surface-exposed passenger domain. Here we describe the characterization of EhaG, a TAA identified from enterohemorrhagic Escherichia coli (EHEC) O157:H7. EhaG is a positional orthologue of the recently characterized UpaG TAA from uropathogenic E. coli (UPEC). Similarly to UpaG, EhaG localized at the bacterial cell surface and promoted cell aggregation, biofilm formation, and adherence to a range of ECM proteins. However, the two orthologues display differential cellular binding: EhaG mediates specific adhesion to colorectal epithelial cells while UpaG promotes specific binding to bladder epithelial cells. The EhaG and UpaG TAAs contain extensive sequence divergence in their respective passenger domains that could account for these differences. Indeed, sequence analyses of UpaG and EhaG homologues from several E. coli genomes revealed grouping of the proteins in clades almost exclusively represented by distinct E. coli pathotypes. The expression of EhaG (in EHEC) and UpaG (in UPEC) was also investigated and shown to be significantly enhanced in an hns isogenic mutant, suggesting that H-NS acts as a negative regulator of both TAAs. Thus, while the EhaG and UpaG TAAs contain some conserved binding and regulatory features, they also possess important differences that correlate with the distinct pathogenic lifestyles of EHEC and UPEC.
Resumo:
A new strategy for rapidly selecting and testing genetic vaccines has been developed, in which a whole genome library is cloned into a bacteriophage λ ZAP Express vector which contains both prokaryotic (Plac) and eukaryotic (PCMV) promoters upstream of the insertion site. The phage library is plated on Escherichia coli cells, immunoblotted, and probed with hyperimmune and/or convalescent-phase antiserum to rapidly identify vaccine candidates. These are then plaque purified and grown as liquid lysates, and whole bacteriophage particles are then used directly to immunize the host, following which PCMV-driven expression of the candidate vaccine gene occurs. In the example given here, a semirandom genome library of the bovine pathogen Mycoplasma mycoides subsp. mycoides small colony (SC) biotype was cloned into λ ZAP Express, and two strongly immunodominant clones, λ-A8 and λ-B1, were identified and subsequently tested for vaccine potential against M. mycoides subsp. mycoides SC biotype-induced mycoplasmemia. Sequencing and immunoblotting indicated that clone λ-A8 expressed an isopropyl-β-d-thiogalactopyranoside (IPTG)-inducible M. mycoides subsp. mycoides SC biotype protein with a 28-kDa apparent molecular mass, identified as a previously uncharacterized putative lipoprotein (MSC_0397). Clone λ-B1 contained several full-length genes from the M. mycoides subsp. mycoides SC biotype pyruvate dehydrogenase region, and two IPTG-independent polypeptides, of 29 kDa and 57 kDa, were identified on immunoblots. Following vaccination, significant anti-M. mycoides subsp. mycoides SC biotype responses were observed in mice vaccinated with clones λ-A8 and λ-B1. A significant stimulation index was observed following incubation of splenocytes from mice vaccinated with clone λ-A8 with whole live M. mycoides subsp. mycoides SC biotype cells, indicating cellular proliferation. After challenge, mice vaccinated with clone λ-A8 also exhibited a reduced level of mycoplasmemia compared to controls, suggesting that the MSC_0397 lipoprotein has a protective effect in the mouse model when delivered as a bacteriophage DNA vaccine. Bacteriophage-mediated immunoscreening using an appropriate vector system offers a rapid and simple technique for the identification and immediate testing of putative candidate vaccines from a variety of pathogens.
Resumo:
Determination of sequence similarity is a central issue in computational biology, a problem addressed primarily through BLAST, an alignment based heuristic which has underpinned much of the analysis and annotation of the genomic era. Despite their success, alignment-based approaches scale poorly with increasing data set size, and are not robust under structural sequence rearrangements. Successive waves of innovation in sequencing technologies – so-called Next Generation Sequencing (NGS) approaches – have led to an explosion in data availability, challenging existing methods and motivating novel approaches to sequence representation and similarity scoring, including adaptation of existing methods from other domains such as information retrieval. In this work, we investigate locality-sensitive hashing of sequences through binary document signatures, applying the method to a bacterial protein classification task. Here, the goal is to predict the gene family to which a given query protein belongs. Experiments carried out on a pair of small but biologically realistic datasets (the full protein repertoires of families of Chlamydia and Staphylococcus aureus genomes respectively) show that a measure of similarity obtained by locality sensitive hashing gives highly accurate results while offering a number of avenues which will lead to substantial performance improvements over BLAST..