953 resultados para Myoviridae genomes
Resumo:
UNLABELLED We previously showed that close relatives of human coronavirus 229E (HCoV-229E) exist in African bats. The small sample and limited genomic characterizations have prevented further analyses so far. Here, we tested 2,087 fecal specimens from 11 bat species sampled in Ghana for HCoV-229E-related viruses by reverse transcription-PCR (RT-PCR). Only hipposiderid bats tested positive. To compare the genetic diversity of bat viruses and HCoV-229E, we tested historical isolates and diagnostic specimens sampled globally over 10 years. Bat viruses were 5- and 6-fold more diversified than HCoV-229E in the RNA-dependent RNA polymerase (RdRp) and spike genes. In phylogenetic analyses, HCoV-229E strains were monophyletic and not intermixed with animal viruses. Bat viruses formed three large clades in close and more distant sister relationships. A recently described 229E-related alpaca virus occupied an intermediate phylogenetic position between bat and human viruses. According to taxonomic criteria, human, alpaca, and bat viruses form a single CoV species showing evidence for multiple recombination events. HCoV-229E and the alpaca virus showed a major deletion in the spike S1 region compared to all bat viruses. Analyses of four full genomes from 229E-related bat CoVs revealed an eighth open reading frame (ORF8) located at the genomic 3' end. ORF8 also existed in the 229E-related alpaca virus. Reanalysis of HCoV-229E sequences showed a conserved transcription regulatory sequence preceding remnants of this ORF, suggesting its loss after acquisition of a 229E-related CoV by humans. These data suggested an evolutionary origin of 229E-related CoVs in hipposiderid bats, hypothetically with camelids as intermediate hosts preceding the establishment of HCoV-229E. IMPORTANCE The ancestral origins of major human coronaviruses (HCoVs) likely involve bat hosts. Here, we provide conclusive genetic evidence for an evolutionary origin of the common cold virus HCoV-229E in hipposiderid bats by analyzing a large sample of African bats and characterizing several bat viruses on a full-genome level. Our evolutionary analyses show that animal and human viruses are genetically closely related, can exchange genetic material, and form a single viral species. We show that the putative host switches leading to the formation of HCoV-229E were accompanied by major genomic changes, including deletions in the viral spike glycoprotein gene and loss of an open reading frame. We reanalyze a previously described genetically related alpaca virus and discuss the role of camelids as potential intermediate hosts between bat and human viruses. The evolutionary history of HCoV-229E likely shares important characteristics with that of the recently emerged highly pathogenic Middle East respiratory syndrome (MERS) coronavirus.
Resumo:
Mycoplasma mycoides subsp. capri (Mmc) and subsp. mycoides (Mmm) are important ruminant pathogens worldwide causing diseases such as pleuropneumonia, mastitis and septicaemia. They express galactofuranose residues on their surface, but their role in pathogenesis has not yet been determined. The M. mycoides genomes contain up to several copies of the glf gene, which encodes an enzyme catalysing the last step in the synthesis of galactofuranose. We generated a deletion of the glf gene in a strain of Mmc using genome transplantation and tandem repeat endonuclease coupled cleavage (TREC) with yeast as an intermediary host for the genome editing. As expected, the resulting YCp1.1-Δglf strain did not produce the galactofuranose-containing glycans as shown by immunoblots and immuno-electronmicroscopy employing a galactofuranose specific monoclonal antibody. The mutant lacking galactofuranose exhibited a decreased growth rate and a significantly enhanced adhesion to small ruminant cells. The mutant was also 'leaking' as revealed by a β-galactosidase-based assay employing a membrane impermeable substrate. These findings indicate that galactofuranose-containing polysaccharides conceal adhesins and are important for membrane integrity. Unexpectedly, the mutant strain showed increased serum resistance.
Resumo:
Puumala virus (PUUV) is one of the predominant hantavirus species in Europe causing mild to moderate cases of haemorrhagic fever with renal syndrome. Parts of Lower Saxony in north-western Germany are endemic for PUUV infections. In this study, the complete PUUV genome sequence of a bank vole-derived tissue sample from the 2007 outbreak was determined by a combined primer-walking and RNA ligation strategy. The S, M and L genome segments were 1,828, 3,680 and 6,550 nucleotides in length, respectively. Sliding-window analyses of the nucleotide sequences of all available complete PUUV genomes indicated a non-homogenous distribution of variability with hypervariable regions located at the 3′-ends of the S and M segments. The overall similarity of the coding genome regions to the other PUUV strains ranged between 80.1 and 84.7 % at the level of the nucleotide sequence and between 89.5 and 98.1 % for the deduced amino acid sequences. In comparison to the phylogenetic trees of the complete coding sequences, trees based on partial segments revealed a general drop in phylogenetic support and a lower resolution. The Astrup strain S and M segment sequences showed the highest similarity to sequences of strains from geographically close sites in the Osnabrück Hills region. In conclusion, a primer-walking-mediated strategy resulted in the determination of the first complete nucleotide sequence of a PUUV strain from Central Europe. Different levels of variability along the genome provide the opportunity to choose regions for analyses according to the particular research question, e.g., large-scale phylogenetics or within-host evolution.
Resumo:
INTRODUCTION blaOXA-48, blaNDM-1 and blaCTX-M-3 are clinically relevant resistance genes, frequently associated with the broad-host range plasmids of the IncL/M group. The L and M plasmids belong to two compatible groups, which were incorrectly classified together by molecular methods. In order to understand their evolution, we fully sequenced four IncL/M plasmids, including the reference plasmids R471 and R69, the recently described blaOXA-48-carrying plasmid pKPN-El.Nr7 from a Klebsiella pneumoniae isolated in Bern (Switzerland), and the blaSHV-5 carrying plasmid p202c from a Salmonella enterica from Tirana (Albania). METHODS Sequencing was performed using 454 Junior Genome Sequencer (Roche). Annotation was performed using Sequin and Artemis software. Plasmid sequences were compared with 13 fully sequenced plasmids belonging to the IncL/M group available in GenBank. RESULTS Comparative analysis of plasmid genomes revealed two distinct genetic lineages, each containing one of the R471 (IncL) and R69 (IncM) reference plasmids. Conjugation experiments demonstrated that plasmids representative of the IncL and IncM groups were compatible with each other. The IncL group is constituted by the blaOXA-48-carrying plasmids and R471. The IncM group contains two sub-types of plasmids named IncM1 and IncM2 that are each incompatible. CONCLUSION This work re-defines the structure of the IncL and IncM families and ascribes a definitive designation to the fully sequenced IncL/M plasmids available in GenBank.
Resumo:
Alveolar echinococcosis, caused by the tapeworm Echinococcus multilocularis, is one of the most severe parasitic diseases in humans and represents one of the 17 neglected diseases prioritised by the World Health Organisation (WHO) in 2012. Considering the major medical and veterinary importance of this parasite, the phylogeny of the genus Echinococcus is of considerable importance; yet, despite numerous efforts with both mitochondrial and nuclear data, it has remained unresolved. The genus is clearly complex, and this is one of the reasons for the incomplete understanding of its taxonomy. Although taxonomic studies have recognised E. multilocularis as a separate entity from the Echinococcus granulosus complex and other members of the genus, it would be premature to draw firm conclusions about the taxonomy of the genus before the phylogeny of the whole genus is fully resolved. The recent sequencing of E. multilocularis and E. granulosus genomes opens new possibilities for performing in-depth phylogenetic analyses. In addition, whole genome data provide the possibility of inferring phylogenies based on a large number of functional genes, i.e. genes that trace the evolutionary history of adaptation in E. multilocularis and other members of the genus. Moreover, genomic data open new avenues for studying the molecular epidemiology of E. multilocularis: genotyping studies with larger panels of genetic markers allow the genetic diversity and spatial dynamics of parasites to be evaluated with greater precision. There is an urgent need for international coordination of genotyping of E. multilocularis isolates from animals and human patients. This could be fundamental for a better understanding of the transmission of alveolar echinococcosis and for designing efficient healthcare strategies.
Resumo:
Completion of fungal, plant and human genomes paved the way to the identification of erythrocytic rhesus proteins and their kidney homologs as ammonium transporters. Ammonium is the preferred nitrogen source of bacteria and fungi, and plants acquire nitrogen from the soil in the form of ammonium [1]. In animals and humans, assimilated forms of nitrogen - amino acids - are much preferred for nutrition, and, in the case of ammonotelic animals, ammonium is used for the excretion of nitrogen instead. In the human kidney, ammonium is produced, reabsorbed and excreted as a means to maintain pH balance and to get rid of surplus inorganic nitrogen. Whether ammonium transport also has a role in the pH regulation of other organs is not known and the molecular mechanisms were not, up to now, understood.
Resumo:
Theoretical and empirical studies were conducted on the pattern of nucleotide and amino acid substitution in evolution, taking into account the effects of mutation at the nucleotide level and purifying selection at the amino acid level. A theoretical model for predicting the evolutionary change in electrophoretic mobility of a protein was also developed by using information on the pattern of amino acid substitution. The specific problems studied and the main results obtained are as follows: (1) Estimation of the pattern of nucleotide substitution in DNA nuclear genomes. The pattern of point mutations and nucleotide substitutions among the four different nucleotides are inferred from the evolutionary changes of pseudogenes and functional genes, respectively. Both patterns are non-random, the rate of change varying considerably with nucleotide pair, and that in both cases transitions occur somewhat more frequently than transversions. In protein evolution, substitution occurs more often between amino acids with similar physico-chemical properties than between dissimilar amino acids. (2) Estimation of the pattern of nucleotide substitution in RNA genomes. The majority of mutations in retroviruses accumulate at the reverse transcription stage. Selection at the amino acid level is very weak, and almost non-existent between synonymous codons. The pattern of mutation is very different from that in DNA genomes. Nevertheless, the pattern of purifying selection at the amino acid level is similar to that in DNA genomes, although selection intensity is much weaker. (3) Evaluation of the determinants of molecular evolutionary rates in protein-coding genes. Based on rates of nucleotide substitution for mammalian genes, the rate of amino acid substitution of a protein is determined by its amino acid composition. The content of glycine is shown to correlate strongly and negatively with the rate of substitution. Empirical formulae, called indices of mutability, are developed in order to predict the rate of molecular evolution of a protein from data on its amino acid sequence. (4) Studies on the evolutionary patterns of electrophoretic mobility of proteins. A theoretical model was constructed that predicts the electric charge of a protein at any given pH and its isoelectric point from data on its primary and quaternary structures. Using this model, the evolutionary change in electrophoretic mobilities of different proteins and the expected amount of electrophoretically hidden genetic variation were studied. In the absence of selection for the pI value, proteins will on the average evolve toward a mildly basic pI. (Abstract shortened with permission of author.) ^
Resumo:
Retroviruses uniquely co-package two copies of their genomic RNA within each virion. The two copies are used as templates for synthesis of the proviral DNA during the process of reverse transcription. Two template switches are required to complete retroviral DNA synthesis by the retroviral enzyme, reverse transcriptase. With two RNA genomes present in the virion, reverse transcriptase can make template switches utilizing only one of the RNA templates (intramolecular) or utilizing both RNA templates (intermolecular) during the process of reverse transcription. The results presented in this study show that during a single cycle of Moloney murine leukemia virus replication, both nonrecombinant and recombinant proviruses predominantly underwent intramolecular minus- and plus-strand transfers during the process of reverse transcription. This is the first study to examine the nature of the required template switches occurring during MLV replication and these results support the previous findings for SNV, and the hypothesis that the required template switches are ordered events. This study also determined rates for deletion and a rate of recombination for a single cycle of MLV replication. The rates reported here are comparable to the rates previously reported for both SNV and MLV. ^
Resumo:
Historically morphological features were used as the primary means to classify organisms. However, the age of molecular genetics has allowed us to approach this field from the perspective of the organism's genetic code. Early work used highly conserved sequences, such as ribosomal RNA. The increasing number of complete genomes in the public data repositories provides the opportunity to look not only at a single gene, but at organisms' entire parts list. ^ Here the Sequence Comparison Index (SCI) and the Organism Comparison Index (OCI), algorithms and methods to compare proteins and proteomes, are presented. The complete proteomes of 104 sequenced organisms were compared. Over 280 million full Smith-Waterman alignments were performed on sequence pairs which had a reasonable expectation of being related. From these alignments a whole proteome phylogenetic tree was constructed. This method was also used to compare the small subunit (SSU) rRNA from each organism and a tree constructed from these results. The SSU rRNA tree by the SCI/OCI method looks very much like accepted SSU rRNA trees from sources such as the Ribosomal Database Project, thus validating the method. The SCI/OCI proteome tree showed a number of small but significant differences when compared to the SSU rRNA tree and proteome trees constructed by other methods. Horizontal gene transfer does not appear to affect the SCI/OCI trees until the transferred genes make up a large portion of the proteome. ^ As part of this work, the Database of Related Local Alignments (DaRLA) was created and contains over 81 million rows of sequence alignment information. DaRLA, while primarily used to build the whole proteome trees, can also be applied shared gene content analysis, gene order analysis, and creating individual protein trees. ^ Finally, the standard BLAST method for analyzing shared gene content was compared to the SCI method using 4 spirochetes. The SCI system performed flawlessly, finding all proteins from one organism against itself and finding all the ribosomal proteins between organisms. The BLAST system missed some proteins from its respective organism and failed to detect small ribosomal proteins between organisms. ^
Resumo:
Over the past decade the topic of genetic engineering has been has been readily debated in the media, but often these debates consist of political rhetoric and fail to offer objective information on the methods and the potential benefits to human health and their environment. In truth, humans have been manipulating the genomes of organisms for thousands of years, and it has been an evolution of scientific knowledge that has led to the more precise methods of genetic engineering. This paper discusses how scientists utilize natural processes to alter the genetic constituents of both prokaryotic and eukaryotic organisms, benefits to human health and the environment, as well as potential misuses of biotechnology such as bioterrorism.
Resumo:
The genomes of Fusobacterium nucleatum subspecies polymorphum strain ATCC 10953, Rickettsia typhi strain Wilmington, and Francisella tularensis subspecies holarctica strain OSU18 were sequenced, annotated, and analyzed. Each genome was then compared to the sequenced genomes of closely related bacteria. The genome of F. nucleatum ATCC 10953 was compared to two additional F. nucleatum subspecies, subspecies nucleatum and subspecies vincentii. This analysis revealed substantial evidence of horizontal gene transfer along with considerable genetic diversity within the species of F. nucleatum. R. typhi was compared to R. prowazekii and R. conorii. This analysis uncovered a hotspot for chromosomal rearrangements in the Spotted Fever Group but not the Typhus Group Rickettsia and revealed the close genetic relationship between the Typhus Group rickettsial species. F. tularensis OSU18 was compared to two additional F. tularensis strains. These comparisons uncovered significant chromosomal rearrangements between F. tularensis subspecies due to recombination between insertion sequence elements. ^
Resumo:
DNA interstrand crosslinks (ICLs) are among the most toxic type of damage to a cell. Many ICL-inducing agents are widely used as therapeutic agents, e.g. cisplatin, psoralen. A bettor understanding of the cellular mechanism that eliminates ICLs is important for the improvement of human health. However, ICL repair is still poorly understood in mammals. Using a triplex-directed site-specific ICL model, we studied the roles of mismatch repair (MMR) proteins in ICL repair in human cells. We are also interested in using psoralen-conjugated triplex-forming oligonucleotides (TFOs) to direct ICLs to a specific site in targeted DNA and in the mammalian genomes. ^ MSH2 protein is the common subunit of two MMR recognition complexes, and MutSα and MutSβ. We showed that MSH2 deficiency renders human cell hypersensitive to psoralen ICLs. MMR recognition complexes bind specifically to triplex-directed psoralen ICLs in vitro. Together with the fact that psoralen ICL-induced repair synthesis is dramatically decreased in MSH2 deficient cell extracts, we demonstrated that MSH2 function is critical for the recognition and processing of psoralen ICLs in human cells. Interestingly, lack of MSH2 does not reduce the level of psoralen ICL-induced mutagenesis in human cells, suggesting that MSH2 does not contribute to error-generating repair of psoralen ICLs, and therefore, may represent a novel error-free mechanism for repairing ICLs. We also studied the role of MLH1, anther key protein in MMR, in the processing of psoralen ICLs. MLH1-deficient human cells are more resistant to psoralen plus UVA treatment. Importantly, MLH1 function is not required for the mutagenic repair of psoralen ICLs, suggesting that it is not involved in the error-generating repair of this type of DNA damage in human cells. ^ These are the first data indicating mismatch repair proteins may participate in a relatively error-free mechanism for processing psoralen ICL in human cells. Enhancement of MMR protein function relative to nucleotide excision repair proteins may reduce the mutagenesis caused by DNA ICLs in humans. ^ In order to specifically target ICLs to mammalian genes, we identified novel TFO target sequences in mouse and human genomes. Using this information, many critical mammalian genes can now be targeted by TFOs.^
Resumo:
The susceptibility of most Bacillus anthracis strains to β-lactam antibiotics is intriguing considering that the B. anthracis genome harbors two β-lactamase genes, bla1 and bla2, and closely-related species, Bacillus cereus and Bacillus thuringiensis, typically produce β-lactamases. This work demonstrates that B. anthracis bla expression is affected by two genes, sigP and rsp, predicted to encode an extracytoplasmic function sigma factor and an antisigma factor, respectively. Deletion of the sigP/rsp locus abolished bla expression in a penicillin-resistant clinical isolate and had no effect on bla expression in a prototypical penicillin-susceptible strain. Complementation with sigP/rsp from the penicillin-resistant strain, but not the penicillin-susceptible strain, conferred β-lactamase activity upon both mutants. These results are attributed to a nucleotide deletion near the 5' end of rsp in the penicillin-resistant strain that is predicted to result in a nonfunctional protein. B. cereus and B. thuringiensis sigP and rsp homologues are required for inducible penicillin resistance in those species. Expression of the B. cereus or B. thuringiensis sigP and rsp genes in a B. anthracis sigP/rsp-null mutant confers resistance to β-lactam antibiotics, suggesting that while B. anthracis contains the genes necessary for sensing β-lactam antibiotics, the B. anthracis sigP/rsp gene products are insufficient for bla induction. ^ Because alternative sigma factors recognize unique promoter sequence, direct targets can be elucidated by comparing transcriptional profiling results with an in silico search using the sigma factor binding sequence. Potential σP -10 and -35 promoter elements were identified upstream from bla1 bla2 and sigP. Results obtained from searching the B. anthracis genome with the conserved sequences were evaluated against transcriptional profiling results comparing B. anthracis 32 and an isogenic sigP/rsp -null strain. Results from these analyses indicate that while the absence of the sigP gene significantly affects the transcript levels of 16 genes, only bla1, bla2 and sigP are directly regulated by σP. The genomes of B. cereus and B. thuringiensis strains were also analyzed for the potential σP binding elements. The sequence was located upstream from the sigP and bla genes, and previously unidentified genes predicted to encode a penicillin-binding protein (PBP) and a D-alanyl-D-alanine carboxypeptidase, indicating that the σ P regulon in these species responds to cell-wall stress caused by β-lactam antibiotics. ^ β-lactam antibiotics prevent attachment of new peptidoglycan to the cell wall by blocking the active site of PBPs. A B. cereus and B. thuringiensis pbp-encoding gene located near bla1 contains a potential σP recognition sequence upstream from the annotated translational start. Deletion of this gene abolished β-lactam resistance in both strains. Mutations in the active site of the PBP were detrimental to β-lactam resistance in B. cereus, but not B. thuringiensis, indicating that the transpeptidase activity is only important in B. cereus. I also found that transcript levels of the PBP-encoding gene are not significantly affected by the presence of β-lactam antibiotic. Based on these data I hypothesize that the gene product acts a sensor of β-lactam antibiotic. ^
Resumo:
Pancreatic cancer is the 4th most common cause for cancer death in the United States, accompanied by less than 5% five-year survival rate based on current treatments, particularly because it is usually detected at a late stage. Identifying a high-risk population to launch an effective preventive strategy and intervention to control this highly lethal disease is desperately needed. The genetic etiology of pancreatic cancer has not been well profiled. We hypothesized that unidentified genetic variants by previous genome-wide association study (GWAS) for pancreatic cancer, due to stringent statistical threshold or missing interaction analysis, may be unveiled using alternative approaches. To achieve this aim, we explored genetic susceptibility to pancreatic cancer in terms of marginal associations of pathway and genes, as well as their interactions with risk factors. We conducted pathway- and gene-based analysis using GWAS data from 3141 pancreatic cancer patients and 3367 controls with European ancestry. Using the gene set ridge regression in association studies (GRASS) method, we analyzed 197 pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Using the logistic kernel machine (LKM) test, we analyzed 17906 genes defined by University of California Santa Cruz (UCSC) database. Using the likelihood ratio test (LRT) in a logistic regression model, we analyzed 177 pathways and 17906 genes for interactions with risk factors in 2028 pancreatic cancer patients and 2109 controls with European ancestry. After adjusting for multiple comparisons, six pathways were marginally associated with risk of pancreatic cancer ( P < 0.00025): Fc epsilon RI signaling, maturity onset diabetes of the young, neuroactive ligand-receptor interaction, long-term depression (Ps < 0.0002), and the olfactory transduction and vascular smooth muscle contraction pathways (P = 0.0002; Nine genes were marginally associated with pancreatic cancer risk (P < 2.62 × 10−5), including five reported genes (ABO, HNF1A, CLPTM1L, SHH and MYC), as well as four novel genes (OR13C4, OR 13C3, KCNA6 and HNF4 G); three pathways significantly interacted with risk factors on modifying the risk of pancreatic cancer (P < 2.82 × 10−4): chemokine signaling pathway with obesity ( P < 1.43 × 10−4), calcium signaling pathway (P < 2.27 × 10−4) and MAPK signaling pathway with diabetes (P < 2.77 × 10−4). However, none of the 17906 genes tested for interactions survived the multiple comparisons corrections. In summary, our current GWAS study unveiled unidentified genetic susceptibility to pancreatic cancer using alternative methods. These novel findings provide new perspectives on genetic susceptibility to and molecular mechanisms of pancreatic cancer, once confirmed, will shed promising light on the prevention and treatment of this disease. ^
Resumo:
The basis for the recent transition of Enterococcus faecium from a primarily commensal organism to one of the leading causes of hospital-acquired infections in the United States is not yet understood. To address this, the first part of my project assessed isolates from early outbreaks in the USA and South America using sequence analysis, colony hybridizations, and minimal inhibitory concentrations (MICs) which showed clinical isolates possess virulence and antibiotic resistance determinants that are less abundant or lacking in community isolates. I also revealed that the level of ampicillin resistance increased over time in clinical strains. By sequencing the pbp5 gene, I demonstrated an ~5% difference in the pbp5 gene between strains with MICs <4ug/ml and those with MICs >4µg/ml, but no specific sequence changes correlated with increases in MICs within the latter group. A 3-10% nucleotide difference was also seen in three other genes analyzed, which suggested the existence of two distinct subpopulations of E. faecium. This led to the second part of my project analyzing concatenated core gene sequences, SNPs, the 16S rRNA, and phylogenetics of 21 E. faecium genomes confirming two distinct clades; a community-associated (CA) clade and hospital-associated (HA) clade. Molecular clock calculations indicate that these two clades likely diverged ~ 300,000 to > 1 million years ago, long before the modern antibiotic era. Genomic analysis also showed that, in addition to core genomic differences, HA E. faecium harbor specific accessory genetic elements that may confer selection advantages over CA E. faecium. The third part of my project discovered 6 E. faecium genes with the newly identified “WxL” domain. My analyses, using RT-PCR, western blots, patient sera, whole-cell ELISA, and immunogold electron microscopy, indicated that E. faecium WxL genes exist in operons, encode bacterial cell surface localized proteins, that WxL proteins are antigenic in humans, and are more exposed on the surface of clinical isolates versus community isolates (even though they are ubiquitous in both clades). ELISAs and BIAcore analyses also showed that proteins encoded by these operons bind several different host extracellular matrix proteins, as well as to each other, suggesting a novel cell-surface complex. In summary, my studies provide new insights into the evolution of E. faecium by showing that there are two distantly related clades; one being more successful in the hospital setting. My studies also identified operons encoding WxL proteins whose characteristics could also contribute to colonization and virulence within this species.