995 resultados para sequenced-based typing
Resumo:
Sequences of the gene encoding the beta-subunit of the RNA polymerase (rpoB) were used to delineate the phylogeny of the family Pasteurellaceae. A total of 72 strains, including the type strains of the major described species as well as selected field isolates, were included in the study. Selection of universal rpoB-derived primers for the family allowed straightforward amplification and sequencing of a 560 bp fragment of the rpoB gene. In parallel, 16S rDNA was sequenced from all strains. The phylogenetic tree obtained with the rpoB sequences reflected the major branches of the tree obtained with the 16S rDNA, especially at the genus level. Only a few discrepancies between the trees were observed. In certain cases the rpoB phylogeny was in better agreement with DNA-DNA hybridization studies than the phylogeny derived from 16S rDNA. The rpoB gene is strongly conserved within the various species of the family of Pasteurellaceae. Hence, rpoB gene sequence analysis in conjunction with 16S rDNA sequencing is a valuable tool for phylogenetic studies of the Pasteurellaceae and may also prove useful for reorganizing the current taxonomy of this bacterial family.
Resumo:
We describe a microarray based broad-range screening technique for Escherichia coli virulence typing. Gene probes were amplified by PCR from a plasmid bank of characterised E. coli virulence genes and were spotted onto a glass slide to form an array of capture probes. Genomic DNA from E. coli strains which were to be tested for the presence of these virulence gene sequences was labelled with fluorescent cyanine dyes by random amplification and then hybridised against the array of probes. The hybridisation, washing and data analysis conditions were optimised for glass slides, and the applicability of the method for identifying the presence of the virulence genes was determined using reference strains and clinical isolates. It was found to be a sensitive screening method for detecting virulence genes, and a powerful tool for determining the pathotype of E. coli. It will be possible to expand and automate this microarray technique to make it suitable for rapid and reliable diagnostic screening of bacterial isolates.
Resumo:
Campylobacteriosis is the most frequent zoonosis in developed countries and various domestic animals can function as reservoir for the main pathogens Campylobacter jejuni and Campylobacter coli. In the present study we compared population structures of 730 C. jejuni and C. coli from human cases, 610 chicken, 159 dog, 360 pig and 23 cattle isolates collected between 2001 and 2012 in Switzerland. All isolates had been typed with multi locus sequence typing (MLST) and flaB-typing and their genotypic resistance to quinolones was determined. We used complementary approaches by testing for differences between isolates from different hosts with the proportion similarity as well as the fixation index and by attributing the source of the human isolates with Bayesian assignment using the software STRUCTURE. Analyses were done with MLST and flaB data in parallel and both typing methods were tested for associations of genotypes with quinolone resistance. Results obtained with MLST and flaB data corresponded remarkably well, both indicating chickens as the main source for human infection for both Campylobacter species. Based on MLST, 70.9% of the human cases were attributed to chickens, 19.3% to cattle, 8.6% to dogs and 1.2% to pigs. Furthermore we found a host independent association between sequence type (ST) and quinolone resistance. The most notable were ST-45, all isolates of which were susceptible, while for ST-464 all were resistant.
Resumo:
Staphylococcus pseudintermedius is an opportunistic pathogen in dogs. Four housekeeping genes with allelic polymorphisms were identified and used to develop an expanded multilocus sequence typing (MLST) scheme. The new seven-locus technique shows S. pseudintermedius to have greater genetic diversity than previous methods and discriminates more isolates based upon host origin.
Resumo:
In cattle, at least 39 variants of the 4 casein proteins (α(S1)-, β-, α(S2)- and κ-casein) have been described to date. Many of these variants are known to affect milk-production traits, cheese-processing properties, and the nutritive value of milk. They also provide valuable information for phylogenetic studies. So far, the majority of studies exploring the genetic variability of bovine caseins considered European taurine cattle breeds and were carried out at the protein level by electrophoretic techniques. This only allows the identification of variants that, due to amino acid exchanges, differ in their electric charge, molecular weight, or isoelectric point. In this study, the open reading frames of the casein genes CSN1S1, CSN2, CSN1S2, and CSN3 of 356 animals belonging to 14 taurine and 3 indicine cattle breeds were sequenced. With this approach, we identified 23 alleles, including 5 new DNA sequence variants, with a predicted effect on the protein sequence. The new variants were only found in indicine breeds and in one local Iranian breed, which has been phenotypically classified as a taurine breed. A multidimensional scaling approach based on available SNP chip data, however, revealed an admixture of taurine and indicine populations in this breed as well as in the local Iranian breed Golpayegani. Specific indicine casein alleles were also identified in a few European taurine breeds, indicating the introgression of indicine breeds into these populations. This study shows the existence of substantial undiscovered genetic variability of bovine casein loci, especially in indicine cattle breeds. The identification of new variants is a valuable tool for phylogenetic studies and investigations into the evolution of the milk protein genes.
Resumo:
In this study, we present a trilocus sequence typing (TLST) scheme based on intragenic regions of two antigenic genes, ace and salA (encoding a collagen/laminin adhesin and a cell wall-associated antigen, respectively), and a gene associated with antibiotic resistance, lsa (encoding a putative ABC transporter), for subspecies differentiation of Enterococcus faecalis. Each of the alleles was analyzed using 50 E. faecalis isolates representing 42 diverse multilocus sequence types (ST(M); based on seven housekeeping genes) and four groups of clonally linked (by pulsed-field gel electrophoresis [PFGE]) isolates. The allelic profiles and/or concatenated sequences of the three genes agreed with multilocus sequence typing (MLST) results for typing of 49 of the 50 isolates; in addition to the one exception, two isolates were found to have identical TLST types but were single-locus variants (differing by a single nucleotide) by MLST and were therefore also classified as clonally related by MLST. TLST was also comparable to PFGE for establishing short-term epidemiological relationships, typing all isolates classified as clonally related by PFGE with the same type. TLST was then applied to representative isolates (of each PFGE subtype and isolation year) of a collection of 48 hospital isolates and demonstrated the same relationships between isolates of an outbreak strain as those found by MLST and PFGE. In conclusion, the TLST scheme described here was shown to be successful for investigating short-term epidemiology in a hospital setting and may provide an alternative to MLST for discriminating isolates.
Resumo:
BACKGROUND Whole genome sequencing (WGS) is increasingly used in molecular-epidemiological investigations of bacterial pathogens, despite cost- and time-intensive analyses. We combined strain-specific single nucleotide polymorphism (SNP)-typing and targeted WGS to investigate a tuberculosis cluster spanning 21 years in Bern, Switzerland. METHODS Based on genome sequences of three historical outbreak Mycobacterium tuberculosis isolates, we developed a strain-specific SNP-typing assay to identify further cases. We screened 1,642 patient isolates, and performed WGS on all identified cluster isolates. We extracted SNPs to construct genomic networks. Clinical and social data were retrospectively collected. RESULTS We identified 68 patients associated with the outbreak strain. Most were diagnosed in 1991-1995, but cases were observed until 2011. Two thirds belonged to the homeless and substance abuser milieu. Targeted WGS revealed 133 variable SNP positions among outbreak isolates. Genomic network analyses suggested a single origin of the outbreak, with subsequent division into three sub-clusters. Isolates from patients with confirmed epidemiological links differed by 0-11 SNPs. CONCLUSIONS Strain-specific SNP-genotyping allowed rapid and inexpensive identification of M. tuberculosis outbreak isolates in a population-based strain collection. Subsequent targeted WGS provided detailed insights into transmission dynamics. This combined approach could be applied to track bacterial pathogens in real-time and at high resolution.
Resumo:
PURPOSE The microRNA miR-27a was recently shown to directly regulate dihydropyrimidine dehydrogenase (DPD), the key enzyme in fluoropyrimidine catabolism. A common polymorphism (rs895819A>G) in the miR-27a genomic region (MIR27A) was associated with reduced DPD activity in healthy volunteers, but the clinical relevance of this effect is still unknown. Here, we assessed the association of MIR27A germline variants with early-onset fluoropyrimidine toxicity. EXPERIMENTAL DESIGN MIR27A was sequenced in 514 patients with cancer receiving fluoropyrimidine-based chemotherapy. Associations of MIR27A polymorphisms with early-onset (cycles 1-2) fluoropyrimidine toxicity were assessed in the context of known risk variants in the DPD gene (DPYD) and additional covariates associated with toxicity. RESULTS The association of rs895819A>G with early-onset fluoropyrimidine toxicity was strongly dependent on DPYD risk variant carrier status (Pinteraction = 0.0025). In patients carrying DPYD risk variants, rs895819G was associated with a strongly increased toxicity risk [OR, 7.6; 95% confidence interval (CI), 1.7-34.7; P = 0.0085]. Overall, 71% (12/17) of patients who carried both rs895819G and a DPYD risk variant experienced severe toxicity. In patients without DPYD risk variants, rs895819G was associated with a modest decrease in toxicity risk (OR, 0.62; 95% CI, 0.43-0.9; P = 0.012). CONCLUSIONS These results indicate that miR-27a and rs895819A>G may be clinically relevant for further toxicity risk stratification in carriers of DPYD risk variants. Our data suggest that direct suppression of DPD by miR-27a is primarily relevant in the context of fluoropyrimidine toxicity in patients with reduced DPD activity. However, miR-27a regulation of additional targets may outweigh its effect on DPD in patients without DPYD risk variants.
Resumo:
The genetic variability of milk protein genes may influence the nutritive value or processing and functional properties of the milk. While numerous protein variants are known in ruminants, knowledge about milk protein variability in horses is still limited. Mare's milk is, however, produced for human consumption in many countries. Beta-lactoglobulin belonging to the protein family of lipocalins, which are known as common food- and airborne allergens, is a major whey protein. It is absent from human milk and thus a key agent in provoking cow's milk protein allergy. Mare's milk is, however, usually better tolerated by most affected people. Several functions of β-lactoglobulin have been discussed, but its ultimate physiological role remains unclear. In the current study, the open reading frames of the two equine β-lactoglobulin paralogues LGB1 and LGB2 were re-sequenced in 249 horses belonging to 14 different breeds in order to predict the existence of protein variants at the DNA-level. Thereby, only a single signal peptide variant of LGB1, but 10 different putative protein variants of LGB2 were identified. In horses, both genes are expressed and in such this is a striking previously unknown difference in genetic variability between the two genes. It can be assumed that LGB1 is the ancestral paralogue, which has an essential function causing a high selection pressure. As horses have very low milk fat content this unknown function might well be related to vitamin-uptake. Further studies are, however, needed, to elucidate the properties of the different gene products.
Resumo:
Knowledge about the quality characteristics (QoS) of service com- positions is crucial for determining their usability and economic value. Ser- vice quality is usually regulated using Service Level Agreements (SLA). While end-to-end SLAs are well suited for request-reply interactions, more complex, decentralized, multiparticipant compositions (service choreographies) typ- ically involve multiple message exchanges between stateful parties and the corresponding SLAs thus encompass several cooperating parties with interde- pendent QoS. The usual approaches to determining QoS ranges structurally (which are by construction easily composable) are not applicable in this sce- nario. Additionally, the intervening SLAs may depend on the exchanged data. We present an approach to data-aware QoS assurance in choreographies through the automatic derivation of composable QoS models from partici- pant descriptions. Such models are based on a message typing system with size constraints and are derived using abstract interpretation. The models ob- tained have multiple uses including run-time prediction, adaptive participant selection, or design-time compliance checking. We also present an experimen- tal evaluation and discuss the benefits of the proposed approach.
Resumo:
Genes that are characteristic of only certain strains of a bacterial species can be of great biologic interest. Here we describe a PCR-based subtractive hybridization method for efficiently detecting such DNAs and apply it to the gastric pathogen Helicobacter pylori. Eighteen DNAs specific to a monkey-colonizing strain (J166) were obtained by subtractive hybridization against an unrelated strain whose genome has been fully sequenced (26695). Seven J166-specific clones had no DNA sequence match to the 26695 genome, and 11 other clones were mixed, with adjacent patches that did and did not match any sequences in 26695. At the protein level, seven clones had homology to putative DNA restriction-modification enzymes, and two had homology to putative metabolic enzymes. Nine others had no database match with proteins of assigned function. PCR tests of 13 unrelated H. pylori strains by using primers specific for 12 subtracted clones and complementary Southern blot hybridizations indicated that these DNAs are highly polymorphic in the H. pylori population, with each strain yielding a different pattern of gene-specific PCR amplification. The search for polymorphic DNAs, as described here, should help identify previously unknown virulence genes in pathogens and provide new insights into microbial genetic diversity and evolution.
Resumo:
Many small bacterial, archaebacterial, and eukaryotic genomes have been sequenced, and the larger eukaryotic genomes are predicted to be completely sequenced within the next decade. In all genomes sequenced to date, a large portion of these organisms’ predicted protein coding regions encode polypeptides of unknown biochemical, biophysical, and/or cellular functions. Three-dimensional structures of these proteins may suggest biochemical or biophysical functions. Here we report the crystal structure of one such protein, MJ0577, from a hyperthermophile, Methanococcus jannaschii, at 1.7-Å resolution. The structure contains a bound ATP, suggesting MJ0577 is an ATPase or an ATP-mediated molecular switch, which we confirm by biochemical experiments. Furthermore, the structure reveals different ATP binding motifs that are shared among many homologous hypothetical proteins in this family. This result indicates that structure-based assignment of molecular function is a viable approach for the large-scale biochemical assignment of proteins and for discovering new motifs, a basic premise of structural genomics.
Resumo:
Arabidopsis thaliana, a small annual plant belonging to the mustard family, is the subject of study by an estimated 7000 researchers around the world. In addition to the large body of genetic, physiological and biochemical data gathered for this plant, it will be the first higher plant genome to be completely sequenced, with completion expected at the end of the year 2000. The sequencing effort has been coordinated by an international collaboration, the Arabidopsis Genome Initiative (AGI). The rationale for intensive investigation of Arabidopsis is that it is an excellent model for higher plants. In order to maximize use of the knowledge gained about this plant, there is a need for a comprehensive database and information retrieval and analysis system that will provide user-friendly access to Arabidopsis information. This paper describes the initial steps we have taken toward realizing these goals in a project called The Arabidopsis Information Resource (TAIR) (www.arabidopsis.org).
Resumo:
As the number of protein folds is quite limited, a mode of analysis that will be increasingly common in the future, especially with the advent of structural genomics, is to survey and re-survey the finite parts list of folds from an expanding number of perspectives. We have developed a new resource, called PartsList, that lets one dynamically perform these comparative fold surveys. It is available on the web at http://bioinfo.mbb.yale.edu/partslist and http://www.partslist.org. The system is based on the existing fold classifications and functions as a form of companion annotation for them, providing ‘global views’ of many already completed fold surveys. The central idea in the system is that of comparison through ranking; PartsList will rank the approximately 420 folds based on more than 180 attributes. These include: (i) occurrence in a number of completely sequenced genomes (e.g. it will show the most common folds in the worm versus yeast); (ii) occurrence in the structure databank (e.g. most common folds in the PDB); (iii) both absolute and relative gene expression information (e.g. most changing folds in expression over the cell cycle); (iv) protein–protein interactions, based on experimental data in yeast and comprehensive PDB surveys (e.g. most interacting fold); (v) sensitivity to inserted transposons; (vi) the number of functions associated with the fold (e.g. most multi-functional folds); (vii) amino acid composition (e.g. most Cys-rich folds); (viii) protein motions (e.g. most mobile folds); and (ix) the level of similarity based on a comprehensive set of structural alignments (e.g. most structurally variable folds). The integration of whole-genome expression and protein–protein interaction data with structural information is a particularly novel feature of our system. We provide three ways of visualizing the rankings: a profiler emphasizing the progression of high and low ranks across many pre-selected attributes, a dynamic comparer for custom comparisons and a numerical rankings correlator. These allow one to directly compare very different attributes of a fold (e.g. expression level, genome occurrence and maximum motion) in the uniform numerical format of ranks. This uniform framework, in turn, highlights the way that the frequency of many of the attributes falls off with approximate power-law behavior (i.e. according to V–b, for attribute value V and constant exponent b), with a few folds having large values and most having small values.
Resumo:
Universal trees based on sequences of single gene homologs cannot be rooted. Iwabe et al. [Iwabe, N., Kuma, K.-I., Hasegawa, M., Osawa, S. & Miyata, T. (1989) Proc. Natl. Acad. Sci. USA 86, 9355-9359] circumvented this problem by using ancient gene duplications that predated the last common ancestor of all living things. Their separate, reciprocally rooted gene trees for elongation factors and ATPase subunits showed Bacteria (eubacteria) as branching first from the universal tree with Archaea (archaebacteria) and Eucarya (eukaryotes) as sister groups. Given its topical importance to evolutionary biology and concerns about the appropriateness of the ATPase data set, an evaluation of the universal tree root using other ancient gene duplications is essential. In this study, we derive a rooting for the universal tree using aminoacyl-tRNA synthetase genes, an extensive multigene family whose divergence likely preceded that of prokaryotes and eukaryotes. An approximately 1600-bp conserved region was sequenced from the isoleucyl-tRNA synthetases of several species representing deep evolutionary branches of eukaryotes (Nosema locustae), Bacteria (Aquifex pyrophilus and Thermotoga maritima) and Archaea (Pyrococcus furiosus and Sulfolobus acidocaldarius). In addition, a new valyl-tRNA synthetase was characterized from the protist Trichomonas vaginalis. Different phylogenetic methods were used to generate trees of isoleucyl-tRNA synthetases rooted by valyl- and leucyl-tRNA synthetases. All isoleucyl-tRNA synthetase trees showed Archaea and Eucarya as sister groups, providing strong confirmation for the universal tree rooting reported by Iwabe et al. As well, there was strong support for the monophyly (sensu Hennig) of Archaea. The valyl-tRNA synthetase gene from Tr. vaginalis clustered with other eukaryotic ValRS genes, which may have been transferred from the mitochondrial genome to the nuclear genome, suggesting that this amitochondrial trichomonad once harbored an endosymbiotic bacterium.