978 resultados para Genomic sequencing


Relevância:

30.00% 30.00%

Publicador:

Resumo:

The initial step in coronavirus-mouse hepatitis virus (MHV) replication is the synthesis of negative strand RNA from a positive strand genomic RNA template. Our approach to studying MHV RNA replication is to identify the cis-acting signals for RNA synthesis and the protein(s) which recognizes these signals at the 3$\sp\prime$ end of genomic RNA of MHV. To determine whether host cellular and/or virus-specific proteins interact with the 3$\sp\prime$ end of the coronavirus genome, an RNase T$\sb1$ protection/gel mobility shift electrophoresis assay was used to examine cytoplasmic extracts from either mock- or MHV-JHM-infected 17Cl-1 murine cells for the ability to form complexes with defined regions of the genomic RNA. A conserved 11 nucleotide sequence UGAAUGAAGUU at nucleotide positions 36 to 26 from the 3$\sp\prime$ end of genomic RNA was identified to be responsible for the specific binding of host proteins, by using a series of RNA probes with deletions and mutations in this region. The RNA probe containing the 11 nucleotide sequence bound approximately four host cellular proteins with a highly labeled 120 kDa and three minor species with sizes of 103, 81 and 55 kDa, assayed by UV-induced covalent cross-linking. Mutation of the 11 nucleotide motif strongly inhibited cellular protein binding, and decreased the amount of the 103 and 81 kDa proteins in the complex to undetectable levels and strongly reduced the binding of the 120 kDa protein. Less extensive mutations within this 11 nucleotide motif resulted in variable decreases in RNA-protein complex formation depending on each probe tested. The RNA-protein complexes observed with cytoplasmic extracts from MHV-JHM-infected cells in both RNase protection/gel mobility shift and UV cross-linking assays were indistinguishable to those observed with extracts from uninfected cells.^ To investigate the possible role of this 3$\sp\prime$ protein binding element in viral RNA replication in vivo, defective interfering RNA molecules with complete or partial mutations of the 11 nucleotide conserved sequence were transcribed in vitro, transfected to host 17Cl-1 cells in the presence of helper virus MHV-JHM and analyzed by agarose gel electrophoresis, competitive RT-PCR and direct sequencing of the RT-PCR products. Both negative strand synthesis and positive strand replication of DI RNA were affected by mutation that disrupts RNA-protein complex formation, even though the 11 mutated nucleotides were converted to wild type sequence, presumably by recombination with helper virus. Kinetic analysis indicated that recombination between DI RNA and helper virus occurred 5.5 to 7.5 hours post infection when replication of positive strand DI RNA was barely observed. Replication of positive strand DI RNAs carrying partial mutations within the 11 nucleotide motif was dependent upon recombination events after transfection. Replication was strongly inhibited when reversion to wild type sequence did not occur, and after recombination, reached similar levels as wild type DI RNA. A DI RNA with mutation upstream of the protein binding motif replicated as efficiently as wild type without undergoing recombination. Thus the conserved 11 nucleotide host protein binding motif appears to play an important role in viral RNA replication. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

(Full text is available at http://www.manu.edu.mk/prilozi). New generation genomic platforms enable us to decipher the complex genetic basis of complex diseases and Balkan Endemic Nephropathy (BEN) at a high-throughput basis. They give valuable information about predisposing Single Nucleotide Polymorphisms (SNPs), Copy Number Variations (CNVs) or Loss of Heterozygosity (LOH) (using SNP-array) and about disease-causing mutations along the whole sequence of candidate-genes (using Next Generation Sequencing). This information could be used for screening of individuals in risk families and moving the main medicine stream to the prevention. They also might have an impact on more effective treatment. Here we discuss these genomic platforms and report some applications of SNP-array technology in a case with familial nephrotic syndrome. Key words: complex diseases, genome wide association studies, SNP, genomic arrays, next generation sequ-encing.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND  Whole genome sequencing (WGS) is increasingly used in molecular-epidemiological investigations of bacterial pathogens, despite cost- and time-intensive analyses. We combined strain-specific single nucleotide polymorphism (SNP)-typing and targeted WGS to investigate a tuberculosis cluster spanning 21 years in Bern, Switzerland. METHODS  Based on genome sequences of three historical outbreak Mycobacterium tuberculosis isolates, we developed a strain-specific SNP-typing assay to identify further cases. We screened 1,642 patient isolates, and performed WGS on all identified cluster isolates. We extracted SNPs to construct genomic networks. Clinical and social data were retrospectively collected. RESULTS  We identified 68 patients associated with the outbreak strain. Most were diagnosed in 1991-1995, but cases were observed until 2011. Two thirds belonged to the homeless and substance abuser milieu. Targeted WGS revealed 133 variable SNP positions among outbreak isolates. Genomic network analyses suggested a single origin of the outbreak, with subsequent division into three sub-clusters. Isolates from patients with confirmed epidemiological links differed by 0-11 SNPs. CONCLUSIONS  Strain-specific SNP-genotyping allowed rapid and inexpensive identification of M. tuberculosis outbreak isolates in a population-based strain collection. Subsequent targeted WGS provided detailed insights into transmission dynamics. This combined approach could be applied to track bacterial pathogens in real-time and at high resolution.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In order to detect a large spectrum of small ruminant lentiviruses, primers for PCR were chosen in conserved parts of the LTR and GAG genes of Icelandic Visna virus 1514 and of the POL gene of caprine arthritis-encephalitis virus. This set of primers was tested in six different caprine arthritis-encephalitis virus (CAEV)- and Maedi-Visna virus isolates of Dutch, American and Swiss origin. The LTR primers allowed the detection of the corresponding fragments of all isolates. The GAG primers allowed amplification of the corresponding fragments of all but the Swiss Maedi-Visna virus strain OLV. Using the POL primers, one Maedi-Visna- and two caprine arthritis-encephalitis virus strains were detected after one round of amplification. Sequencing of the GAG and POL amplification products and comparison to Icelandic Visna virus and CAEV strain CO revealed total heterogeneity of 38% for the GAG- and 28% for the POL fragment. The virus strains studied fall into two groups which are more closely related to one another than to Icelandic Visna virus.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND A novel Gram-negative, non-haemolytic, non-motile, rod-shaped bacterium was discovered in the lungs of a dead parakeet (Melopsittacus undulatus) that was kept in captivity in a petshop in Basel, Switzerland. The organism is described with a chemotaxonomic profile and the nearly complete genome sequence obtained through the assembly of short sequence reads. RESULTS Genome sequence analysis and characterization of respiratory quinones, fatty acids, polar lipids, and biochemical phenotype is presented here. Comparison of gene sequences revealed that the most similar species is Pelistega europaea, with BLAST identities of only 93% to the 16S rDNA gene, 76% identity to the rpoB gene, and a similar GC content (~43%) as the organism isolated from the parakeet, DSM 24701 (40%). The closest full genome sequences are those of Bordetella spp. and Taylorella spp. High-throughput sequencing reads from the Illumina-Solexa platform were assembled with the Edena de novo assembler to form 195 contigs comprising the ~2 Mb genome. Genome annotation with RAST, construction of phylogenetic trees with the 16S rDNA (rrs) gene sequence and the rpoB gene, and phylogenetic placement using other highly conserved marker genes with ML Tree all suggest that the bacterial species belongs to the Alcaligenaceae family. Analysis of samples from cages with healthy parakeets suggested that the newly discovered bacterial species is not widespread in parakeet living quarters. CONCLUSIONS Classification of this organism in the current taxonomy system requires the formation of a new genus and species. We designate the new genus Basilea and the new species psittacipulmonis. The type strain of Basilea psittacipulmonis is DSM 24701 (= CIP 110308 T, 16S rDNA gene sequence Genbank accession number JX412111 and GI 406042063).

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Over 250 Mendelian traits and disorders, caused by rare alleles have been mapped in the canine genome. Although each disease is rare in the dog as a species, they are collectively common and have major impact on canine health. With SNP-based genotyping arrays, genome-wide association studies (GWAS) have proven to be a powerful method to map the genomic region of interest when 10-20 cases and 10-20 controls are available. However, to identify the genetic variant in associated regions, fine-mapping and targeted re-sequencing is required. Here we present a new approach using whole-genome sequencing (WGS) of a family trio without prior GWAS. As a proof-of-concept, we chose an autosomal recessive disease known as hereditary footpad hyperkeratosis (HFH) in Kromfohrl änder dogs. To our knowledge, this is the first time this family trio WGS-approach, has successfully been used to identify a genetic variant that perfectly segregates with a canine disorder. The sequencing of three Kromfohrl änder dogs from a family trio (an affected offspring and both its healthy parents) resulted in an average genome coverage of 9.2X per individual. After applying stringent filtering criteria for candidate causative coding variants, 527 single nucleotide variants (SNVs) and 15 indels were found to be homozygous in the affected offspring and heterozygous in the parents. Using the computer software packages ANNOVAR and SIFT to functionally annotate coding sequence differences and to predict their functional effect, resulted in seven candidate variants located in six different genes. Of these, only FAM83G:c155G>C (p.R52P) was found to be concordant in eight additional cases and 16 healthy Kromfohrl änder dogs.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

High-throughput molecular profiling approaches have emerged as precious research tools in the field of head and neck translational oncology. Such approaches have identified and/or confirmed the role of several genes or pathways in the acquisition/maintenance of an invasive phenotype and the execution of cellular programs related to cell invasion. Recently published new-generation sequencing studies in head and neck squamous cell carcinoma (HNSCC) have unveiled prominent roles in carcinogenesis and cell invasion of mutations involving NOTCH1 and PI3K-patwhay components. Gene-expression profiling studies combined with systems biology approaches have allowed identifying and gaining further mechanistic understanding into pathways commonly enriched in invasive HNSCC. These pathways include antigen-presenting and leucocyte adhesion molecules, as well as genes involved in cell-extracellular matrix interactions. Here we review the major insights into invasiveness in head and neck cancer provided by high-throughput molecular profiling approaches.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Herein we provide a detailed molecular analysis of the spatial heterogeneity of clinically localized, multifocal prostate cancer to delineate new oncogenes or tumor suppressors. We initially determined the copy number aberration (CNA) profiles of 74 patients with index tumors of Gleason score 7. Of these, 5 patients were subjected to whole-genome sequencing using DNA quantities achievable in diagnostic biopsies, with detailed spatial sampling of 23 distinct tumor regions to assess intraprostatic heterogeneity in focal genomics. Multifocal tumors are highly heterogeneous for single-nucleotide variants (SNVs), CNAs and genomic rearrangements. We identified and validated a new recurrent amplification of MYCL, which is associated with TP53 deletion and unique profiles of DNA damage and transcriptional dysregulation. Moreover, we demonstrate divergent tumor evolution in multifocal cancer and, in some cases, tumors of independent clonal origin. These data represent the first systematic relation of intraprostatic genomic heterogeneity to predicted clinical outcome and inform the development of novel biomarkers that reflect individual prognosis.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Next-generation DNA sequencing platforms can effectively detect the entire spectrum of genomic variation and is emerging to be a major tool for systematic exploration of the universe of variants and interactions in the entire genome. However, the data produced by next-generation sequencing technologies will suffer from three basic problems: sequence errors, assembly errors, and missing data. Current statistical methods for genetic analysis are well suited for detecting the association of common variants, but are less suitable to rare variants. This raises great challenge for sequence-based genetic studies of complex diseases.^ This research dissertation utilized genome continuum model as a general principle, and stochastic calculus and functional data analysis as tools for developing novel and powerful statistical methods for next generation of association studies of both qualitative and quantitative traits in the context of sequencing data, which finally lead to shifting the paradigm of association analysis from the current locus-by-locus analysis to collectively analyzing genome regions.^ In this project, the functional principal component (FPC) methods coupled with high-dimensional data reduction techniques will be used to develop novel and powerful methods for testing the associations of the entire spectrum of genetic variation within a segment of genome or a gene regardless of whether the variants are common or rare.^ The classical quantitative genetics suffer from high type I error rates and low power for rare variants. To overcome these limitations for resequencing data, this project used functional linear models with scalar response to develop statistics for identifying quantitative trait loci (QTLs) for both common and rare variants. To illustrate their applications, the functional linear models were applied to five quantitative traits in Framingham heart studies. ^ This project proposed a novel concept of gene-gene co-association in which a gene or a genomic region is taken as a unit of association analysis and used stochastic calculus to develop a unified framework for testing the association of multiple genes or genomic regions for both common and rare alleles. The proposed methods were applied to gene-gene co-association analysis of psoriasis in two independent GWAS datasets which led to discovery of networks significantly associated with psoriasis.^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The basis for the recent transition of Enterococcus faecium from a primarily commensal organism to one of the leading causes of hospital-acquired infections in the United States is not yet understood. To address this, the first part of my project assessed isolates from early outbreaks in the USA and South America using sequence analysis, colony hybridizations, and minimal inhibitory concentrations (MICs) which showed clinical isolates possess virulence and antibiotic resistance determinants that are less abundant or lacking in community isolates. I also revealed that the level of ampicillin resistance increased over time in clinical strains. By sequencing the pbp5 gene, I demonstrated an ~5% difference in the pbp5 gene between strains with MICs <4ug/ml and those with MICs >4µg/ml, but no specific sequence changes correlated with increases in MICs within the latter group. A 3-10% nucleotide difference was also seen in three other genes analyzed, which suggested the existence of two distinct subpopulations of E. faecium. This led to the second part of my project analyzing concatenated core gene sequences, SNPs, the 16S rRNA, and phylogenetics of 21 E. faecium genomes confirming two distinct clades; a community-associated (CA) clade and hospital-associated (HA) clade. Molecular clock calculations indicate that these two clades likely diverged ~ 300,000 to > 1 million years ago, long before the modern antibiotic era. Genomic analysis also showed that, in addition to core genomic differences, HA E. faecium harbor specific accessory genetic elements that may confer selection advantages over CA E. faecium. The third part of my project discovered 6 E. faecium genes with the newly identified “WxL” domain. My analyses, using RT-PCR, western blots, patient sera, whole-cell ELISA, and immunogold electron microscopy, indicated that E. faecium WxL genes exist in operons, encode bacterial cell surface localized proteins, that WxL proteins are antigenic in humans, and are more exposed on the surface of clinical isolates versus community isolates (even though they are ubiquitous in both clades). ELISAs and BIAcore analyses also showed that proteins encoded by these operons bind several different host extracellular matrix proteins, as well as to each other, suggesting a novel cell-surface complex. In summary, my studies provide new insights into the evolution of E. faecium by showing that there are two distantly related clades; one being more successful in the hospital setting. My studies also identified operons encoding WxL proteins whose characteristics could also contribute to colonization and virulence within this species.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

My dissertation focuses on two aspects of RNA sequencing technology. The first is the methodology for modeling the overdispersion inherent in RNA-seq data for differential expression analysis. This aspect is addressed in three sections. The second aspect is the application of RNA-seq data to identify the CpG island methylator phenotype (CIMP) by integrating datasets of mRNA expression level and DNA methylation status. Section 1: The cost of DNA sequencing has reduced dramatically in the past decade. Consequently, genomic research increasingly depends on sequencing technology. However it remains elusive how the sequencing capacity influences the accuracy of mRNA expression measurement. We observe that accuracy improves along with the increasing sequencing depth. To model the overdispersion, we use the beta-binomial distribution with a new parameter indicating the dependency between overdispersion and sequencing depth. Our modified beta-binomial model performs better than the binomial or the pure beta-binomial model with a lower false discovery rate. Section 2: Although a number of methods have been proposed in order to accurately analyze differential RNA expression on the gene level, modeling on the base pair level is required. Here, we find that the overdispersion rate decreases as the sequencing depth increases on the base pair level. Also, we propose four models and compare them with each other. As expected, our beta binomial model with a dynamic overdispersion rate is shown to be superior. Section 3: We investigate biases in RNA-seq by exploring the measurement of the external control, spike-in RNA. This study is based on two datasets with spike-in controls obtained from a recent study. We observe an undiscovered bias in the measurement of the spike-in transcripts that arises from the influence of the sample transcripts in RNA-seq. Also, we find that this influence is related to the local sequence of the random hexamer that is used in priming. We suggest a model of the inequality between samples and to correct this type of bias. Section 4: The expression of a gene can be turned off when its promoter is highly methylated. Several studies have reported that a clear threshold effect exists in gene silencing that is mediated by DNA methylation. It is reasonable to assume the thresholds are specific for each gene. It is also intriguing to investigate genes that are largely controlled by DNA methylation. These genes are called “L-shaped” genes. We develop a method to determine the DNA methylation threshold and identify a new CIMP of BRCA. In conclusion, we provide a detailed understanding of the relationship between the overdispersion rate and sequencing depth. And we reveal a new bias in RNA-seq and provide a detailed understanding of the relationship between this new bias and the local sequence. Also we develop a powerful method to dichotomize methylation status and consequently we identify a new CIMP of breast cancer with a distinct classification of molecular characteristics and clinical features.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A loxP-transposon retrofitting strategy for generating large nested deletions from one end of the insert DNA in bacterial artificial chromosomes and P1 artificial chromosomes was described recently [Chatterjee, P. K. & Coren, J. S. (1997) Nucleic Acids Res. 25, 2205–2212]. In this report, we combine this procedure with direct sequencing of nested-deletion templates by using primers located in the transposon end to illustrate its value for position-specific single-nucleotide polymorphism (SNP) discovery from chosen regions of large insert clones. A simple ampicillin sensitivity screen was developed to facilitate identification and recovery of deletion clones free of transduced transposon plasmid. This directed approach requires minimal DNA sequencing, and no in vitro subclone library generation; positionally oriented SNPs are a consequence of the method. The procedure is used to discover new SNPs as well as physically map those identified from random subcloned libraries or sequence databases. The deletion templates, positioned SNPs, and markers are also used to orient large insert clones into a contig. The deletion clone can serve as a ready resource for future functional genomic studies because each carries a mammalian cell-specific antibiotic resistance gene from the transposon. Furthermore, the technique should be especially applicable to the analysis of genomes for which a full genome sequence or radiation hybrid cell lines are unavailable.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Pax proteins are a family of transcription factors with a highly conserved paired domain; many members also contain a paired-type homeodomain and/or an octapeptide. Nine mammalian Pax genes are known and classified into four subgroups: Pax-1/9, Pax-2/5/8, Pax-3/7, and Pax-4/6. Most of these genes are involved in nervous system development. In particular, Pax-6 is a key regulator that controls eye development in vertebrates and Drosophila. Although the Pax-4/6 subgroup seems to be more closely related to Pax-2/5/8 than to Pax-3/7 or Pax-1/9, its evolutionary origin is unknown. We therefore searched for a Pax-6 homolog and related genes in Cnidaria, which is the lowest phylum of animals that possess a nervous system and eyes. A sea nettle (a jellyfish) genomic library was constructed and two pax genes (Pax-A and -B) were isolated and partially sequenced. Surprisingly, unlike most known Pax genes, the paired box in these two genes contains no intron. In addition, the complete cDNA sequences of hydra Pax-A and -B were obtained. Hydra Pax-B contains both the homeodomain and the octapeptide, whereas hydra Pax-A contains neither. DNA binding assays showed that sea nettle Pax-A and -B and hydra Pax-A paired domains bound to a Pax-5/6 site and a Pax-5 site, although hydra Pax-B paired domain bound neither. An alignment of all available paired domain sequences revealed two highly conserved regions, which cover the DNA binding contact positions. Phylogenetic analysis showed that Pax-A and especially Pax-B were more closely related to Pax-2/5/8 and Pax-4/6 than to Pax-1/9 or Pax-3/7 and that the Pax genes can be classified into two supergroups: Pax-A/Pax-B/Pax-2/5/8/4/6 and Pax-1/9/3/7. From this analysis and the gene structure, we propose that modern Pax-4/6 and Pax-2/5/8 genes evolved from an ancestral gene similar to cnidarian Pax-B, having both the homeodomain and the octapeptide.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

An mAb was raised to the C5 phagosomal antigen in Paramecium multimicronucleatum. To determine its function, the cDNA and genomic DNA encoding C5 were cloned. This antigen consisted of 315 amino acid residues with a predicted molecular weight of 36,594, a value similar to that determined by SDS-PAGE. Sequence comparisons uncovered a low but significant homology with a Schizosaccharomyces pombe protein and the C-terminal half of the β-fructofuranosidase protein of Zymomonas mobilis. Lacking an obvious transmembrane domain or a possible signal sequence at the N terminus, C5 was predicted to be a soluble protein, whereas immunofluorescence data showed that it was present on the membranes of vesicles and digestive vacuoles (DVs). In cells that were minimally permeabilized but with intact DVs, C5 was found to be located on the cytosolic surface of the DV membranes. Immunoblotting of proteins from the purified and KCl-washed DVs showed that C5 was tightly bound to the DV membranes. Cryoelectron microscopy also confirmed that C5 was on the cytosolic surface of the discoidal vesicles, acidosomes, and lysosomes, organelles known to fuse with the membranes of the cytopharynx, the DVs of stages I (DV-I) and II (DV-II), respectively. Although C5 was concentrated more on the mature than on the young DV membranes, the striking observation was that the cytopharyngeal membrane that is derived from the discoidal vesicles was almost devoid of C5. Approximately 80% of the C5 was lost from the discoidal vesicle-derived membrane after this membrane fused with the cytopharyngeal membrane. Microinjection of the mAb to C5 greatly inhibited the fusion of the discoidal vesicles with the cytopharyngeal membrane and thus the incorporation of the discoidal vesicle membranes into the DV membranes. Taken together, these results suggest that C5 is a membrane protein that is involved in binding and/or fusion of the discoidal vesicles with the cytopharyngeal membrane that leads to DV formation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A typing method for bacteria was developed and applied to several species, including Escherichia coli and Actinobacillus actinomycetemcomitans. Total genomic DNA was digested with a restriction endonuclease, and fragments were enabled with [alpha-32P]dATP by using the Klenow fragment of DNA polymerase and separated by electrophoresis in 6% polyacrylamide/8 M urea (sequencing gel). Depending on the restriction endonuclease and the bacterium, the method produced approximately 30-50 well-separated fragments in the size range of 100-400 nucleotides. For A. actinomycetemcomitans, all strains had bands in common. Nevertheless, many polymorphisms could be observed, and the 31 strains tested could be classified into 29 distinct types. Furthermore, serotype-specific fragments could be assigned for the three serotypes investigated. The method described is very sensitive, allowing more distinct types to be distinguished than other commonly used typing methods. When the method was applied to 10 other clinically relevant bacterial species, both species-specific bands and strain-specific bands were found. Isolates from different locations of one patient showed indistinguishable patterns. Computer-assisted analysis of the DNA fingerprints allowed the determination of similarity coefficients. It is concluded that genomic fingerprinting by restriction fragment end labeling (RFEL) is a powerful and generally applicable technique to type bacterial species.