954 resultados para Metagenomic Sequencing
Resumo:
Pediatric acute myeloid leukemia (AML) is a molecularly heterogeneous disease that arises from genetic alterations in pathways that regulate self-renewal and myeloid differentiation. While the majority of patients carry recurrent chromosomal translocations, almost 20% of childhood AML do not show any recognizable cytogenetic alteration and are defined as cytogenetically normal (CN)-AML. CN-AML patients have always showed a great variability in response to therapy and overall outcome, underlining the presence of unknown genetic changes, not detectable by conventional analyses, but relevant for pathogenesis, and outcome of AML. The development of novel genome-wide techniques such as next-generation sequencing, have tremendously improved our ability to interrogate the cancer genome. Based on this background, the aim of this research study was to investigate the mutational landscape of pediatric CN-AML patients negative for all the currently known somatic mutations reported in AML through whole-transcriptome sequencing (RNA-seq). RNA-seq performed on diagnostic leukemic blasts from 19 pediatric CN-AML cases revealed a considerable incidence of cryptic chromosomal rearrangements, with the identification of 21 putative fusion genes. Several of the fusion genes that were identified in this study are recurrent and might have a prognostic and/or therapeutic relevance. A paradigm of that is the CBFA2T3-GLIS2 fusion, which has been demonstrated to be a common alteration in pediatric CN-AML, predicting poor outcome. Important findings have been also obtained in the identification of novel therapeutic targets. On one side, the identification of NUP98-JARID1A fusion suggests the use of disulfiram; on the other, here we describe alteration-activating tyrosine kinases, providing functional data supporting the use of tyrosine kinase inhibitors to specifically inhibit leukemia cells. This study provides new insights in the knowledge of genetic alterations underlying pediatric AML, defines novel prognostic markers and putative therapeutic targets, and prospectively ensures a correct risk stratification and risk-adapted therapy also for the “all-neg” AML subgroup.
Resumo:
I sottotipi H1N1, H1N2 e H3N2 di influenza A virus sono largamente diffusi nella popolazione suina di tutto il mondo. Nel presente lavoro è stato sviluppato un protocollo di sequenziamento di c.d. nuova generazione, su piattaforma Ion Torrent PGM, idoneo per l’analisi di tutti i virus influenzali suini (SIV). Per valutare l’evoluzione molecolare dei SIV italiani, sono stati sequenziati ed analizzati mediante analisi genomica e filogenetica un totale di sessantadue ceppi di SIV appartenenti ai sottotipi H1N1, H1N2 e H3N2, isolati in Italia dal 1998 al 2014. Sono stati evidenziati in sei campioni due fenomeni di riassortimento: tutti i SIV H1N2 esaminati presentavano una neuraminidasi di derivazione umana, diversa da quella dei SIV H1N2 circolanti in Europa, inoltre l’emoagglutinina (HA) di due isolati H1N2 era originata dal riassortimento con un SIV H1N1 avian-like. L’analisi molecolare dell’HA ha permesso di rivelare un’inserzione di due amminoacidi in quattro SIV H1N1 pandemici e una delezione di due aminoacidi in quattro SIV H1N2, entrambe a livello del sito di legame con il recettore cellulare. E’ stata inoltre evidenziata un’elevata omologia di un SIV H1N1 con ceppi europei isolati negli anni ’80, suggerendo la possibile origine vaccinale di questo virus. E’ stato possibile, in aggiunta, applicare il nuovo protocollo sviluppato per sequenziare un virus influenzale aviare altamente patogeno trasmesso all’uomo, direttamente da campione biologico. La diversità genetica nei SIV esaminati in questo studio conferma l’importanza di un continuo monitoraggio della costellazione genomica dei virus influenzali nella popolazione suina.
Resumo:
The cytidine deaminase AID hypermutates immunoglobulin genes but can also target oncogenes, leading to tumorigenesis. The extent of AID's promiscuity and its predilection for immunoglobulin genes are unknown. We report here that AID interacted broadly with promoter-proximal sequences associated with stalled polymerases and chromatin-activating marks. In contrast, genomic occupancy of replication protein A (RPA), an AID cofactor, was restricted to immunoglobulin genes. The recruitment of RPA to the immunoglobulin loci was facilitated by phosphorylation of AID at Ser38 and Thr140. We propose that stalled polymerases recruit AID, thereby resulting in low frequencies of hypermutation across the B cell genome. Efficient hypermutation and switch recombination required AID phosphorylation and correlated with recruitment of RPA. Our findings provide a rationale for the oncogenic role of AID in B cell malignancy.
Resumo:
Mycobacterium abscessus, Mycobacterium bolletii, and Mycobacterium massiliense (Mycobacterium abscessus sensu lato) are closely related species that currently are identified by the sequencing of the rpoB gene. However, recent studies show that rpoB sequencing alone is insufficient to discriminate between these species, and some authors have questioned their current taxonomic classification. We studied here a large collection of M. abscessus (sensu lato) strains by partial rpoB sequencing (752 bp) and multilocus sequence analysis (MLSA). The final MLSA scheme developed was based on the partial sequences of eight housekeeping genes: argH, cya, glpK, gnd, murC, pgm, pta, and purH. The strains studied included the three type strains (M. abscessus CIP 104536(T), M. massiliense CIP 108297(T), and M. bolletii CIP 108541(T)) and 120 isolates recovered between 1997 and 2007 in France, Germany, Switzerland, and Brazil. The rpoB phylogenetic tree confirmed the existence of three main clusters, each comprising the type strain of one species. However, divergence values between the M. massiliense and M. bolletii clusters all were below 3% and between the M. abscessus and M. massiliense clusters were from 2.66 to 3.59%. The tree produced using the concatenated MLSA gene sequences (4,071 bp) also showed three main clusters, each comprising the type strain of one species. The M. abscessus cluster had a bootstrap value of 100% and was mostly compact. Bootstrap values for the M. massiliense and M. bolletii branches were much lower (71 and 61%, respectively), with the M. massiliense cluster having a fuzzy aspect. Mean (range) divergence values were 2.17% (1.13 to 2.58%) between the M. abscessus and M. massiliense clusters, 2.37% (1.5 to 2.85%) between the M. abscessus and M. bolletii clusters, and 2.28% (0.86 to 2.68%) between the M. massiliense and M. bolletii clusters. Adding the rpoB sequence to the MLSA-concatenated sequence (total sequence, 4,823 bp) had little effect on the clustering of strains. We found 10/120 (8.3%) isolates for which the concatenated MLSA gene sequence and rpoB sequence were discordant (e.g., M. massiliense MLSA sequence and M. abscessus rpoB sequence), suggesting the intergroup lateral transfers of rpoB. In conclusion, our study strongly supports the recent proposal that M. abscessus, M. massiliense, and M. bolletii should constitute a single species. Our findings also indicate that there has been a horizontal transfer of rpoB sequences between these subgroups, precluding the use of rpoB sequencing alone for the accurate identification of the two proposed M. abscessus subspecies.
Resumo:
The time passed since the infection of a human immunodeficiency virus (HIV)-infected individual (the age of infection) is an important but often only poorly known quantity. We assessed whether the fraction of ambiguous nucleotides obtained from bulk sequencing as done for genotypic resistance testing can serve as a proxy of this parameter.
Resumo:
Epilepsies have a highly heterogeneous background with a strong genetic contribution. The variety of unspecific and overlapping syndromic and nonsyndromic phenotypes often hampers a clear clinical diagnosis and prevents straightforward genetic testing. Knowing the genetic basis of a patient's epilepsy can be valuable not only for diagnosis but also for guiding treatment and estimating recurrence risks.
Resumo:
Arachnomelia is a monogenic recessive defect of skeletal development in cattle. The causative mutation was previously mapped to a approximately 7 Mb interval on chromosome 5. Here we show that array-based sequence capture and massively parallel sequencing technology, combined with the typical family structure in livestock populations, facilitates the identification of the causative mutation. We re-sequenced the entire critical interval in a healthy partially inbred cow carrying one copy of the critical chromosome segment in its ancestral state and one copy of the same segment with the arachnomelia mutation, and we detected a single heterozygous position. The genetic makeup of several partially inbred cattle provides extremely strong support for the causality of this mutation. The mutation represents a single base insertion leading to a premature stop codon in the coding sequence of the SUOX gene and is perfectly associated with the arachnomelia phenotype. Our findings suggest an important role for sulfite oxidase in bone development.
Resumo:
With the advent of cheaper and faster DNA sequencing technologies, assembly methods have greatly changed. Instead of outputting reads that are thousands of base pairs long, new sequencers parallelize the task by producing read lengths between 35 and 400 base pairs. Reconstructing an organism’s genome from these millions of reads is a computationally expensive task. Our algorithm solves this problem by organizing and indexing the reads using n-grams, which are short, fixed-length DNA sequences of length n. These n-grams are used to efficiently locate putative read joins, thereby eliminating the need to perform an exhaustive search over all possible read pairs. Our goal was develop a novel n-gram method for the assembly of genomes from next-generation sequencers. Specifically, a probabilistic, iterative approach was utilized to determine the most likely reads to join through development of a new metric that models the probability of any two arbitrary reads being joined together. Tests were run using simulated short read data based on randomly created genomes ranging in lengths from 10,000 to 100,000 nucleotides with 16 to 20x coverage. We were able to successfully re-assemble entire genomes up to 100,000 nucleotides in length.
Resumo:
With the advent of high through-put sequencing (HTS), the emerging science of metagenomics is transforming our understanding of the relationships of microbial communities with their environments. While metagenomics aims to catalogue the genes present in a sample through assessing which genes are actively expressed, metatranscriptomics can provide a mechanistic understanding of community inter-relationships. To achieve these goals, several challenges need to be addressed from sample preparation to sequence processing, statistical analysis and functional annotation. Here we use an inbred non-obese diabetic (NOD) mouse model in which germ-free animals were colonized with a defined mixture of eight commensal bacteria, to explore methods of RNA extraction and to develop a pipeline for the generation and analysis of metatranscriptomic data. Applying the Illumina HTS platform, we sequenced 12 NOD cecal samples prepared using multiple RNA-extraction protocols. The absence of a complete set of reference genomes necessitated a peptide-based search strategy. Up to 16% of sequence reads could be matched to a known bacterial gene. Phylogenetic analysis of the mapped ORFs revealed a distribution consistent with ribosomal RNA, the majority from Bacteroides or Clostridium species. To place these HTS data within a systems context, we mapped the relative abundance of corresponding Escherichia coli homologs onto metabolic and protein-protein interaction networks. These maps identified bacterial processes with components that were well-represented in the datasets. In summary this study highlights the potential of exploiting the economy of HTS platforms for metatranscriptomics.
Resumo:
Here we determined the analytical sensitivities of broad-range real-time PCR-based assays employing one of three different genomic DNA extraction protocols in combination with one of three different primer pairs targeting the 16S rRNA gene to detect a panel of 22 bacterial species. DNA extraction protocol III, using lysozyme, lysostaphin, and proteinase K, followed by PCR with the primer pair Bak11W/Bak2, giving amplicons of 796 bp in length, showed the best overall sensitivity, detecting DNA of 82% of the strains investigated at concentrations of < or =10(2) CFU in water per reaction. DNA extraction protocols I and II, using less enzyme treatment, combined with other primer pairs giving shorter amplicons of 466 bp and 342 or 346 bp, respectively, were slightly more sensitive for the detection of gram-negative but less sensitive for the detection of gram-positive bacteria. The obstacle of detecting background DNA in blood samples spiked with bacteria was circumvented by introducing a broad-range hybridization probe, and this preserved the minimal detection limits observed in samples devoid of blood. Finally, sequencing of the amplicons generated using the primer pair Bak11W/Bak2 allowed species identification of the detected bacterial DNA. Thus, broad-spectrum PCR targeting the 16S rRNA gene in the quantitative real-time format can achieve an analytical sensitivity of 1 to 10 CFU per reaction in water, avoid detection of background DNA with the introduction of a broad-range probe, and generate amplicons that allow species identification of the detected bacterial DNA by sequencing. These prerequisites are important for its application to blood-containing patient samples.
Resumo:
Ninety strains of a collection of well-identified clinical isolates of gram-negative nonfermentative rods collected over a period of 5 years were evaluated using the new colorimetric VITEK 2 card. The VITEK 2 colorimetric system identified 53 (59%) of the isolates to the species level and 9 (10%) to the genus level; 28 (31%) isolates were misidentified. An algorithm combining the colorimetric VITEK 2 card and 16S rRNA gene sequencing for adequate identification of gram-negative nonfermentative rods was developed. According to this algorithm, any identification by the colorimetric VITEK 2 card other than Achromobacter xylosoxidans, Acinetobacter sp., Burkholderia cepacia complex, Pseudomonas aeruginosa, and Stenotrophomonas maltophilia should be subjected to 16S rRNA gene sequencing when accurate identification of nonfermentative rods is of concern.