18 resultados para Complete genome sequencing


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Current computational methods used to analyze changes in DNA methylation and chromatin modification rely on sequenced genomes. Here we describe a pipeline for the detection of these changes from short-read sequence data that does not require a reference genome. Open source software packages were used for sequence assembly, alignment, and measurement of differential enrichment. The method was evaluated by comparing results with reference-based results showing a strong correlation between chromatin modification and gene expression. We then used our de novo sequence assembly to build the DNA methylation profile for the non-referenced Psammomys obesus genome. The pipeline described uses open source software for fast annotation and visualization of unreferenced genomic regions from short-read data.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND: Acute Lymphoblastic Leukaemia (ALL) is the most common cancer in children. Over the past four decades, research has advanced the treatment of this cancer from a less than 60% chance of survival to over 85% today. The causal molecular mechanisms remain unclear. Here, we performed sequencing-based genomic DNA methylation profiling of eight paediatric ALL patients using archived bone marrow smear microscope slides. FINDINGS: SOLiD™ sequencing data was collected from Methyl-Binding Domain (MBD) enriched fractions of genomic DNA. The primary tumour and remission bone marrow sample was analysed from eight patients. Four patients relapsed and the relapsed tumour was analysed. Input and MBD-enriched DNA from each sample was sequenced, aligned to the hg19 reference genome and analysed for enrichment peaks using MACS (Model-based Analysis for ChIP-Seq) and HOMER (Hypergeometric Optimization of Motif EnRichment). In total, 3.67 gigabases (Gb) were sequenced, 2.74 Gb were aligned to the reference genome (average 74.66% alignment efficiency). This dataset enables the interrogation of differential DNA methylation associated with paediatric ALL. Preliminary results reveal concordant regions of enrichment indicative of a DNA methylation signature. CONCLUSION: Our dataset represents one of the first SOLiD™MBD-Seq studies performed on paediatric ALL and is the first to utilise archival bone marrow smears. Differential DNA methylation between cancer and equivalent disease-free tissue can be identified and correlated with existing and published genomic studies. Given the rarity of paediatric haematopoietic malignancies, relative to adult counterparts, our demonstration of the utility of archived bone marrow smear samples to high-throughput methylation sequencing approaches offers tremendous potential to explore the role of DNA methylation in the aetiology of cancer.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND: Strand specific RNAseq data is now more common in RNAseq projects. Visualizing RNAseq data has become an important matter in Analysis of sequencing data. The most widely used visualization tool is the UCSC genome browser that introduced the custom track concept that enabled researchers to simultaneously visualize gene expression at a particular locus from multiple experiments. Our objective of the software tool is to provide friendly interface for visualization of RNAseq datasets.

RESULTS: This paper introduces a visualization tool (RNASeqBrowser) that incorporates and extends the functionality of the UCSC genome browser. For example, RNASeqBrowser simultaneously displays read coverage, SNPs, InDels and raw read tracks with other BED and wiggle tracks -- all being dynamically built from the BAM file. Paired reads are also connected in the browser to enable easier identification of novel exon/intron borders and chimaeric transcripts. Strand specific RNAseq data is also supported by RNASeqBrowser that displays reads above (positive strand transcript) or below (negative strand transcripts) a central line. Finally, RNASeqBrowser was designed for ease of use for users with few bioinformatic skills, and incorporates the features of many genome browsers into one platform.

CONCLUSIONS: The features of RNASeqBrowser: (1) RNASeqBrowser integrates UCSC genome browser and NGS visualization tools such as IGV. It extends the functionality of the UCSC genome browser by adding several new types of tracks to show NGS data such as individual raw reads, SNPs and InDels. (2) RNASeqBrowser can dynamically generate RNA secondary structure. It is useful for identifying non-coding RNA such as miRNA. (3) Overlaying NGS wiggle data is helpful in displaying differential expression and is simple to implement in RNASeqBrowser. (4) NGS data accumulates a lot of raw reads. Thus, RNASeqBrowser collapses exact duplicate reads to reduce visualization space. Normal PC's can show many windows of NGS individual raw reads without much delay. (5) Multiple popup windows of individual raw reads provide users with more viewing space. This avoids existing approaches (such as IGV) which squeeze all raw reads into one window. This will be helpful for visualizing multiple datasets simultaneously. RNASeqBrowser and its manual are freely available at http://www.australianprostatecentre.org/research/software/rnaseqbrowser or http://sourceforge.net/projects/rnaseqbrowser/.