8 resultados para DNA Sequencing
em Cambridge University Engineering Department Publications Database
Resumo:
High-throughput DNA sequencing (HTS) instruments today are capable of generating millions of sequencing reads in a short period of time, and this represents a serious challenge to current bioinformatics pipeline in processing such an enormous amount of data in a fast and economical fashion. Modern graphics cards are powerful processing units that consist of hundreds of scalar processors in parallel in order to handle the rendering of high-definition graphics in real-time. It is this computational capability that we propose to harness in order to accelerate some of the time-consuming steps in analyzing data generated by the HTS instruments. We have developed BarraCUDA, a novel sequence mapping software that utilizes the parallelism of NVIDIA CUDA graphics cards to map sequencing reads to a particular location on a reference genome. While delivering a similar mapping fidelity as other mainstream programs , BarraCUDA is a magnitude faster in mapping throughput compared to its CPU counterparts. The software is also capable of supporting multiple CUDA devices in parallel to further accelerate the mapping throughput. BarraCUDA is designed to take advantage of the parallelism of GPU to accelerate the mapping of millions of sequencing reads generated by HTS instruments. By doing this, we could, at least in part streamline the current bioinformatics pipeline such that the wider scientific community could benefit from the sequencing technology. BarraCUDA is currently available at http://seqbarracuda.sf.net
Resumo:
BACKGROUND: With the maturation of next-generation DNA sequencing (NGS) technologies, the throughput of DNA sequencing reads has soared to over 600 gigabases from a single instrument run. General purpose computing on graphics processing units (GPGPU), extracts the computing power from hundreds of parallel stream processors within graphics processing cores and provides a cost-effective and energy efficient alternative to traditional high-performance computing (HPC) clusters. In this article, we describe the implementation of BarraCUDA, a GPGPU sequence alignment software that is based on BWA, to accelerate the alignment of sequencing reads generated by these instruments to a reference DNA sequence. FINDINGS: Using the NVIDIA Compute Unified Device Architecture (CUDA) software development environment, we ported the most computational-intensive alignment component of BWA to GPU to take advantage of the massive parallelism. As a result, BarraCUDA offers a magnitude of performance boost in alignment throughput when compared to a CPU core while delivering the same level of alignment fidelity. The software is also capable of supporting multiple CUDA devices in parallel to further accelerate the alignment throughput. CONCLUSIONS: BarraCUDA is designed to take advantage of the parallelism of GPU to accelerate the alignment of millions of sequencing reads generated by NGS instruments. By doing this, we could, at least in part streamline the current bioinformatics pipeline such that the wider scientific community could benefit from the sequencing technology.BarraCUDA is currently available from http://seqbarracuda.sf.net.
Resumo:
DNA cytosine methylation is a conserved epigenetic modification frequently correlating with transcriptional silencing in a wide variety of eukaryotic organisms. Sodium bisulfite treatment of DNA converts unmethylated cytosine to uracil, while 5-methylated cytosine is protected. We describe techniques that ensure reliable sequencing data following sodium bisulfite conversion and to avoid common pitfalls such as amplification of unconverted DNA and inclusion of sibling clones.
Resumo:
Small RNAs have several important biological functions. MicroRNAs (miRNAs) and trans-acting small interfering RNAs (tasiRNAs) regulate mRNA stability and translation, and siRNAs cause post-transcriptional gene silencing of transposons, viruses and transgenes and are important in both the establishment and maintenance of cytosine DNA methylation. Here, we study the role of the four Arabidopsis thaliana DICER-LIKE genes (DCL1-DCL4) in these processes. Sequencing of small RNAs from a dcl2 dcl3 dcl4 triple mutant showed markedly reduced tasiRNA and siRNA production and indicated that DCL1, in addition to its role as the major enzyme for processing miRNAs, has a previously unknown role in the production of small RNAs from endogenous inverted repeats. DCL2, DCL3 and DCL4 showed functional redundancy in siRNA and tasiRNA production and in the establishment and maintenance of DNA methylation. Our studies also suggest that asymmetric DNA methylation can be maintained by pathways that do not require siRNAs.