5 resultados para Plant genome mapping
em Cambridge University Engineering Department Publications Database
Resumo:
Sexual eukaryotes reproduce via the meiotic cell division, where ploidy is halved and homologous chromosomes undergo reciprocal genetic exchange, termed crossover (CO). CO frequency has a profound effect on patterns of genetic variation and species evolution. Relative CO rates vary extensively both within and between plant genomes. Plant genome size varies by over 1000-fold, largely due to differential expansion of repetitive sequences, and increased genome size is associated with reduced CO frequency. Gene versus repeat sequences associate with distinct chromatin modifications, and evidence from plant genomes indicates that this epigenetic information influences CO patterns. This is consistent with data from diverse eukaryotes that demonstrate the importance of chromatin structure for control of meiotic recombination. In this review I will discuss CO frequency patterns in plant genomes and recent advances in understanding recombination distributions.
Resumo:
Cytosine methylation is important for transposon silencing and epigenetic regulation of endogenous genes, although the extent to which this DNA modification functions to regulate the genome is still unknown. Here we report the first comprehensive DNA methylation map of an entire genome, at 35 base pair resolution, using the flowering plant Arabidopsis thaliana as a model. We find that pericentromeric heterochromatin, repetitive sequences, and regions producing small interfering RNAs are heavily methylated. Unexpectedly, over one-third of expressed genes contain methylation within transcribed regions, whereas only approximately 5% of genes show methylation within promoter regions. Interestingly, genes methylated in transcribed regions are highly expressed and constitutively active, whereas promoter-methylated genes show a greater degree of tissue-specific expression. Whole-genome tiling-array transcriptional profiling of DNA methyltransferase null mutants identified hundreds of genes and intergenic noncoding RNAs with altered expression levels, many of which may be epigenetically controlled by DNA methylation.
Resumo:
In addition to the three RNA polymerases (RNAP I-III) shared by all eukaryotic organisms, plant genomes encode a fourth RNAP (RNAP IV) that appears to be specialized in the production of siRNAs. Available data support a model in which dsRNAs are generated by RNAP IV and RNA-dependent RNAP 2 (RDR2) and processed by DICER (DCL) enzymes into 21- to 24-nt siRNAs, which are associated with different ARGONAUTE (AGO) proteins for transcriptional or posttranscriptional gene silencing. However, it is not yet clear what fraction of genomic siRNA production is RNAP IV-dependent, and to what extent these siRNAs are preferentially processed by certain DCL(s) or associated with specific AGOs for distinct downstream functions. To address these questions on a genome-wide scale, we sequenced approximately 335,000 siRNAs from wild-type and RNAP IV mutant Arabidopsis plants by using 454 technology. The results show that RNAP IV is required for the production of >90% of all siRNAs, which are faithfully produced from a discrete set of genomic loci. Comparisons of these siRNAs with those accumulated in rdr2 and dcl2 dcl3 dcl4 and those associated with AGO1 and AGO4 provide important information regarding the processing, channeling, and functions of plant siRNAs. We also describe a class of RNAP IV-independent siRNAs produced from endogenous single-stranded hairpin RNA precursors.
Resumo:
The function of plant genomes depends on chromatin marks such as the methylation of DNA and the post-translational modification of histones. Techniques for studying model plants such as Arabidopsis thaliana have enabled researchers to begin to uncover the pathways that establish and maintain chromatin modifications, and genomic studies are allowing the mapping of modifications such as DNA methylation on a genome-wide scale. Small RNAs seem to be important in determining the distribution of chromatin modifications, and RNA might also underlie the complex epigenetic interactions that occur between homologous sequences. Plants use these epigenetic silencing mechanisms extensively to control development and parent-of-origin imprinted gene expression.
Resumo:
High-throughput DNA sequencing (HTS) instruments today are capable of generating millions of sequencing reads in a short period of time, and this represents a serious challenge to current bioinformatics pipeline in processing such an enormous amount of data in a fast and economical fashion. Modern graphics cards are powerful processing units that consist of hundreds of scalar processors in parallel in order to handle the rendering of high-definition graphics in real-time. It is this computational capability that we propose to harness in order to accelerate some of the time-consuming steps in analyzing data generated by the HTS instruments. We have developed BarraCUDA, a novel sequence mapping software that utilizes the parallelism of NVIDIA CUDA graphics cards to map sequencing reads to a particular location on a reference genome. While delivering a similar mapping fidelity as other mainstream programs , BarraCUDA is a magnitude faster in mapping throughput compared to its CPU counterparts. The software is also capable of supporting multiple CUDA devices in parallel to further accelerate the mapping throughput. BarraCUDA is designed to take advantage of the parallelism of GPU to accelerate the mapping of millions of sequencing reads generated by HTS instruments. By doing this, we could, at least in part streamline the current bioinformatics pipeline such that the wider scientific community could benefit from the sequencing technology. BarraCUDA is currently available at http://seqbarracuda.sf.net