494 resultados para DNase-seq
Resumo:
DNaseI footprinting is an established assay for identifying transcription factor (TF)-DNA interactions with single base pair resolution. High-throughput DNase-seq assays have recently been used to detect in vivo DNase footprints across the genome. Multiple computational approaches have been developed to identify DNase-seq footprints as predictors of TF binding. However, recent studies have pointed to a substantial cleavage bias of DNase and its negative impact on predictive performance of footprinting. To assess the potential for using DNase-seq to identify individual binding sites, we performed DNase-seq on deproteinized genomic DNA and determined sequence cleavage bias. This allowed us to build bias corrected and TF-specific footprint models. The predictive performance of these models demonstrated that predicted footprints corresponded to high-confidence TF-DNA interactions. DNase-seq footprints were absent under a fraction of ChIP-seq peaks, which we show to be indicative of weaker binding, indirect TF-DNA interactions or possible ChIP artifacts. The modeling approach was also able to detect variation in the consensus motifs that TFs bind to. Finally, cell type specific footprints were detected within DNase hypersensitive sites that are present in multiple cell types, further supporting that footprints can identify changes in TF binding that are not detectable using other strategies.
Resumo:
Despite an emerging understanding of the genetic alterations giving rise to various tumors, the mechanisms whereby most oncogenes are overexpressed remain unclear. Here we have utilized an integrated approach of genomewide regulatory element mapping via DNase-seq followed by conventional reporter assays and transcription factor binding site discovery to characterize the transcriptional regulation of the medulloblastoma oncogene Orthodenticle Homeobox 2 (OTX2). Through these studies we have revealed that OTX2 is differentially regulated in medulloblastoma at the level of chromatin accessibility, which is in part mediated by DNA methylation. In cell lines exhibiting chromatin accessibility of OTX2 regulatory regions, we found that autoregulation maintains OTX2 expression. Comparison of medulloblastoma regulatory elements with those of the developing brain reveals that these tumors engage a developmental regulatory program to drive OTX2 transcription. Finally, we have identified a transcriptional regulatory element mediating retinoid-induced OTX2 repression in these tumors. This work characterizes for the first time the mechanisms of OTX2 overexpression in medulloblastoma. Furthermore, this study establishes proof of principle for applying ENCODE datasets towards the characterization of upstream trans-acting factors mediating expression of individual genes.
Resumo:
Transcriptional regulation has been studied intensively in recent decades. One important aspect of this regulation is the interaction between regulatory proteins, such as transcription factors (TF) and nucleosomes, and the genome. Different high-throughput techniques have been invented to map these interactions genome-wide, including ChIP-based methods (ChIP-chip, ChIP-seq, etc.), nuclease digestion methods (DNase-seq, MNase-seq, etc.), and others. However, a single experimental technique often only provides partial and noisy information about the whole picture of protein-DNA interactions. Therefore, the overarching goal of this dissertation is to provide computational developments for jointly modeling different experimental datasets to achieve a holistic inference on the protein-DNA interaction landscape.
We first present a computational framework that can incorporate the protein binding information in MNase-seq data into a thermodynamic model of protein-DNA interaction. We use a correlation-based objective function to model the MNase-seq data and a Markov chain Monte Carlo method to maximize the function. Our results show that the inferred protein-DNA interaction landscape is concordant with the MNase-seq data and provides a mechanistic explanation for the experimentally collected MNase-seq fragments. Our framework is flexible and can easily incorporate other data sources. To demonstrate this flexibility, we use prior distributions to integrate experimentally measured protein concentrations.
We also study the ability of DNase-seq data to position nucleosomes. Traditionally, DNase-seq has only been widely used to identify DNase hypersensitive sites, which tend to be open chromatin regulatory regions devoid of nucleosomes. We reveal for the first time that DNase-seq datasets also contain substantial information about nucleosome translational positioning, and that existing DNase-seq data can be used to infer nucleosome positions with high accuracy. We develop a Bayes-factor-based nucleosome scoring method to position nucleosomes using DNase-seq data. Our approach utilizes several effective strategies to extract nucleosome positioning signals from the noisy DNase-seq data, including jointly modeling data points across the nucleosome body and explicitly modeling the quadratic and oscillatory DNase I digestion pattern on nucleosomes. We show that our DNase-seq-based nucleosome map is highly consistent with previous high-resolution maps. We also show that the oscillatory DNase I digestion pattern is useful in revealing the nucleosome rotational context around TF binding sites.
Finally, we present a state-space model (SSM) for jointly modeling different kinds of genomic data to provide an accurate view of the protein-DNA interaction landscape. We also provide an efficient expectation-maximization algorithm to learn model parameters from data. We first show in simulation studies that the SSM can effectively recover underlying true protein binding configurations. We then apply the SSM to model real genomic data (both DNase-seq and MNase-seq data). Through incrementally increasing the types of genomic data in the SSM, we show that different data types can contribute complementary information for the inference of protein binding landscape and that the most accurate inference comes from modeling all available datasets.
This dissertation provides a foundation for future research by taking a step toward the genome-wide inference of protein-DNA interaction landscape through data integration.
Resumo:
BACKGROUND: Small molecule inhibitors of histone deacetylases (HDACi) hold promise as anticancer agents for particular malignancies. However, clinical use is often confounded by toxicity, perhaps due to indiscriminate hyperacetylation of cellular proteins. Therefore, elucidating the mechanisms by which HDACi trigger differentiation, cell cycle arrest, or apoptosis of cancer cells could inform development of more targeted therapies. We used the myelogenous leukemia line K562 as a model of HDACi-induced differentiation to investigate chromatin accessibility (DNase-seq) and expression (RNA-seq) changes associated with this process. RESULTS: We identified several thousand specific regulatory elements [~10 % of total DNase I-hypersensitive (DHS) sites] that become significantly more or less accessible with sodium butyrate or suberanilohydroxamic acid treatment. Most of the differential DHS sites display hallmarks of enhancers, including being enriched for non-promoter regions, associating with nearby gene expression changes, and increasing luciferase reporter expression in K562 cells. Differential DHS sites were enriched for key hematopoietic lineage transcription factor motifs, including SPI1 (PU.1), a known pioneer factor. We found PU.1 increases binding at opened DHS sites with HDACi treatment by ChIP-seq, but PU.1 knockdown by shRNA fails to block the chromatin accessibility and expression changes. A machine-learning approach indicates H3K27me3 initially marks PU.1-bound sites that open with HDACi treatment, suggesting these sites are epigenetically poised. CONCLUSIONS: We find HDACi treatment of K562 cells results in site-specific chromatin remodeling at epigenetically poised regulatory elements. PU.1 shows evidence of a pioneer role in this process by marking poised enhancers but is not required for transcriptional activation.
Resumo:
Thesis (Ph.D.)--University of Washington, 2016-06
Resumo:
Thesis (Ph.D.)--University of Washington, 2016-08
Resumo:
This is a preliminary scoping presentation. It outlines some of the very early issues identified this research topic.
Resumo:
The rationale for providing state subsidised public transport has changed over time from a social obligation to provide transport options for those without access to private transport to an environmental and economic imperative to minimize congestion and greenhouse gas emissions. In many jurisdictions this shift has seen a greater focus on the provision of peak hour commuter services and a shift in the demographic profile of the riding public and a significant increase in the number of commuter passengers relative to others. The scheduling of commuter services is not geared to meet the needs of children and their generally female carers who often need to engage in trip chaining and travel outside peak commuting periods and on weekends. In addition to service scheduling difficulties, transport infrastructure, both on-board and supporting infrastructure such as bus stops, train stations and connecting footpaths often do not support children and their carers to use public transport services. Combined with a negative attitude by passengers and service providers, such as bus drivers, which may see children, babies and young people as out of place and unwelcome on commuter services, these issues conspire to hinder the use of public transport by children and their carers. Overlaying feminist geography analysis and insights and child-friendly cities objectives, this paper proposes some basic criteria for the provision of public transport services and supporting infrastructure which meets the needs of children, babies and their carers and juxtaposes the achievement of these in South East Queensland, Australia and Stockholm, Sweden.
Resumo:
The project investigated the molecular response of Tra catfish (Pangasianodon hypophthalmus) to elevated salinity conditions. We employed Next generation sequencing platform to evaluate differential gene expression profiles of key genes under two salinity conditions. Results of the current project can form the basis for further studies to confirm the functional roles of specific genes that influence salinity tolerance in the target species and more broadly in other freshwater teleost fishes. Ultimately, the approach can contribute to developing superior culture stocks of the target species.
Resumo:
A comparison of the DNase I digestion products of the 32P-5’-end-labeled pachytene nucleosome core particles (containing histones H2A, TH2A, X2, H2B, THPB, H3a, nd H4) and liver nucleosome core particles (containing somatic histones H2A, H2B, H3, and H4) revealed that the cleavage sites that are 30, 40, and 110 nucleotidesa way from the 5’-enda re significantly more accessiblei n the pachytene core particles than in the liver core particles. These cleavage sites correspond to the region wherein H2B interacts with the nucleosome core DNA. These results, therefore, suggest that the histone-DNA interactiona t these sites in the pachytene core particles is weaker, possibly because of the presence of the histone variant THBB interacting at similar topological positions in the nucleosome core as that of its somatic counterpart H2B. Such a loosened structumrea y also be maintainede ven in the native pachytene chromatin since micrococcal nuclease digestion of pachytene nuclei resulted in a higher ratio of subnucleosomes (SN4 + SN?) to mononucleosomes than that observed liinv er chromatin
Resumo:
Increasing salinity levels in freshwater and coastal environments caused by sea level rise linked to climate change is now recognized to be a major factor that can impact fish growth negatively, especially for freshwater teleost species. Striped catfish (Pangasianodon hypophthalmus) is an important freshwater teleost that is now widely farmed across the Mekong River Delta in Vietnam. Understanding the basis for tolerance and adaptation to raised environmental salinity conditions can assist the regional culture industry to mitigate predicted impacts of climate change across this region. Attempt of next generation sequencing using the ion proton platform results in more than 174 million raw reads from three tissue libraries (gill, kidney and intestine). Reads were filtered and de novo assembled using a variety of assemblers and then clustered together to generate a combined reference transcriptome. Downstream analysis resulted in a final reference transcriptome that contained 60,585 transcripts with an N50 of 683 bp. This resource was further annotated using a variety of bioinformatics databases, followed by differential gene expression analysis that resulted in 3062 transcripts that were differentially expressed in catfish samples raised under two experimental conditions (0 and 15 ppt). A number of transcripts with a potential role in salinity tolerance were then classified into six different functional gene categories based on their gene ontology assignments. These included; energy metabolism, ion transportation, detoxification, signal transduction, structural organization and detoxification. Finally, we combined the data on functional salinity tolerance genes into a hypothetical schematic model that attempted to describe potential relationships and interactions among target genes to explain the molecular pathways that control adaptive salinity responses in P. hypophthalmus. Our results indicate that P. hypophthalmus exhibit predictable plastic regulatory responses to elevated salinity by means of characteristic gene expression patterns, providing numerous candidate genes for future investigations.
Resumo:
Preferential cleavage of active genes by DNase I has been correlated with a structurally altered conformation of DNA at the hypersensitive site in chromatin. To have a better understanding of the structural requirements for gene activation as probed by DNase I action, digestability by DNase I of synthetic polynucleotides having the ability to adopt B and non-B conformation (like Z-form) was studied which indicated a marked higher digestability of the B-form of DNA. Left handed Z form present within a natural sequence in supercoiled plasmid also showed marked resistance towards DNase I digestion. We show that alternating purine-pyrimidine sequences adopting Z-conformation exhibit DNAse I foot printing even in a protein free system. The logical deductions from the results indicate that 1) altered structure like Z-DNA is not a favourable substrate for DNase I, 2) both the ends of the alternating purine-pyrimidine insert showed hypersensitivity, 3) B-form with a minor groove of 12-13 A is a more favourable substrate for DNase I than an altered structure, 4) any structure of DNA deviating largely from B form with a capacity to flip over to the B-form are potential targets for the DNase I enzymic probes in naked DNA.
Resumo:
Campylobacter jejuni is the most common bacterial cause of foodborne disease in the developed world. Its general physiology and biochemistry, as well as the mechanisms enabling it to colonize and cause disease in various hosts, are not well understood, and new approaches are required to understand its basic biology. High-throughput sequencing technologies provide unprecedented opportunities for functional genomic research. Recent studies have shown that direct Illumina sequencing of cDNA (RNA-seq) is a useful technique for the quantitative and qualitative examination of transcriptomes. In this study we report RNA-seq analyses of the transcriptomes of C. jejuni (NCTC11168) and its rpoN mutant. This has allowed the identification of hitherto unknown transcriptional units, and further defines the regulon that is dependent on rpoN for expression. The analysis of the NCTC11168 transcriptome was supplemented by additional proteomic analysis using liquid chromatography-MS. The transcriptomic and proteomic datasets represent an important resource for the Campylobacter research community. © 2011 SGM.