33 resultados para Transcription Factors -- chemistry -- genetics -- metabolism


Relevância:

100.00% 100.00%

Publicador:

Resumo:

A human endogenous retrovirus type E (HERV-E) was recently found to be selectively expressed in most renal cell carcinomas (RCCs). Importantly, antigens derived from this provirus are immunogenic, stimulating cytotoxic T cells that kill RCC cells in vitro and in vivo. Here, we show HERV-E expression is restricted to the clear cell subtype of RCC (ccRCC) characterized by an inactivation of the von Hippel-Lindau (VHL) tumor-suppressor gene with subsequent stabilization of hypoxia-inducible transcription factors (HIFs)-1α and -2α. HERV-E expression in ccRCC linearly correlated with HIF-2α levels and could be silenced in tumor cells by either transfection of normal VHL or small interfering RNA inhibition of HIF-2α. Using chromatin immunoprecipitation, we demonstrated that HIF-2α can serve as transcriptional factor for HERV-E by binding with HIF response element (HRE) localized in the proviral 5' long terminal repeat (LTR). Remarkably, the LTR was found to be hypomethylated only in HERV-E-expressing ccRCC while other tumors and normal tissues possessed a hypermethylated LTR preventing proviral expression. Taken altogether, these findings provide the first evidence that inactivation of a tumor suppressor gene can result in aberrant proviral expression in a human tumor and give insights needed for translational research aimed at boosting human immunity against antigenic components of this HERV-E.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The zinc-finger protein Rotund (Rn) plays a critical role in controlling the development of the fly olfactory system. However, little is known about its molecular function in vivo. Here, we added protein tags to the rn locus using CRISPR-Cas9 technology in Drosophila to investigate its subcellular localization and the genes that it regulates . We previously used a reporter construct to show that rn is expressed in a subset of olfactory receptor neuron (ORN) precursors and it is required for the diversification of ORN fates. Here, we show that tagged endogenous Rn protein is functional based on the analysis of ORN phenotypes. Using this method, we also mapped the expression pattern of the endogenous isoform-specific tags in vivo with increased precision. Comparison of the Rn expression pattern from this study with previously published results using GAL4 reporters showed that Rn is mainly present in early steps in antennal disc patterning, but not in pupal stages when ORNs are born. Finally, using chromatin immunoprecipitation, we showed a direct binding of Rotund to a previously identified regulatory element upstream of the bric-a-brac gene locus in the developing antennal disc.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Transcriptional regulation has been studied intensively in recent decades. One important aspect of this regulation is the interaction between regulatory proteins, such as transcription factors (TF) and nucleosomes, and the genome. Different high-throughput techniques have been invented to map these interactions genome-wide, including ChIP-based methods (ChIP-chip, ChIP-seq, etc.), nuclease digestion methods (DNase-seq, MNase-seq, etc.), and others. However, a single experimental technique often only provides partial and noisy information about the whole picture of protein-DNA interactions. Therefore, the overarching goal of this dissertation is to provide computational developments for jointly modeling different experimental datasets to achieve a holistic inference on the protein-DNA interaction landscape.

We first present a computational framework that can incorporate the protein binding information in MNase-seq data into a thermodynamic model of protein-DNA interaction. We use a correlation-based objective function to model the MNase-seq data and a Markov chain Monte Carlo method to maximize the function. Our results show that the inferred protein-DNA interaction landscape is concordant with the MNase-seq data and provides a mechanistic explanation for the experimentally collected MNase-seq fragments. Our framework is flexible and can easily incorporate other data sources. To demonstrate this flexibility, we use prior distributions to integrate experimentally measured protein concentrations.

We also study the ability of DNase-seq data to position nucleosomes. Traditionally, DNase-seq has only been widely used to identify DNase hypersensitive sites, which tend to be open chromatin regulatory regions devoid of nucleosomes. We reveal for the first time that DNase-seq datasets also contain substantial information about nucleosome translational positioning, and that existing DNase-seq data can be used to infer nucleosome positions with high accuracy. We develop a Bayes-factor-based nucleosome scoring method to position nucleosomes using DNase-seq data. Our approach utilizes several effective strategies to extract nucleosome positioning signals from the noisy DNase-seq data, including jointly modeling data points across the nucleosome body and explicitly modeling the quadratic and oscillatory DNase I digestion pattern on nucleosomes. We show that our DNase-seq-based nucleosome map is highly consistent with previous high-resolution maps. We also show that the oscillatory DNase I digestion pattern is useful in revealing the nucleosome rotational context around TF binding sites.

Finally, we present a state-space model (SSM) for jointly modeling different kinds of genomic data to provide an accurate view of the protein-DNA interaction landscape. We also provide an efficient expectation-maximization algorithm to learn model parameters from data. We first show in simulation studies that the SSM can effectively recover underlying true protein binding configurations. We then apply the SSM to model real genomic data (both DNase-seq and MNase-seq data). Through incrementally increasing the types of genomic data in the SSM, we show that different data types can contribute complementary information for the inference of protein binding landscape and that the most accurate inference comes from modeling all available datasets.

This dissertation provides a foundation for future research by taking a step toward the genome-wide inference of protein-DNA interaction landscape through data integration.