10 resultados para TRANSCRIPTIONAL REGULATION

em CaltechTHESIS


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The ability to regulate gene expression is of central importance for the adaptability of living organisms to changes in their internal and external environment. At the transcriptional level, binding of transcription factors (TFs) in the vicinity of promoters can modulate the rate at which transcripts are produced, and as such play an important role in gene regulation. TFs with regulatory action at multiple promoters is the rule rather than the exception, with examples ranging from TFs like the cAMP receptor protein (CRP) in E. coli that regulates hundreds of different genes, to situations involving multiple copies of the same gene, such as on plasmids, or viral DNA. When the number of TFs heavily exceeds the number of binding sites, TF binding to each promoter can be regarded as independent. However, when the number of TF molecules is comparable to the number of binding sites, TF titration will result in coupling ("entanglement") between transcription of different genes. The last few decades have seen rapid advances in our ability to quantitatively measure such effects, which calls for biophysical models to explain these data. Here we develop a statistical mechanical model which takes the TF titration effect into account and use it to predict both the level of gene expression and the resulting correlation in transcription rates for a general set of promoters. To test these predictions experimentally, we create genetic constructs with known TF copy number, binding site affinities, and gene copy number; hence avoiding the need to use free fit parameters. Our results clearly prove the TF titration effect and that the statistical mechanical model can accurately predict the fold change in gene expression for the studied cases. We also generalize these experimental efforts to cover systems with multiple different genes, using the method of mRNA fluorescence in situ hybridization (FISH). Interestingly, we can use the TF titration affect as a tool to measure the plasmid copy number at different points in the cell cycle, as well as the plasmid copy number variance. Finally, we investigate the strategies of transcriptional regulation used in a real organism by analyzing the thousands of known regulatory interactions in E. coli. We introduce a "random promoter architecture model" to identify overrepresented regulatory strategies, such as TF pairs which coregulate the same genes more frequently than would be expected by chance, indicating a related biological function. Furthermore, we investigate whether promoter architecture has a systematic effect on gene expression by linking the regulatory data of E. coli to genome-wide expression censuses.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Understanding how transcriptional regulatory sequence maps to regulatory function remains a difficult problem in regulatory biology. Given a particular DNA sequence for a bacterial promoter region, we would like to be able to say which transcription factors bind there, how strongly they bind, and whether they interact with each other and/or RNA polymerase, with the ultimate objective of integrating knowledge of these parameters into a prediction of gene expression levels. The theoretical framework of statistical thermodynamics provides a useful framework for doing so, enabling us to predict how gene expression levels depend on transcription factor binding energies and concentrations. We used thermodynamic models, coupled with models of the sequence-dependent binding energies of transcription factors and RNAP, to construct a genotype to phenotype map for the level of repression exhibited by the lac promoter, and tested it experimentally using a set of promoter variants from E. coli strains isolated from different natural environments. For this work, we sought to ``reverse engineer'' naturally occurring promoter sequences to understand how variations in promoter sequence affects gene expression. The natural inverse of this approach is to ``forward engineer'' promoter sequences to obtain targeted levels of gene expression. We used a high precision model of RNAP-DNA sequence dependent binding energy, coupled with a thermodynamic model relating binding energy to gene expression, to predictively design and verify a suite of synthetic E. coli promoters whose expression varied over nearly three orders of magnitude.

However, although thermodynamic models enable predictions of mean levels of gene expression, it has become evident that cell-to-cell variability or ``noise'' in gene expression can also play a biologically important role. In order to address this aspect of gene regulation, we developed models based on the chemical master equation framework and used them to explore the noise properties of a number of common E. coli regulatory motifs; these properties included the dependence of the noise on parameters such as transcription factor binding strength and copy number. We then performed experiments in which these parameters were systematically varied and measured the level of variability using mRNA FISH. The results showed a clear dependence of the noise on these parameters, in accord with model predictions.

Finally, one shortcoming of the preceding modeling frameworks is that their applicability is largely limited to systems that are already well-characterized, such as the lac promoter. Motivated by this fact, we used a high throughput promoter mutagenesis assay called Sort-Seq to explore the completely uncharacterized transcriptional regulatory DNA of the E. coli mechanosensitive channel of large conductance (MscL). We identified several candidate transcription factor binding sites, and work is continuing to identify the associated proteins.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The main focus of this thesis is the use of high-throughput sequencing technologies in functional genomics (in particular in the form of ChIP-seq, chromatin immunoprecipitation coupled with sequencing, and RNA-seq) and the study of the structure and regulation of transcriptomes. Some parts of it are of a more methodological nature while others describe the application of these functional genomic tools to address various biological problems. A significant part of the research presented here was conducted as part of the ENCODE (ENCyclopedia Of DNA Elements) Project.

The first part of the thesis focuses on the structure and diversity of the human transcriptome. Chapter 1 contains an analysis of the diversity of the human polyadenylated transcriptome based on RNA-seq data generated for the ENCODE Project. Chapter 2 presents a simulation-based examination of the performance of some of the most popular computational tools used to assemble and quantify transcriptomes. Chapter 3 includes a study of variation in gene expression, alternative splicing and allelic expression bias on the single-cell level and on a genome-wide scale in human lymphoblastoid cells; it also brings forward a number of critical to the practice of single-cell RNA-seq measurements methodological considerations.

The second part presents several studies applying functional genomic tools to the study of the regulatory biology of organellar genomes, primarily in mammals but also in plants. Chapter 5 contains an analysis of the occupancy of the human mitochondrial genome by TFAM, an important structural and regulatory protein in mitochondria, using ChIP-seq. In Chapter 6, the mitochondrial DNA occupancy of the TFB2M transcriptional regulator, the MTERF termination factor, and the mitochondrial RNA and DNA polymerases is characterized. Chapter 7 consists of an investigation into the curious phenomenon of the physical association of nuclear transcription factors with mitochondrial DNA, based on the diverse collections of transcription factor ChIP-seq datasets generated by the ENCODE, mouseENCODE and modENCODE consortia. In Chapter 8 this line of research is further extended to existing publicly available ChIP-seq datasets in plants and their mitochondrial and plastid genomes.

The third part is dedicated to the analytical and experimental practice of ChIP-seq. As part of the ENCODE Project, a set of metrics for assessing the quality of ChIP-seq experiments was developed, and the results of this activity are presented in Chapter 9. These metrics were later used to carry out a global analysis of ChIP-seq quality in the published literature (Chapter 10). In Chapter 11, the development and initial application of an automated robotic ChIP-seq (in which these metrics also played a major role) is presented.

The fourth part presents the results of some additional projects the author has been involved in, including the study of the role of the Piwi protein in the transcriptional regulation of transposon expression in Drosophila (Chapter 12), and the use of single-cell RNA-seq to characterize the heterogeneity of gene expression during cellular reprogramming (Chapter 13).

The last part of the thesis provides a review of the results of the ENCODE Project and the interpretation of the complexity of the biochemical activity exhibited by mammalian genomes that they have revealed (Chapters 15 and 16), an overview of the expected in the near future technical developments and their impact on the field of functional genomics (Chapter 14), and a discussion of some so far insufficiently explored research areas, the future study of which will, in the opinion of the author, provide deep insights into many fundamental but not yet completely answered questions about the transcriptional biology of eukaryotes and its regulation.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Biological information storage and retrieval is a dynamic process that requires the genome to undergo dramatic structural rearrangements. Recent advances in single-molecule techniques have allowed precise quantification of the nano-mechanical properties of DNA [1, 2], and direct in vivo observation of molecules in action [3]. In this work, we will examine elasticity in protein-mediated DNA looping, whose structural rearrangement is essential for transcriptional regulation in both prokaryotes and eukaryotes. We will look at hydrodynamics in the process of viral DNA ejection, which mediates information transfer and exchange and has prominent implications in evolution. As in the case of Kepler's laws of planetary motion leading to Newton's gravitational theory, and the allometric scaling laws in biology revealing the organizing principles of complex networks [4], experimental data collapse in these biological phenomena has guided much of our studies and urged us to find the underlying physical principles.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Despite the complexity of biological networks, we find that certain common architectures govern network structures. These architectures impose fundamental constraints on system performance and create tradeoffs that the system must balance in the face of uncertainty in the environment. This means that while a system may be optimized for a specific function through evolution, the optimal achievable state must follow these constraints. One such constraining architecture is autocatalysis, as seen in many biological networks including glycolysis and ribosomal protein synthesis. Using a minimal model, we show that ATP autocatalysis in glycolysis imposes stability and performance constraints and that the experimentally well-studied glycolytic oscillations are in fact a consequence of a tradeoff between error minimization and stability. We also show that additional complexity in the network results in increased robustness. Ribosome synthesis is also autocatalytic where ribosomes must be used to make more ribosomal proteins. When ribosomes have higher protein content, the autocatalysis is increased. We show that this autocatalysis destabilizes the system, slows down response, and also constrains the system’s performance. On a larger scale, transcriptional regulation of whole organisms also follows architectural constraints and this can be seen in the differences between bacterial and yeast transcription networks. We show that the degree distributions of bacterial transcription network follow a power law distribution while the yeast network follows an exponential distribution. We then explored the evolutionary models that have previously been proposed and show that neither the preferential linking model nor the duplication-divergence model of network evolution generates the power-law, hierarchical structure found in bacteria. However, in real biological systems, the generation of new nodes occurs through both duplication and horizontal gene transfers, and we show that a biologically reasonable combination of the two mechanisms generates the desired network.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

With recent advances in high-throughput sequencing, mapping of genome-wide transcription factor occupancy has become feasible. To advance the understanding of skeletal muscle differentiation specifically and transcriptional regulation in general, I determined the genome-wide occupancy map for myogenin in differentiating C2C12 myocyte cells. I then analyzed the myogenin map for underlying sequence content and the association between occupied elements and expression trajectories of adjacent genes. Having determined that myogenin primarily associates with expressed genes, I performed a similar analysis on occupancy maps of other transcription factors active during skeletal muscle differentiation, including an extensive analysis of co-occupancy. This analysis provided strong motif evidence for protein-protein interactions as the primary driving force in the formation of Myogenin / Mef2 and MyoD / AP-1 complexes at jointly-occupied sites. Finally, factor occupancy analysis was extended to include bHLH transcription factors in tissues other than skeletal muscle. The cross-tissue analysis led to the emergence of a motif structure used by bHLH TFs to encode either tissue-specific or "general" (public) access in a variety of lineages.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The investigations presented in this thesis use various in vivo techniques to understand how trans-acting factors control gene expression. The first part addresses the transcriptional regulation of muscle creatine kinase (MCK). MCK expression is activated during the course of development and is found only in differentiated muscle. Several in vivo footprints are observed at the enhancer of this gene, but all of these interactions are limited to cell types that express MCK. This is interesting because two of the footprints appear to represent muscle specific use of general transcription factors, while the other two correspond to sites that can bind the myogenic regulator, MyoD1, in vitro. MyoD1 and these general factors are present in myoblasts, but can bind to the enhancer only in myocytes. This suggests that either the factors themselves are post-translationally modified (phosphorylation or protein:protein interactions), or the accessibility of the enhancer to the factors is limited (changes in chromatin structure). The in vivo footprinting study of MCK was performed with a new ligation mediated, single-sided PCR (polymerase chain reaction) technique that I have developed.

The second half of the thesis concerns the regulation of mouse metallothionein (MT). Metallothioneins are a family of highly conserved housekeeping genes whose expression can be induced by heavy metals, steroids, and other stresses. By adapting a primer extension method of genomic sequencing to in vivo footprinting, I've observed both metal inducible and noninducible interactions at the promoter of MT-I. From these results I've been able to limit the possible mechanisms by which metal responsive trans-acting factors induce transcription. These interpretations correlate with a second line of experiments involving the stable titration of positive acting factors necessary for induction of MT. I've amplified the promoter of MT to 10^2-10^3 copies per cell by fusing the 5' and 3' ends of the MT gene to the coding region of DHFR and selecting cells for methotrexate resistance. In these cells, there is a metal-specific titration effect, and although it acts at the level of transcription, it appears to be independent of direct DNA binding factors.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The sea urchin embryonic skeleton, or spicule, is deposited by mesenchymal progeny of four precursor cells, the micromeres, which are determined to the skeletogenic pathway by a process known as cytoplasmic localization. A gene encoding one of the major products of the skeletogenic mesenchyme, a prominent 50 kD protein of the spicule matrix, has been characterized in detail. cDNA clones were first isolated by antibody screening of a phage expression library, followed by isolation of homologous genomic clones. The gene, known as SM50, is single copy in the sea urchin genome, is divided into two exons of 213 and 1682 bp, and is expressed only in skeletogenic cells. Transcripts are first detectable at the 120 cell stage, shortly after the segregation of the skeletogenic precursors from the rest of the embryo. The SM50 open reading frame begins within the first exon, is 450 amino acids in length, and contains a loosely repeated 13 amino acid motif rich in acidic residues which accounts for 45% of the protein and which is possibly involved in interaction with the mineral phase of the spicule.

The important cis-acting regions of the SM50 gene necessary for proper regulation of expression were identified by gene transfer experiments. A 562 bp promoter fragment, containing 438 bp of 5' promoter sequence and 124 bp of the SM50 first exon (including the SM50 initiation codon), was both necessary and sufficient to direct high levels of expression of the bacterial chloramphenicol acetyltransferase (CAT) reporter gene specifically in the skeletogenic cells. Removal of promoter sequences between positions -2200 and -438, and of transcribed regions downstream of +124 (including the SM50 intron), had no effect on the spatial or transcriptional activity of the transgenes.

Regulatory proteins that interact with the SM50 promoter were identified by the gel retardation assay, using bulk embryo mesenchyme blastula stage nuclear proteins. Five protein binding sites were identified and mapped to various degrees of resolution. Two sites are homologous, may be enhancer elements, and at least one is required for expression. Two additional sites are also present in the promoter of the aboral ectoderm specific cytoskeletal actin gene CyIIIa; one of these is a CCAA T element, the other a putative repressor element. The fifth site overlaps the binding site of the putative repressor and may function as a positive regulator by interfering with binding of the repressor. All of the proteins are detectable in nuclear extracts prepared from 64 cell stage embryos, a stage just before expression of SM50 is initiated, as well as from blastula and gastrula stage; the putative enhancer binding protein may be maternal as well.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The ubiquitin-dependent proteolytic pathway plays an important role in a broad array of cellular processes, inducting cell cycle control and transcription. Biochemical analysis of the ubiquitination of Sic1, the B-type cyclin-dependent kinase (CDK) inhibitor in budding yeast helped to define a ubiquitin ligase complex named SCFcdc4 (for Skp1, Cdc53/cullin, F-box protein). We found that besides Sic1, the CDK inhibitor Far1 and the replication initiation protein Cdc6 are also substrates of SCFcdc4 in vitro. A common feature in the ubiquitination of the cell cycle SCFcdc4 substrates is that they must be phosphorylated by the major cell cycle CDK, Cdc28. Gcn4, a transcription activator involved in the general control of amino acid biosynthesis, is rapidly degraded in an SCFcdc4-dependent manner in vivo. We have focused on this substrate to investigate the generality of the SCFcdc4 pathway. Through biochemical fractionations, we found that the Srb10 CDK phosphorylates Gcn4 and thereby marks it for recognition by SCFcdc4 ubiquitin ligase. Srb10 is a physiological regulator of Gcn4 stability because both phosphorylation and turnover of Gcn4 are diminished in srb10 mutants. Furthermore, we found that at least two different CDKs, Pho85 and Srb10, conspire to promote the rapid degradation of Gcn4 in vivo. The multistress response transcriptional regulator Msn2 is also a substrate for Srb10 and is hyperphosphorylated in an Srb10-dependent manner upon heat stress-induced translocation into the nucleus. Whereas Msn2 is cytoplasmic in resting wild type cells, its nuclear exclusion is partially compromised in srb10 mutant cells. Srb10 has been shown to repress a subset of genes in vivo, and has been proposed to inhibit transcription via phosphorylation of the C-terminal domain of RNA polymerase II. Our results suggest a general theme that Srb10 represses the transcription of specific genes by directly antagonizing the transcriptional activators.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Part I. The cellular slime mold Dictyostelium discoideum is a simple eukaryote which undergoes a multi-cellular developmental process. Single cell myxamoebae divide vegetatively in the presence of a food source. When the food is depleted or removed, the cells aggregate, forming a migrating pseudoplasmodium which differentiates into a fruiting body containing stalk and spore cells. I have shown that during the developmental cycle glycogen phosphorylase, aminopeptidase, and alanine transaminase are developmentally regulated, that is their specific activities increased at a specific time in the developmental cycle. Phosphorylase activity is undetectable in developing cells until mid-aggregation whereupon it increases and reaches a maximum at mid-culmination. Thereafter the enzyme disappears. Actinomycin D and cycloheximide studies as well as studies with morphologically aberrant and temporally deranged mutants indicate that prior RNA and concomitant protein synthesis are necessary for the rise and decrease in activity and support the view that the appearance of the enzyme is regulated at the transcriptional level. Aminopeptidase and alanine transaminase increase 3 fold starting at starvation and reach maximum activity at 18 and 5 hours respectively.

The cellular DNA s of D. discoideum were characterized by CsC1 buoyant density gradient centrifugation and by renaturation kinetics. Whole cell DNA exhibits three bands in CsCl: ρ = 1.676 g/cc (nuclear main band), 1.687 (nuclear satellite), and 1.682 (mitochondrial). Reassociation kinetics at a criterion of Tm -23°C indicates that the nuclear reiterated sequences make up 30% of the genome (Cot1/2 (pure) 0.28) and the single-copy DNA 70% (Cot1/2(pure) 70). The complexity of the nuclear genome is 30 x 109 daltons and that of the mitochondrial DNA is 35-40 x 106 daltons (Cot1/2 0.15). rRNA cistrons constitute 2.2% of nuclear DNA and have a ρ = 1.682.

RNA extracted from 4 stages during developmental cycle of Dictyostelium was hybridized with purified single-copy nuclear DNA. The hybrids had properties indicative of single-copy DNA-RNA hybrids. These studies indicate that there are, during development, qualitative and quantitative changes in the portion of the single-copy of the genome transcribed. Overall, 56% of the genome is represented by transcripts between the amoeba and mid-culmination stages. Some 19% are sequences which are represented at all stages while 37% of the genome consists of stage specific sequences.

Part II. RNA and protein synthesis and polysome formation were studied during early development of the surf clam Spisula solidissima embryos. The oocyte has a small number of polysomes and a low but measurable rate of protein synthesis (leucine-3H incorporation). After fertilization, there is a continual increase in the percentage of ribosomes sedimenting in the polysome region. Newly synthesized RNA (uridine-5-3H incorporation) was found in polysomes as early as the 2-cell stage. During cleavage, the newly formed RNA is associated mainly with the light polysomes.

RNA extracted from polysomes labeled at the 4-cell stage is polydisperse, nonribosomal, and non-4 S. Actinomycin D causes a reduction of about 30% of the polysomes formed between fertilization and the 16-cell stage.

In the early cleavage stages the light polysomes are mostly affected by actinomycin.