5 resultados para COPY-NUMBER VARIATION
em CaltechTHESIS
Resumo:
Some of the most exciting developments in the field of nucleic acid engineering include the utilization of synthetic nucleic acid molecular devices as gene regulators, as disease marker detectors, and most recently, as therapeutic agents. The common thread between these technologies is their reliance on the detection of specific nucleic acid input markers to generate some desirable output, such as a change in the copy number of an mRNA (for gene regulation), a change in the emitted light intensity (for some diagnostics), and a change in cell state within an organism (for therapeutics). The research presented in this thesis likewise focuses on engineering molecular tools that detect specific nucleic acid inputs, and respond with useful outputs.
Four contributions to the field of nucleic acid engineering are presented: (1) the construction of a single nucleotide polymorphism (SNP) detector based on the mechanism of hybridization chain reaction (HCR); (2) the utilization of a single-stranded oligonucleotide molecular Scavenger as a means of enhancing HCR selectivity; (3) the implementation of Quenched HCR, a technique that facilitates transduction of a nucleic acid chemical input into an optical (light) output, and (4) the engineering of conditional probes that function as sequence transducers, receiving target signal as input and providing a sequence of choice as output. These programmable molecular systems are conceptually well-suited for performing wash-free, highly selective rapid genotyping and expression profiling in vitro, in situ, and potentially in living cells.
Resumo:
The ability to regulate gene expression is of central importance for the adaptability of living organisms to changes in their internal and external environment. At the transcriptional level, binding of transcription factors (TFs) in the vicinity of promoters can modulate the rate at which transcripts are produced, and as such play an important role in gene regulation. TFs with regulatory action at multiple promoters is the rule rather than the exception, with examples ranging from TFs like the cAMP receptor protein (CRP) in E. coli that regulates hundreds of different genes, to situations involving multiple copies of the same gene, such as on plasmids, or viral DNA. When the number of TFs heavily exceeds the number of binding sites, TF binding to each promoter can be regarded as independent. However, when the number of TF molecules is comparable to the number of binding sites, TF titration will result in coupling ("entanglement") between transcription of different genes. The last few decades have seen rapid advances in our ability to quantitatively measure such effects, which calls for biophysical models to explain these data. Here we develop a statistical mechanical model which takes the TF titration effect into account and use it to predict both the level of gene expression and the resulting correlation in transcription rates for a general set of promoters. To test these predictions experimentally, we create genetic constructs with known TF copy number, binding site affinities, and gene copy number; hence avoiding the need to use free fit parameters. Our results clearly prove the TF titration effect and that the statistical mechanical model can accurately predict the fold change in gene expression for the studied cases. We also generalize these experimental efforts to cover systems with multiple different genes, using the method of mRNA fluorescence in situ hybridization (FISH). Interestingly, we can use the TF titration affect as a tool to measure the plasmid copy number at different points in the cell cycle, as well as the plasmid copy number variance. Finally, we investigate the strategies of transcriptional regulation used in a real organism by analyzing the thousands of known regulatory interactions in E. coli. We introduce a "random promoter architecture model" to identify overrepresented regulatory strategies, such as TF pairs which coregulate the same genes more frequently than would be expected by chance, indicating a related biological function. Furthermore, we investigate whether promoter architecture has a systematic effect on gene expression by linking the regulatory data of E. coli to genome-wide expression censuses.
Resumo:
Mitochondria contain a 16.6 kb circular genome encoding 13 proteins as well as mitochondrial tRNAs and rRNAs. Copies of the genome are organized into nucleoids containing both DNA and proteins, including the machinery required for mtDNA replication and transcription. Although mtDNA integrity is essential for cellular and organismal viability, regulation of proliferation of the mitochondrial genome is poorly understood. To elucidate the mechanisms behind this, we chose to study the interplay between mtDNA copy number and the proteins involved in mitochondrial fusion, another required function in cells. Strikingly, we found that mouse embryonic fibroblasts lacking fusion also had a mtDNA copy number deficit. To understand this phenomenon further, we analyzed the binding of mitochondrial transcription factor A, whose role in transcription, replication, and packaging of the genome is well-established and crucial for cellular maintenance. Using ChIP-seq, we were able to detect largely uniform, non-specific binding across the genome, with no occupancy in the known specific binding sites in the regulatory region. We did detect a single binding site directly upstream of a known origin of replication, suggesting that TFAM may play a direct role in replication. Finally, although TFAM has been previously shown to localize to the nuclear genome, we found no evidence for such binding sites in our system.
To further understand the regulation of mtDNA by other proteins, we analyzed publicly available ChIP-seq datasets from ENCODE, modENCODE, and mouseENCODE for evidence of nuclear transcription factor binding to the mitochondrial genome. We identified eight human transcription factors and three mouse transcription factors that demonstrated binding events with the classical strand asymmetrical morphology of classical binding sites. ChIP-seq is a powerful tool for understanding the interactions between proteins and the mitochondrial genome, and future studies promise to further the understanding of how mtDNA is regulated within the nucleoid.
Resumo:
Understanding how transcriptional regulatory sequence maps to regulatory function remains a difficult problem in regulatory biology. Given a particular DNA sequence for a bacterial promoter region, we would like to be able to say which transcription factors bind there, how strongly they bind, and whether they interact with each other and/or RNA polymerase, with the ultimate objective of integrating knowledge of these parameters into a prediction of gene expression levels. The theoretical framework of statistical thermodynamics provides a useful framework for doing so, enabling us to predict how gene expression levels depend on transcription factor binding energies and concentrations. We used thermodynamic models, coupled with models of the sequence-dependent binding energies of transcription factors and RNAP, to construct a genotype to phenotype map for the level of repression exhibited by the lac promoter, and tested it experimentally using a set of promoter variants from E. coli strains isolated from different natural environments. For this work, we sought to ``reverse engineer'' naturally occurring promoter sequences to understand how variations in promoter sequence affects gene expression. The natural inverse of this approach is to ``forward engineer'' promoter sequences to obtain targeted levels of gene expression. We used a high precision model of RNAP-DNA sequence dependent binding energy, coupled with a thermodynamic model relating binding energy to gene expression, to predictively design and verify a suite of synthetic E. coli promoters whose expression varied over nearly three orders of magnitude.
However, although thermodynamic models enable predictions of mean levels of gene expression, it has become evident that cell-to-cell variability or ``noise'' in gene expression can also play a biologically important role. In order to address this aspect of gene regulation, we developed models based on the chemical master equation framework and used them to explore the noise properties of a number of common E. coli regulatory motifs; these properties included the dependence of the noise on parameters such as transcription factor binding strength and copy number. We then performed experiments in which these parameters were systematically varied and measured the level of variability using mRNA FISH. The results showed a clear dependence of the noise on these parameters, in accord with model predictions.
Finally, one shortcoming of the preceding modeling frameworks is that their applicability is largely limited to systems that are already well-characterized, such as the lac promoter. Motivated by this fact, we used a high throughput promoter mutagenesis assay called Sort-Seq to explore the completely uncharacterized transcriptional regulatory DNA of the E. coli mechanosensitive channel of large conductance (MscL). We identified several candidate transcription factor binding sites, and work is continuing to identify the associated proteins.
Resumo:
DNA charge transport (CT) involves the efficient transfer of electrons or electron holes through the DNA π-stack over long molecular distances of at least 100 base-pairs. Despite this shallow distance dependence, DNA CT is sensitive to mismatches or lesions that disrupt π-stacking and is critically dependent on proper electronic coupling of the donor and acceptor moieties into the base stack. Favorable DNA CT is very rapid, occurring on the picosecond timescale. Because of this speed, electron holes equilibrate along the DNA π-stack, forming a characteristic pattern of DNA damage at low oxidation potential guanine multiplets. Furthermore, DNA CT may be used in a biological context. DNA processing enzymes with 4Fe4S clusters can perform DNA-mediated electron transfer (ET) self-exchange reactions with other 4Fe4S cluster proteins, even if the proteins are quite dissimilar, as long as the DNA-bound [4Fe4S]3+/2+ redox potentials are conserved. This mechanism would allow low copy number DNA repair proteins to find their lesions efficiently within the cell. DNA CT may also be used biologically for the long-range, selective activation of redox-active transcription factors. Within this work, we pursue other proteins that may utilize DNA CT within the cell and further elucidate aspects of the DNA-mediated ET self-exchange reaction of 4Fe4S cluster proteins.
Dps proteins, bacterial mini-ferritins that protect DNA from oxidative stress, are implicated in the survival and virulence of pathogenic bacteria. One aspect of their protection involves ferroxidase activity, whereby ferrous iron is bound and oxidized selectively by hydrogen peroxide, thereby preventing formation of damaging hydroxyl radicals via Fenton chemistry. Understanding the specific mechanism by which Dps proteins protect the bacterial genome could inform the development of new antibiotics. We investigate whether DNA-binding E. coli Dps can utilize DNA CT to protect the genome from a distance. An intercalating ruthenium photooxidant was employed to generate oxidative DNA damage via the flash-quench technique, which localizes to a low potential guanine triplet. We find that Dps loaded with ferrous iron, in contrast to Apo-Dps and ferric iron-loaded Dps which lack available reducing equivalents, significantly attenuates the yield of oxidative DNA damage at the guanine triplet. These data demonstrate that ferrous iron-loaded Dps is selectively oxidized to fill guanine radical holes, thereby restoring the integrity of the DNA. Luminescence studies indicate no direct interaction between the ruthenium photooxidant and Dps, supporting the DNA-mediated oxidation of ferrous iron-loaded Dps. Thus DNA CT may be a mechanism by which Dps efficiently protects the genome of pathogenic bacteria from a distance.
Further work focused on spectroscopic characterization of the DNA-mediated oxidation of ferrous iron-loaded Dps. X-band EPR was used to monitor the oxidation of DNA-bound Dps after DNA photooxidation via the flash-quench technique. Upon irradiation with poly(dGdC)2, a signal arises with g = 4.3, consistent with the formation of mononuclear high-spin Fe(III) sites of low symmetry, the expected oxidation product of Dps with one iron bound at each ferroxidase site. When poly(dGdC)2 is substituted with poly(dAdT)2, the yield of Dps oxidation is decreased significantly, indicating that guanine radicals facilitate Dps oxidation. The more favorable oxidation of Dps by guanine radicals supports the feasibility of a long-distance protection mechanism via DNA CT where Dps is oxidized to fill guanine radical holes in the bacterial genome produced by reactive oxygen species.
We have also explored possible electron transfer intermediates in the DNA-mediated oxidation of ferrous iron-loaded Dps. Dps proteins contain a conserved tryptophan residue in close proximity to the ferroxidase site (W52 in E. coli Dps). In comparison to WT Dps, in EPR studies of the oxidation of ferrous iron-loaded Dps following DNA photooxidation, W52Y and W52A mutants were deficient in forming the characteristic EPR signal at g = 4.3, with a larger deficiency for W52A compared to W52Y. In addition to EPR, we also probed the role of W52 Dps in cells using a hydrogen peroxide survival assay. Bacteria containing W52Y Dps survived the hydrogen peroxide challenge more similarly to those containing WT Dps, whereas cells with W52A Dps died off as quickly as cells without Dps. Overall, these results suggest the possibility of W52 as a CT hopping intermediate.
DNA-modified electrodes have become an essential tool for the study of the redox chemistry of DNA processing enzymes with 4Fe4S clusters. In many cases, it is necessary to investigate different complex samples and substrates in parallel in order to elucidate this chemistry. Therefore, we optimized and characterized a multiplexed electrochemical platform with the 4Fe4S cluster base excision repair glycosylase Endonuclease III (EndoIII). Closely packed DNA films, where the protein has limited surface accessibility, produce EndoIII electrochemical signals sensitive to an intervening mismatch, indicating a DNA-mediated process. Multiplexed analysis allowed more robust characterization of the CT-deficient Y82A EndoIII mutant, as well as comparison of a new family of mutations altering the electrostatics surrounding the 4Fe4S cluster in an effort to shift the reduction potential of the cluster. While little change in the DNA-bound midpoint potential was found for this family of mutants, likely indicating the dominant effect of DNA-binding on establishing the protein redox potential, significant variations in the efficiency of DNA-mediated electron transfer were apparent. On the basis of the stability of these proteins, examined by circular dichroism, we proposed that the electron transfer pathway in EndoIII can be perturbed not only by the removal of aromatic residues but also through changes in solvation near the cluster.
While the 4Fe4S cluster of EndoIII is relatively insensitive to oxidation and reduction in solution, we have found that upon DNA binding, the reduction potential of the [4Fe4S]3+/2+ couple shifts negatively by approximately 200 mV, bringing this couple into a physiologically relevant range. Demonstrated using electrochemistry experiments in the presence and absence of DNA, these studies do not provide direct molecular evidence for the species being observed. Sulfur K-edge X-ray absorbance spectroscopy (XAS) can be used to probe directly the covalency of iron-sulfur clusters, which is correlated to their reduction potential. We have shown that the Fe-S covalency of the 4Fe4S cluster of EndoIII increases upon DNA binding, stabilizing the oxidized [4Fe4S]3+ cluster, consistent with a negative shift in reduction potential. The 7% increase in Fe-S covalency corresponds to an approximately 150 mV shift, remarkably similar to DNA electrochemistry results. Therefore we have obtained direct molecular evidence for the shift in 4Fe4S reduction potential of EndoIII upon DNA binding, supporting the feasibility of our model whereby these proteins can utilize DNA CT to cooperate in order to efficiently find DNA lesions inside cells.
In conclusion, in this work we have explored the biological applications of DNA CT. We discovered that the DNA-binding bacterial ferritin Dps can protect the bacterial genome from a distance via DNA CT, perhaps contributing to pathogen survival and virulence. Furthermore, we optimized a multiplexed electrochemical platform for the study of the redox chemistry of DNA-bound 4Fe4S cluster proteins. Finally, we have used sulfur K-edge XAS to obtain direct molecular evidence for the negative shift in 4Fe4S cluster reduction potential of EndoIII upon DNA binding. These studies contribute to the understanding of DNA-mediated protein oxidation within cells.