7 resultados para Biochemistry, Biophysics, and Structural Biology
em Duke University
Resumo:
Abstract
Listeria monocytogenes is a gram-positive soil saprophytic bacterium that is capable of causing fatal infection in humans. The main virulence regulator PrfA, a member of the Crp/FNR family of transcriptional regulators, activates the expression of essential proteins required for host cell invasion and cell-to-cell spread. The mechanism of PrfA activation and the identity of its small molecule coactivator have remained a mystery for more than 20 years, but it is hypothesized that PrfA shares mechanistic similarity to the E. coli cAMP binding protein, Crp. Crp activates gene expression by binding cAMP, increasing the DNA binding affinity of the protein and causing a significant DNA bend that facilitates RNA polymerase binding and downstream gene activation. Our data suggests PrfA activates virulence protein expression through a mechanism distinct from the canonical Crp activation mechanism that involves a combination of cysteine residue reduction and glutathione (GSH) binding.
Listeria lacking glutathione synthase (ΔgshF) is avirulent in mice; however virulence is rescued when the bacterium expresses the constitutively active PrfA mutant G145S. Interestingly, Listeria expressing a PrfA mutant in which its four cysteines are mutated to alanine (Quad PrfA), demonstrate a 30-fold decrease in virulence. The Quad and ΔgshF double mutant strains are avirulent. DNA-binding affinity, measured through fluorescence polarization assays, indicate reduction of the cysteine side chains is sufficient to allow PrfA to binds its physiological promoters Phly and PactA with low nanomolar affinity. Oxidized PrfA binds the promoters poorly.
Unexpectedly, Quad also binds promoter DNA with nanomolar affinity, suggesting that the cysteines play a role in transcription efficiency in addition to DNA binding. Both PrfA and Quad bind GSH at physiologically relevant and comparable affinities, however GSH did not affect DNA binding in either case. Thermal denaturation assays suggest that Quad and wild-type PrfA differ structurally upon binding GSH, which supports the in vivo difference in infection between the regulator and its mutant.
Structures of PrfA in complex with cognate DNA, determined through X-ray crystallography, further support the disparity between PrfA and Crp activation mechanisms as two structures of reduced PrfA bound to Phly (PrfA-Phly30 and PrfA-Phly24) suggest the DNA adopts a less bent DNA conformation when compared to Crp-cAMP- DNA. The structure of Quad-Phly30 confirms the DNA-binding data as the protein-DNA complex adopts the same overall conformation as PrfA-Phly.
From these results, we hypothesize a two-step activation mechanism wherein PrfA, oxidized upon cell entry and unable to bind DNA, is reduced upon its intracellular release and binds DNA, causing a slight bend in the promoter and small increase in transcription of PrfA-regulated genes. The structures of PrfA-Phly30 and PrfA-Phly24 likely visualize this intermediate complex. Increasing concentrations of GSH shift the protein to a (PrfA-GSH)-DNA complex which is fully active transcriptionally and is hypothesized to resemble closely the transcriptionally active structure of the cAMP-(Crp)-DNA complex. Thermal denaturation results suggest Quad PrfA is deficient in this second step, which explains the decrease in virulence and implicates the cysteine residues as critical for transcription efficiency. Further structural and biochemical studies are on-going to clarify this mechanism of activation.
Resumo:
Transcription factors (TFs) control the temporal and spatial expression of target genes by interacting with DNA in a sequence-specific manner. Recent advances in high throughput experiments that measure TF-DNA interactions in vitro and in vivo have facilitated the identification of DNA binding sites for thousands of TFs. However, it remains unclear how each individual TF achieves its specificity, especially in the case of paralogous TFs that recognize distinct target genomic sites despite sharing very similar DNA binding motifs. In my work, I used a combination of high throughput in vitro protein-DNA binding assays and machine-learning algorithms to characterize and model the binding specificity of 11 paralogous TFs from 4 distinct structural families. My work proves that even very closely related paralogous TFs, with indistinguishable DNA binding motifs, oftentimes exhibit differential binding specificity for their genomic target sites, especially for sites with moderate binding affinity. Importantly, the differences I identify in vitro and through computational modeling help explain, at least in part, the differential in vivo genomic targeting by paralogous TFs. Future work will focus on in vivo factors that might also be important for specificity differences between paralogous TFs, such as DNA methylation, interactions with protein cofactors, or the chromatin environment. In this larger context, my work emphasizes the importance of intrinsic DNA binding specificity in targeting of paralogous TFs to the genome.
Resumo:
This dissertation contributes to the rapidly growing empirical research area in the field of operations management. It contains two essays, tackling two different sets of operations management questions which are motivated by and built on field data sets from two very different industries --- air cargo logistics and retailing.
The first essay, based on the data set obtained from a world leading third-party logistics company, develops a novel and general Bayesian hierarchical learning framework for estimating customers' spillover learning, that is, customers' learning about the quality of a service (or product) from their previous experiences with similar yet not identical services. We then apply our model to the data set to study how customers' experiences from shipping on a particular route affect their future decisions about shipping not only on that route, but also on other routes serviced by the same logistics company. We find that customers indeed borrow experiences from similar but different services to update their quality beliefs that determine future purchase decisions. Also, service quality beliefs have a significant impact on their future purchasing decisions. Moreover, customers are risk averse; they are averse to not only experience variability but also belief uncertainty (i.e., customer's uncertainty about their beliefs). Finally, belief uncertainty affects customers' utilities more compared to experience variability.
The second essay is based on a data set obtained from a large Chinese supermarket chain, which contains sales as well as both wholesale and retail prices of un-packaged perishable vegetables. Recognizing the special characteristics of this particularly product category, we develop a structural estimation model in a discrete-continuous choice model framework. Building on this framework, we then study an optimization model for joint pricing and inventory management strategies of multiple products, which aims at improving the company's profit from direct sales and at the same time reducing food waste and thus improving social welfare.
Collectively, the studies in this dissertation provide useful modeling ideas, decision tools, insights, and guidance for firms to utilize vast sales and operations data to devise more effective business strategies.
Resumo:
Trehalose is a non-reducing disaccharide essential for pathogenic fungal survival and virulence. The biosynthesis of trehalose requires the trehalose-6-phosphate synthase, Tps1, and trehalose-6-phosphate phosphatase, Tps2. More importantly, the trehalose biosynthetic pathway is absent in mammals, conferring this pathway as an ideal target for antifungal drug design. However, lack of germane biochemical and structural information hinders antifungal drug design against these targets.
In this dissertation, macromolecular X-ray crystallography and biochemical assays were employed to understand the structures and functions of proteins involved in the trehalose biosynthetic pathway. I report here the first eukaryotic Tps1 structures from Candida albicans (C. albicans) and Aspergillus fumigatus (A. fumigatus) with substrates or substrate analogs. These structures reveal the key residues involved in substrate binding and catalysis. Subsequent enzymatic assays and cellular assays highlight the significance of these key Tps1 residues in enzyme function and fungal stress response. The Tps1 structure captured in its transition-state with a non-hydrolysable inhibitor demonstrates that Tps1 adopts an “internal return like” mechanism for catalysis. Furthermore, disruption of the trehalose biosynthetic complex formation through abolishing Tps1 dimerization reveals that complex formation has regulatory function in addition to trehalose production, providing additional targets for antifungal drug intervention.
I also present here the structure of the Tps2 N-terminal domain (Tps2NTD) from C. albicans, which may be involved in the proper formation of the trehalose biosynthetic complex. Deletion of the Tps2NTD results in a temperature sensitive phenotype. Further, I describe in this dissertation the structures of the Tps2 phosphatase domain (Tps2PD) from C. albicans, A. fumigatus and Cryptococcus neoformans (C. neoformans) in multiple conformational states. The structures of the C. albicans Tps2PD -BeF3-trehalose complex and C. neoformans Tps2PD(D24N)-T6P complex reveal extensive interactions between both glucose moieties of the trehalose involving all eight hydroxyl groups and multiple residues of both the cap and core domains of Tps2PD. These structures also reveal that steric hindrance is a key underlying factor for the exquisite substrate specificity of Tps2PD. In addition, the structures of Tps2PD in the open conformation provide direct visualization of the conformational changes of this domain that are effected by substrate binding and product release.
Last, I present the structure of the C. albicans trehalose synthase regulatory protein (Tps3) pseudo-phosphatase domain (Tps3PPD) structure. Tps3PPD adopts a haloacid dehydrogenase superfamily (HADSF) phosphatase fold with a core Rossmann-fold domain and a α/β fold cap domain. Despite lack of phosphatase activity, the cleft between the Tps3PPD core domain and cap domain presents a binding pocket for a yet uncharacterized ligand. Identification of this ligand could reveal the cellular function of Tps3 and any interconnection of the trehalose biosynthetic pathway with other cellular metabolic pathways.
Combined, these structures together with significant biochemical analyses advance our understanding of the proteins responsible for trehalose biosynthesis. These structures are ready to be exploited to rationally design or optimize inhibitors of the trehalose biosynthetic pathway enzymes. Hence, the work described in this thesis has laid the groundwork for the design of Tps1 and Tps2 specific inhibitors, which ultimately could lead to novel therapeutics to treat fungal infections.
Resumo:
The central dogma of molecular biology relies on the correct Watson-Crick (WC) geometry of canonical deoxyribonucleic acid (DNA) dG•dC and dA•dT base pairs to replicate and transcribe genetic information with speed and an astonishing level of fidelity. In addition, the Watson-Crick geometry of canonical ribonucleic acid (RNA) rG•rC and rA•rU base pairs is highly conserved to ensure that proteins are translated with high fidelity. However, numerous other potential nucleobase tautomeric and ionic configurations are possible that can give rise to entirely new pairing modes between the nucleotide bases. Very early on, James Watson and Francis Crick recognized their importance and in 1953 postulated that if bases adopted one of their less energetically disfavored tautomeric forms (and later ionic forms) during replication it could lead to the formation of a mismatch with a Watson-Crick-like geometry and could give rise to “natural mutations.”
Since this time numerous studies have provided evidence in support of this hypothesis and have expanded upon it; computational studies have addressed the energetic feasibilities of different nucleobases’ tautomeric and ionic forms in siico; crystallographic studies have trapped different mismatches with WC-like geometries in polymerase or ribosome active sites. However, no direct evidence has been given for (i) the direct existence of these WC-like mismatches in canonical DNA duplex, RNA duplexes, or non-coding RNAs; (ii) which, if any, tautomeric or ionic form stabilizes the WC-like geometry. This thesis utilizes nuclear magnetic resonance (NMR) spectroscopy and rotating frame relaxation dispersion (R1ρ RD) in combination with density functional theory (DFT), biochemical assays, and targeted chemical perturbations to show that (i) dG•dT mismatches in DNA duplexes, as well as rG•rU mismatches RNA duplexes and non-coding RNAs, transiently adopt a WC-like geometry that is stabilized by (ii) an interconnected network of rapidly interconverting rare tautomers and anionic bases. These results support Watson and Crick’s tautomer hypothesis, but additionally support subsequent hypotheses invoking anionic mismatches and ultimately tie them together. This dissertation shows that a common mismatch can adopt a Watson-Crick-like geometry globally, in both DNA and RNA, and whose geometry is stabilized by a kinetically linked network of rare tautomeric and anionic bases. The studies herein also provide compelling evidence for their involvement in spontaneous replication and translation errors.
Resumo:
Nucleic acids (DNA and RNA) play essential roles in the central dogma of biology for the storage and transfer of genetic information. The unique chemical and conformational structures of nucleic acids – the double helix composed of complementary Watson-Crick base pairs, provide the structural basis to carry out their biological functions. DNA double helix can dynamically accommodate Watson-Crick and Hoogsteen base-pairing, in which the purine base is flipped by ~180° degrees to adopt syn rather than anti conformation as in Watson-Crick base pairs. There is growing evidence that Hoogsteen base pairs play important roles in DNA replication, recognition, damage or mispair accommodation and repair. Here, we constructed a database for existing Hoogsteen base pairs in DNA duplexes by a structure-based survey from the Protein Data Bank, and structural analyses based on the resulted Hoogsteen structures revealed that Hoogsteen base pairs occur in a wide variety of biological contexts and can induce DNA kinking towards the major groove. As there were documented difficulties in modeling Hoogsteen or Watson-Crick by crystallography, we collaborated with the Richardsons’ lab and identified potential Hoogsteen base pairs that were mis-modeled as Watson-Crick base pairs which suggested that Hoogsteen can be more prevalent than it was thought to be. We developed solution NMR method combined with the site-specific isotope labeling to characterize the formation of, or conformational exchange with Hoogsteen base pairs in large DNA-protein complexes under solution conditions, in the absence of the crystal packing force. We showed that there are enhanced chemical exchange, potentially between Watson-Crick and Hoogsteen, at a sharp kink site in the complex formed by DNA and the Integration Host Factor protein. In stark contrast to B-form DNA, we found that Hoogsteen base pairs are strongly disfavored in A-form RNA duplex. Chemical modifications N1-methyl adenosine and N1-methyl guanosine that block Watson-Crick base-pairing, can be absorbed as Hoogsteen base pairs in DNA, but rather potently destabilized A-form RNA and caused helix melting. The intrinsic instability of Hoogsteen base pairs in A-form RNA endows the N1-methylation as a functioning post-transcriptional modification that was known to facilitate RNA folding, translation and potentially play roles in the epitranscriptome. On the other hand, the dynamic property of DNA that can accommodate Hoogsteen base pairs could be critical to maintaining the genome stability.
Resumo:
Nature is challenged to move charge efficiently over many length scales. From sub-nm to μm distances, electron-transfer proteins orchestrate energy conversion, storage, and release both inside and outside the cell. Uncovering the detailed mechanisms of biological electron-transfer reactions, which are often coupled to bond-breaking and bond-making events, is essential to designing durable, artificial energy conversion systems that mimic the specificity and efficiency of their natural counterparts. Here, we use theoretical modeling of long-distance charge hopping (Chapter 3), synthetic donor-bridge-acceptor molecules (Chapters 4, 5, and 6), and de novo protein design (Chapters 5 and 6) to investigate general principles that govern light-driven and electrochemically driven electron-transfer reactions in biology. We show that fast, μm-distance charge hopping along bacterial nanowires requires closely packed charge carriers with low reorganization energies (Chapter 3); singlet excited-state electronic polarization of supermolecular electron donors can attenuate intersystem crossing yields to lower-energy, oppositely polarized, donor triplet states (Chapter 4); the effective static dielectric constant of a small (~100 residue) de novo designed 4-helical protein bundle can change upon phototriggering an electron transfer event in the protein interior, providing a means to slow the charge-recombination reaction (Chapter 5); and a tightly-packed de novo designed 4-helix protein bundle can drastically alter charge-transfer driving forces of photo-induced amino acid radical formation in the bundle interior, effectively turning off a light-driven oxidation reaction that occurs in organic solvent (Chapter 6). This work leverages unique insights gleaned from proteins designed from scratch that bind synthetic donor-bridge-acceptor molecules that can also be studied in organic solvents, opening new avenues of exploration into the factors critical for protein control of charge flow in biology.