8 resultados para high-protein plants
em CaltechTHESIS
Resumo:
A unique chloroplast Signal Recognition Particle (SRP) in green plants is primarily dedicated to the post-translational targeting of light harvesting chlorophyll-a/b binding (LHC) proteins. Our study of the thermodynamics and kinetics of the GTPases of the system demonstrates that GTPase complex assembly and activation are highly coupled in the chloroplast GTPases, suggesting they may forego the GTPase activation step as a key regulatory point. This reflects adaptations of the chloroplast SRP to the delivery of their unique substrate protein. Devotion to one highly hydrophobic family of proteins also may have allowed the chloroplast SRP system to evolve an efficient chaperone in the cpSRP43 subunit. To understand the mechanism of disaggregation, we showed that LHC proteins form micellar, disc-shaped aggregates that present a recognition motif (L18) on the aggregate surface. Further molecular genetic and structure-activity analyses reveal that the action of cpSRP43 can be dissected into two steps: (i) initial recognition of L18 on the aggregate surface; and (ii) aggregate remodeling, during which highly adaptable binding interactions of cpSRP43 with hydrophobic transmembrane domains of the substrate protein compete with the packing interactions within the aggregate. We also tested the adaptability of cpSRP43 for alternative substrates, specifically in attempts to improve membrane protein expression and inhibition of amyloid beta fibrillization. These preliminary results attest to cpSRP43’s potential as a molecular chaperone and provides the impetus for further engineering endeavors to address problems that stem from protein aggregation.
Resumo:
This dissertation describes studies of G protein-coupled receptors (GPCRs) and ligand-gated ion channels (LGICs) using unnatural amino acid mutagenesis to gain high precision insights into the function of these important membrane proteins.
Chapter 2 considers the functional role of highly conserved proline residues within the transmembrane helices of the D2 dopamine GPCR. Through mutagenesis employing unnatural α-hydroxy acids, proline analogs, and N-methyl amino acids, we find that lack of backbone hydrogen bond donor ability is important to proline function. At one proline site we additionally find that a substituent on the proline backbone N is important to receptor function.
In Chapter 3, side chain conformation is probed by mutagenesis of GPCRs and the muscle-type nAChR. Specific side chain rearrangements of highly conserved residues have been proposed to accompany activation of these receptors. These rearrangements were probed using conformationally-biased β-substituted analogs of Trp and Phe and unnatural stereoisomers of Thr and Ile. We also modeled the conformational bias of the unnatural Trp and Phe analogs employed.
Chapters 4 and 5 examine details of ligand binding to nAChRs. Chapter 4 describes a study investigating the importance of hydrogen bonds between ligands and the complementary face of muscle-type and α4β4 nAChRs. A hydrogen bond involving the agonist appears to be important for ligand binding in the muscle-type receptor but not the α4β4 receptor.
Chapter 5 describes a study characterizing the binding of varenicline, an actively prescribed smoking cessation therapeutic, to the α7 nAChR. Additionally, binding interactions to the complementary face of the α7 binding site were examined for a small panel of agonists. We identified side chains important for binding large agonists such as varenicline, but dispensable for binding the small agonist ACh.
Chapter 6 describes efforts to image nAChRs site-specifically modified with a fluorophore by unnatural amino acid mutagenesis. While progress was hampered by high levels of fluorescent background, improvements to sample preparation and alternative strategies for fluorophore incorporation are described.
Chapter 7 describes efforts toward a fluorescence assay for G protein association with a GPCR, with the ultimate goal of probing key protein-protein interactions along the G protein/receptor interface. A wide range of fluorescent protein fusions were generated, expressed in Xenopus oocytes, and evaluated for their ability to associate with each other.
Resumo:
The main focus of this thesis is the use of high-throughput sequencing technologies in functional genomics (in particular in the form of ChIP-seq, chromatin immunoprecipitation coupled with sequencing, and RNA-seq) and the study of the structure and regulation of transcriptomes. Some parts of it are of a more methodological nature while others describe the application of these functional genomic tools to address various biological problems. A significant part of the research presented here was conducted as part of the ENCODE (ENCyclopedia Of DNA Elements) Project.
The first part of the thesis focuses on the structure and diversity of the human transcriptome. Chapter 1 contains an analysis of the diversity of the human polyadenylated transcriptome based on RNA-seq data generated for the ENCODE Project. Chapter 2 presents a simulation-based examination of the performance of some of the most popular computational tools used to assemble and quantify transcriptomes. Chapter 3 includes a study of variation in gene expression, alternative splicing and allelic expression bias on the single-cell level and on a genome-wide scale in human lymphoblastoid cells; it also brings forward a number of critical to the practice of single-cell RNA-seq measurements methodological considerations.
The second part presents several studies applying functional genomic tools to the study of the regulatory biology of organellar genomes, primarily in mammals but also in plants. Chapter 5 contains an analysis of the occupancy of the human mitochondrial genome by TFAM, an important structural and regulatory protein in mitochondria, using ChIP-seq. In Chapter 6, the mitochondrial DNA occupancy of the TFB2M transcriptional regulator, the MTERF termination factor, and the mitochondrial RNA and DNA polymerases is characterized. Chapter 7 consists of an investigation into the curious phenomenon of the physical association of nuclear transcription factors with mitochondrial DNA, based on the diverse collections of transcription factor ChIP-seq datasets generated by the ENCODE, mouseENCODE and modENCODE consortia. In Chapter 8 this line of research is further extended to existing publicly available ChIP-seq datasets in plants and their mitochondrial and plastid genomes.
The third part is dedicated to the analytical and experimental practice of ChIP-seq. As part of the ENCODE Project, a set of metrics for assessing the quality of ChIP-seq experiments was developed, and the results of this activity are presented in Chapter 9. These metrics were later used to carry out a global analysis of ChIP-seq quality in the published literature (Chapter 10). In Chapter 11, the development and initial application of an automated robotic ChIP-seq (in which these metrics also played a major role) is presented.
The fourth part presents the results of some additional projects the author has been involved in, including the study of the role of the Piwi protein in the transcriptional regulation of transposon expression in Drosophila (Chapter 12), and the use of single-cell RNA-seq to characterize the heterogeneity of gene expression during cellular reprogramming (Chapter 13).
The last part of the thesis provides a review of the results of the ENCODE Project and the interpretation of the complexity of the biochemical activity exhibited by mammalian genomes that they have revealed (Chapters 15 and 16), an overview of the expected in the near future technical developments and their impact on the field of functional genomics (Chapter 14), and a discussion of some so far insufficiently explored research areas, the future study of which will, in the opinion of the author, provide deep insights into many fundamental but not yet completely answered questions about the transcriptional biology of eukaryotes and its regulation.
Resumo:
Protein structure prediction has remained a major challenge in structural biology for more than half a century. Accelerated and cost efficient sequencing technologies have allowed researchers to sequence new organisms and discover new protein sequences. Novel protein structure prediction technologies will allow researchers to study the structure of proteins and to determine their roles in the underlying biology processes and develop novel therapeutics.
Difficulty of the problem stems from two folds: (a) describing the energy landscape that corresponds to the protein structure, commonly referred to as force field problem; and (b) sampling of the energy landscape, trying to find the lowest energy configuration that is hypothesized to be the native state of the structure in solution. The two problems are interweaved and they have to be solved simultaneously. This thesis is composed of three major contributions. In the first chapter we describe a novel high-resolution protein structure refinement algorithm called GRID. In the second chapter we present REMCGRID, an algorithm for generation of low energy decoy sets. In the third chapter, we present a machine learning approach to ranking decoys by incorporating coarse-grain features of protein structures.
Resumo:
The ability to sense mechanical force is vital to all organisms to interact with and respond to stimuli in their environment. Mechanosensation is critical to many physiological functions such as the senses of hearing and touch in animals, gravitropism in plants and osmoregulation in bacteria. Of these processes, the best understood at the molecular level involve bacterial mechanosensitive channels. Under hypo-osmotic stress, bacteria are able to alleviate turgor pressure through mechanosensitive channels that gate directly in response to tension in the membrane lipid bilayer. A key participant in this response is the mechanosensitive channel of large conductance (MscL), a non-selective channel with a high conductance of ~3 nS that gates at tensions close to the membrane lytic tension.
It has been appreciated since the original discovery by C. Kung that the small subunit size (~130 to 160 residues) and the high conductance necessitate that MscL forms a homo-oligomeric channel. Over the past 20 years of study, the proposed oligomeric state of MscL has ranged from monomer to hexamer. Oligomeric state has been shown to vary between MscL homologues and is influenced by lipid/detergent environment. In this thesis, we report the creation of a chimera library to systematically survey the correlation between MscL sequence and oligomeric state to identify the sequence determinants of oligomeric state. Our results demonstrate that although there is no combination of sequences uniquely associated with a given oligomeric state (or mixture of oligomeric states), there are significant correlations. In the quest to characterize the oligomeric state of MscL, an exciting discovery was made about the dynamic nature of the MscL complex. We found that in detergent solution, under mild heating conditions (37 °C – 60 °C), subunits of MscL can exchange between complexes, and the dynamics of this process are sensitive to the protein sequence.
Extensive efforts were made to produce high diffraction quality crystals of MscL for the determination of a high resolution X-ray crystal structure of a full length channel. The surface entropy reduction strategy was applied to the design of S. aureus MscL variants and while the strategy appears to have improved the crystallizability of S. aureus MscL, unfortunately the diffraction qualities of these crystals were not significantly improved. MscL chimeras were also screened for crystallization in various solubilization detergents, but also failed to yield high quality crystals.
MscL is a fascinating protein and continues to serve as a model system for the study of the structural and functional properties of mechanosensitive channels. Further characterization of the MscL chimera library will offer more insight into the characteristics of the channel. Of particular interest are the functional characterization of the chimeras and the exploration of the physiological relevance of intercomplex subunit exchange.
Resumo:
G-protein coupled receptors (GPCRs) form a large family of proteins and are very important drug targets. They are membrane proteins, which makes computational prediction of their structure challenging. Homology modeling is further complicated by low sequence similarly of the GPCR superfamily.
In this dissertation, we analyze the conserved inter-helical contacts of recently solved crystal structures, and we develop a unified sequence-structural alignment of the GPCR superfamily. We use this method to align 817 human GPCRs, 399 of which are nonolfactory. This alignment can be used to generate high quality homology models for the 817 GPCRs.
To refine the provided GPCR homology models we developed the Trihelix sampling method. We use a multi-scale approach to simplify the problem by treating the transmembrane helices as rigid bodies. In contrast to Monte Carlo structure prediction methods, the Trihelix method does a complete local sampling using discretized coordinates for the transmembrane helices. We validate the method on existing structures and apply it to predict the structure of the lactate receptor, HCAR1. For this receptor, we also build extracellular loops by taking into account constraints from three disulfide bonds. Docking of lactate and 3,5-dihydroxybenzoic acid shows likely involvement of three Arg residues on different transmembrane helices in binding a single ligand molecule.
Protein structure prediction relies on accurate force fields. We next present an effort to improve the quality of charge assignment for large atomic models. In particular, we introduce the formalism of the polarizable charge equilibration scheme (PQEQ) and we describe its implementation in the molecular simulation package Lammps. PQEQ allows fast on the fly charge assignment even for reactive force fields.
Resumo:
DNA charge transport (CT) involves the efficient transfer of electrons or electron holes through the DNA π-stack over long molecular distances of at least 100 base-pairs. Despite this shallow distance dependence, DNA CT is sensitive to mismatches or lesions that disrupt π-stacking and is critically dependent on proper electronic coupling of the donor and acceptor moieties into the base stack. Favorable DNA CT is very rapid, occurring on the picosecond timescale. Because of this speed, electron holes equilibrate along the DNA π-stack, forming a characteristic pattern of DNA damage at low oxidation potential guanine multiplets. Furthermore, DNA CT may be used in a biological context. DNA processing enzymes with 4Fe4S clusters can perform DNA-mediated electron transfer (ET) self-exchange reactions with other 4Fe4S cluster proteins, even if the proteins are quite dissimilar, as long as the DNA-bound [4Fe4S]3+/2+ redox potentials are conserved. This mechanism would allow low copy number DNA repair proteins to find their lesions efficiently within the cell. DNA CT may also be used biologically for the long-range, selective activation of redox-active transcription factors. Within this work, we pursue other proteins that may utilize DNA CT within the cell and further elucidate aspects of the DNA-mediated ET self-exchange reaction of 4Fe4S cluster proteins.
Dps proteins, bacterial mini-ferritins that protect DNA from oxidative stress, are implicated in the survival and virulence of pathogenic bacteria. One aspect of their protection involves ferroxidase activity, whereby ferrous iron is bound and oxidized selectively by hydrogen peroxide, thereby preventing formation of damaging hydroxyl radicals via Fenton chemistry. Understanding the specific mechanism by which Dps proteins protect the bacterial genome could inform the development of new antibiotics. We investigate whether DNA-binding E. coli Dps can utilize DNA CT to protect the genome from a distance. An intercalating ruthenium photooxidant was employed to generate oxidative DNA damage via the flash-quench technique, which localizes to a low potential guanine triplet. We find that Dps loaded with ferrous iron, in contrast to Apo-Dps and ferric iron-loaded Dps which lack available reducing equivalents, significantly attenuates the yield of oxidative DNA damage at the guanine triplet. These data demonstrate that ferrous iron-loaded Dps is selectively oxidized to fill guanine radical holes, thereby restoring the integrity of the DNA. Luminescence studies indicate no direct interaction between the ruthenium photooxidant and Dps, supporting the DNA-mediated oxidation of ferrous iron-loaded Dps. Thus DNA CT may be a mechanism by which Dps efficiently protects the genome of pathogenic bacteria from a distance.
Further work focused on spectroscopic characterization of the DNA-mediated oxidation of ferrous iron-loaded Dps. X-band EPR was used to monitor the oxidation of DNA-bound Dps after DNA photooxidation via the flash-quench technique. Upon irradiation with poly(dGdC)2, a signal arises with g = 4.3, consistent with the formation of mononuclear high-spin Fe(III) sites of low symmetry, the expected oxidation product of Dps with one iron bound at each ferroxidase site. When poly(dGdC)2 is substituted with poly(dAdT)2, the yield of Dps oxidation is decreased significantly, indicating that guanine radicals facilitate Dps oxidation. The more favorable oxidation of Dps by guanine radicals supports the feasibility of a long-distance protection mechanism via DNA CT where Dps is oxidized to fill guanine radical holes in the bacterial genome produced by reactive oxygen species.
We have also explored possible electron transfer intermediates in the DNA-mediated oxidation of ferrous iron-loaded Dps. Dps proteins contain a conserved tryptophan residue in close proximity to the ferroxidase site (W52 in E. coli Dps). In comparison to WT Dps, in EPR studies of the oxidation of ferrous iron-loaded Dps following DNA photooxidation, W52Y and W52A mutants were deficient in forming the characteristic EPR signal at g = 4.3, with a larger deficiency for W52A compared to W52Y. In addition to EPR, we also probed the role of W52 Dps in cells using a hydrogen peroxide survival assay. Bacteria containing W52Y Dps survived the hydrogen peroxide challenge more similarly to those containing WT Dps, whereas cells with W52A Dps died off as quickly as cells without Dps. Overall, these results suggest the possibility of W52 as a CT hopping intermediate.
DNA-modified electrodes have become an essential tool for the study of the redox chemistry of DNA processing enzymes with 4Fe4S clusters. In many cases, it is necessary to investigate different complex samples and substrates in parallel in order to elucidate this chemistry. Therefore, we optimized and characterized a multiplexed electrochemical platform with the 4Fe4S cluster base excision repair glycosylase Endonuclease III (EndoIII). Closely packed DNA films, where the protein has limited surface accessibility, produce EndoIII electrochemical signals sensitive to an intervening mismatch, indicating a DNA-mediated process. Multiplexed analysis allowed more robust characterization of the CT-deficient Y82A EndoIII mutant, as well as comparison of a new family of mutations altering the electrostatics surrounding the 4Fe4S cluster in an effort to shift the reduction potential of the cluster. While little change in the DNA-bound midpoint potential was found for this family of mutants, likely indicating the dominant effect of DNA-binding on establishing the protein redox potential, significant variations in the efficiency of DNA-mediated electron transfer were apparent. On the basis of the stability of these proteins, examined by circular dichroism, we proposed that the electron transfer pathway in EndoIII can be perturbed not only by the removal of aromatic residues but also through changes in solvation near the cluster.
While the 4Fe4S cluster of EndoIII is relatively insensitive to oxidation and reduction in solution, we have found that upon DNA binding, the reduction potential of the [4Fe4S]3+/2+ couple shifts negatively by approximately 200 mV, bringing this couple into a physiologically relevant range. Demonstrated using electrochemistry experiments in the presence and absence of DNA, these studies do not provide direct molecular evidence for the species being observed. Sulfur K-edge X-ray absorbance spectroscopy (XAS) can be used to probe directly the covalency of iron-sulfur clusters, which is correlated to their reduction potential. We have shown that the Fe-S covalency of the 4Fe4S cluster of EndoIII increases upon DNA binding, stabilizing the oxidized [4Fe4S]3+ cluster, consistent with a negative shift in reduction potential. The 7% increase in Fe-S covalency corresponds to an approximately 150 mV shift, remarkably similar to DNA electrochemistry results. Therefore we have obtained direct molecular evidence for the shift in 4Fe4S reduction potential of EndoIII upon DNA binding, supporting the feasibility of our model whereby these proteins can utilize DNA CT to cooperate in order to efficiently find DNA lesions inside cells.
In conclusion, in this work we have explored the biological applications of DNA CT. We discovered that the DNA-binding bacterial ferritin Dps can protect the bacterial genome from a distance via DNA CT, perhaps contributing to pathogen survival and virulence. Furthermore, we optimized a multiplexed electrochemical platform for the study of the redox chemistry of DNA-bound 4Fe4S cluster proteins. Finally, we have used sulfur K-edge XAS to obtain direct molecular evidence for the negative shift in 4Fe4S cluster reduction potential of EndoIII upon DNA binding. These studies contribute to the understanding of DNA-mediated protein oxidation within cells.
Resumo:
I. The 3.7 Å Crystal Structure of Horse Heart Ferricytochrome C.
The crystal structure of horse heart ferricytochrome c has been determined to a resolution of 3.7 Å using the multiple isomorphous replacement technique. Two isomorphous derivatives were used in the analysis, leading to a map with a mean figure of merit of 0.458. The quality of the resulting map was extremely high, even though the derivative data did not appear to be of high quality.
Although it was impossible to fit the known amino acid sequence to the calculated structure in an unambiguous way, many important features of the molecule could still be determined from the 3.7 Å electron density map. Among these was the fact that cytochrome c contains little or no α-helix. The polypeptide chain appears to be wound about the heme group in such a way as to form a loosely packed hydrophobic core in the molecule.
The heme group is located in a cleft on the molecule with one edge exposed to the solvent. The fifth coordinating ligand is His 18 and the sixth coordinating ligand is probably neither His 26 nor His 33.
The high resolution analysis of cytochrome c is now in progress and should be completed within the next year.
II. The Application of the Karle-Hauptman Tangent Formula to Protein Phasing.
The Karle-Hauptman tangent formula has been shown to be applicable to the refinement of previously determined protein phases. Tests were made with both the cytochrome c data from Part I and a theoretical structure based on the myoglobin molecule. The refinement process was found to be highly dependent upon the manner in which the tangent formula was applied. Iterative procedures did not work well, at least at low resolution.
The tangent formula worked very well in selecting the true phase from the two possible phase choices resulting from a single isomorphous replacement phase analysis. The only restriction on this application is that the heavy atoms form a non-centric cluster in the unit cell.
Pages 156 through 284 in this Thesis consist of previously published papers relating to the above two sections. References to these papers can be found on page 155.