16 resultados para PROTEIN-PROTEIN INTERACTIONS
em CaltechTHESIS
Resumo:
This dissertation describes studies of G protein-coupled receptors (GPCRs) and ligand-gated ion channels (LGICs) using unnatural amino acid mutagenesis to gain high precision insights into the function of these important membrane proteins.
Chapter 2 considers the functional role of highly conserved proline residues within the transmembrane helices of the D2 dopamine GPCR. Through mutagenesis employing unnatural α-hydroxy acids, proline analogs, and N-methyl amino acids, we find that lack of backbone hydrogen bond donor ability is important to proline function. At one proline site we additionally find that a substituent on the proline backbone N is important to receptor function.
In Chapter 3, side chain conformation is probed by mutagenesis of GPCRs and the muscle-type nAChR. Specific side chain rearrangements of highly conserved residues have been proposed to accompany activation of these receptors. These rearrangements were probed using conformationally-biased β-substituted analogs of Trp and Phe and unnatural stereoisomers of Thr and Ile. We also modeled the conformational bias of the unnatural Trp and Phe analogs employed.
Chapters 4 and 5 examine details of ligand binding to nAChRs. Chapter 4 describes a study investigating the importance of hydrogen bonds between ligands and the complementary face of muscle-type and α4β4 nAChRs. A hydrogen bond involving the agonist appears to be important for ligand binding in the muscle-type receptor but not the α4β4 receptor.
Chapter 5 describes a study characterizing the binding of varenicline, an actively prescribed smoking cessation therapeutic, to the α7 nAChR. Additionally, binding interactions to the complementary face of the α7 binding site were examined for a small panel of agonists. We identified side chains important for binding large agonists such as varenicline, but dispensable for binding the small agonist ACh.
Chapter 6 describes efforts to image nAChRs site-specifically modified with a fluorophore by unnatural amino acid mutagenesis. While progress was hampered by high levels of fluorescent background, improvements to sample preparation and alternative strategies for fluorophore incorporation are described.
Chapter 7 describes efforts toward a fluorescence assay for G protein association with a GPCR, with the ultimate goal of probing key protein-protein interactions along the G protein/receptor interface. A wide range of fluorescent protein fusions were generated, expressed in Xenopus oocytes, and evaluated for their ability to associate with each other.
Resumo:
Computational protein design (CPD) is a burgeoning field that uses a physical-chemical or knowledge-based scoring function to create protein variants with new or improved properties. This exciting approach has recently been used to generate proteins with entirely new functions, ones that are not observed in naturally occurring proteins. For example, several enzymes were designed to catalyze reactions that are not in the repertoire of any known natural enzyme. In these designs, novel catalytic activity was built de novo (from scratch) into a previously inert protein scaffold. In addition to de novo enzyme design, the computational design of protein-protein interactions can also be used to create novel functionality, such as neutralization of influenza. Our goal here was to design a protein that can self-assemble with DNA into nanowires. We used computational tools to homodimerize a transcription factor that binds a specific sequence of double-stranded DNA. We arranged the protein-protein and protein-DNA binding sites so that the self-assembly could occur in a linear fashion to generate nanowires. Upon mixing our designed protein homodimer with the double-stranded DNA, the molecules immediately self-assembled into nanowires. This nanowire topology was confirmed using atomic force microscopy. Co-crystal structure showed that the nanowire is assembled via the desired interactions. To the best of our knowledge, this is the first example of a protein-DNA self-assembly that does not rely on covalent interactions. We anticipate that this new material will stimulate further interest in the development of advanced biomaterials.
Resumo:
Single-cell functional proteomics assays can connect genomic information to biological function through quantitative and multiplex protein measurements. Tools for single-cell proteomics have developed rapidly over the past 5 years and are providing unique opportunities. This thesis describes an emerging microfluidics-based toolkit for single cell functional proteomics, focusing on the development of the single cell barcode chips (SCBCs) with applications in fundamental and translational cancer research.
The microchip designed to simultaneously quantify a panel of secreted, cytoplasmic and membrane proteins from single cells will be discussed at the beginning, which is the prototype for subsequent proteomic microchips with more sophisticated design in preclinical cancer research or clinical applications. The SCBCs are a highly versatile and information rich tool for single-cell functional proteomics. They are based upon isolating individual cells, or defined number of cells, within microchambers, each of which is equipped with a large antibody microarray (the barcode), with between a few hundred to ten thousand microchambers included within a single microchip. Functional proteomics assays at single-cell resolution yield unique pieces of information that significantly shape the way of thinking on cancer research. An in-depth discussion about analysis and interpretation of the unique information such as functional protein fluctuations and protein-protein correlative interactions will follow.
The SCBC is a powerful tool to resolve the functional heterogeneity of cancer cells. It has the capacity to extract a comprehensive picture of the signal transduction network from single tumor cells and thus provides insight into the effect of targeted therapies on protein signaling networks. We will demonstrate this point through applying the SCBCs to investigate three isogenic cell lines of glioblastoma multiforme (GBM).
The cancer cell population is highly heterogeneous with high-amplitude fluctuation at the single cell level, which in turn grants the robustness of the entire population. The concept that a stable population existing in the presence of random fluctuations is reminiscent of many physical systems that are successfully understood using statistical physics. Thus, tools derived from that field can probably be applied to using fluctuations to determine the nature of signaling networks. In the second part of the thesis, we will focus on such a case to use thermodynamics-motivated principles to understand cancer cell hypoxia, where single cell proteomics assays coupled with a quantitative version of Le Chatelier's principle derived from statistical mechanics yield detailed and surprising predictions, which were found to be correct in both cell line and primary tumor model.
The third part of the thesis demonstrates the application of this technology in the preclinical cancer research to study the GBM cancer cell resistance to molecular targeted therapy. Physical approaches to anticipate therapy resistance and to identify effective therapy combinations will be discussed in detail. Our approach is based upon elucidating the signaling coordination within the phosphoprotein signaling pathways that are hyperactivated in human GBMs, and interrogating how that coordination responds to the perturbation of targeted inhibitor. Strongly coupled protein-protein interactions constitute most signaling cascades. A physical analogy of such a system is the strongly coupled atom-atom interactions in a crystal lattice. Similar to decomposing the atomic interactions into a series of independent normal vibrational modes, a simplified picture of signaling network coordination can also be achieved by diagonalizing protein-protein correlation or covariance matrices to decompose the pairwise correlative interactions into a set of distinct linear combinations of signaling proteins (i.e. independent signaling modes). By doing so, two independent signaling modes – one associated with mTOR signaling and a second associated with ERK/Src signaling have been resolved, which in turn allow us to anticipate resistance, and to design combination therapies that are effective, as well as identify those therapies and therapy combinations that will be ineffective. We validated our predictions in mouse tumor models and all predictions were borne out.
In the last part, some preliminary results about the clinical translation of single-cell proteomics chips will be presented. The successful demonstration of our work on human-derived xenografts provides the rationale to extend our current work into the clinic. It will enable us to interrogate GBM tumor samples in a way that could potentially yield a straightforward, rapid interpretation so that we can give therapeutic guidance to the attending physicians within a clinical relevant time scale. The technical challenges of the clinical translation will be presented and our solutions to address the challenges will be discussed as well. A clinical case study will then follow, where some preliminary data collected from a pediatric GBM patient bearing an EGFR amplified tumor will be presented to demonstrate the general protocol and the workflow of the proposed clinical studies.
Resumo:
This dissertation primarily describes chemical-scale studies of G protein-coupled receptors and Cys-loop ligand-gated ion channels to better understand ligand binding interactions and the mechanism of channel activation using recently published crystal structures as a guide. These studies employ the use of unnatural amino acid mutagenesis and electrophysiology to measure subtle changes in receptor function.
In chapter 2, the role of a conserved aromatic microdomain predicted in the D3 dopamine receptor is probed in the closely related D2 and D4 dopamine receptors. This domain was found to act as a structural unit near the ligand binding site that is important for receptor function. The domain consists of several functionally important noncovalent interactions including hydrogen bond, aromatic-aromatic, and sulfur-π interactions that show strong couplings by mutant cycle analysis. We also assign an alternate interpretation for the linear fluorination plot observed at W6.48, a residue previously thought to participate in a cation-π interaction with dopamine.
Chapter 3 outlines attempts to incorporate chemically synthesized and in vitro acylated unnatural amino acids into mammalian cells. While our attempts were not successful, method optimizations and data for nonsense suppression with an in vivo acylated tRNA are included. This chapter is aimed to aid future researchers attempting unnatural amino acid mutagenesis in mammalian cells.
Chapter 4 identifies a cation-π interaction between glutamate and a tyrosine residue on loop C in the GluClβ receptor. Using the recently published crystal structure of the homologous GluClα receptor, other ligand-binding and protein-protein interactions are probed to determine the similarity between this invertebrate receptor and other more distantly related vertebrate Cys-loop receptors. We find that many of the interactions previously observed are conserved in the GluCl receptors, however care must be taken when extrapolating structural data.
Chapter 5 examines inherent properties of the GluClα receptor that are responsible for the observed glutamate insensitivity of the receptor. Chimera synthesis and mutagenesis reveal the C-terminal portion of the M4 helix and the C-terminus as contributing to formation of the decoupled state, where ligand binding is incapable of triggering channel gating. Receptor mutagenesis was unable to identify single residue mismatches or impaired protein-protein interactions within this domain. We conclude that M4 helix structure and/or membrane dynamics are likely the cause of ligand insensitivity in this receptor and that the M4 helix has an role important in the activation process.
Resumo:
The investigations presented in this thesis use various in vivo techniques to understand how trans-acting factors control gene expression. The first part addresses the transcriptional regulation of muscle creatine kinase (MCK). MCK expression is activated during the course of development and is found only in differentiated muscle. Several in vivo footprints are observed at the enhancer of this gene, but all of these interactions are limited to cell types that express MCK. This is interesting because two of the footprints appear to represent muscle specific use of general transcription factors, while the other two correspond to sites that can bind the myogenic regulator, MyoD1, in vitro. MyoD1 and these general factors are present in myoblasts, but can bind to the enhancer only in myocytes. This suggests that either the factors themselves are post-translationally modified (phosphorylation or protein:protein interactions), or the accessibility of the enhancer to the factors is limited (changes in chromatin structure). The in vivo footprinting study of MCK was performed with a new ligation mediated, single-sided PCR (polymerase chain reaction) technique that I have developed.
The second half of the thesis concerns the regulation of mouse metallothionein (MT). Metallothioneins are a family of highly conserved housekeeping genes whose expression can be induced by heavy metals, steroids, and other stresses. By adapting a primer extension method of genomic sequencing to in vivo footprinting, I've observed both metal inducible and noninducible interactions at the promoter of MT-I. From these results I've been able to limit the possible mechanisms by which metal responsive trans-acting factors induce transcription. These interpretations correlate with a second line of experiments involving the stable titration of positive acting factors necessary for induction of MT. I've amplified the promoter of MT to 10^2-10^3 copies per cell by fusing the 5' and 3' ends of the MT gene to the coding region of DHFR and selecting cells for methotrexate resistance. In these cells, there is a metal-specific titration effect, and although it acts at the level of transcription, it appears to be independent of direct DNA binding factors.
Resumo:
Heparin has been used as an anticoagulant drug for more than 70 years. The global distribution of contaminated heparin in 2007, which resulted in adverse clinical effects and over 100 deaths, emphasizes the necessity for safer alternatives to animal-sourced heparin. The structural complexity and heterogeneity of animal-sourced heparin not only impedes safe access to these biologically active molecules, but also hinders investigations on the significance of structural constituents at a molecular level. Efficient methods for preparing new synthetic heparins with targeted biological activity are necessary not only to ensure clinical safety, but to optimize derivative design to minimize potential side effects. Low molecular weight heparins have become a reliable alternative to heparin, due to their predictable dosages, long half-lives, and reduced side effects. However, heparin oligosaccharide synthesis is a challenging endeavor due to the necessity for complex protecting group manipulation and stereoselective glycosidic linkage chemistry, which often result in lengthy synthetic routes and low yields. Recently, chemoenzymatic syntheses have produced targeted ultralow molecular weight heparins with high-efficiency, but continue to be restricted by the substrate specificities of enzymes.
To address the need for access to homogeneous, complex glycosaminoglycan structures, we have synthesized novel heparan sulfate glycopolymers with well-defined carbohydrate structures and tunable chain length through ring-opening metathesis polymerization chemistry. These polymers recapitulate the key features of anticoagulant heparan sulfate by displaying the sulfation pattern responsible for heparin’s anticoagulant activity. The use of polymerization chemistry greatly simplifies the synthesis of complex glycosaminoglycan structures, providing a facile method to generate homogeneous macromolecules with tunable biological and chemical properties. Through the use of in vitro chromogenic substrate assays and ex vivo clotting assays, we found that the HS glycopolymers exhibited anticoagulant activity in a sulfation pattern and length-dependent manner. Compared to heparin standards, our short polymers did not display any activity. However, our longer polymers were able to incorporate in vitro and ex vivo characteristics of both low-molecular-weight heparin derivatives and heparin, displaying hybrid anticoagulant properties. These studies emphasize the significance of sulfation pattern specificity in specific carbohydrate-protein interactions, and demonstrate the effectiveness of multivalent molecules in recapitulating the activity of natural polysaccharides.
Resumo:
Efficient and accurate localization of membrane proteins is essential to all cells and requires a complex cascade of interactions between protein machineries. This is exemplified in the recently discovered Guided Entry of Tail-anchored protein pathway, in which the central targeting factor Get3 must sequentially interact with three distinct binding partners (Get4, Get1 and Get2) to ensure the targeted delivery of Tail-anchored proteins to the endoplasmic reticulum membrane. To understand the molecular and energetic principles that provide the vectorial driving force of these interactions, we used a quantitative fluorescence approach combined with mechanistic enzymology to monitor the effector interactions of Get3 at each stage of Tail-anchored protein targeting. We show that nucleotide and membrane protein substrate generate a gradient of interaction energies that drive the cyclic and ordered transit of Get3 from Get4 to Get2 and lastly to Get1. These data also define how the Get3/Tail-anchored complex is captured, handed over, and disassembled by the Get1/2 receptor at the membrane, and reveal a novel role for Get4/5 in recycling Get3 from the endoplasmic reticulum membrane at the end of the targeting reaction. These results provide general insights into how complex cascades of protein interactions are coordinated and coupled to energy inputs in biological systems.
Resumo:
Being able to detect a single molecule without the use of labels has been a long standing goal of bioengineers and physicists. This would simplify applications ranging from single molecular binding studies to those involving public health and security, improved drug screening, medical diagnostics, and genome sequencing. One promising technique that has the potential to detect single molecules is the microtoroid optical resonator. The main obstacle to detecting single molecules, however, is decreasing the noise level of the measurements such that a single molecule can be distinguished from background. We have used laser frequency locking in combination with balanced detection and data processing techniques to reduce the noise level of these devices and report the detection of a wide range of nanoscale objects ranging from nanoparticles with radii from 100 to 2.5 nm, to exosomes, ribosomes, and single protein molecules (mouse immunoglobulin G and human interleukin-2). We further extend the exosome results towards creating a non-invasive tumor biopsy assay. Our results, covering several orders of magnitude of particle radius (100 nm to 2 nm), agree with the `reactive' model prediction for the frequency shift of the resonator upon particle binding. In addition, we demonstrate that molecular weight may be estimated from the frequency shift through a simple formula, thus providing a basis for an ``optical mass spectrometer'' in solution. We anticipate that our results will enable many applications, including more sensitive medical diagnostics and fundamental studies of single receptor-ligand and protein-protein interactions in real time. The thesis summarizes what we have achieved thus far and shows that the goal of detecting a single molecule without the use of labels can now be realized.
Resumo:
With recent advances in high-throughput sequencing, mapping of genome-wide transcription factor occupancy has become feasible. To advance the understanding of skeletal muscle differentiation specifically and transcriptional regulation in general, I determined the genome-wide occupancy map for myogenin in differentiating C2C12 myocyte cells. I then analyzed the myogenin map for underlying sequence content and the association between occupied elements and expression trajectories of adjacent genes. Having determined that myogenin primarily associates with expressed genes, I performed a similar analysis on occupancy maps of other transcription factors active during skeletal muscle differentiation, including an extensive analysis of co-occupancy. This analysis provided strong motif evidence for protein-protein interactions as the primary driving force in the formation of Myogenin / Mef2 and MyoD / AP-1 complexes at jointly-occupied sites. Finally, factor occupancy analysis was extended to include bHLH transcription factors in tissues other than skeletal muscle. The cross-tissue analysis led to the emergence of a motif structure used by bHLH TFs to encode either tissue-specific or "general" (public) access in a variety of lineages.
Resumo:
Immunoglobulin G (IgG) is central in mediating host defense due to its ability to target and eliminate invading pathogens. The fragment antigen binding (Fab) regions are responsible for antigen recognition; however the effector responses are encoded on the Fc region of IgG. IgG Fc displays considerable glycan heterogeneity, accounting for its complex effector functions of inflammation, modulation and immune suppression. Intravenous immunoglobulin G (IVIG) is pooled serum IgG from multiple donors and is used to treat individuals with autoimmune and inflammatory disorders such as rheumatoid arthritis and Kawasaki’s disease, respectively. It contains all the subtypes of IgG (IgG1-4) and over 120 glycovariants due to variation of an Asparagine 297-linked glycan on the Fc. The species identified as the activating component of IVIG is sialylated IgG Fc. Comparisons of wild type Fc and sialylated Fc X-ray crystal structures suggests that sialylation causes an increase in conformational flexibility, which may be important for its anti-inflammatory properties.
Although glycan modifications can promote the anti-inflammatory properties of the Fc, there are amino acid substitutions that cause Fcs to initiate an enhanced immune response. Mutations in the Fc can cause up to a 100-fold increase in binding affinity to activating Fc gamma receptors located on immune cells, and have been shown to enhance antibody dependent cell-mediated cytotoxicity. This is important in developing therapeutic antibodies against cancer and infectious diseases. Structural studies of mutant Fcs in complex with activating receptors gave insight into new protein-protein interactions that lead to an enhanced binding affinity.
Together these studies show how dynamic and diverse the Fc region is and how both protein and carbohydrate modifications can alter structure, leading to IgG Fc’s switch from a pro-inflammatory to an anti-inflammatory protein.
Resumo:
Viruses possess very specific methods of targeting and entering cells. These methods would be extremely useful if they could also be applied to drug delivery, but little is known about the molecular mechanisms of the viral entry process. In order to gain further insight into mechanisms of viral entry, chemical and spectroscopic studies in two systems were conducted, examining hydrophobic protein-lipid interactions during Sendai virus membrane fusion, and the kinetics of bacteriophage λ DNA injection.
Sendai virus glycoprotein interactions with target membranes during the early stages of fusion were examined using time-resolved hydrophobic photoaffinity labeling with the lipid-soluble carbene generator3-(trifluoromethyl)-3-(m-^(125 )I] iodophenyl)diazirine (TID). The probe was incorporated in target membranes prior to virus addition and photolysis. During Sendai virus fusion with liposomes composed of cardiolipin (CL) or phosphatidylserine (PS), the viral fusion (F) protein is preferentially labeled at early time points, supporting the hypothesis that hydrophobic interaction of the fusion peptide at the N-terminus of the F_1 subunit with the target membrane is an initiating event in fusion. Correlation of the hydrophobic interactions with independently monitored fusion kinetics further supports this conclusion. Separation of proteins after labeling shows that the F_1 subunit, containing the putative hydrophobic fusion sequence, is exclusively labeled, and that the F_2 subunit does not participate in fusion. Labeling shows temperature and pH dependence consistent with a need for protein conformational mobility and fusion at neutral pH. Higher amounts of labeling during fusion with CL vesicles than during virus-PS vesicle fusion reflects membrane packing regulation of peptide insertion into target membranes. Labeling of the viral hemagglutinin/neuraminidase (HN) at low pH indicates that HN-mediated fusion is triggered by hydrophobic interactions, after titration of acidic amino acids. HN labeling under nonfusogenic conditions reveals that viral binding may involve hydrophobic as well as electrostatic interactions. Controls for diffusional labeling exclude a major contribution from this source. Labeling during reconstituted Sendai virus envelope-liposome fusion shows that functional reconstitution involves protein retention of the ability to undergo hydrophobic interactions.
Examination of Sendai virus fusion with erythrocyte membranes indicates that hydrophobic interactions also trigger fusion between biological membranes, and that HN binding may involve hydrophobic interactions as well. Labeling of the erythrocyte membranes revealed close membrane association of spectrin, which may play a role in regulating membrane fusion. The data show that hydrophobic fusion protein interaction with both artificial and biological membranes is a triggering event in fusion. Correlation of these results with earlier studies of membrane hydration and fusion kinetics provides a more detailed view of the mechanism of fusion.
The kinetics of DNA injection by bacteriophage λ. into liposomes bearing reconstituted receptors were measured using fluorescence spectroscopy. LamB, the bacteriophage receptor, was extracted from bacteria and reconstituted into liposomes by detergent removal dialysis. The DNA binding fluorophore ethidium bromide was encapsulated in the liposomes during dialysis. Enhanced fluorescence of ethidium bromide upon binding to injected DNA was monitored, and showed that injection is a rapid, one-step process. The bimolecular rate law, determined by the method of initial rates, revealed that injection occurs several times faster than indicated by earlier studies employing indirect assays.
It is hoped that these studies will increase the understanding of the mechanisms of virus entry into cells, and to facilitate the development of virus-mimetic drug delivery strategies.
Resumo:
A novel Ca^(2+)-binding protein with Mr of 23 K (designated p23) has been identified in avian erythrocytes and thrombocytes. p23 localizes to the marginal bands (MBs), centrosomes and discrete sites around the nuclear membrane in mature avian erythrocytes. p23 appears to bind Ca^(2+) directly and its interaction with subcellular organelles seems to be modulated by intracellular [Ca^(2+)]. However, its unique protein sequence lacks any known Ca^(2+)-binding motif. Developmental analysis reveals that p23 association to its target structures occurs only at very late stages of bone marrow definitive erythropoeisis. In primitive erythroid cells, p23 distributes diffusely in the cytoplasm and lacks any distinct localization. It is postulated that p23 association to subcellular structures may be induced in part by decreased intracellular [Ca^(2+)]. In vitro and in vivo experiments indicate that p23 does not appear to act as a classical microtubule-associated protein (MAP) but p23 homologues appear to be expressed in MB-containing cells of a variety of species from different vertebrate classes. It has been hypothesized that p23 may play a regulatory role in MB stabilization in a Ca^(2+)-dependent manner.
Binucleated (bnbn) turkey erythrocytes were found to express a truncated p23 variant (designated p21) with identical subcellular localization as p23 except immunostaining reveals the presence of multi-centrosomes in bnbn cells. The p21 sequence has a 62 amino acid deletion at the C-terminus and must therefore have an additional ~40 amino acids at the N-terminus. In addition, p21 seems to have lost the ability to bind Ca^(2+) and its supramolecular interactions are not modulated by intracellular [Ca^(2+)]. These apparent differences between p23 and p21 raised the possibility that the p23/p21 allelism could be the Bn/bn genotype. However, genetic analysis suggested that p23/p21 allelism had no absolute correlation with the Bn/bn genotype.
Resumo:
A unique chloroplast Signal Recognition Particle (SRP) in green plants is primarily dedicated to the post-translational targeting of light harvesting chlorophyll-a/b binding (LHC) proteins. Our study of the thermodynamics and kinetics of the GTPases of the system demonstrates that GTPase complex assembly and activation are highly coupled in the chloroplast GTPases, suggesting they may forego the GTPase activation step as a key regulatory point. This reflects adaptations of the chloroplast SRP to the delivery of their unique substrate protein. Devotion to one highly hydrophobic family of proteins also may have allowed the chloroplast SRP system to evolve an efficient chaperone in the cpSRP43 subunit. To understand the mechanism of disaggregation, we showed that LHC proteins form micellar, disc-shaped aggregates that present a recognition motif (L18) on the aggregate surface. Further molecular genetic and structure-activity analyses reveal that the action of cpSRP43 can be dissected into two steps: (i) initial recognition of L18 on the aggregate surface; and (ii) aggregate remodeling, during which highly adaptable binding interactions of cpSRP43 with hydrophobic transmembrane domains of the substrate protein compete with the packing interactions within the aggregate. We also tested the adaptability of cpSRP43 for alternative substrates, specifically in attempts to improve membrane protein expression and inhibition of amyloid beta fibrillization. These preliminary results attest to cpSRP43’s potential as a molecular chaperone and provides the impetus for further engineering endeavors to address problems that stem from protein aggregation.
Resumo:
G protein-coupled receptors (GPCRs) are the largest family of proteins within the human genome. They consist of seven transmembrane (TM) helices, with a N-terminal region of varying length and structure on the extracellular side, and a C-terminus on the intracellular side. GPCRs are involved in transmitting extracellular signals to cells, and as such are crucial drug targets. Designing pharmaceuticals to target GPCRs is greatly aided by full-atom structural information of the proteins. In particular, the TM region of GPCRs is where small molecule ligands (much more bioavailable than peptide ligands) typically bind to the receptors. In recent years nearly thirty distinct GPCR TM regions have been crystallized. However, there are more than 1,000 GPCRs, leaving the vast majority of GPCRs with limited structural information. Additionally, GPCRs are known to exist in a myriad of conformational states in the body, rendering the static x-ray crystal structures an incomplete reflection of GPCR structures. In order to obtain an ensemble of GPCR structures, we have developed the GEnSeMBLE procedure to rapidly sample a large number of variations of GPCR helix rotations and tilts. The lowest energy GEnSeMBLE structures are then docked to small molecule ligands and optimized. The GPCR family consists of five subfamilies with little to no sequence homology between them: class A, B1, B2, C, and Frizzled/Taste2. Almost all of the GPCR crystal structures have been of class A GPCRs, and much is known about their conserved interactions and binding sites. In this work we particularly focus on class B1 GPCRs, and aim to understand that family’s interactions and binding sites both to small molecules and their native peptide ligands. Specifically, we predict the full atom structure and peptide binding site of the glucagon-like peptide receptor and the TM region and small molecule binding sites for eight other class B1 GPCRs: CALRL, CRFR1, GIPR, GLR, PACR, PTH1R, VIPR1, and VIPR2. Our class B1 work reveals multiple conserved interactions across the B1 subfamily as well as a consistent small molecule binding site centrally located in the TM bundle. Both the interactions and the binding sites are distinct from those seen in the more well-characterized class A GPCRs, and as such our work provides a strong starting point for drug design targeting class B1 proteins. We also predict the full structure of CXCR4 bound to a small molecule, a class A GPCR that was not closely related to any of the class A GPCRs at the time of the work.
Resumo:
The first chapter of this thesis deals with automating data gathering for single cell microfluidic tests. The programs developed saved significant amounts of time with no loss in accuracy. The technology from this chapter was applied to experiments in both Chapters 4 and 5.
The second chapter describes the use of statistical learning to prognose if an anti-angiogenic drug (Bevacizumab) would successfully treat a glioblastoma multiforme tumor. This was conducted by first measuring protein levels from 92 blood samples using the DNA-encoded antibody library platform. This allowed the measure of 35 different proteins per sample, with comparable sensitivity to ELISA. Two statistical learning models were developed in order to predict whether the treatment would succeed. The first, logistic regression, predicted with 85% accuracy and an AUC of 0.901 using a five protein panel. These five proteins were statistically significant predictors and gave insight into the mechanism behind anti-angiogenic success/failure. The second model, an ensemble model of logistic regression, kNN, and random forest, predicted with a slightly higher accuracy of 87%.
The third chapter details the development of a photocleavable conjugate that multiplexed cell surface detection in microfluidic devices. The method successfully detected streptavidin on coated beads with 92% positive predictive rate. Furthermore, chambers with 0, 1, 2, and 3+ beads were statistically distinguishable. The method was then used to detect CD3 on Jurkat T cells, yielding a positive predictive rate of 49% and false positive rate of 0%.
The fourth chapter talks about the use of measuring T cell polyfunctionality in order to predict whether a patient will succeed an adoptive T cells transfer therapy. In 15 patients, we measured 10 proteins from individual T cells (~300 cells per patient). The polyfunctional strength index was calculated, which was then correlated with the patient's progress free survival (PFS) time. 52 other parameters measured in the single cell test were correlated with the PFS. No statistical correlator has been determined, however, and more data is necessary to reach a conclusion.
Finally, the fifth chapter talks about the interactions between T cells and how that affects their protein secretion. It was observed that T cells in direct contact selectively enhance their protein secretion, in some cases by over 5 fold. This occurred for Granzyme B, Perforin, CCL4, TNFa, and IFNg. IL- 10 was shown to decrease slightly upon contact. This phenomenon held true for T cells from all patients tested (n=8). Using single cell data, the theoretical protein secretion frequency was calculated for two cells and then compared to the observed rate of secretion for both two cells not in contact, and two cells in contact. In over 90% of cases, the theoretical protein secretion rate matched that of two cells not in contact.