16 resultados para conserved noncoding sequence
em CaltechTHESIS
Resumo:
DNA recognition is an essential biological process responsible for the regulation of cellular functions including protein synthesis and cell division and is implicated in the mechanism of action of some anticancer drugs. Studies directed towards defining the elements responsible for sequence specific DNA recognition through the study of the interactions of synthetic organic ligands with DNA are described.
DNA recognition by poly-N-methylpyrrolecarboxamides was studied by the synthesis and characterization of a series of molecules where the number of contiguous N-methylpyrrolecarboxamide units was increased from 2 to 9. The effect of this incremental change in structure on DNA recognition has been investigated at base pair resolution using affinity cleaving and MPE•Fe(II) footprinting techniques. These studies led to a quantitative relationship between the number of amides in the molecule and the DNA binding site size. This relationship is called the n + 1 rule and it states that a poly-N methylpyrrolecarboxamide molecule with n amides will bind n + 1 base pairs of DNA. This rule is consistent with a model where the carboxamides of these compounds form three center bridging hydrogen bonds between adjacent base pairs on opposite strands of the helix. The poly-N methylpyrrolecarboxamide recognition element was found to preferentially bind poly dA•poly dT stretches; however, both binding site selection and orientation were found to be affected by flanking sequences. Cleavage of large DNA is also described.
One approach towards the design of molecules that bind large sequences of double helical DNA sequence specifically is to couple DNA binding subunits of similar or diverse base pair specificity. Bis-EDTA-distamycin-fumaramide (BEDF) is an octaamide dimer of two tri-N methylpyrrolecarboxamide subunits linked by fumaramide. DNA recognition by BEDF was compared to P7E, an octaamide molecule containing seven consecutive pyrroles. These two compounds were found to recognize the same sites on pBR322 with approximately the same affinities demonstrating that fumaramide is an effective linking element for Nmethylpyrrolecarboxamide recognition subunits. Further studies involved the synthesis and characterization of a trimer of tetra-N-methylpyrrolecarboxamide subunits linked by β-alanine ((P4)_(3)E). This trimerization produced a molecule which is capable of recognizing 16 base pairs of A•T DNA, more than a turn and a half of the DNA helix.
DNA footprinting is a powerful direct method for determining the binding sites of proteins and small molecules on heterogeneous DNA. It was found that attachment of EDTA•Fe(II) to spermine creates a molecule, SE•Fe(II), which binds and cleaves DNA sequence neutrally. This lack of specificity provides evidence that at the nucleotide level polyamines recognize heterogeneous DNA independent of sequence and allows SE•Fe(II) to be used as a footprinting reagent. SE•Fe(II) was compared with two other small molecule footprinting reagents, EDTA•Fe(II) and MPE•Fe(II).
Resumo:
The author has constructed a synthetic gene for ∝-lytic protease. Since the DNA sequence of the protein is not known, the gene was designed by using the reverse translation of ∝-lytic protease's amino acid sequence. Unique restriction sites are carefully sought in the degenerate DNA sequence to aid in future mutagenesis studies. The unique restriction sites are designed approximately 50 base pairs apart and their appropriate codons used in the DNA sequence. The codons used to construct the DNA sequence of ∝-lytic protease are preferred codons in E-coli or used in the production of β-lactamase. Codon usage is also distributed evenly to ensure that one particular codon is not heavily used. The gene is essentially constructed from the outside in. The gene is built in a stepwise fashion using plasmids as the vehicles for the ∝-lytic oligomers. The use of plasmids allows the replication and isolation of large quantities of the intermediates during gene synthesis. The ∝-lytic DNA is a double-stranded oligomer that has sufficient overhang and sticky ends to anneal correctly in the vector. After six steps of incorporating ∝-lytic DNA, the gene is completed and sequenced to ensure that the correct DNA sequence is present and that no mutations occurred in the structural gene.
β-lactamase is the other serine hydrolase studied in this thesis. The author used the class A RTEM-1 β- lactamase encoded on the plasmid pBR322 to investigate the roll of the conserved threonine residue at position 71. Cassette mutagenesis was previously used to generate all possible amino acid substitutions at position 71. The work presented here describes the purification and kinetic characterization of a T71H mutant previously constructed by S. Schultz. The mutated gene was transferred into plasmid pJN for expression and induced with IPTG. The enzyme is purified by column chromatography and FPLC to homogeneity. Kinetic studies reveal that the mutant has lower k_(cat) values on benzylpenicillin, cephalothin and 6-aminopenicillanic acid but no changes in k_m except for cephalothin which is approximately 4 times higher. The mutant did not change siginificantly in its pH profile compared to the wild-type enzyme. Also, the mutant is more sensitive to thermal denaturation as compared to the wild-type enzyme. However, experimental evidence indicates that the probable generation of a positive charge at position 71 thermally stabilized the mutant.
Resumo:
Vulval differentiation in C. elegans is mediated by an Epidermal growth factor (EGF)- EGF receptor (EGFR) signaling pathway. I have cloned unc-101, a negative regulator of vulval differentiation of the nematode C. elegans. unc-101 encodes a homolog of AP47, the medium chain of the trans-Golgi clathrin-associated protein complex. This identity was confirmed by cloning and comparing sequence of a C. elegans homolog of AP50, the medium chain of the plasma membrane clathrin-associated protein complex. I provided the first genetic evidence that the trans-Golgi clathrin-coated vesicles are involved in regulation of an EGF signaling pathway. Most of the unc-101 alleles are deletions or nonsense mutations, suggesting that these alleles severely reduce the unc-101 activity. A hybrid gene that contains parts of unc-101 and mouse AP4 7 rescued at least two phenotypes of unc-101 mutations, the Unc and the suppression of vulvaless phenotype of let-23(sy1) mutation. Therefore, the functions of AP47 are conserved between nematodes and mammals.
unc-101 mutations can cause a greater than wild-type vulval differentiation in combination with certain mutations in sli-1, another negative regulator of the vulval induction pathway. A mutation in a new gene, rok-1, causes no defect by itself, but causes a greater than wild-type vulval differentiation in the presence of a sli-1 mutation. The unc-101; rok-1; sli-1 triple mutants display a greater extent of vulval differentiation than any double mutant combinations of unc-101, rok-1 and sli-1. Therefore, rok-1 locus defines another negative regulator of the vulval induction pathway.
I analyzed a second gene encoding an AP47 homolog in C. elegans. This gene, CEAP47, encodes a protein 72% identical to both unc-101 and mammalian AP47. A hybrid gene containing parts of unc-101 and CEAP47 sequences can rescue phenotypes of unc-101 mutants, indicating that UNC- 101 and CEAP47 proteins can be redundant if expressed in the same set of cells.
Resumo:
This thesis describes research pursued in two areas, both involving the design and synthesis of sequence specific DNA-cleaving proteins. The first involves the use of sequence-specific DNA-cleaving metalloproteins to probe the structure of a protein-DNA complex, and the second seeks to develop cleaving moieties capable of DNA cleavage through the generation of a non-diffusible oxidant under physiological conditions.
Chapter One provides a brief review of the literature concerning sequence-specific DNA-binding proteins. Chapter Two summarizes the results of affinity cleaving experiments using leucine zipper-basic region (bZip) DNA-binding proteins. Specifically, the NH_2-terminal locations of a dimer containing the DNA binding domain of the yeast transcriptional activator GCN4 were mapped on the binding sites 5'-CTGACTAAT-3' and 5'ATGACTCTT- 3' using affinity cleaving. Analysis of the DNA cleavage patterns from Fe•EDTA-GCN4(222-281) and (226-281) dimers reveals that the NH_2-termini are in the major groove nine to ten base pairs apart and symmetrically displaced four to five base pairs from the central C of the recognition site. These data are consistent with structural models put forward for this class of DNA binding proteins. The results of these experiments are evaluated in light of the recently published crystal structure for the GCN4-DNA complex. Preliminary investigations of affinity cleaving proteins based on the DNA-binding domains of the bZip proteins Jun and Fos are also described.
Chapter Three describes experiments demonstrating the simultaneous binding of GCN4(226-281) and 1-Methylimidazole-2-carboxamide-netropsin (2-ImN), a designed synthetic peptide which binds in the minor groove of DNA at 5'-TGACT-3' sites as an antiparallel, side-by-side dimer. Through the use of Fe•EDTA-GCN4(226-281) as a sequence-specific footprinting agent, it is shown that the dimeric protein GCN4(226-281) and the dimeric peptide 2- ImN can simultaneously occupy their common binding site in the major and minor grooves of DNA, respectively. The association constants for 2-ImN in the presence and in the absence of Fe•EDTA-GCN4(226-281) are found to be similar, suggesting that the binding of the two dimers is not cooperative.
Chapter Four describes the synthesis and characterization of PBA-β-OH-His- Hin(139-190), a hybrid protein containing the DNA-binding domain of Hin recombinase and the putative iron-binding and oxygen-activating domain of the antitumor antibiotic bleomycin. This 54-residue protein, comprising residues 139-190 of Hin recombinase with the dipeptide pyrimidoblamic acid-β-hydroxy-L-histidine (PBA-β-OH-His) at the NH2 terminus, was synthesized by solid phase methods. PBA-β-OH-His-Hin(139- 190) binds specifically to DNA at four distinct Hin binding sites with affinities comparable to those of the unmodified Hin(139-190). In the presence of dithiothreitol (DTT), Fe•PB-β-OH-His-Hin(139-190) cleaves DNA with specificity remarkably similar to that of Fe•EDTA-Hin(139-190), although with lower efficiency. Analysis of the cleavage pattern suggests that DNA cleavage is mediated through a diffusible species, in contrast with cleavage by bleomycin, which occurs through a non-diffusible oxidant.
Resumo:
RNA interference (RNAi) is a powerful biological pathway allowing for sequence-specific knockdown of any gene of interest. While RNAi is a proven tool for probing gene function in biological circuits, it is limited by being constitutively ON and executes the logical operation: silence gene Y. To provide greater control over post-transcriptional gene silencing, we propose engineering a biological logic gate to implement “conditional RNAi.” Such a logic gate would silence gene Y only upon the expression of gene X, a completely unrelated gene, executing the logic: if gene X is transcribed, silence independent gene Y. Silencing of gene Y could be confined to a specific time and/or tissue by appropriately selecting gene X.
To implement the logic of conditional RNAi, we present the design and experimental validation of three nucleic acid self-assembly mechanisms which detect a sub-sequence of mRNA X and produce a Dicer substrate specific to gene Y. We introduce small conditional RNAs (scRNAs) to execute the signal transduction under isothermal conditions. scRNAs are small RNAs which change conformation, leading to both shape and sequence signal transduction, in response to hybridization to an input nucleic acid target. While all three conditional RNAi mechanisms execute the same logical operation, they explore various design alternatives for nucleic acid self-assembly pathways, including the use of duplex and monomer scRNAs, stable versus metastable reactants, multiple methods of nucleation, and 3-way and 4-way branch migration.
We demonstrate the isothermal execution of the conditional RNAi mechanisms in a test tube with recombinant Dicer. These mechanisms execute the logic: if mRNA X is detected, produce a Dicer substrate targeting independent mRNA Y. Only the final Dicer substrate, not the scRNA reactants or intermediates, is efficiently processed by Dicer. Additional work in human whole-cell extracts and a model tissue-culture system delves into both the promise and challenge of implementing conditional RNAi in vivo.
Resumo:
Yeast chromosomes contain sequences called ARSs which function as origins of replication in vitro and in vivo. We have carried out a systematic deletion analysis of ARS1, allowing us to define three functionally distinct domains, designated A, B, and C. Domain A is a sequence of 11 to 19bp, containing the core consensus element that is required for replication. The core consensus sequence, A/TTTTATPuTTTA/T, is conserved at all ARSs sequenced to date. A fragment containing only element A and 8 flanking nucleotides enables autonomous replication of centromeric plasmids. These plasmids replicate very inefficiently, suggesting that flanking sequences must be important for ARS function. Domain B also provides important sequences needed for efficient replication. Deletion of domain B drastically increases the doubling times of transformants and reduces plasmid stability. Domain B contains a potential consensus sequence conserved at some ARSs which overlaps a region of bent DNA. Mutational analysis suggests this bent DNA may be important for ARS function. Deletion of domain C has only a slight effect on replication of plasmids carrying those deletions.
We have identified a protein called ARS binding factor I (ABF-I) that binds to the HMR-E ARS and ARS1. We have purified this protein to homogeneity using conventional and oligonucleotide affinity chromatography. The protein has an apparent molecular weight of 135kDa and is present at about 700 molecules per diploid cell, based on the yield of purified protein and in situ antibody staining. DNaseI footprinting reveals that ABF-I binds sequence-specifically to an approximately 24bp sequence that overlaps element Bat ARSl. This same protein binds to and protects a similar size region at the HMR-E ARS.
We also find evidence for another ARS binding protein, ABF-III, based on DN asei footprint analysis and gel retardation assays. The protein protects approximately 22bp adjacent to the ABF-I site. There appears to be no interaction between ABF-I and ABF-III despite the proximity of their binding sites.
To address the function of ABF-I in DNA replication, we have cloned the ABF-I gene using rabbit polyclonal anti-sera and murine monoclonal antibodies against ABF-I to screen a λgt11 expression library. Four EcoRI restriction fragments were isolated which encoded proteins that were recognized by both polyclonal and monoclonal antibodies. A gene disruption can now be constructed to determine the in vivo function of ABF-I.
Resumo:
Because so little is known about the structure of membrane proteins, an attempt has been made in this work to develop techniques by which to model them in three dimensions. The procedures devised rely heavily upon the availability of several sequences of a given protein. The modelling procedure is composed of two parts. The first identifies transmembrane regions within the protein sequence on the basis of hydrophobicity, β-turn potential, and the presence of certain amino acid types, specifically, proline and basic residues. The second part of the procedure arranges these transmembrane helices within the bilayer based upon the evolutionary conservation of their residues. Conserved residues are oriented toward other helices and variable residues are positioned to face the surrounding lipids. Available structural information concerning the protein's helical arrangement, including the lengths of interhelical loops, is also taken into account. Rhodopsin, band 3, and the nicotinic acetylcholine receptor have all been modelled using this methodology, and mechanisms of action could be proposed based upon the resulting structures.
Specific residues in the rhodopsin and iodopsin sequences were identified, which may regulate the proteins' wavelength selectivities. A hinge-like motion of helices M3, M4, and M5 with respect to the rest of the protein was proposed to result in the activation of transducin, the G-protein associated with rhodopsin. A similar mechanism is also proposed for signal transduction by the muscarinic acetylcholine and β-adrenergic receptors.
The nicotinic acetylcholine receptor was modelled with four trans-membrane helices per subunit and with the five homologous M2 helices forming the cation channel. Putative channel-lining residues were identified and a mechanism of channel-opening based upon the concerted, tangential rotation of the M2 helices was proposed.
Band 3, the anion exchange protein found in the erythrocyte membrane, was modelled with 14 transmembrane helices. In general the pathway of anion transport can be viewed as a channel composed of six helices that contains a single hydrophobic restriction. This hydrophobic region will not allow the passage of charged species, unless they are part of an ion-pair. An arginine residue located near this restriction is proposed to be responsible for anion transport. When ion-paired with a transportable anion it rotates across the barrier and releases the anion on the other side of the membrane. A similar process returns it to its original position. This proposed mechanism, based on the three-dimensional model, can account for the passive, electroneutral, anion exchange observed for band 3. Dianions can be transported through a similar mechanism with the additional participation of a histidine residue. Both residues are located on M10.
Resumo:
A series of eight related analogs of distamycin A has been synthesized. Footprinting and affinity cleaving reveal that only two of the analogs, pyridine-2- car box amide-netropsin (2-Py N) and 1-methylimidazole-2-carboxamide-netrops in (2-ImN), bind to DNA with a specificity different from that of the parent compound. A new class of sites, represented by a TGACT sequence, is a strong site for 2-PyN binding, and the major recognition site for 2-ImN on DNA. Both compounds recognize the G•C bp specifically, although A's and T's in the site may be interchanged without penalty. Additional A•T bp outside the binding site increase the binding affinity. The compounds bind in the minor groove of the DNA sequence, but protect both grooves from dimethylsulfate. The binding evidence suggests that 2-PyN or 2-ImN binding induces a DNA conformational change.
In order to understand this sequence specific complexation better, the Ackers quantitative footprinting method for measuring individual site affinity constants has been extended to small molecules. MPE•Fe(II) cleavage reactions over a 10^5 range of free ligand concentrations are analyzed by gel electrophoresis. The decrease in cleavage is calculated by densitometry of a gel autoradiogram. The apparent fraction of DNA bound is then calculated from the amount of cleavage protection. The data is fitted to a theoretical curve using non-linear least squares techniques. Affinity constants at four individual sites are determined simultaneously. The distamycin A analog binds solely at A•T rich sites. Affinities range from 10^(6)- 10^(7)M^(-1) The data for parent compound D fit closely to a monomeric binding curve. 2-PyN binds both A•T sites and the TGTCA site with an apparent affinity constant of 10^(5) M^(-1). 2-ImN binds A•T sites with affinities less than 5 x 10^(4) M^(-1). The affinity of 2-ImN for the TGTCA site does not change significantly from the 2-PyN value. At the TGTCA site, the experimental data fit a dimeric binding curve better than a monomeric curve. Both 2-PyN and 2-ImN have substantially lower DNA affinities than closely related compounds.
In order to probe the requirements of this new binding site, fourteen other derivatives have been synthesized and tested. All compounds that recognize the TGTCA site have a heterocyclic aromatic nitrogen ortho to the N or C-terminal amide of the netropsin subunit. Specificity is strongly affected by the overall length of the small molecule. Only compounds that consist of at least three aromatic rings linked by amides exhibit TGTCA site binding. Specificity is only weakly altered by substitution on the pyridine ring, which correlates best with steric factors. A model is proposed for TGTCA site binding that has as its key feature hydrogen bonding to both G's by the small molecule. The specificity is determined by the sequence dependence of the distance between G's.
One derivative of 2-PyN exhibits pH dependent sequence specificity. At low pH, 4-dimethylaminopyridine-2-carboxamide-netropsin binds tightly to A•T sites. At high pH, 4-Me_(2)NPyN binds most tightly to the TGTCA site. In aqueous solution, this compound protonates at the pyridine nitrogen at pH 6. Thus presence of the protonated form correlates with A•T specificity.
The binding site of a class of eukaryotic transcriptional activators typified by yeast protein GCN4 and the mammalian oncogene Jun contains a strong 2-ImN binding site. Specificity requirements for the protein and small molecule are similar. GCN4 and 2-lmN bind simultaneously to the same binding site. GCN4 alters the cleavage pattern of 2-ImN-EDTA derivative at only one of its binding sites. The details of the interaction suggest that GCN4 alters the conformation of an AAAAAAA sequence adjacent to its binding site. The presence of a yeast counterpart to Jun partially blocks 2-lmN binding. The differences do not appear to be caused by direct interactions between 2-lmN and the proteins, but by induced conformational changes in the DNA protein complex. It is likely that the observed differences in complexation are involved in the varying sequence specificity of these proteins.
Resumo:
Signal recognition particle (SRP) and signal recognition particle receptor (SR) are evolutionarily conserved GTPases that deliver secretory and membrane proteins to the protein-conducting channel Sec61 complex in the lipid bilayer of the endoplasmic reticulum in eukaryotes or the SecYEG complex in the inner membrane of bacteria. Unlike the canonical Ras-type GTPases, SRP and SR are activated via nucleotide-dependent heterodimerization. Upon formation of the SR•SRP targeting complex, SRP and SR undergo a series of discrete conformational changes that culminate in their reciprocal activation and hydrolysis of GTP. How the SR•SRP GTPase cycle is regulated and coupled to the delivery of the cargo protein to the protein-conducting channel at the target membrane is not well-understood. Here we examine the role of the lipid bilayer and SecYEG in regulation of the SRP-mediated protein targeting pathway and show that they serve as important biological cues that spatially control the targeting reaction.
In the first chapter, we show that anionic phospholipids of the inner membrane activate the bacterial SR, FtsY, and favor the late conformational states of the targeting complex conducive to efficient unloading of the cargo. The results of our studies suggest that the lipid bilayer acts as a spatial cue that weakens the interaction of the cargo protein with SRP and primes the complex for unloading its cargo onto SecYEG.
In the second chapter, we focus on the effect of SecYEG on the conformational states and activity of the targeting complex. While phospholipids prime the complex for unloading its cargo, they are insufficient to trigger hydrolysis of GTP and the release of the cargo from the complex. SecYEG modulates the conformation of the targeting complex and triggers the GTP hydrolysis from the complex, thus driving the targeting reaction to completion. The results of this study suggest that SecYEG is not a passive recipient of the cargo protein; rather, it actively releases the cargo from the targeting complex. Together, anionic phospholipids and SecYEG serve distinct yet complementary roles. They spatially control the targeting reaction in a sequential manner, ensuring efficient delivery and unloading of the cargo protein.
In the third chapter, we reconstitute the transfer reaction in vitro and visualize it in real time. We show that the ribosome-nascent chain complex is transferred to SecYEG via a stepwise mechanism with gradual dissolution and formation of the contacts with SRP and SecYEG, respectively, explaining how the cargo is kept tethered to the membrane during the transfer and how its loss to the cytosol is avoided.
In the fourth chapter, we examine interaction of SecYEG with secretory and membrane proteins and attempt to address the role of a novel insertase YidC in this interaction. We show that detergent-solubilized SecYEG is capable of discriminating between the nascent chains of various lengths and engages a signal sequence in a well-defined conformation in the absence of accessory factors. Further, YidC alters the conformation of the signal peptide bound to SecYEG. The results described in this chapter show that YidC affects the SecYEG-nascent chain interaction at early stages of translocation/insertion and suggest a YidC-facilitated mechanism for lateral exit of transmembrane domains from SecYEG into the lipid bilayer.
Resumo:
Understanding how transcriptional regulatory sequence maps to regulatory function remains a difficult problem in regulatory biology. Given a particular DNA sequence for a bacterial promoter region, we would like to be able to say which transcription factors bind there, how strongly they bind, and whether they interact with each other and/or RNA polymerase, with the ultimate objective of integrating knowledge of these parameters into a prediction of gene expression levels. The theoretical framework of statistical thermodynamics provides a useful framework for doing so, enabling us to predict how gene expression levels depend on transcription factor binding energies and concentrations. We used thermodynamic models, coupled with models of the sequence-dependent binding energies of transcription factors and RNAP, to construct a genotype to phenotype map for the level of repression exhibited by the lac promoter, and tested it experimentally using a set of promoter variants from E. coli strains isolated from different natural environments. For this work, we sought to ``reverse engineer'' naturally occurring promoter sequences to understand how variations in promoter sequence affects gene expression. The natural inverse of this approach is to ``forward engineer'' promoter sequences to obtain targeted levels of gene expression. We used a high precision model of RNAP-DNA sequence dependent binding energy, coupled with a thermodynamic model relating binding energy to gene expression, to predictively design and verify a suite of synthetic E. coli promoters whose expression varied over nearly three orders of magnitude.
However, although thermodynamic models enable predictions of mean levels of gene expression, it has become evident that cell-to-cell variability or ``noise'' in gene expression can also play a biologically important role. In order to address this aspect of gene regulation, we developed models based on the chemical master equation framework and used them to explore the noise properties of a number of common E. coli regulatory motifs; these properties included the dependence of the noise on parameters such as transcription factor binding strength and copy number. We then performed experiments in which these parameters were systematically varied and measured the level of variability using mRNA FISH. The results showed a clear dependence of the noise on these parameters, in accord with model predictions.
Finally, one shortcoming of the preceding modeling frameworks is that their applicability is largely limited to systems that are already well-characterized, such as the lac promoter. Motivated by this fact, we used a high throughput promoter mutagenesis assay called Sort-Seq to explore the completely uncharacterized transcriptional regulatory DNA of the E. coli mechanosensitive channel of large conductance (MscL). We identified several candidate transcription factor binding sites, and work is continuing to identify the associated proteins.
Resumo:
Several different methods have been employed in the study of voltage-gated ion channels. Electrophysiological studies on excitable cells in vertebrates and molluscs have shown that many different voltage-gated potassium (K+) channels and sodium channels may coexist in the same organism. Parallel genetic studies in Drosophila have identified mutations in several genes that alter the properties of specific subsets of physiologically identified ion channels. Chapter 2 describes molecular studies that identify two Drosophila homologs of vertebrate sodium-channel genes. Mutations in one of these Drosophila sodium-channel genes are shown to be responsible for the temperature-dependent paralysis of a behavioural mutant parats. Evolutionary arguments, based on the partial sequences of the two Drosophila genes, suggest that subfamilies of voltage-gated sodium channels in vertebrates remain to be identified.
In Drosophila, diverse voltage-gated K+ channels arise from alternatively spliced mRNAs generated at the Shaker locus. Chapter 3 and the Appendices describe the isolation and characterization of several human K+-channel genes, similar in sequence to Shaker. Each of these human genes has a highly conserved homolog in rodents; thus, this K+-channel gene family probably diversified prior to the mammalian radiation. Functional K+ channels encoded by these genes have been expressed in Xenopus oocytes and their properties have been analyzed by electrophysiological methods. These studies demonstrate that both transient and noninactivating voltage-gated K+ channels may be encoded by mammalian genes closely related to Shaker. In addition, results presented in Appendix 3 clearly demonstrate that independent gene products from two K+-channel genes may efficiently co-assemble into heterooligomeric K+ channels with properties distinct from either homomultimeric channel. This finding suggests yet another molecular mechanism for the generation of K+-channel diversity.
Resumo:
G-protein coupled receptors (GPCRs) form a large family of proteins and are very important drug targets. They are membrane proteins, which makes computational prediction of their structure challenging. Homology modeling is further complicated by low sequence similarly of the GPCR superfamily.
In this dissertation, we analyze the conserved inter-helical contacts of recently solved crystal structures, and we develop a unified sequence-structural alignment of the GPCR superfamily. We use this method to align 817 human GPCRs, 399 of which are nonolfactory. This alignment can be used to generate high quality homology models for the 817 GPCRs.
To refine the provided GPCR homology models we developed the Trihelix sampling method. We use a multi-scale approach to simplify the problem by treating the transmembrane helices as rigid bodies. In contrast to Monte Carlo structure prediction methods, the Trihelix method does a complete local sampling using discretized coordinates for the transmembrane helices. We validate the method on existing structures and apply it to predict the structure of the lactate receptor, HCAR1. For this receptor, we also build extracellular loops by taking into account constraints from three disulfide bonds. Docking of lactate and 3,5-dihydroxybenzoic acid shows likely involvement of three Arg residues on different transmembrane helices in binding a single ligand molecule.
Protein structure prediction relies on accurate force fields. We next present an effort to improve the quality of charge assignment for large atomic models. In particular, we introduce the formalism of the polarizable charge equilibration scheme (PQEQ) and we describe its implementation in the molecular simulation package Lammps. PQEQ allows fast on the fly charge assignment even for reactive force fields.
Resumo:
Transcription factor p53 is the most commonly altered gene in human cancer. As a redox-active protein in direct contact with DNA, p53 can directly sense oxidative stress through DNA-mediated charge transport. Electron hole transport occurs with a shallow distance dependence over long distances through the π-stacked DNA bases, leading to the oxidation and dissociation of DNA-bound p53. The extent of p53 dissociation depends upon the redox potential of the response element DNA in direct contact with each p53 monomer. The DNA sequence dependence of p53 oxidative dissociation was examined by electrophoretic mobility shift assays using radiolabeled oligonucleotides containing both synthetic and human p53 response elements with an appended anthraquinone photooxidant. Greater p53 dissociation is observed from DNA sequences containing low redox potential purine regions, particularly guanine triplets, within the p53 response element. Using denaturing polyacrylamide gel electrophoresis of irradiated anthraquinone-modified DNA, the DNA damage sites, which correspond to locations of preferred electron hole localization, were determined. The resulting DNA damage preferentially localizes to guanine doublets and triplets within the response element. Oxidative DNA damage is inhibited in the presence of p53, however, only at DNA sites within the response element, and therefore in direct contact with p53. From these data, predictions about the sensitivity of human p53-binding sites to oxidative stress, as well as possible biological implications, have been made. On the basis of our data, the guanine pattern within the purine region of each p53-binding site determines the response of p53 to DNA-mediated oxidation, yielding for some sequences the oxidative dissociation of p53 from a distance and thereby providing another potential role for DNA charge transport chemistry within the cell.
To determine whether the change in p53 response element occupancy observed in vitro also correlates in cellulo, chromatin immunoprecipition (ChIP) and quantitative PCR (qPCR) were used to directly quantify p53 binding to certain response elements in HCT116N cells. The HCT116N cells containing a wild type p53 were treated with the photooxidant [Rh(phi)2bpy]3+, Nutlin-3 to upregulate p53, and subsequently irradiated to induce oxidative genomic stress. To covalently tether p53 interacting with DNA, the cells were fixed with disuccinimidyl glutarate and formaldehyde. The nuclei of the harvested cells were isolated, sonicated, and immunoprecipitated using magnetic beads conjugated with a monoclonal p53 antibody. The purified immounoprecipiated DNA was then quantified via qPCR and genomic sequencing. Overall, the ChIP results were significantly varied over ten experimental trials, but one trend is observed overall: greater variation of p53 occupancy is observed in response elements from which oxidative dissociation would be expected, while significantly less change in p53 occupancy occurs for response elements from which oxidative dissociation would not be anticipated.
The chemical oxidation of transcription factor p53 via DNA CT was also investigated with respect to the protein at the amino acid level. Transcription factor p53 plays a critical role in the cellular response to stress stimuli, which may be modulated through the redox modulation of conserved cysteine residues within the DNA-binding domain. Residues within p53 that enable oxidative dissociation are herein investigated. Of the 8 mutants studied by electrophoretic mobility shift assay (EMSA), only the C275S mutation significantly decreased the protein affinity (KD) for the Gadd45 response element. EMSA assays of p53 oxidative dissociation promoted by photoexcitation of anthraquinone-tethered Gadd45 oligonucleotides were used to determine the influence of p53 mutations on oxidative dissociation; mutation to C275S severely attenuates oxidative dissociation while C277S substantially attenuates dissociation. Differential thiol labeling was used to determine the oxidation states of cysteine residues within p53 after DNA-mediated oxidation. Reduced cysteines were iodoacetamide labeled, while oxidized cysteines participating in disulfide bonds were 13C2D2-iodoacetamide labeled. Intensities of respective iodoacetamide-modified peptide fragments were analyzed using a QTRAP 6500 LC-MS/MS system, quantified with Skyline, and directly compared. A distinct shift in peptide labeling toward 13C2D2-iodoacetamide labeled cysteines is observed in oxidized samples as compared to the respective controls. All of the observable cysteine residues trend toward the heavy label under conditions of DNA CT, indicating the formation of multiple disulfide bonds potentially among the C124, C135, C141, C182, C275, and C277. Based on these data it is proposed that disulfide formation involving C275 is critical for inducing oxidative dissociation of p53 from DNA.
Resumo:
Interleukin 2 (IL2) is the primary growth hormone used by mature T cells and this lymphokine plays an important role in the magnification of cell-mediated immune responses. Under normal circumstances its expression is limited to antigen-activated type 1 helper T cells (TH1) and the ability to transcribe this gene is often regarded as evidence for commitment to this developmental lineage. There is, however, abundant evidence than many non-TH1 T cells, under appropriate conditions, possess the ability to express this gene. Of paramount interest in the study of T-cell development is the mechanisms by which differentiating thymocytes are endowed with particular combinations of cell surface proteins and response repertoires. For example, why do most helper T cells express the CD4 differentiation antigen?
As a first step in understanding these developmental processes the gene encoding IL2 was isolated from a mouse genomic library by probing with a conspecific IL2 cDNA. The sequence of the 5' flanking region from + 1 to -2800 was determined and compared to the previously reported human sequence. Extensive identity exists between +1 and -580 (86%) and sites previously shown to be crucial for the proper expression of the human gene are well conserved in both sequence location in the mouse counterpart.
Transient expression assays were used to evaluate the contribution of various genomic sequences to high-level gene expression mediated by a cloned IL2 promoter fragment. Differing lengths of 5' flanking DNA, all terminating in the 5' untranslated region, were linked to a reporter gene, bacterial chloramphenicol acetyltransferase (CAT) and enzyme activity was measured after introduction into IL2-producing cell lines. No CAT was ever detected without stimulation of the recipient cells. A cloned promoter fragment containing only 321 bp of upstream DNA was expressed well in both Jurkat and EL4.El cells. Addition of intragenic or downstream DNA to these 5' IL2-CAT constructs showed that no obvious regulatory regions resided there. However, increasing the extent of 5' DNA from -321 to -2800 revealed several positive and negative regulatory elements. One negative region that was well characterized resided between -750 and -1000 and consisted almost exclusively of alternating purine and pyrimidines. There is no sequence resembling this in the human gene now, but there is evidence that there may have once been.
No region, when deleted, could relax either the stringent induction-dependence on cell-type specificity displayed by this promoter. Reagents that modulated endogenous IL2 expression, such as cAMP, cyclosporin A, and IL1, affected expression of the 5' IL2-CAT constructs also. For a given reagent, expression from all expressible constructs was suppressed or enhanced to the same extent. This suggests that these modulators affect IL2 expression through perturbation of a central inductive signal rather than by summation of the effects of discrete, independently regulated, negative and positive transcription factors.
Resumo:
G protein-coupled receptors (GPCRs) are the largest family of proteins within the human genome. They consist of seven transmembrane (TM) helices, with a N-terminal region of varying length and structure on the extracellular side, and a C-terminus on the intracellular side. GPCRs are involved in transmitting extracellular signals to cells, and as such are crucial drug targets. Designing pharmaceuticals to target GPCRs is greatly aided by full-atom structural information of the proteins. In particular, the TM region of GPCRs is where small molecule ligands (much more bioavailable than peptide ligands) typically bind to the receptors. In recent years nearly thirty distinct GPCR TM regions have been crystallized. However, there are more than 1,000 GPCRs, leaving the vast majority of GPCRs with limited structural information. Additionally, GPCRs are known to exist in a myriad of conformational states in the body, rendering the static x-ray crystal structures an incomplete reflection of GPCR structures. In order to obtain an ensemble of GPCR structures, we have developed the GEnSeMBLE procedure to rapidly sample a large number of variations of GPCR helix rotations and tilts. The lowest energy GEnSeMBLE structures are then docked to small molecule ligands and optimized. The GPCR family consists of five subfamilies with little to no sequence homology between them: class A, B1, B2, C, and Frizzled/Taste2. Almost all of the GPCR crystal structures have been of class A GPCRs, and much is known about their conserved interactions and binding sites. In this work we particularly focus on class B1 GPCRs, and aim to understand that family’s interactions and binding sites both to small molecules and their native peptide ligands. Specifically, we predict the full atom structure and peptide binding site of the glucagon-like peptide receptor and the TM region and small molecule binding sites for eight other class B1 GPCRs: CALRL, CRFR1, GIPR, GLR, PACR, PTH1R, VIPR1, and VIPR2. Our class B1 work reveals multiple conserved interactions across the B1 subfamily as well as a consistent small molecule binding site centrally located in the TM bundle. Both the interactions and the binding sites are distinct from those seen in the more well-characterized class A GPCRs, and as such our work provides a strong starting point for drug design targeting class B1 proteins. We also predict the full structure of CXCR4 bound to a small molecule, a class A GPCR that was not closely related to any of the class A GPCRs at the time of the work.