9 resultados para cDNA sequence
em CaltechTHESIS
Resumo:
DNA recognition is an essential biological process responsible for the regulation of cellular functions including protein synthesis and cell division and is implicated in the mechanism of action of some anticancer drugs. Studies directed towards defining the elements responsible for sequence specific DNA recognition through the study of the interactions of synthetic organic ligands with DNA are described.
DNA recognition by poly-N-methylpyrrolecarboxamides was studied by the synthesis and characterization of a series of molecules where the number of contiguous N-methylpyrrolecarboxamide units was increased from 2 to 9. The effect of this incremental change in structure on DNA recognition has been investigated at base pair resolution using affinity cleaving and MPE•Fe(II) footprinting techniques. These studies led to a quantitative relationship between the number of amides in the molecule and the DNA binding site size. This relationship is called the n + 1 rule and it states that a poly-N methylpyrrolecarboxamide molecule with n amides will bind n + 1 base pairs of DNA. This rule is consistent with a model where the carboxamides of these compounds form three center bridging hydrogen bonds between adjacent base pairs on opposite strands of the helix. The poly-N methylpyrrolecarboxamide recognition element was found to preferentially bind poly dA•poly dT stretches; however, both binding site selection and orientation were found to be affected by flanking sequences. Cleavage of large DNA is also described.
One approach towards the design of molecules that bind large sequences of double helical DNA sequence specifically is to couple DNA binding subunits of similar or diverse base pair specificity. Bis-EDTA-distamycin-fumaramide (BEDF) is an octaamide dimer of two tri-N methylpyrrolecarboxamide subunits linked by fumaramide. DNA recognition by BEDF was compared to P7E, an octaamide molecule containing seven consecutive pyrroles. These two compounds were found to recognize the same sites on pBR322 with approximately the same affinities demonstrating that fumaramide is an effective linking element for Nmethylpyrrolecarboxamide recognition subunits. Further studies involved the synthesis and characterization of a trimer of tetra-N-methylpyrrolecarboxamide subunits linked by β-alanine ((P4)_(3)E). This trimerization produced a molecule which is capable of recognizing 16 base pairs of A•T DNA, more than a turn and a half of the DNA helix.
DNA footprinting is a powerful direct method for determining the binding sites of proteins and small molecules on heterogeneous DNA. It was found that attachment of EDTA•Fe(II) to spermine creates a molecule, SE•Fe(II), which binds and cleaves DNA sequence neutrally. This lack of specificity provides evidence that at the nucleotide level polyamines recognize heterogeneous DNA independent of sequence and allows SE•Fe(II) to be used as a footprinting reagent. SE•Fe(II) was compared with two other small molecule footprinting reagents, EDTA•Fe(II) and MPE•Fe(II).
Resumo:
The Drosophila compound eye has provided a genetic approach to understanding the specification of cell fates during differentiation. The eye is made up of some 750 repeated units or ommatidia, arranged in a lattice. The cellular composition of each ommatidium is identical. The arrangement of the lattice and the specification of cell fates in each ommatidium are thought to occur in development through cellular interactions with the local environment. Many mutations have been studied that disrupt the proper patterning and cell fating in the eye. The eyes absent (eya) mutation, the subject of this thesis, was chosen because of its eyeless phenotype. In eya mutants, eye progenitor cells undergo programmed cell death before the onset of patterning has occurred. The molecular genetic analysis of the gene is presented.
The eye arises from the larval eye-antennal imaginal disc. During the third larval instar, a wave of differentiation progresses across the disc, marked by a furrow. Anterior to the furrow, proliferating cells are found in apparent disarray. Posterior to the furrow, clusters of differentiating cells can be discerned, that correspond to the ommatidia of the adult eye. Analysis of an allelic series of eya mutants in comparison to wild type revealed the presence of a selection point: a wave of programmed cell death that normally precedes the furrow. In eya mutants, an excessive number of eye progenitor cells die at this selection point, suggesting the eya gene influences the distribution of cells between fates of death and differentiation.
In addition to its role in the eye, the eya gene has an embryonic function. The eye function is autonomous to the eye progenitor cells. Molecular maps of the eye and embryonic phenotypes are different. Therefore, the function of eya in the eye can be treated independently of the embryonic function. Cloning of the gene reveals two cDNA's that are identical except for the use of an alternatively-spliced 5' exon. The predicted protein products differ only at the N-termini. Sequence analysis shows these two proteins to be the first of their kind to be isolated. Trangenic studies using the two cDNA's show that either gene product is able to rescue the eye phenotype of eya mutants.
The eya gene exhibits interallelic complementation. This interaction is an example of an "allelic position effect": an interaction that depends on the relative position in the genome of the two alleles, which is thought to be mediated by chromosomal pairing. The interaction at eya is essentially identical to a phenomenon known as transvection, which is an allelic position effect that is sensitive to certain kinds of chromosomal rearrangements. A current model for the mechanism of transvection is the trans action of gene regulatory regions. The eya locus is particularly well suited for the study of transvection because the mutant phenotypes can be quantified by scoring the size of the eye.
The molecular genetic analysis of eya provides a system for uncovering mechanisms underlying differentiation, developmentally regulated programmed cell death, and gene regulation.
Resumo:
This thesis describes research pursued in two areas, both involving the design and synthesis of sequence specific DNA-cleaving proteins. The first involves the use of sequence-specific DNA-cleaving metalloproteins to probe the structure of a protein-DNA complex, and the second seeks to develop cleaving moieties capable of DNA cleavage through the generation of a non-diffusible oxidant under physiological conditions.
Chapter One provides a brief review of the literature concerning sequence-specific DNA-binding proteins. Chapter Two summarizes the results of affinity cleaving experiments using leucine zipper-basic region (bZip) DNA-binding proteins. Specifically, the NH_2-terminal locations of a dimer containing the DNA binding domain of the yeast transcriptional activator GCN4 were mapped on the binding sites 5'-CTGACTAAT-3' and 5'ATGACTCTT- 3' using affinity cleaving. Analysis of the DNA cleavage patterns from Fe•EDTA-GCN4(222-281) and (226-281) dimers reveals that the NH_2-termini are in the major groove nine to ten base pairs apart and symmetrically displaced four to five base pairs from the central C of the recognition site. These data are consistent with structural models put forward for this class of DNA binding proteins. The results of these experiments are evaluated in light of the recently published crystal structure for the GCN4-DNA complex. Preliminary investigations of affinity cleaving proteins based on the DNA-binding domains of the bZip proteins Jun and Fos are also described.
Chapter Three describes experiments demonstrating the simultaneous binding of GCN4(226-281) and 1-Methylimidazole-2-carboxamide-netropsin (2-ImN), a designed synthetic peptide which binds in the minor groove of DNA at 5'-TGACT-3' sites as an antiparallel, side-by-side dimer. Through the use of Fe•EDTA-GCN4(226-281) as a sequence-specific footprinting agent, it is shown that the dimeric protein GCN4(226-281) and the dimeric peptide 2- ImN can simultaneously occupy their common binding site in the major and minor grooves of DNA, respectively. The association constants for 2-ImN in the presence and in the absence of Fe•EDTA-GCN4(226-281) are found to be similar, suggesting that the binding of the two dimers is not cooperative.
Chapter Four describes the synthesis and characterization of PBA-β-OH-His- Hin(139-190), a hybrid protein containing the DNA-binding domain of Hin recombinase and the putative iron-binding and oxygen-activating domain of the antitumor antibiotic bleomycin. This 54-residue protein, comprising residues 139-190 of Hin recombinase with the dipeptide pyrimidoblamic acid-β-hydroxy-L-histidine (PBA-β-OH-His) at the NH2 terminus, was synthesized by solid phase methods. PBA-β-OH-His-Hin(139- 190) binds specifically to DNA at four distinct Hin binding sites with affinities comparable to those of the unmodified Hin(139-190). In the presence of dithiothreitol (DTT), Fe•PB-β-OH-His-Hin(139-190) cleaves DNA with specificity remarkably similar to that of Fe•EDTA-Hin(139-190), although with lower efficiency. Analysis of the cleavage pattern suggests that DNA cleavage is mediated through a diffusible species, in contrast with cleavage by bleomycin, which occurs through a non-diffusible oxidant.
Resumo:
RNA interference (RNAi) is a powerful biological pathway allowing for sequence-specific knockdown of any gene of interest. While RNAi is a proven tool for probing gene function in biological circuits, it is limited by being constitutively ON and executes the logical operation: silence gene Y. To provide greater control over post-transcriptional gene silencing, we propose engineering a biological logic gate to implement “conditional RNAi.” Such a logic gate would silence gene Y only upon the expression of gene X, a completely unrelated gene, executing the logic: if gene X is transcribed, silence independent gene Y. Silencing of gene Y could be confined to a specific time and/or tissue by appropriately selecting gene X.
To implement the logic of conditional RNAi, we present the design and experimental validation of three nucleic acid self-assembly mechanisms which detect a sub-sequence of mRNA X and produce a Dicer substrate specific to gene Y. We introduce small conditional RNAs (scRNAs) to execute the signal transduction under isothermal conditions. scRNAs are small RNAs which change conformation, leading to both shape and sequence signal transduction, in response to hybridization to an input nucleic acid target. While all three conditional RNAi mechanisms execute the same logical operation, they explore various design alternatives for nucleic acid self-assembly pathways, including the use of duplex and monomer scRNAs, stable versus metastable reactants, multiple methods of nucleation, and 3-way and 4-way branch migration.
We demonstrate the isothermal execution of the conditional RNAi mechanisms in a test tube with recombinant Dicer. These mechanisms execute the logic: if mRNA X is detected, produce a Dicer substrate targeting independent mRNA Y. Only the final Dicer substrate, not the scRNA reactants or intermediates, is efficiently processed by Dicer. Additional work in human whole-cell extracts and a model tissue-culture system delves into both the promise and challenge of implementing conditional RNAi in vivo.
Resumo:
The sea urchin embryonic skeleton, or spicule, is deposited by mesenchymal progeny of four precursor cells, the micromeres, which are determined to the skeletogenic pathway by a process known as cytoplasmic localization. A gene encoding one of the major products of the skeletogenic mesenchyme, a prominent 50 kD protein of the spicule matrix, has been characterized in detail. cDNA clones were first isolated by antibody screening of a phage expression library, followed by isolation of homologous genomic clones. The gene, known as SM50, is single copy in the sea urchin genome, is divided into two exons of 213 and 1682 bp, and is expressed only in skeletogenic cells. Transcripts are first detectable at the 120 cell stage, shortly after the segregation of the skeletogenic precursors from the rest of the embryo. The SM50 open reading frame begins within the first exon, is 450 amino acids in length, and contains a loosely repeated 13 amino acid motif rich in acidic residues which accounts for 45% of the protein and which is possibly involved in interaction with the mineral phase of the spicule.
The important cis-acting regions of the SM50 gene necessary for proper regulation of expression were identified by gene transfer experiments. A 562 bp promoter fragment, containing 438 bp of 5' promoter sequence and 124 bp of the SM50 first exon (including the SM50 initiation codon), was both necessary and sufficient to direct high levels of expression of the bacterial chloramphenicol acetyltransferase (CAT) reporter gene specifically in the skeletogenic cells. Removal of promoter sequences between positions -2200 and -438, and of transcribed regions downstream of +124 (including the SM50 intron), had no effect on the spatial or transcriptional activity of the transgenes.
Regulatory proteins that interact with the SM50 promoter were identified by the gel retardation assay, using bulk embryo mesenchyme blastula stage nuclear proteins. Five protein binding sites were identified and mapped to various degrees of resolution. Two sites are homologous, may be enhancer elements, and at least one is required for expression. Two additional sites are also present in the promoter of the aboral ectoderm specific cytoskeletal actin gene CyIIIa; one of these is a CCAA T element, the other a putative repressor element. The fifth site overlaps the binding site of the putative repressor and may function as a positive regulator by interfering with binding of the repressor. All of the proteins are detectable in nuclear extracts prepared from 64 cell stage embryos, a stage just before expression of SM50 is initiated, as well as from blastula and gastrula stage; the putative enhancer binding protein may be maternal as well.
Resumo:
A series of eight related analogs of distamycin A has been synthesized. Footprinting and affinity cleaving reveal that only two of the analogs, pyridine-2- car box amide-netropsin (2-Py N) and 1-methylimidazole-2-carboxamide-netrops in (2-ImN), bind to DNA with a specificity different from that of the parent compound. A new class of sites, represented by a TGACT sequence, is a strong site for 2-PyN binding, and the major recognition site for 2-ImN on DNA. Both compounds recognize the G•C bp specifically, although A's and T's in the site may be interchanged without penalty. Additional A•T bp outside the binding site increase the binding affinity. The compounds bind in the minor groove of the DNA sequence, but protect both grooves from dimethylsulfate. The binding evidence suggests that 2-PyN or 2-ImN binding induces a DNA conformational change.
In order to understand this sequence specific complexation better, the Ackers quantitative footprinting method for measuring individual site affinity constants has been extended to small molecules. MPE•Fe(II) cleavage reactions over a 10^5 range of free ligand concentrations are analyzed by gel electrophoresis. The decrease in cleavage is calculated by densitometry of a gel autoradiogram. The apparent fraction of DNA bound is then calculated from the amount of cleavage protection. The data is fitted to a theoretical curve using non-linear least squares techniques. Affinity constants at four individual sites are determined simultaneously. The distamycin A analog binds solely at A•T rich sites. Affinities range from 10^(6)- 10^(7)M^(-1) The data for parent compound D fit closely to a monomeric binding curve. 2-PyN binds both A•T sites and the TGTCA site with an apparent affinity constant of 10^(5) M^(-1). 2-ImN binds A•T sites with affinities less than 5 x 10^(4) M^(-1). The affinity of 2-ImN for the TGTCA site does not change significantly from the 2-PyN value. At the TGTCA site, the experimental data fit a dimeric binding curve better than a monomeric curve. Both 2-PyN and 2-ImN have substantially lower DNA affinities than closely related compounds.
In order to probe the requirements of this new binding site, fourteen other derivatives have been synthesized and tested. All compounds that recognize the TGTCA site have a heterocyclic aromatic nitrogen ortho to the N or C-terminal amide of the netropsin subunit. Specificity is strongly affected by the overall length of the small molecule. Only compounds that consist of at least three aromatic rings linked by amides exhibit TGTCA site binding. Specificity is only weakly altered by substitution on the pyridine ring, which correlates best with steric factors. A model is proposed for TGTCA site binding that has as its key feature hydrogen bonding to both G's by the small molecule. The specificity is determined by the sequence dependence of the distance between G's.
One derivative of 2-PyN exhibits pH dependent sequence specificity. At low pH, 4-dimethylaminopyridine-2-carboxamide-netropsin binds tightly to A•T sites. At high pH, 4-Me_(2)NPyN binds most tightly to the TGTCA site. In aqueous solution, this compound protonates at the pyridine nitrogen at pH 6. Thus presence of the protonated form correlates with A•T specificity.
The binding site of a class of eukaryotic transcriptional activators typified by yeast protein GCN4 and the mammalian oncogene Jun contains a strong 2-ImN binding site. Specificity requirements for the protein and small molecule are similar. GCN4 and 2-lmN bind simultaneously to the same binding site. GCN4 alters the cleavage pattern of 2-ImN-EDTA derivative at only one of its binding sites. The details of the interaction suggest that GCN4 alters the conformation of an AAAAAAA sequence adjacent to its binding site. The presence of a yeast counterpart to Jun partially blocks 2-lmN binding. The differences do not appear to be caused by direct interactions between 2-lmN and the proteins, but by induced conformational changes in the DNA protein complex. It is likely that the observed differences in complexation are involved in the varying sequence specificity of these proteins.
Resumo:
Understanding how transcriptional regulatory sequence maps to regulatory function remains a difficult problem in regulatory biology. Given a particular DNA sequence for a bacterial promoter region, we would like to be able to say which transcription factors bind there, how strongly they bind, and whether they interact with each other and/or RNA polymerase, with the ultimate objective of integrating knowledge of these parameters into a prediction of gene expression levels. The theoretical framework of statistical thermodynamics provides a useful framework for doing so, enabling us to predict how gene expression levels depend on transcription factor binding energies and concentrations. We used thermodynamic models, coupled with models of the sequence-dependent binding energies of transcription factors and RNAP, to construct a genotype to phenotype map for the level of repression exhibited by the lac promoter, and tested it experimentally using a set of promoter variants from E. coli strains isolated from different natural environments. For this work, we sought to ``reverse engineer'' naturally occurring promoter sequences to understand how variations in promoter sequence affects gene expression. The natural inverse of this approach is to ``forward engineer'' promoter sequences to obtain targeted levels of gene expression. We used a high precision model of RNAP-DNA sequence dependent binding energy, coupled with a thermodynamic model relating binding energy to gene expression, to predictively design and verify a suite of synthetic E. coli promoters whose expression varied over nearly three orders of magnitude.
However, although thermodynamic models enable predictions of mean levels of gene expression, it has become evident that cell-to-cell variability or ``noise'' in gene expression can also play a biologically important role. In order to address this aspect of gene regulation, we developed models based on the chemical master equation framework and used them to explore the noise properties of a number of common E. coli regulatory motifs; these properties included the dependence of the noise on parameters such as transcription factor binding strength and copy number. We then performed experiments in which these parameters were systematically varied and measured the level of variability using mRNA FISH. The results showed a clear dependence of the noise on these parameters, in accord with model predictions.
Finally, one shortcoming of the preceding modeling frameworks is that their applicability is largely limited to systems that are already well-characterized, such as the lac promoter. Motivated by this fact, we used a high throughput promoter mutagenesis assay called Sort-Seq to explore the completely uncharacterized transcriptional regulatory DNA of the E. coli mechanosensitive channel of large conductance (MscL). We identified several candidate transcription factor binding sites, and work is continuing to identify the associated proteins.
Resumo:
The genomes of many positive stranded RNA viruses and of all retroviruses are translated as large polyproteins which are proteolytically processed by cellular and viral proteases. Viral proteases are structurally related to two families of cellular proteases, the pepsin-like and trypsin-like proteases. This thesis describes the proteolytic processing of several nonstructural proteins of dengue 2 virus, a representative member of the Flaviviridae, and describes methods for transcribing full-length genomic RNA of dengue 2 virus. Chapter 1 describes the in vitro processing of the nonstructural proteins NS2A, NS2B and NS3. Chapter 2 describes a system that allows identification of residues within the protease that are directly or indirectly involved with substrate recognition. Chapter 3 describes methods to produce genome length dengue 2 RNA from cDNA templates.
The nonstructural protein NS3 is structurally related to viral trypsinlike proteases from the alpha-, picorna-, poty-, and pestiviruses. The hypothesis that the flavivirus nonstructural protein NS3 is a viral proteinase that generates the termini of several nonstructural proteins was tested using an efficient in vitro expression system and antisera specific for the nonstructural proteins NS2B and NS3. A series of cDNA constructs was transcribed using T7 RNA polymerase and the RNA translated in reticulocyte lysates. Proteolytic processing occurred in vitro to generate NS2B and NS3. The amino termini of NS2B and NS3 produced in vitro were found to be the same as the termini of NS2B and NS3 isolated from infected cells. Deletion analysis of cDNA constructs localized the protease domain necessary and sufficient for correct cleavage to the first 184 amino acids of NS3. Kinetic analysis of processing events in vitro and experiments to examine the sensitivity of processing to dilution suggested that an intramolecular cleavage between NS2A and NS2B preceded an intramolecular cleavage between NS2B and NS3. The data from these expression experiments confirm that NS3 is the viral proteinase responsible for cleavage events generating the amino termini of NS2B and NS3 and presumably for cleavages generating the termini of NS4A and NS5 as well.
Biochemical and genetic experiments using viral proteinases have defined the sequence requirements for cleavage site recognition, but have not identified residues within proteinases that interact with substrates. A biochemical assay was developed that could identify residues which were important for substrate recognition. Chimeric proteases between yellow fever and dengue 2 were constructed that allowed mapping of regions involved in substrate recognition, and site directed mutagenesis was used to modulate processing efficiency.
Expression in vitro revealed that the dengue protease domain efficiently processes the yellow fever polyprotein between NS2A and NS2B and between NS2B and NS3, but that the reciprocal construct is inactive. The dengue protease processes yellow fever cleavage sites more efficiently than dengue cleavage sites, suggesting that suboptimal cleavage efficiency may be used to increase levels of processing intermediates in vivo. By mutagenizing the putative substrate binding pocket it was possible to change the substrate specificity of the yellow fever protease; changing a minimum of three amino acids in the yellow fever protease enabled it to recognize dengue cleavage sites. This system allows identification of residues which are directly or indirectly involved with enzyme-substrate interaction, does not require a crystal structure, and can define the substrate preferences of individual members of a viral proteinase family.
Full-length cDNA clones, from which infectious RNA can be transcribed, have been developed for a number of positive strand RNA viruses, including the flavivirus type virus, yellow fever. The technology necessary to transcribe genomic RNA of dengue 2 virus was developed in order to better understand the molecular biology of the dengue subgroup. A 5' structural region clone was engineered to transcribe authentic dengue RNA that contains an additional 1 or 2 residues at the 5' end. A 3' nonstructural region clone was engineered to allow production of run off transcripts, and to allow directional ligation with the 5' structural region clone. In vitro ligation and transcription produces full-length genomic RNA which is noninfectious when transfected into mammalian tissue culture cells. Alternative methods for constructing cDNA clones and recovering live dengue virus are discussed.
Resumo:
Interleukin 2 (IL2) is the primary growth hormone used by mature T cells and this lymphokine plays an important role in the magnification of cell-mediated immune responses. Under normal circumstances its expression is limited to antigen-activated type 1 helper T cells (TH1) and the ability to transcribe this gene is often regarded as evidence for commitment to this developmental lineage. There is, however, abundant evidence than many non-TH1 T cells, under appropriate conditions, possess the ability to express this gene. Of paramount interest in the study of T-cell development is the mechanisms by which differentiating thymocytes are endowed with particular combinations of cell surface proteins and response repertoires. For example, why do most helper T cells express the CD4 differentiation antigen?
As a first step in understanding these developmental processes the gene encoding IL2 was isolated from a mouse genomic library by probing with a conspecific IL2 cDNA. The sequence of the 5' flanking region from + 1 to -2800 was determined and compared to the previously reported human sequence. Extensive identity exists between +1 and -580 (86%) and sites previously shown to be crucial for the proper expression of the human gene are well conserved in both sequence location in the mouse counterpart.
Transient expression assays were used to evaluate the contribution of various genomic sequences to high-level gene expression mediated by a cloned IL2 promoter fragment. Differing lengths of 5' flanking DNA, all terminating in the 5' untranslated region, were linked to a reporter gene, bacterial chloramphenicol acetyltransferase (CAT) and enzyme activity was measured after introduction into IL2-producing cell lines. No CAT was ever detected without stimulation of the recipient cells. A cloned promoter fragment containing only 321 bp of upstream DNA was expressed well in both Jurkat and EL4.El cells. Addition of intragenic or downstream DNA to these 5' IL2-CAT constructs showed that no obvious regulatory regions resided there. However, increasing the extent of 5' DNA from -321 to -2800 revealed several positive and negative regulatory elements. One negative region that was well characterized resided between -750 and -1000 and consisted almost exclusively of alternating purine and pyrimidines. There is no sequence resembling this in the human gene now, but there is evidence that there may have once been.
No region, when deleted, could relax either the stringent induction-dependence on cell-type specificity displayed by this promoter. Reagents that modulated endogenous IL2 expression, such as cAMP, cyclosporin A, and IL1, affected expression of the 5' IL2-CAT constructs also. For a given reagent, expression from all expressible constructs was suppressed or enhanced to the same extent. This suggests that these modulators affect IL2 expression through perturbation of a central inductive signal rather than by summation of the effects of discrete, independently regulated, negative and positive transcription factors.