15 resultados para RNA Dynamic Structure
em DigitalCommons@The Texas Medical Center
Resumo:
Eukaryotic genomes exist within a dynamic structure named chromatin in which DNA is wrapped around an octamer of histones forming the nucleosome. Histones are modified by a range of posttranslational modifications including methylation, phosphorylation, and ubiquitination, which are integral to a range of DNA-templated processes including transcriptional regulation. A hallmark for transcriptional activity is methylation of histone H3 on lysine (K) 4 within active gene promoters. In S. cerevisiae, H3K4 methylation is mediated by Set1 within the COMPASS complex. Methylation requires prior ubiquitination of histone H2BK123 by the E2-E3 ligases Rad6 and Bre1, as well as the Paf1 transcriptional elongation complex. This regulatory pathway exemplifies cross-talk in trans between posttranslational modifications on distinct histone molecules. Set1 has an additional substrate in the kinetochore protein Dam1, which is methylated on K233. This methylation antagonizes phosphorylation of adjacent serines by the Ipl1 Aurora kinase. The discovery of a second Set1 substrate raised the question of how Set1 function is regulated at the kinetochore. I hypothesized that transcriptional regulatory factors essential for H3K4 methylation at gene promoters might also regulate Set1-mediated methylation of Dam1K233. Here I show that the regulatory factors essential for COMPASS activity at gene promoters is also indispensable for the methylation of Dam1K233. Deletion of members of the COMPASS complex leads to loss of Dam1K233 methylation. In addition, deletion of Rad6, Bre1, or members of the Paf1 complex abolishes Dam1 methylation. The role of Rad6 and Bre1 in Dam1 methylation is dependent on H2BK123 ubiquitination, as mutation of K123 within H2B results in complete loss of Dam1 methylation. Importantly, methylation of Dam1K233 is independent of transcription and occurs at the kinetochore. My results demonstrate that Set1-mediated methylation is regulated by a general pathway regardless of substrate that is composed of transcriptional regulatory factors functioning independently of transcription at the kinetochore. My data provide the first example of cross-talk in trans between modifications on a histone and a non-histone protein. Additionally, my results indicate that several factors previously thought to be required for Set1 function at gene promoters are more generally required for the catalytic activity of the COMPASS complex regardless of substrate or cellular process.
Resumo:
Kaposi's sarcoma-associated herpesvirus (KSHV) is a recently discovered DNA tumor virus that belongs to the gamma-herpesvirus subfamily. Though numerous studies on KSHV and other herpesviruses, in general, have revealed much about their multilayered organization and capsid structure, the herpesvirus capsid assembly and maturation pathway remains poorly understood. Structural variability or irregularity of the capsid internal scaffolding core and the lack of adequate tools to study such structures have presented major hurdles to earlier investigations employing more traditional cryo-electron microscopy (cryoEM) single particle reconstruction. In this study, we used cryo-electron tomography (cryoET) to obtain 3D reconstructions of individual KSHV capsids, allowing direct visualization of the capsid internal structures and systematic comparison of the scaffolding cores for the first time. We show that B-capsids are not a structurally homogenous group; rather, they represent an ensemble of "B-capsid-like" particles whose inner scaffolding is highly variable, possibly representing different intermediates existing during the KSHV capsid assembly and maturation. This information, taken together with previous observations, has allowed us to propose a detailed pathway of herpesvirus capsid assembly and maturation.
Resumo:
Cytoplasmic polyhedrosis virus (CPV) is unique within the Reoviridae family in having a turreted single-layer capsid contained within polyhedrin inclusion bodies, yet being fully capable of cell entry and endogenous RNA transcription. Biochemical data have shown that the amino-terminal 79 residues of the CPV turret protein (TP) is sufficient to bring CPV or engineered proteins into the polyhedrin matrix for micro-encapsulation. Here we report the three-dimensional structure of CPV at 3.88 A resolution using single-particle cryo-electron microscopy. Our map clearly shows the turns and deep grooves of alpha-helices, the strand separation in beta-sheets, and densities for loops and many bulky side chains; thus permitting atomic model-building effort from cryo-electron microscopy maps. We observed a helix-to-beta-hairpin conformational change between the two conformational states of the capsid shell protein in the region directly interacting with genomic RNA. We have also discovered a messenger RNA release hole coupled with the mRNA capping machinery unique to CPV. Furthermore, we have identified the polyhedrin-binding domain, a structure that has potential in nanobiotechnology applications.
Resumo:
Ca$\sp{++}$/calmodulin-dependent protein kinase II (CaM-KII) is highly concentrated in mammalian brain, comprising as much as 2% of the total protein in some regions. In forebrain, CaM-KII has been shown to be enriched in postsynaptic structures where it has been implicated in maintaining cytoskeletal structure, and more recently in signal transduction mechanisms and processes underlying learning and memory. CaM-KII appears to exist as a holoenzyme composed of two related yet distinct subunits, alpha and beta. The ratio of the subunits in the holoenzyme varies with different brain regions and to some degree with subcellular fractions. The two subunits also display distinct developmental profiles. Levels of alpha subunit are not evident at birth but increase dramatically during postnatal development, while levels of beta subunit are readily detected at birth and only gradual increase postnatally. The distinct regional, subcellular and developmental distribution of the two subunits of CaM-KII have prompted us to examine factors involved in regulating the synthesis of the subunit proteins.^ This dissertation addresses the regional and developmental expression of the mRNAs for the individual subunits using in situ hybridization histochemistry and northern slot-blot analysis. By comparing the developmental profile of each mRNA with that of its respective protein, we have determined that initiation of gene transcription is likely the primary site for regulating CaM-KII protein levels. Furthermore, the distinct cytoarchitecture of the hippocampus has allowed us to demonstrate that the alpha, but not beta subunit mRNA is localized in dendrites of certain forebrain neurons. The localization of alpha subunit mRNA at postsynaptic structures, in concert with the accumulation of subunit protein, suggests that dendritic synthesis of CaM-KII alpha subunit may be important for maintaining postsynaptic structure and/or function. ^
Resumo:
Involvement of E. coli 23S ribosomal RNA (rRNA) in decoding of termination codons was first indicated by the characterization of a 23S rRNA mutant that causes UGA-specific nonsense suppression. The work described here was begun to test the hypothesis that more 23S rRNA suppressors of specific nonsense mutations can be isolated and that they would occur non-randomly in the rRNA genes and be clustered in specific, functionally significant regions of rRNA.^ Approximately 2 kilobases of the gene for 23S rRNA were subjected to PCR random mutagenesis and the amplified products screened for suppression of nonsense mutations in trpA. All of the suppressor mutations obtained were located in a thirty-nucleotide part of the GTPase center, a conserved rRNA sequence and structure, and they and others made in that region by site-directed mutagenesis were shown to be UGA-specific in their suppression of termination codon mutations. These results proved the initial hypothesis and demonstrated that a group of nucleotides in this region are involved in decoding of the UGA termination codon. Further, it was shown that limitation of cellular availability or synthesis of L11, a ribosomal protein that binds to the GTPase center rRNA, resulted in suppression of termination codon mutations, suggesting the direct involvement of L11 in termination in vivo.^ Finally, in vivo analysis of certain site-specific mutations made in the GTPase center RNA demonstrated that (a) the G$\cdot$A base pair closing the hexanucleotide hairpin loop was not essential for normal termination, (b) the "U-turn" structure in the 1093 to 1098 hexaloop is critical for normal termination, (c) nucleotides A1095 and A1067, necessary for the binding to ribosomes of thiostrepton, an antibiotic that inhibits polypeptide release factor binding to ribosomes in vitro, are also necessary for normal peptide chain termination in vivo, and (d) involvement of this region of rRNA in termination is determined by some unique subset structure that includes particular nucleotides rather than merely by a general structural feature of the GTPase center.^ This work advances the understanding of peptide chain termination by demonstrating that the GTPase region of 23S rRNA participates in recognition of termination codons, through an associated ribosomal protein and specific conserved nucleotides and structural motifs in its RNA. ^
Resumo:
Two regions in the 3$\prime$ domain of 16S rRNA (the RNA of the small ribosomal subunit) have been implicated in decoding of termination codons. Using segment-directed PCR random mutagenesis, I isolated 33 translational suppressor mutations in the 3$\prime$ domain of 16S rRNA. Characterization of the mutations by both genetic and biochemical methods indicated that some of the mutations are defective in UGA-specific peptide chain termination and that others may be defective in peptide chain termination at all termination codons. The studies of the mutations at an internal loop in the non-conserved region of helix 44 also indicated that this structure, in a non-conserved region of 16S rRNA, is involved in both peptide chain termination and assembly of 16S rRNA.^ With a suppressible trpA UAG nonsense mutation, a spontaneously arising translational suppressor mutation was isolated in the rrnB operon cloned into a pBR322-derived plasmid. The mutation caused suppression of UAG at two codon positions in trpA but did not suppress UAA or UGA mutations at the same trpA positions. The specificity of the rRNA suppressor mutation suggests that it may cause a defect in UAG-specific peptide chain termination. The mutation is a single nucleotide deletion (G2484$\Delta$) in helix 89 of 23S rRNA (the large RNA of the large ribosomal subunit). The result indicates a functional interaction between two regions of 23S rRNA. Furthermore, it provides suggestive in vivo evidence for the involvement of the peptidyl-transferase center of 23S rRNA in peptide chain termination. The $\Delta$2484 and A1093/$\Delta$2484 (double) mutations were also observed to alter the decoding specificity of the suppressor tRNA lysT(U70), which has a mutation in its acceptor stem. That result suggests that there is an interaction between the stem-loop region of helix 89 of 23S rRNA and the acceptor stem of tRNA during decoding and that the interaction is important for the decoding specificity of tRNA.^ Using gene manipulation procedures, I have constructed a new expression vector to express and purify the cellular protein factors required for a recently developed, realistic in vitro termination assay. The gene for each protein was cloned into the newly constructed vector in such a way that expression yielded a protein with an N-terminal affinity tag, for specific, rapid purification. The amino terminus was engineered so that, after purification, the unwanted N-terminal tag can be completely removed from the protein by thrombin cleavage, yielding a natural amino acid sequence for each protein. I have cloned the genes for EF-G and all three release factors into this new expression vector and the genes for all the other protein factors into a pCAL-n expression vector. These constructs will allow our laboratory group to quickly and inexpensively purify all the protein factors needed for the new in vitro termination assay. (Abstract shortened by UMI.) ^
Resumo:
In many organisms, polarity of the oocyte is established post-transcriptionally via subcellular RNA localization. Many RNAs are localized during oogenesis in Xenopus laevis, including Xlsirts ( Xenopus laevis short interspersed repeat transcripts) [Kloc, 1993]. Xlsirts constitute a large family defined by highly homologous repeat units 79–81 nucleotides in length. Endogenous Xlsirt RNAs use the METRO (Message Transport Organizer) pathway of localization, where RNAs are transported from the nucleus to the mitochondrial cloud in stage I oocytes. Secondly, RNAs anchor at the vegetal pole in stage II oocytes. Exogenous Xlsirt RNAs can also utilize the Late pathway of localization, which involves localization to the vegetal cortex during stage III of oogenesis and results in RNAs anchored in the cortex of the entire vegetal hemisphere. ^ The Xlsirts localization signal is contained within the repeat region. This study was designed to test the hypothesis that there are cis -acting localization elements in Xlsirts, and that higher order structure plays a role. Results of experiments on Xlsirt P11, a 1700 basepair (bp) family member, led to the conclusion that a 137-bp fragment of the repetitive region is necessary and sufficient for METRO and Late pathway localization. This analysis definitively demonstrates that the Xlsirt localization signal for the METRO and Late pathways reside within the repetitive region and not within the flanking regions. Analysis of Xlsirt linker scanning mutations revealed two METRO-pathway specific subelements, and one Late-pathway specific subelement. Functional, computer, and biochemical evidence relates the higher order structure of this element to its ability to function as a localization element. ^ Xlsirt 137 is 99% identical to the Xlsirt consensus sequence identified in this study, suggesting that it is the localization element for all localized Xlsirt family members. The repeat unit was reframed based on function, rather than arbitrarily based on sequence. This work supports the hypothesis presented in 1981 by George Spohr, who originally isolated the Xlsirts, which stated that the highly conserved repetitive elements must be constrained from variability due to some unknown function of the repeats themselves. These studies shed light on the mechanism of RNA localization, linking structure and function. ^
Resumo:
mRNA 3′ polyadenylation is central to mRNA biogenesis in prokaryotes and eukaryotes, and is implicated in numerous aspects of mRNA metabolism, including efficiency of mRNA export from the nucleus, message stability, and initiation of translation. However, due to the great complexity of the eukaryotic polyadenylation apparatus, the mechanisms of RNA 3 ′ end processing have remained elusive. Although the RNA processing reactions leading to polyadenylated messenger RNA have been studied in many systems, and much progress has been made, a complete understanding of the biochemistry of the poly(A) polymerase enzyme is still lacking. My research uses Vaccinia virus as a model system to gain a better understanding of this complicated polyadenylation process, which consist of RNA binding, catalysis and polymerase translocation. ^ Vaccinia virus replicates in the cytoplasm of its host cell, so it must employ its own poly(A) polymerase (PAP), a heterodimer of two virus encoded proteins, VP55 and VP39. VP55 is the catalytic subunit, adding 30 adenylates to a non-polyadenylated RNA in a rapid processive manner before abruptly changing to a slow, non-processive mode of adenylate addition and dissociating from the RNA. VP39 is the stimulatory subunit. It has no polyadenylation catalytic activity by itself, but when associated with VP55 it facilitates the semi-processive synthesis of tails several hundred adenylates in length. ^ Oligonucleotide selection and competition studies have shown that the heterodimer binds a minimal motif of (rU)2 (N)25 U, the “heterodimer binding motif”, within an oligonucleotide, and its primer selection for polyadenylation is base-type specific. ^ Crosslinking studies using photosensitive uridylate analogs show that within a VP55-VP39-primer ternary complex, VP55 comes into contact with all three required uridylates, while VP39 only contacts the downstream uridylate. Further studies, using a backbone-anchored photosensitive crosslinker show that both PAP subunits are in close proximity to the downstream −10 to −21 region of 50mer model primers containing the heterodimer binding motif. This equal crosslinking to both subunits suggests that the dimerization of VP55 and VP39 creates either a cleft or a channel between the two subunits through which this region of RNA passes. ^ Peptide mapping studies of VP39 covalently crosslinked to the oligonucleotide have identified residue R107 as the amino acid in close proximity to the −10 uridylate. This helps us project a conceptual model onto the known physical surface of this subunit. In the absence of any tertiary structural data for VP55, we have used a series of oligonucleotide selection assays, as well as crosslinking, nucleotide transfer assays, and gel shift assays to gain insight into the requirements for binding, polyadenylation and translocation. Collectively, these data allow us to put together a comprehensive model of the structure and function of the polyadenylation ternary complex consisting of VP39, VP55 and RNA. ^
Resumo:
Carboxypeptidase N (CPN) is a plasma zinc metalloprotease, which consists of two enzymatically active small subunits and two large subunits that protect the protein from degradation. CPN cleaves carboxy-terminal arginines and lysines from peptides found in the bloodstream such as complement anaphylatoxins, kinins, and creatine kinase MM. In this study, the mouse CPN small subunit (CPN1) coding region, gene structure, and chromosomal location were characterized and the expression of CPN1 was investigated in mouse embryos at different stages of development. The CPN1 gene, which was approximately 29 kb in length, contained nine exons and localized to mouse chromosome 19D2. The fifth and sixth exons of CPN1 encoded the amino acids necessary for substrate binding and catalytic activity. CPN1 RNA was expressed predominately in adult liver and contained a 1371 bp open reading frame encoding 457 amino acids. In the mouse embryo, CPN1 RNA was observed at 8.5 days post coitus (dpc), while its protein was detected at 10.5 dpc. In situ hybridization of the fetal liver detected CPN1 RNA in erythroid progenitor cells at 10.5, 13.5, and 16.5 dpc and in hepatocytes at 16.5 dpc. This was compared to the expression of the complement component C3, the parent molecule of complement anaphylatoxin C3a. Consistently throughout the experiments, CPN1 message and protein preceded the expression of C3. To obtain a better understanding of the biological significance of CPN1 in vivo, studies were initiated to produce a genetically engineered mouse in which the CPN1 gene was ablated. To facilitate this project a targeting vector was constructed by removing the functionally important fifth and sixth exons of the CPN1 gene. Collectively, these studies have: (1) provided important detailed information regarding the structure and organization of the murine CPN1 gene, (2) yielded insights into the developmental expression of mouse CPN1 in relationship to C3 expression, and (3) set the stage for the generation of a CPN1 “knock-out” mouse, which can be used to determine the biological significance of CPN1 in both normal and diseased conditions. ^
Resumo:
Essential biological processes are governed by organized, dynamic interactions between multiple biomolecular systems. Complexes are thus formed to enable the biological function and get dissembled as the process is completed. Examples of such processes include the translation of the messenger RNA into protein by the ribosome, the folding of proteins by chaperonins or the entry of viruses in host cells. Understanding these fundamental processes by characterizing the molecular mechanisms that enable then, would allow the (better) design of therapies and drugs. Such molecular mechanisms may be revealed trough the structural elucidation of the biomolecular assemblies at the core of these processes. Various experimental techniques may be applied to investigate the molecular architecture of biomolecular assemblies. High-resolution techniques, such as X-ray crystallography, may solve the atomic structure of the system, but are typically constrained to biomolecules of reduced flexibility and dimensions. In particular, X-ray crystallography requires the sample to form a three dimensional (3D) crystal lattice which is technically di‑cult, if not impossible, to obtain, especially for large, dynamic systems. Often these techniques solve the structure of the different constituent components within the assembly, but encounter difficulties when investigating the entire system. On the other hand, imaging techniques, such as cryo-electron microscopy (cryo-EM), are able to depict large systems in near-native environment, without requiring the formation of crystals. The structures solved by cryo-EM cover a wide range of resolutions, from very low level of detail where only the overall shape of the system is visible, to high-resolution that approach, but not yet reach, atomic level of detail. In this dissertation, several modeling methods are introduced to either integrate cryo-EM datasets with structural data from X-ray crystallography, or to directly interpret the cryo-EM reconstruction. Such computational techniques were developed with the goal of creating an atomic model for the cryo-EM data. The low-resolution reconstructions lack the level of detail to permit a direct atomic interpretation, i.e. one cannot reliably locate the atoms or amino-acid residues within the structure obtained by cryo-EM. Thereby one needs to consider additional information, for example, structural data from other sources such as X-ray crystallography, in order to enable such a high-resolution interpretation. Modeling techniques are thus developed to integrate the structural data from the different biophysical sources, examples including the work described in the manuscript I and II of this dissertation. At intermediate and high-resolution, cryo-EM reconstructions depict consistent 3D folds such as tubular features which in general correspond to alpha-helices. Such features can be annotated and later on used to build the atomic model of the system, see manuscript III as alternative. Three manuscripts are presented as part of the PhD dissertation, each introducing a computational technique that facilitates the interpretation of cryo-EM reconstructions. The first manuscript is an application paper that describes a heuristics to generate the atomic model for the protein envelope of the Rift Valley fever virus. The second manuscript introduces the evolutionary tabu search strategies to enable the integration of multiple component atomic structures with the cryo-EM map of their assembly. Finally, the third manuscript develops further the latter technique and apply it to annotate consistent 3D patterns in intermediate-resolution cryo-EM reconstructions. The first manuscript, titled An assembly model for Rift Valley fever virus, was submitted for publication in the Journal of Molecular Biology. The cryo-EM structure of the Rift Valley fever virus was previously solved at 27Å-resolution by Dr. Freiberg and collaborators. Such reconstruction shows the overall shape of the virus envelope, yet the reduced level of detail prevents the direct atomic interpretation. High-resolution structures are not yet available for the entire virus nor for the two different component glycoproteins that form its envelope. However, homology models may be generated for these glycoproteins based on similar structures that are available at atomic resolutions. The manuscript presents the steps required to identify an atomic model of the entire virus envelope, based on the low-resolution cryo-EM map of the envelope and the homology models of the two glycoproteins. Starting with the results of the exhaustive search to place the two glycoproteins, the model is built iterative by running multiple multi-body refinements to hierarchically generate models for the different regions of the envelope. The generated atomic model is supported by prior knowledge regarding virus biology and contains valuable information about the molecular architecture of the system. It provides the basis for further investigations seeking to reveal different processes in which the virus is involved such as assembly or fusion. The second manuscript was recently published in the of Journal of Structural Biology (doi:10.1016/j.jsb.2009.12.028) under the title Evolutionary tabu search strategies for the simultaneous registration of multiple atomic structures in cryo-EM reconstructions. This manuscript introduces the evolutionary tabu search strategies applied to enable a multi-body registration. This technique is a hybrid approach that combines a genetic algorithm with a tabu search strategy to promote the proper exploration of the high-dimensional search space. Similar to the Rift Valley fever virus, it is common that the structure of a large multi-component assembly is available at low-resolution from cryo-EM, while high-resolution structures are solved for the different components but lack for the entire system. Evolutionary tabu search strategies enable the building of an atomic model for the entire system by considering simultaneously the different components. Such registration indirectly introduces spatial constrains as all components need to be placed within the assembly, enabling the proper docked in the low-resolution map of the entire assembly. Along with the method description, the manuscript covers the validation, presenting the benefit of the technique in both synthetic and experimental test cases. Such approach successfully docked multiple components up to resolutions of 40Å. The third manuscript is entitled Evolutionary Bidirectional Expansion for the Annotation of Alpha Helices in Electron Cryo-Microscopy Reconstructions and was submitted for publication in the Journal of Structural Biology. The modeling approach described in this manuscript applies the evolutionary tabu search strategies in combination with the bidirectional expansion to annotate secondary structure elements in intermediate resolution cryo-EM reconstructions. In particular, secondary structure elements such as alpha helices show consistent patterns in cryo-EM data, and are visible as rod-like patterns of high density. The evolutionary tabu search strategy is applied to identify the placement of the different alpha helices, while the bidirectional expansion characterizes their length and curvature. The manuscript presents the validation of the approach at resolutions ranging between 6 and 14Å, a level of detail where alpha helices are visible. Up to resolution of 12 Å, the method measures sensitivities between 70-100% as estimated in experimental test cases, i.e. 70-100% of the alpha-helices were correctly predicted in an automatic manner in the experimental data. The three manuscripts presented in this PhD dissertation cover different computation methods for the integration and interpretation of cryo-EM reconstructions. The methods were developed in the molecular modeling software Sculptor (http://sculptor.biomachina.org) and are available for the scientific community interested in the multi-resolution modeling of cryo-EM data. The work spans a wide range of resolution covering multi-body refinement and registration at low-resolution along with annotation of consistent patterns at high-resolution. Such methods are essential for the modeling of cryo-EM data, and may be applied in other fields where similar spatial problems are encountered, such as medical imaging.
Resumo:
Structure-function analysis of human Integrator subunit 4 Anupama Sataluri Advisor: Eric. J. Wagner, Ph.D. Uridine-rich small nuclear RNAs (U snRNA) are RNA Polymerase-II (RNAPII) transcripts that are ubiquitously expressed and are known to be essential for gene expression. snRNAs play a key role in mRNA splicing and in histone mRNA expression. Inaccurate snRNA biosynthesis can lead to diseases related to defective splicing and histone mRNA expression. Although the 3′ end formation mechanism and processing machinery of other RNAPII transcripts such as mRNA has been well studied, the mechanism of snRNA 3′ end processing has remained a mystery until the recent discovery of the machinery that mediates this process. In 2005, a complex of 14 subunits (the Integrator complex) associated with RNA Polymerase-II was discovered. The 14subunits were annotated Integrator 1-14 based on their size. The subunits of this complex together were found to facilitate 3′ end processing of snRNA. Identification of the Integrator complex propelled research in the direction of understanding the events of snRNA 3’end processing. Recent studies from our lab confirmed that Integrator subunit (IntS) 9 and 11 together perform the endonucleolytic cleavage of the nascent snRNA 3′ end to generate mature snRNA. However, the role of other members of the Integrator complex remains elusive. Current research in our lab is focused on deciphering the role of each subunit within the Integrator complex This work specifically focuses on elucidating the role of human Integrator subunit 4 (IntS4) and understanding how it facilitates the overall function of the complex. IntS4 has structural similarity with a protein called “Symplekin”, which is part of the mRNA 3’end processing machinery. Symplekin has been thoroughly researched in recent years and structure-function correlation studies in the context of mRNA 3’end processing have reported a scaffold function for Symplekin due to the presence of HEAT repeat motifs in its N-terminus. Based upon the structural similarity between IntS4 and Symplekin, we hypothesized that Integrator subunit 4 may be behaving as a Symplekin-like scaffold molecule that facilitates the interaction between other members of the Integrator Complex. To answer this question, the two important goals of this study were to: 1) identify the region of IntS4, which is important for snRNA 3′ end processing and 2) determine binding partners of IntS4 which promote its function as a scaffold. IntS4 structurally consists of a highly conserved N-terminus with 8 HEAT repeats, followed by a nonconserved C- terminus. A series of siRNA resistant N and C-terminus deletion constructs as well as specific point mutants within its N-terminal HEAT repeats were generated for human IntS4 and, utilizing a snRNA transcriptional readthrough GFP-reporter assay, we tested their ability to rescue misprocessing. This assay revealed a possible scaffold like property of IntS4. To probe IntS4 for interaction partners, we performed co-immunoprecipitation on nuclear extracts of IntS4 expressing stable cell lines and identified IntS3 and IntS5 among other Integrator subunits to be binding partners which facilitate the scaffold like function of hIntS4. These findings have established a critical role for IntS4 in snRNA 3′ end processing, identified that both its N and C termini are essential for its function, and mapped putative interaction domains with other Integrator subunits.
Resumo:
My dissertation focuses on two aspects of RNA sequencing technology. The first is the methodology for modeling the overdispersion inherent in RNA-seq data for differential expression analysis. This aspect is addressed in three sections. The second aspect is the application of RNA-seq data to identify the CpG island methylator phenotype (CIMP) by integrating datasets of mRNA expression level and DNA methylation status. Section 1: The cost of DNA sequencing has reduced dramatically in the past decade. Consequently, genomic research increasingly depends on sequencing technology. However it remains elusive how the sequencing capacity influences the accuracy of mRNA expression measurement. We observe that accuracy improves along with the increasing sequencing depth. To model the overdispersion, we use the beta-binomial distribution with a new parameter indicating the dependency between overdispersion and sequencing depth. Our modified beta-binomial model performs better than the binomial or the pure beta-binomial model with a lower false discovery rate. Section 2: Although a number of methods have been proposed in order to accurately analyze differential RNA expression on the gene level, modeling on the base pair level is required. Here, we find that the overdispersion rate decreases as the sequencing depth increases on the base pair level. Also, we propose four models and compare them with each other. As expected, our beta binomial model with a dynamic overdispersion rate is shown to be superior. Section 3: We investigate biases in RNA-seq by exploring the measurement of the external control, spike-in RNA. This study is based on two datasets with spike-in controls obtained from a recent study. We observe an undiscovered bias in the measurement of the spike-in transcripts that arises from the influence of the sample transcripts in RNA-seq. Also, we find that this influence is related to the local sequence of the random hexamer that is used in priming. We suggest a model of the inequality between samples and to correct this type of bias. Section 4: The expression of a gene can be turned off when its promoter is highly methylated. Several studies have reported that a clear threshold effect exists in gene silencing that is mediated by DNA methylation. It is reasonable to assume the thresholds are specific for each gene. It is also intriguing to investigate genes that are largely controlled by DNA methylation. These genes are called “L-shaped” genes. We develop a method to determine the DNA methylation threshold and identify a new CIMP of BRCA. In conclusion, we provide a detailed understanding of the relationship between the overdispersion rate and sequencing depth. And we reveal a new bias in RNA-seq and provide a detailed understanding of the relationship between this new bias and the local sequence. Also we develop a powerful method to dichotomize methylation status and consequently we identify a new CIMP of breast cancer with a distinct classification of molecular characteristics and clinical features.
Resumo:
Phosphatidylserine decarboxylase of E. coli, a cytoplasmic membrane protein, catalyzes the formation of phosphatidylethanolamine, the principal phospholipid of the organism. The activity of the enzyme is dependent on a covalently bound pyruvate (Satre and Kennedy (1978) J. Biol. Chem. 253, 479-483). This study shows that the enzyme consists of two nonidentical subunits, $\alpha$ (Mr = 7,332) and $\beta$ (Mr = 28,579), with the pyruvate prosthetic group in amide linkage to the amino-terminus of the $\alpha$ subunit. Partial protein sequence and DNA sequence analysis reveal that the two subunits are derived from a proenzyme ($\pi$ subunit, Mr = 35,893) through a post-translational event. During the conversion of the proenzyme to the $\alpha$ and $\beta$ subunits, the peptide bond between Gly253-Ser254 is cleaved, and Ser254 is converted to the pyruvate prosthetic group at the amino-terminus of the $\alpha$ subunit (Li and Dowhan (1988) J. Biol. Chem. 263, 11516-11522).^ The proenzyme cannot be detected in cells carrying either single or multiple copies of the gene (psd), but can be observed in a T7 RNA polymerase/promoter and transcription-translation system. The cleavage of the wild-type proenzyme occurs rapidly with a half-time on the order of 2 min. Changing of the Ser254 to cysteine (S254C) or threonine (S254T) slows the cleavage rate dramatically and results in mutants with a half-time for processing of around 2-4 h. Change of the Ser254 to alanine (S254A) blocks the cleavage of the proenzyme. The reduced processing rate with the mutations of the proenzyme is consistent with less of the functional enzyme being made. Mutants S254C and S254T produce $\sim$15% and $\sim$1%, respectively, of the activity of the wild-type allele, but can still complement a temperature-sensitive mutant of the psd locus. Neither detectable activity nor complementation is observed by mutant S254A. These results are consistent with the hydroxyl-group of the Ser254 playing a critical role in the cleavage of the peptide bond Gly253-Ser254 of the pro-phosphatidylserine decarboxylase, and support the mechanism proposed by Snell and co-workers (Recsei and Snell (1984) Annu. Rev. Biochem. 53, 357-387) for the formation of the prosthetic group of pyruvate-dependent decarboxylases. ^
Resumo:
The purpose of this work was to examine the possible mechanisms for the regulation of cytochrome c gene expression in response to increased contractile activity in rat skeletal muscle. The working hypothesis was that increased contractile activity enhances cytochrome c gene expression through a cis-element. A 110% increase in cytochrome c mRNA concentration was observed in tibialis anterior (TA) muscle after 9 days of chronic stimulation. Similar difference (120%) exists between soleus (SO) muscle of higher contractile activity and white vastus lateralis (WV) muscle of lower contractile activity. These results suggest that the endogenous cytochrome c gene expression is regulated by contractile activity. Cytochrome c-reporter genes were injected into skeletal muscles to identify the cis-element that is responsible for the regulation. Although the data was inconclusive, part of it suggested the importance of the 3$\sp\prime$-untranslated region (3$\sp\prime$-UTR) in mediating the response to increased contractile activity.^ RNA gel mobility shift (GMSA) and ultraviolet (UV) cross-linking assays revealed specific RNA-protein interaction in a 50-nucleotide region of the 3$\sp\prime$-UTR in unstimulated TA muscle. Computer analysis predicted a stem-loop structure of 17 nucleotides, which provides a structural basis for RNA-protein interaction. These 17 nucleotides are 100% conserved among rat, mouse and human cytochrome c genes and their 13 pseudogenes, suggesting a functional role for this region. The RNA-protein interaction was significantly less in highly active SO muscle than in inactive WV muscle and was dramatically decreased in stimulated TA muscle due to a protein inhibitor(s) associated with ribosome. It is possible that cytochrome c mRNAs undergoing translation are subject to a compartmentalized regulatory influence.^ The conclusion from these results is that increases in contractile activity induce or activate a protein inhibitor(s) associated with ribosome in rat skeletal muscle. The inhibitor decreases RNA-protein interaction in the 3$\sp\prime$-UTR of cytochrome c mRNA, which may result in increased mRNA stability and/or translation. ^
Resumo:
Like other simple retroviruses the murine sarcoma virus ts110 (MuSVts110) displays an inefficient mode of genome splicing. But, unlike the splicing phenotypic of other retroviruses, the splicing event effected upon the transcript of MuSVts110 is temperature sensitive. Previous work in this laboratory has established that the conditionally defective nature of MuSVts110 RNA splicing is mediated in cis by features in the viral transcript. Here we show that the 5$\sp\prime$ splice site of the MuSVts110 transcript acts as a point of control of the overall splicing efficiency at both permissive and nonpermissive temperatures for splicing. We strengthened and simultaneously weakened the nucleotide structure of the 5$\sp\prime$ splice site in an attempt to elucidate the differential effects each of the two known critical splicing components which interact with the 5$\sp\prime$ splice site have on the overall efficiency of intron excision. We found that a transversion of the sixth nucleotide, resulting in the formation of a near-consensus 5$\sp\prime$ splice site, dramatically increased the overall efficiency of MuSVts110 RNA splicing and abrogated the thermosensitive nature of this splicing event. Various secondary mutations within this original transversion mutant, designed to selectively decrease specific splicing component interactions, lead to recovery of inefficient and thermosensitive splicing. We have further shown that a sequence of 415 nucleotides lying in the downstream exon of the viral RNA and hypothesized to act as an element in the temperature-dependent inhibition of splicing displays a functional redundancy throughout its length; loss and/or replacement of any one sequence of 100 nucleotides within this sequence does not, with one exception detailed below, diminish the degree to which MuSVts110 RNA is inhibited to splice at the restrictive temperature. One specific deletion, though, fortuitously juxtaposed and activated cryptic consensus splicing signals for the excision of a cryptic intron within the downstream exon and markedly potentiated--across a newly defined cryptic exon--the splicing event effected upon the upstream, native intron. We have exploited this mutant of MuSVts110 to further an understanding of the process of exon definition and intron definition and show that the polypyrimidine tract and consensus 3$\sp\prime$ splice site, as well as the 5$\sp\prime$ splice site, within the intron at the 3$\sp\prime$ flank of the defined exon are required for the exon's definition; implying that definition of the downstream intron is required for the in vivo definition of the proximal, upstream exon. Finally; we have shown, through the construction of heterologous mutants of MuSVts110 employing a foreign 3$\sp\prime$ end-forming sequence, that efficiency of transcript splicing can be increased--to a degree which abrogates its thermosensitive nature--in direct proportion to increasing proximity of the 3$\sp\prime$ end-forming signal to the terminal 3$\sp\prime$ splice site. ^