932 resultados para Rna Transcripts
Resumo:
Alternative RNA splicing is a critical process that contributes variety to protein functions, and further controls cell differentiation and normal development. Although it is known that most eukaryotic genes produce multiple transcripts in which splice site selection is regulated, how RNA binding proteins cooperate to activate and repress specific splice sites is still poorly understood. In addition how the regulation of alternative splicing affects germ cell development is also not well known. In this study, Drosophila Transformer 2 (Tra2) was used as a model to explore both the mechanism of its repressive function on its own pre-mRNA splicing, and the effect of the splicing regulation on spermatogenesis in testis. Half-pint (Hfp), a protein known as splicing activator, was identified in an S2 cell-based RNAi screen as a co-repressor that functions in combination with Tra2 in the splicing repression of the M1 intron. Its repressive splicing function is found to be sequence specific and is dependent on both the weak 3’ splice site and an intronic splicing silencer within the M1 intron. In addition we found that in vivo, two forms of Hfp are expressed in a cell type specific manner. These alternative forms differ at their amino terminus affecting the presence of a region with four RS dipeptides. Using assays in Drosophila S2 cells, we determined that the alternative N terminal domain is necessary in repression. This difference is probably due to differential localization of the two isoforms in the nucleus and cytoplasm. Our in vivo studies show that both Hfp and Tra2 are required for normal spermatogenesis and cooperate in repression of M1 splicing in spermatocytes. But interestingly, Tra2 and Hfp antagonize each other’s function in regulating germline specific alternative splicing of Taf1 (TBP associated factor 1). Genetic and cytological studies showed that mutants of Hfp and Taf1 both cause similar defects in meiosis and spermatogenesis. These results suggest Hfp regulates normal spermatogenesis partially through the regulation of taf1 splicing. These observations indicate that Hfp regulates tra2 and taf1 activity and play an important role in germ cell differentiation of male flies.
Resumo:
My dissertation focuses on two aspects of RNA sequencing technology. The first is the methodology for modeling the overdispersion inherent in RNA-seq data for differential expression analysis. This aspect is addressed in three sections. The second aspect is the application of RNA-seq data to identify the CpG island methylator phenotype (CIMP) by integrating datasets of mRNA expression level and DNA methylation status. Section 1: The cost of DNA sequencing has reduced dramatically in the past decade. Consequently, genomic research increasingly depends on sequencing technology. However it remains elusive how the sequencing capacity influences the accuracy of mRNA expression measurement. We observe that accuracy improves along with the increasing sequencing depth. To model the overdispersion, we use the beta-binomial distribution with a new parameter indicating the dependency between overdispersion and sequencing depth. Our modified beta-binomial model performs better than the binomial or the pure beta-binomial model with a lower false discovery rate. Section 2: Although a number of methods have been proposed in order to accurately analyze differential RNA expression on the gene level, modeling on the base pair level is required. Here, we find that the overdispersion rate decreases as the sequencing depth increases on the base pair level. Also, we propose four models and compare them with each other. As expected, our beta binomial model with a dynamic overdispersion rate is shown to be superior. Section 3: We investigate biases in RNA-seq by exploring the measurement of the external control, spike-in RNA. This study is based on two datasets with spike-in controls obtained from a recent study. We observe an undiscovered bias in the measurement of the spike-in transcripts that arises from the influence of the sample transcripts in RNA-seq. Also, we find that this influence is related to the local sequence of the random hexamer that is used in priming. We suggest a model of the inequality between samples and to correct this type of bias. Section 4: The expression of a gene can be turned off when its promoter is highly methylated. Several studies have reported that a clear threshold effect exists in gene silencing that is mediated by DNA methylation. It is reasonable to assume the thresholds are specific for each gene. It is also intriguing to investigate genes that are largely controlled by DNA methylation. These genes are called “L-shaped” genes. We develop a method to determine the DNA methylation threshold and identify a new CIMP of BRCA. In conclusion, we provide a detailed understanding of the relationship between the overdispersion rate and sequencing depth. And we reveal a new bias in RNA-seq and provide a detailed understanding of the relationship between this new bias and the local sequence. Also we develop a powerful method to dichotomize methylation status and consequently we identify a new CIMP of breast cancer with a distinct classification of molecular characteristics and clinical features.
Resumo:
Viral systems have contributed tremendously to the understanding of eukaryotic molecular biology. The proportional pattern of retroviral RNA expression offers many clues into the alternative splicing of cellular transcripts. The MuSVts110 virus presents an unusual expression system, where the mechanistic combination of RNA splicing and cellular transformation can be physiologically manipulated. Splicing of MuSVts110 pre-mRNA occurs inefficiently (30%-50%) at 33$\sp\circ$C or below and is subdued at 39$\sp\circ$C ($<$5%). Like most alternatively spliced cellular and retroviral transcripts, the MuSVts110 pre-mRNA contains cis-acting intron and exon sequences that attenuate splicing. These include a splicing inhibitory sequence at the 3$\prime$ end of the MuSVts110 v-mos exon, called the E2 Distal Element (E2DE), and a sub-optimal 3$\prime$ splice site. The E2DE directly inhibits MuSVts110 RNA splicing in a sequence-specific fashion at 39$\sp\circ$C but not at 28$\sp\circ$C, potentially through the association of cellular factors. Inefficient MuSVts110 splicing is pre-dominantly attributed to the utilization of multiple weak branchpoint sequences located between $-113$ and $-34$ nucleotides upstream of the 3$\prime$ splice site. The molecular control of MuSVts110 splicing, represented primarily by scattered multiple inefficient branchpoint sequences that are conditionally modulated by the E2DE at higher growth temperatures, is discussed. ^
Resumo:
Nrd1 is an essential yeast protein of unknown function that has an RNA recognition motif (RRM) in its carboxyl half and a putative RNA polymerase II-binding domain, the CTD-binding motif, at its amino terminus. Nrd1 mediates a severe reduction in pre-mRNA production from a reporter gene bearing an exogenous sequence element in its intron. The effect of the inserted element is highly sequence-specific and is accompanied by the appearance of 3′-truncated transcripts. We have proposed that Nrd1 binds to the exogenous sequence element in the nascent pre-mRNA during transcription, aided by the CTD-binding motif, and directs 3′-end formation a short distance downstream. Here we show that highly purified Nrd1 carboxyl half binds tightly to the RNA element in vitro with sequence specificity that correlates with the efficiency of cis-element-directed down-regulation in vivo. A large deletion in the CTD-binding motif blocks down-regulation but does not affect the essential function of Nrd1. Furthermore, a nonsense mutant allele that produces truncated Nrd1 protein lacking the RRM has a dominant-negative effect on down-regulation but not on cell growth. Viability of this and several other nonsense alleles of Nrd1 appears to require translational readthrough, which in one case is extremely efficient. Thus the CTD-binding motif of Nrd1 is important for pre-mRNA down-regulation but is not required for the essential function of Nrd1. In contrast, the RNA-binding activity of Nrd1 appears to be required both for down-regulation and for its essential function.
Resumo:
Mammalian capping enzymes are bifunctional proteins with both RNA 5′-triphosphatase and guanylyltransferase activities. The N-terminal 237-aa triphosphatase domain contains (I/V)HCXXGXXR(S/T)G, a sequence corresponding to the conserved active-site motif in protein tyrosine phosphatases (PTPs). Analysis of point mutants of mouse RNA 5′-triphosphatase identified the motif Cys and Arg residues and an upstream Asp as required for activity. Like PTPs, this enzyme was inhibited by iodoacetate and VO43− and independent of Mg2+, providing additional evidence for phosphate removal from RNA 5′ ends by a PTP-like mechanism. The full-length, 597-aa mouse capping enzyme and the C-terminal guanylyltransferase fragment (residues 211–597), unlike the triphosphatase domain, bound poly (U) and were nuclear in transfected cells. RNA binding was increased by GTP, and a guanylylation-defective, active-site mutant was not affected. Ala substitution at positions required for the formation of the enzyme-GMP capping intermediate (R315, R530, K533, or N537) also eliminated poly (U) binding, while proteins with conservative substitutions at these sites retained binding but not guanylyltransferase activity. These results demonstrate that the guanylyltransferase domain of mammalian capping enzyme specifies nuclear localization and RNA binding. Association of capping enzyme with nascent transcripts may act in synergy with RNA polymerase II binding to ensure 5′ cap formation.
Resumo:
The endogenous clock that drives circadian rhythms is thought to communicate temporal information within the cell via cycling downstream transcripts. A transcript encoding a glycine-rich RNA-binding protein, Atgrp7, in Arabidopsis thaliana undergoes circadian oscillations with peak levels in the evening. The AtGRP7 protein also cycles with a time delay so that Atgrp7 transcript levels decline when the AtGRP7 protein accumulates to high levels. After AtGRP7 protein concentration has fallen to trough levels, Atgrp7 transcript starts to reaccumulate. Overexpression of AtGRP7 in transgenic Arabidopsis plants severely depresses cycling of the endogenous Atgrp7 transcript. These data establish both transcript and protein as components of a negative feedback circuit capable of generating a stable oscillation. AtGRP7 overexpression also depresses the oscillation of the circadian-regulated transcript encoding the related RNA-binding protein AtGRP8 but does not affect the oscillation of transcripts such as cab or catalase mRNAs. We propose that the AtGRP7 autoregulatory loop represents a “slave” oscillator in Arabidopsis that receives temporal information from a central “master” oscillator, conserves the rhythmicity by negative feedback, and transduces it to the output pathway by regulating a subset of clock-controlled transcripts.
Resumo:
TFIIH is a multifunctional RNA polymerase II transcription factor that possesses DNA-dependent ATPase, DNA helicase, and protein kinase activities. Previous studies have established that TFIIH enters the preinitiation complex and fulfills a critical role in initiation by catalyzing ATP-dependent formation of the open complex prior to synthesis of the first phosphodiester bond of nascent transcripts. In this report, we present direct evidence that TFIIH also controls RNA polymerase II activity at a postinitiation stage of transcription, by preventing premature arrest by very early elongation complexes just prior to their transition to stably elongating complexes. Unexpectedly, we observe that TFIIH is capable of entering the transcription cycle not only during assembly of the preinitiation complex but also after initiation and synthesis of as many as four to six phosphodiester bonds. These findings shed new light on the role of TFIIH in initiation and promoter escape and reveal an unanticipated flexibility in the ability of TFIIH to interact with RNA polymerase II transcription intermediates prior to, during, and immediately after initiation.
Resumo:
RNA editing and cytoplasmic male sterility are two important phenomena in higher plant mitochondria. To determine whether correlations might exist between the two, RNA editing in different tissues of Sorghum bicolor was compared employing reverse transcription–PCR and subsequent sequence analysis. In etiolated shoots, RNA editing of transcripts of plant mitochondrial atp6, atp9, nad3, nad4, and rps12 genes was identical among fertile or cytoplasmic male sterile plants. We then established a protocol for mitochondrial RNA isolation from plant anthers and pollen to include in these studies. Whereas RNA editing of atp9, nad3, nad4, and rps12 transcripts in anthers was similar to etiolated shoots, mitochondrial atp6 RNA editing was strongly reduced in anthers of the A3Tx398 male sterile line of S. bicolor. atp6 transcripts of wheat and selected plastid transcripts in S. bicolor showed normal RNA editing, indicating that loss of atp6 RNA editing is specific for cytoplasmic male sterility S. bicolor mitochondria. Restoration of fertility in F1 and F2 lines correlated with an increase in RNA editing of atp6 transcripts. Our data suggest that loss of atp6 RNA editing contributes to or causes cytoplasmic male sterility in S. bicolor. Further analysis of the mechanism of cell type-specific loss of atp6 RNA editing activity may advance our understanding of the mechanism of RNA editing.
Resumo:
Infection of vertebrate cells with alphaviruses normally leads to prodigious expression of virus-encoded genes and a dramatic inhibition of host protein synthesis. Recombinant Sindbis viruses and replicons have been useful as vectors for high level foreign gene expression, but the cytopathic effects of viral replication have limited their use to transient studies. We recently selected Sindbis replicons capable of persistent, noncytopathic growth in BHK cells and describe here a new generation of Sindbis vectors useful for long-term foreign gene expression based on such replicons. Foreign genes of interest as well as the dominant selectable marker puromycin N-acteyltransferase, which confers resistance to the drug puromycin, were expressed as subgenomic transcripts of noncytopathic replicons or defective-interfering genomes complemented in trans by a replicon. Based on these strategies, we developed vectors that can be initiated via either RNA or DNA transfection and analyzed them for their level and stability of foreign gene expression. Noncytopathic Sindbis vectors express reasonably high levels of protein in nearly every cell. These vectors should prove to be flexible tools for the rapid expression of heterologous genes under conditions in which cellular metabolism is not perturbed, and we illustrate their utility with a number of foreign proteins.
Resumo:
Small molecules that bind their biological receptors with high affinity and selectivity can be isolated from randomized pools of combinatorial libraries. RNA-protein interactions are important in many cellular functions, including transcription, RNA splicing, and translation. One example of such interactions is the mechanism of trans-activation of HIV-1 gene expression that requires the interaction of Tat protein with the trans-activation responsive region (TAR) RNA, a 59-base stem-loop structure located at the 5′ end of all nascent HIV-1 transcripts. Here we demonstrate the isolation of small TAR RNA-binding molecules from an encoded combinatorial library. We have made an encoded combinatorial tripeptide library of 24,389 possible members from d-and l-alpha amino acids on TentaGel resin. Using on-bead screening we have identified a small family of mostly heterochiral tripeptides capable of structure-specific binding to the bulge loop of TAR RNA. In vitro binding studies reveal stereospecific discrimination when the best tripeptide ligand is compared with diastereomeric peptide sequences. In addition, the most strongly binding tripeptide was shown to suppress transcriptional activation by Tat protein in human cells with an IC50 of ≈50 nM. Our results indicate that tripeptide RNA ligands are cell permeable, nontoxic to cells, and capable of inhibiting expression of specific genes by interfering with RNA-protein interactions.
Resumo:
5′-Capping is an early mRNA modification that has important consequences for downstream events in gene expression. We have isolated mammalian cDNAs encoding capping enzyme. They contain the sequence motifs characteristic of the nucleotidyl transferase superfamily. The predicted mouse and human enzymes consist of 597 amino acids and are 95% identical. Mouse cDNA directed synthesis of a guanylylated 68-kDa polypeptide that also contained RNA 5′-triphosphatase activity and catalyzed formation of RNA 5′-terminal GpppG. A haploid strain of Saccharomyces cerevisiae lacking mRNA guanylyltransferase was complemented for growth by the mouse cDNA. Conversion of Lys-294 in the KXDG-conserved motif eliminated both guanylylation and complementation, identifying it as the active site. The K294A mutant retained RNA 5′-triphosphatase activity, which was eliminated by N-terminal truncation. Full-length capping enzyme and an active C-terminal fragment bound to the elongating form and not to the initiating form of polymerase. The results document functional conservation of eukaryotic mRNA guanylyltransferases from yeast to mammals and indicate that the phosphorylated C-terminal domain of RNA polymerase II couples capping to transcription elongation. These results also explain the selective capping of RNA polymerase II transcripts.
Resumo:
Many examples of extreme virus resistance and posttranscriptional gene silencing of endogenous or reporter genes have been described in transgenic plants containing sense or antisense transgenes. In these cases of either cosuppression or antisense suppression, there appears to be induction of a surveillance system within the plant that specifically degrades both the transgene and target RNAs. We show that transforming plants with virus or reporter gene constructs that produce RNAs capable of duplex formation confer virus immunity or gene silencing on the plants. This was accomplished by using transcripts from one sense gene and one antisense gene colocated in the plant genome, a single transcript that has self-complementarity, or sense and antisense transcripts from genes brought together by crossing. A model is presented that is consistent with our data and those of other workers, describing the processes of induction and execution of posttranscriptional gene silencing.
Resumo:
Genes for σ-like factors of bacterial-type RNA polymerase have not been characterized from any multicellular eukaryotes, although they probably play a crucial role in the expression of plastid photosynthesis genes. We have cloned three distinct cDNAs, designated SIG1, SIG2, and SIG3, for polypeptides possessing amino acid sequences for domains conserved in σ70 factors of bacterial RNA polymerases from the higher plant Arabidopsis thaliana. Each gene is present as one copy per haploid genome without any additional sequences hybridized in the genome. Transient expression assays using green fluorescent protein demonstrated that N-terminal regions of the SIG2 and SIG3 ORFs could function as transit peptides for import into chloroplasts. Transcripts for all three SIG genes were detected in leaves but not in roots, and were induced in leaves of dark-adapted plants in rapid response to light illumination. Together with results of our previous analysis of tissue-specific regulation of transcription of plastid photosynthesis genes, these results indicate that expressed levels of the genes may influence transcription by regulating RNA polymerase activity in a green tissue-specific manner.
Resumo:
We have characterized two Saccharomyces cerevisiae proteins, Sro9p and Slf1p, which contain a highly conserved motif found in all known La proteins. Originally described as an autoantigen in patients with rheumatic disease, the La protein binds to newly synthesized RNA polymerase III transcripts. In yeast, the La protein homologue Lhp1p is required for the normal pathway of tRNA maturation and also stabilizes newly synthesized U6 RNA. We show that deletions in both SRO9 and SLF1 are not synthetically lethal with a deletion in LHP1, indicating that the three proteins do not function in a single essential process. Indirect immunofluorescence microscopy reveals that although Lhp1p is primarily localized to the nucleus, Sro9p is cytoplasmic. We demonstrate that Sro9p and Slf1p are RNA-binding proteins that associate preferentially with translating ribosomes. Consistent with a role in translation, strains lacking either Sro9p or Slf1p are less sensitive than wild-type strains to certain protein synthesis inhibitors. Thus, Sro9p and Slf1p define a new and possibly evolutionarily conserved class of La motif-containing proteins that may function in the cytoplasm to modulate mRNA translation.
Resumo:
The nucleolar localization elements (NoLEs) of U17 small nucleolar RNA (snoRNA), which is essential for rRNA processing and belongs to the box H/ACA snoRNA family, were analyzed by fluorescence microscopy. Injection of mutant U17 transcripts into Xenopus laevis oocyte nuclei revealed that deletion of stems 1, 2, and 4 of U17 snoRNA reduced but did not prevent nucleolar localization. The deletion of stem 3 had no adverse effect. Therefore, the hairpins of the hairpin–hinge–hairpin–tail structure formed by these stems are not absolutely critical for nucleolar localization of U17, nor are sequences within stems 1, 3, and 4, which may tether U17 to the rRNA precursor by base pairing. In contrast, box H and box ACA are major NoLEs; their combined substitution or deletion abolished nucleolar localization of U17 snoRNA. Mutation of just box H or just the box ACA region alone did not fully abolish the nucleolar localization of U17. This indicates that the NoLEs of the box H/ACA snoRNA family function differently from the bipartite NoLEs (conserved boxes C and D) of box C/D snoRNAs, where mutation of either box alone prevents nucleolar localization.