8 resultados para RNA-seq data
em DigitalCommons@The Texas Medical Center
Resumo:
My dissertation focuses on two aspects of RNA sequencing technology. The first is the methodology for modeling the overdispersion inherent in RNA-seq data for differential expression analysis. This aspect is addressed in three sections. The second aspect is the application of RNA-seq data to identify the CpG island methylator phenotype (CIMP) by integrating datasets of mRNA expression level and DNA methylation status. Section 1: The cost of DNA sequencing has reduced dramatically in the past decade. Consequently, genomic research increasingly depends on sequencing technology. However it remains elusive how the sequencing capacity influences the accuracy of mRNA expression measurement. We observe that accuracy improves along with the increasing sequencing depth. To model the overdispersion, we use the beta-binomial distribution with a new parameter indicating the dependency between overdispersion and sequencing depth. Our modified beta-binomial model performs better than the binomial or the pure beta-binomial model with a lower false discovery rate. Section 2: Although a number of methods have been proposed in order to accurately analyze differential RNA expression on the gene level, modeling on the base pair level is required. Here, we find that the overdispersion rate decreases as the sequencing depth increases on the base pair level. Also, we propose four models and compare them with each other. As expected, our beta binomial model with a dynamic overdispersion rate is shown to be superior. Section 3: We investigate biases in RNA-seq by exploring the measurement of the external control, spike-in RNA. This study is based on two datasets with spike-in controls obtained from a recent study. We observe an undiscovered bias in the measurement of the spike-in transcripts that arises from the influence of the sample transcripts in RNA-seq. Also, we find that this influence is related to the local sequence of the random hexamer that is used in priming. We suggest a model of the inequality between samples and to correct this type of bias. Section 4: The expression of a gene can be turned off when its promoter is highly methylated. Several studies have reported that a clear threshold effect exists in gene silencing that is mediated by DNA methylation. It is reasonable to assume the thresholds are specific for each gene. It is also intriguing to investigate genes that are largely controlled by DNA methylation. These genes are called “L-shaped” genes. We develop a method to determine the DNA methylation threshold and identify a new CIMP of BRCA. In conclusion, we provide a detailed understanding of the relationship between the overdispersion rate and sequencing depth. And we reveal a new bias in RNA-seq and provide a detailed understanding of the relationship between this new bias and the local sequence. Also we develop a powerful method to dichotomize methylation status and consequently we identify a new CIMP of breast cancer with a distinct classification of molecular characteristics and clinical features.
Resumo:
Cells must rapidly sense and respond to a wide variety of potentially cytotoxic external stressors to survive in a constantly changing environment. In a search for novel genes required for stress tolerance in Saccharomyces cerevisiae, we identified the uncharacterized open reading frame YER139C as a gene required for growth at 37 degrees C in the presence of the heat shock mimetic formamide. YER139C encodes the closest yeast homolog of the human RPAP2 protein, recently identified as a novel RNA polymerase II (RNAPII)-associated factor. Multiple lines of evidence support a role for this gene family in transcription, prompting us to rename YER139C RTR1 (regulator of transcription). The core RNAPII subunits RPB5, RPB7, and RPB9 were isolated as potent high-copy-number suppressors of the rtr1Delta temperature-sensitive growth phenotype, and deletion of the nonessential subunits RPB4 and RPB9 hypersensitized cells to RTR1 overexpression. Disruption of RTR1 resulted in mycophenolic acid sensitivity and synthetic genetic interactions with a number of genes involved in multiple phases of transcription. Consistently, rtr1Delta cells are defective in inducible transcription from the GAL1 promoter. Rtr1 constitutively shuttles between the cytoplasm and nucleus, where it physically associates with an active RNAPII transcriptional complex. Taken together, our data reveal a role for members of the RTR1/RPAP2 family as regulators of core RNAPII function.
Resumo:
MuSVts110 is a conditionally defective mutant of Moloney murine sarcoma virus which undergoes a novel tmperature-dependent splice event at growth temperatures of 33$\sp\circ$C or lower. Relative to wild-type MuSV-124, MuSVts110 contains a 1487 base deletion spanning from the 3$\sp\prime$ end of the p30 gag coding region to just downstream of the first v-mos initiation codon. As a result, the gag and mos genes are fused out of frame and no v-mos protein is expressed. However, upon a shift to 33$\sp\circ$C or lower, a splice event occurs which removes 431 bases, realigns the gag and mos genes, and allows read-through translation of a P85gag-mos transforming protein. Interestingly, while the cryptic splice sites utilized in MuSVts110 are present and unaltered in MuSV-124, they are never used. Due to the 1487 base deletion, the MuSV-124 intron was reduced from 1919 to 431 bases suggesting that intron size might be involved in the activation of these cryptic splice sites in MuSVts110. Since the splicing phenotype of the MuSVts110 equivalent (TS32 DNA) which contains the identical 1487 base deletion introduced into otherwise wild-type MuSV-124 DNA, was indistinguishable from authentic MuSVts110, it was concluded that this deletion alone is responsible for activation of the cryptic splice sites used in MuSVts110. These results also confirmed that thermodependent splicing is an intrinsic property of the viral RNA and not due to some cellular defect. Furthermore, analysis of gag gene deletion and frameshift MuSVts110 mutants demonstrated that viral gag gene proteins do not play a role in regulation of MuSVts110 splicing. Instead, cis-acting viral sequences appear to mediate regulation of the splice event.^ Our initial observation that truncation of the MuSVts110 transcript, leaving only residual amounts of the flanking exon sequences, completely abolished splicing activity argued that exon sequences might participate in the regulation of the splice event.^ Analysis of exon sequence involvement has also identified cis-acting sequences important in the thermodependence of the splice event. Data suggest that regulation of the MuSVts110 splice event involves multiple interactions between specific intron and exon sequences and spliceosome components which together limit splicing activity to temperatures of 33$\sp\circ$C or lower while simultaneously restricting splicing to a maximum of 50% efficiency. (Abstract shortened with permission of author.) ^
Resumo:
Untreated AKR mice develop spontaneous thymic lymphomas by 6-12 months of age. Lymphoma development is accelerated when young mice are injected with the carcinogen N-methyl-N-nitrosourea (MNU). Selected molecular and cellular events were compared during the latent period preceding "spontaneous" (retrovirally-induced) and MNU-induced thymic lymphoma development in AKR mice. These studies were undertaken to test the hypothesis that thymic lymphomas induced in the same inbred mouse strain by endogenous retroviruses and by a chemical carcinogen develop by different mechanisms.^ Immunofluorescence analysis of differentiation antigens showed that most MNU-induced lymphomas express an immature CD4-8+ profile. In contrast, spontaneous lymphomas represent each of the major lymphocyte subsets. These data suggest involvement of different target populations in MNU-induced and spontaneous lymphomas. Analyses at intervals after MNU treatment revealed selective expansion of the CD4-8+ J11d+ thymocyte subset at 8-10 weeks post-MNU in 68% of the animals examined, suggesting that these cells are targets for MNU-induced lymphomagenesis. Untreated age-matched animals showed no selective expansion of thymocyte subsets.^ Previous data have shown that both spontaneous and MNU-induced lymphomas are monoclonal or oligoclonal. Distinct rearrangement patterns of the J$\sb2$ region of the T-cell receptor $\beta$-chain showed emergence of clonal thymocyte populations beginning at 6-7 weeks after MNU treatment. However, lymphocytes from untreated animals showed no evidence of clonal expansion at the time intervals investigated.^ Activation of c-myc frequently occurs during development of B- and T- cell lymphomas. Both spontaneous and MNU-induced lymphomas showed increased c-myc transcript levels. Increased c-myc transcription was first detected at 6 weeks post-MNU, and persisted throughout the latent period. However, untreated animals showed no increases in c-myc transcripts at the time intervals examined. Another nuclear oncogene, c-fos, did not display a similar change in RNA transcription during the latent period.^ These results supports the hypothesis that MNU-induced and spontaneous tumors develop by multi-step pathways which are distinct with respect to the target cell population affected. Clonal emergence and c-myc deregulation are important steps in the development of both MNU-induced and spontaneous tumors, but the onset of these events is later in spontaneous tumor development. ^
Resumo:
Wilms tumor (WT) is an embryonal renal tumor with a heterogeneous genetic etiology that serves as a valuable model for studying tumorigenesis. Biallelic inactivation of the tumor suppressor gene WT1, a zinc-finger transcriptional regulator located at 11p13, is critical for the development of some Wilms tumors. Interestingly, WT1 genomic analysis has demonstrated mutations in less than 20% of WT cases. This suggests either other genes play a more major role in Wilms tumorigenesis or WT1 is functionally altered by mechanisms other than DNA mutation. Previous observations in rat and in WT xenograft cell lines have suggested that abnormal WT1 RNA processing (exon 6 RNA editing and aberrant exon 2 splicing, respectively) is a potential mechanism of altering WT1 function in the absence of a WT1 DNA mutation. However, the role of this abnormal RNA processing has not previously been assessed in primary Wilms tumors. ^ To test the hypothesis that abnormal WT1 RNA processing is a mechanism of WT1alteration during tumor development, WT1 RNA from 85 primary tumors was analyzed using reverse transcription and polymerase chain reaction amplification (RT-PCR). Although no evidence for WT1 RNA editing was observed, variable levels (5% to 50%) of aberrant WT1 exon 2 splicing were detected for 11 tumors in the absence of a detectable WT1 DNA mutation. Also, alteration of normal WT1 alternative splicing, observed as RNA isoform loss, was detected in five tumors with no apparent WT1 genomic alteration, although no consistent pattern of RNA isoform loss was detected. This abnormal WT1 splicing, detected by either loss of exon 2 from some of the transcripts or loss of RNA isoforms, is statistically correlated with relapse (p = 0.005). These studies demonstrate that abnormal WT1 RNA processing is not a common mechanism of abrogating normal WT1 function in primary tumors. However, in those cases in which abnormal WTI splicing is present, these data indicate that it may serve as a useful prognostic marker for relapse in WT patients. ^
Resumo:
mRNA 3′ polyadenylation is central to mRNA biogenesis in prokaryotes and eukaryotes, and is implicated in numerous aspects of mRNA metabolism, including efficiency of mRNA export from the nucleus, message stability, and initiation of translation. However, due to the great complexity of the eukaryotic polyadenylation apparatus, the mechanisms of RNA 3 ′ end processing have remained elusive. Although the RNA processing reactions leading to polyadenylated messenger RNA have been studied in many systems, and much progress has been made, a complete understanding of the biochemistry of the poly(A) polymerase enzyme is still lacking. My research uses Vaccinia virus as a model system to gain a better understanding of this complicated polyadenylation process, which consist of RNA binding, catalysis and polymerase translocation. ^ Vaccinia virus replicates in the cytoplasm of its host cell, so it must employ its own poly(A) polymerase (PAP), a heterodimer of two virus encoded proteins, VP55 and VP39. VP55 is the catalytic subunit, adding 30 adenylates to a non-polyadenylated RNA in a rapid processive manner before abruptly changing to a slow, non-processive mode of adenylate addition and dissociating from the RNA. VP39 is the stimulatory subunit. It has no polyadenylation catalytic activity by itself, but when associated with VP55 it facilitates the semi-processive synthesis of tails several hundred adenylates in length. ^ Oligonucleotide selection and competition studies have shown that the heterodimer binds a minimal motif of (rU)2 (N)25 U, the “heterodimer binding motif”, within an oligonucleotide, and its primer selection for polyadenylation is base-type specific. ^ Crosslinking studies using photosensitive uridylate analogs show that within a VP55-VP39-primer ternary complex, VP55 comes into contact with all three required uridylates, while VP39 only contacts the downstream uridylate. Further studies, using a backbone-anchored photosensitive crosslinker show that both PAP subunits are in close proximity to the downstream −10 to −21 region of 50mer model primers containing the heterodimer binding motif. This equal crosslinking to both subunits suggests that the dimerization of VP55 and VP39 creates either a cleft or a channel between the two subunits through which this region of RNA passes. ^ Peptide mapping studies of VP39 covalently crosslinked to the oligonucleotide have identified residue R107 as the amino acid in close proximity to the −10 uridylate. This helps us project a conceptual model onto the known physical surface of this subunit. In the absence of any tertiary structural data for VP55, we have used a series of oligonucleotide selection assays, as well as crosslinking, nucleotide transfer assays, and gel shift assays to gain insight into the requirements for binding, polyadenylation and translocation. Collectively, these data allow us to put together a comprehensive model of the structure and function of the polyadenylation ternary complex consisting of VP39, VP55 and RNA. ^
Resumo:
Ecteinascidin 743 (Et-743), which is a novel DNA minor groove alkylator with a unique spectrum of antitumor activity, is currently being evaluated in phase II/III clinical trials. Although the precise molecular mechanisms responsible for the observed antitumor activity are poorly understood, recent data suggests that post-translational modifications of RNA polymerase II Large Subunit (RNAPII LS) may play a central role in the cellular response to this promising anticancer agent. The stalling of an actively transcribing RNAPII LS at Et-743-DNA adducts is the initial cellular signal for transcription-coupled nucleotide excision repair (TC-NER). In this manner, Et-743 poisons TC-NER and produces DNA single strand breaks. Et-743 also inhibits the transcription and RNAPII LS-mediated expression of selected genes. Because the poisoning of TC-NER and transcription inhibition are critical components of the molecular response to Et-743 treatment, we have investigated if changes in RNAPII LS contribute to the disruption of these two cellular pathways. In addition, we have studied changes in RNAPII LS in two tumors for which clinical responses were reported in phase I/II clinical trials: renal cell carcinoma and Ewing's sarcoma. Our results demonstrate that Et-743 induces degradation of the RNAPII LS that is dependent on active transcription, a functional 26S proteasome, and requires functional TC-NER, but not global genome repair. Additionally, we have provided the first experimental data indicating that degradation of RNAPII LS might lead to the inhibition of activated gene transcription. A set of studies performed in isogenic renal carcinoma cells deficient in von Hippel-Lindau protein, which is a ubiquitin-E3-ligase for RNAPII LS, confirmed the central role of RNAPII LS degradation in the sensitivity to Et-743. Finally, we have shown that RNAPII LS is also degraded in Ewing's sarcoma tumors following Et-743 treatment and provide data to suggest that this event plays a role in decreased expression of the Ewing's sarcoma oncoprotein, EWS-Fli1. Altogether, these data implicate degradation of RNAPII LS as a critical event following Et-743 exposure and suggest that the clinical activity observed in renal carcinoma and Ewing's sarcoma may be mediated by disruption of molecular pathways requiring a fully functional RNAPII LS. ^
Resumo:
The purpose of this work was to examine the possible mechanisms for the regulation of cytochrome c gene expression in response to increased contractile activity in rat skeletal muscle. The working hypothesis was that increased contractile activity enhances cytochrome c gene expression through a cis-element. A 110% increase in cytochrome c mRNA concentration was observed in tibialis anterior (TA) muscle after 9 days of chronic stimulation. Similar difference (120%) exists between soleus (SO) muscle of higher contractile activity and white vastus lateralis (WV) muscle of lower contractile activity. These results suggest that the endogenous cytochrome c gene expression is regulated by contractile activity. Cytochrome c-reporter genes were injected into skeletal muscles to identify the cis-element that is responsible for the regulation. Although the data was inconclusive, part of it suggested the importance of the 3$\sp\prime$-untranslated region (3$\sp\prime$-UTR) in mediating the response to increased contractile activity.^ RNA gel mobility shift (GMSA) and ultraviolet (UV) cross-linking assays revealed specific RNA-protein interaction in a 50-nucleotide region of the 3$\sp\prime$-UTR in unstimulated TA muscle. Computer analysis predicted a stem-loop structure of 17 nucleotides, which provides a structural basis for RNA-protein interaction. These 17 nucleotides are 100% conserved among rat, mouse and human cytochrome c genes and their 13 pseudogenes, suggesting a functional role for this region. The RNA-protein interaction was significantly less in highly active SO muscle than in inactive WV muscle and was dramatically decreased in stimulated TA muscle due to a protein inhibitor(s) associated with ribosome. It is possible that cytochrome c mRNAs undergoing translation are subject to a compartmentalized regulatory influence.^ The conclusion from these results is that increases in contractile activity induce or activate a protein inhibitor(s) associated with ribosome in rat skeletal muscle. The inhibitor decreases RNA-protein interaction in the 3$\sp\prime$-UTR of cytochrome c mRNA, which may result in increased mRNA stability and/or translation. ^