958 resultados para PROTEIN-CODING GENES
Resumo:
The discovery of long non-coding RNA (lncRNA) has dramatically altered our understanding of cancer. Here, we describe a comprehensive analysis of lncRNA alterations at transcriptional, genomic, and epigenetic levels in 5,037 human tumor specimens across 13 cancer types from The Cancer Genome Atlas. Our results suggest that the expression and dysregulation of lncRNAs are highly cancer type specific compared with protein-coding genes. Using the integrative data generated by this analysis, we present a clinically guided small interfering RNA screening strategy and a co-expression analysis approach to identify cancer driver lncRNAs and predict their functions. This provides a resource for investigating lncRNAs in cancer and lays the groundwork for the development of new diagnostics and treatments.
Resumo:
The human genome comprises roughly 20 000 protein coding genes. Proteins are the building material for cells and tissues, and proteins are functional compounds having an important role in many cellular responses, such as cell signalling. In multicellular organisms such as humans, cells need to communicate with each other in order to maintain a normal function of the tissues within the body. This complex signalling between and within cells is transferred by proteins and their post-translational modifications, one of the most important being phosphorylation. The work presented here concerns the development and use of tools for phosphorylation analysis. Mass spectrometers have become essential tools to study proteins and proteomes. In mass spectrometry oriented proteomics, proteins can be identified and their post-translational modifications can be studied. In this Ph.D. thesis the objectives were to improve the robustness of sample handling methods prior to mass spectrometry analysis for peptides and their phosphorylation status. The focus was to develop strategies that enable acquisition of more MS measurements per sample, higher quality MS spectra and simplified and rapid enrichment procedures for phosphopeptides. Furthermore, an objective was to apply these methods to characterize phosphorylation sites of phosphopeptides. In these studies a new MALDI matrix was developed which allowed more homogenous, intense and durable signals to be acquired when compared to traditional CHCA matrix. This new matrix along with other matrices was subsequently used to develop a new method that combines multiple spectra from different matrises from identical peptides. With this approach it was possible to identify more phosphopeptides than with conventional LC/ESI-MS/MS methods, and to use 5 times less sample. Also, phosphopeptide affinity MALDI target was prepared to capture and immobilise phosphopeptides from a standard peptide mixture while maintaining their spatial orientation. In addition a new protocol utilizing commercially available conductive glass slides was developed that enabled fast and sensitive phosphopeptide purification. This protocol was applied to characterize the in vivo phosphorylation of a signalling protein, NFATc1. Evidence for 12 phosphorylation sites were found, and many of those were found in multiply phosphorylated peptides
Resumo:
Sugarcane is the most important crop for sugar industry and raw material for bioethanol. Here we present a quantitative analysis of the gene content from publicly available sugarcane ESTs. The current sugarcane EST collection sampled orthologs for ~58 % of the closely-related sorghum proteome, suggesting that more than 10,000 sugarcane coding-genes remain undiscovered. Moreover the existence of more than 2,000 ncRNAs conserved between sugarcane and sorghum was revealed, among which over 500 are also detected in rice, supporting the existence of hundreds of conserved ncRNAs in grasses. New efforts towards sugarcane transcriptome sequencing were needed to sample the missing coding-genes as well as to expand the catalog of ncRNAs. © 2012 Springer Science+Business Media, LLC.
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Pós-graduação em Microbiologia Agropecuária - FCAV
Resumo:
Abstract Background The mitochondrial DNA of kinetoplastid flagellates is distinctive in the eukaryotic world due to its massive size, complex form and large sequence content. Comprised of catenated maxicircles that contain rRNA and protein-coding genes and thousands of heterogeneous minicircles encoding small guide RNAs, the kinetoplast network has evolved along with an extreme form of mRNA processing in the form of uridine insertion and deletion RNA editing. Many maxicircle-encoded mRNAs cannot be translated without this post-transcriptional sequence modification. Results We present the complete sequence and annotation of the Trypanosoma cruzi maxicircles for the CL Brener and Esmeraldo strains. Gene order is syntenic with Trypanosoma brucei and Leishmania tarentolae maxicircles. The non-coding components have strain-specific repetitive regions and a variable region that is unique for each strain with the exception of a conserved sequence element that may serve as an origin of replication, but shows no sequence identity with L. tarentolae or T. brucei. Alternative assemblies of the variable region demonstrate intra-strain heterogeneity of the maxicircle population. The extent of mRNA editing required for particular genes approximates that seen in T. brucei. Extensively edited genes were more divergent among the genera than non-edited and rRNA genes. Esmeraldo contains a unique 236-bp deletion that removes the 5'-ends of ND4 and CR4 and the intergenic region. Esmeraldo shows additional insertions and deletions outside of areas edited in other species in ND5, MURF1, and MURF2, while CL Brener has a distinct insertion in MURF2. Conclusion The CL Brener and Esmeraldo maxicircles represent two of three previously defined maxicircle clades and promise utility as taxonomic markers. Restoration of the disrupted reading frames might be accomplished by strain-specific RNA editing. Elements in the non-coding region may be important for replication, transcription, and anchoring of the maxicircle within the kinetoplast network.
Resumo:
Abstract Background RNAs transcribed from intronic regions of genes are involved in a number of processes related to post-transcriptional control of gene expression. However, the complement of human genes in which introns are transcribed, and the number of intronic transcriptional units and their tissue expression patterns are not known. Results A survey of mRNA and EST public databases revealed more than 55,000 totally intronic noncoding (TIN) RNAs transcribed from the introns of 74% of all unique RefSeq genes. Guided by this information, we designed an oligoarray platform containing sense and antisense probes for each of 7,135 randomly selected TIN transcripts plus the corresponding protein-coding genes. We identified exonic and intronic tissue-specific expression signatures for human liver, prostate and kidney. The most highly expressed antisense TIN RNAs were transcribed from introns of protein-coding genes significantly enriched (p = 0.002 to 0.022) in the 'Regulation of transcription' Gene Ontology category. RNA polymerase II inhibition resulted in increased expression of a fraction of intronic RNAs in cell cultures, suggesting that other RNA polymerases may be involved in their biosynthesis. Members of a subset of intronic and protein-coding signatures transcribed from the same genomic loci have correlated expression patterns, suggesting that intronic RNAs regulate the abundance or the pattern of exon usage in protein-coding messages. Conclusion We have identified diverse intronic RNA expression patterns, pointing to distinct regulatory roles. This gene-oriented approach, using a combined intron-exon oligoarray, should permit further comparative analysis of intronic transcription under various physiological and pathological conditions, thus advancing current knowledge about the biological functions of these noncoding RNAs.
Resumo:
TbRRM1 of Trypanosoma brucei is a nucleoprotein that was previously identified in a search for splicing factors in T. brucei. We show that TbRRM1 associates with mRNAs and with the auxiliary splicing factor polypyrimidine tract-binding protein 2, but not with components of the core spliceosome. TbRRM1 also interacts with several retrotransposon hot spot (RHS) proteins and histones. RNA immunoprecipitation of a tagged form of TbRRM1 from procyclic (insect) form trypanosomes identified ca. 1,500 transcripts that were enriched and 3,000 transcripts that were underrepresented compared to cellular mRNA. Enriched transcripts encoded RNA-binding proteins, including TbRRM1 itself, several RHS transcripts, mRNAs with long coding regions, and a high proportion of stage-regulated mRNAs that are more highly expressed in bloodstream forms. Transcripts encoding ribosomal proteins, other factors involved in translation, and procyclic-specific transcripts were underrepresented. Knockdown of TbRRM1 by RNA interference caused widespread changes in mRNA abundance, but these changes did not correlate with the binding of the protein to transcripts, and most splice sites were unchanged, negating a general role for TbRRM1 in splice site selection. When changes in mRNA abundance were mapped across the genome, regions with many downregulated mRNAs were identified. Two regions were analyzed by chromatin immunoprecipitation, both of which exhibited increases in nucleosome occupancy upon TbRRM1 depletion. In addition, subjecting cells to heat shock resulted in translocation of TbRRM1 to the cytoplasm and compaction of chromatin, consistent with a second role for TbRRM1 in modulating chromatin structure. IMPORTANCE: Trypanosoma brucei, the parasite that causes human sleeping sickness, is transmitted by tsetse flies. The parasite progresses through different life cycle stages in its two hosts, altering its pattern of gene expression in the process. In trypanosomes, protein-coding genes are organized as polycistronic units that are processed into monocistronic mRNAs. Since genes in the same unit can be regulated independently of each other, it is believed that gene regulation is essentially posttranscriptional. In this study, we investigated the role of a nuclear RNA-binding protein, TbRRM1, in the insect stage of the parasite. We found that TbRRM1 binds nuclear mRNAs and also affects chromatin status. Reduction of nuclear TbRRM1 by RNA interference or heat shock resulted in chromatin compaction. We propose that TbRRM1 regulates RNA polymerase II-driven gene expression both cotranscriptionally, by facilitating transcription and efficient splicing, and posttranscriptionally, via its interaction with nuclear mRNAs.
Resumo:
Naturally occurring genetic variants confer susceptibility to disease in the human population, including in testicular germ cell tumor development. Disease susceptibility loci for testicular germ cell tumors have been identified by genetic mapping in humans and mice. However, the identity of many of the susceptibility genes remains unclear. My study utilized a chromosome substitution strain, the 129.MOLF-Chr 19 (or M19 strain), to identify candidate testicular germ cell tumor susceptibility genes. Males of this strain have a high incidence of germ cell tumors in the testes. By forward genetic approaches, five susceptibility loci were fine-mapped and the genetic interactions were dissected. In addition, I identified three protein-coding genes and one micro-RNA as testicular tumor susceptibility genes by genomic screening. Using reverse genetic approaches, I verified one of the candidates, Splicing factor 1, as a modifier of testicular tumor. Deficiency of SF1 significantly reduces the incidence of testicular tumors in mice. This study highlights the advantage of the 129.MOLF-Chr 19 consomic strain in disease gene identification and validation. It also sets the stage to elucidate the molecular mechanisms of tumorigenesis in the testis. ^
Resumo:
Intracellular transport is essential for morphogenesis and functioning of the cell. The kinesin superfamily proteins (KIFs) have been shown to transport membranous organelles and protein complexes in a microtubule- and ATP-dependent manner. More than 30 KIFs have been reported in mice. However, the nomenclature of KIFs has not been clearly established, resulting in various designations and redundant names for a single KIF. Here, we report the identification and classification of all KIFs in mouse and human genome transcripts. Previously unidentified murine KIFs were found by a PCR-based search. The identification of all KIFs was confirmed by a database search of the total human genome. As a result, there are a total of 45 KIFs. The nomenclature of all KIFs is presented. To understand the function of KIFs in intracellular transport in a single tissue, we focused on the brain. The expression of 38 KIFs was detected in brain tissue by Northern blotting or PCR using cDNA. The brain, mainly composed of highly differentiated and polarized cells such as neurons and glia, requires a highly complex intracellular transport system as indicated by the increased number of KIFs for their sophisticated functions. It is becoming increasingly clear that the cell uses a number of KIFs and tightly controls the direction, destination, and velocity of transportation of various important functional molecules, including mRNA. This report will set the foundation of KIF and intracellular transport research.
Resumo:
The insulin-like growth factor 2 antisense (Igf2as) gene is part of the Ins-Igf2-H19 imprinted gene cluster. The function of the paternally expressed Igf2as is still elusive. In our previous work, we showed that Igf2as transcripts were located in the cytoplasm of C2C12 mouse myoblast cells, associated with polysomes and polyadenylated suggesting that Igf2as is protein coding. In the present work, the protein coding capacity of Igf2as was investigated. We demonstrate for the first time the existence of a polypeptide translated from an Igf2as construct. Furthermore, an RNA-Seq analysis was performed using RNA prepared from skeletal muscles of newborn wild-type and ∆ DMR1-U2 mice to further elucidate the function of Igf2as transcripts. We found no evidence for a regulatory role of Igf2as in the imprinted gene cluster. Interestingly, the RNA-Seq analysis indicated that Igf2as plays a role in the energy metabolism, the cell cycle, histone acetylation and muscle contraction pathways. Our Igf2as investigations further elucidated that there are two distinct Igf2as transcripts corresponding to two putative ORFs.
Resumo:
Sequence diversity in the coat protein coding region of Australian strains of Johnsongrass mosaic virus (JGMV) was investigated. Field isolates were sampled during a seven year period from Johnsongrass, sorghum and corn across the northern grain growing region. The 23 isolates were found to have greater than 94% nucleotide and amino acid sequence identity. The Australian isolates and two strains from the U.S.A. had about 90% nucleotide sequence identity and were between 19 and 30% different in the N-terminus of the coat protein. Two amino acid residues were found in the core region of the coat protein in isolates obtained from sorghum having the Krish gene for JGMV resistance that differed from those found in isolates from other hosts which did not have this single dominant resistance gene. These amino acid changes may have been responsible for overcoming the resistance conferred by the Krish gene for JGMV resistance in sorghum. The identification of these variable regions was essential for the development of durable pathogen-derived resistance to JGMV in sorghum.
Resumo:
At present a complete mtDNA sequence has been reported for only two hymenopterans, the Old World honey bee, Apis mellifera and the sawfly Perga condei. Among the bee group, the tribe Meliponini (stingless bees) has some distinction due to its Pantropical distribution, great number of species and large importance as main pollinators in several ecosystems, including the Brazilian rain forest. However few molecular studies have been conducted on this group of bees and few sequence data from mitochondrial genomes have been described. In this project, we PCR amplified and sequenced 78% of the mitochondrial genome of the stingless bee Melipona bicolor (Apidae, Meliponini). The sequenced region contains all of the 13 mitochondrial protein-coding genes, 18 of 22 tRNA genes, and both rRNA genes (one of them was partially sequenced). We also report the genome organization (gene content and order), gene translation, genetic code, and other molecular features, such as base frequencies, codon usage, gene initiation and termination. We compare these characteristics of M. bicolor to those of the mitochondrial genome of A. mellifera and other insects. A highly biased A+T content is a typical characteristic of the A. mellifera mitochondrial genome and it was even more extreme in that of M. bicolor. Length and compositional differences between M. bicolor and A. mellifera genes were detected and the gene order was compared. Eleven tRNA gene translocations were observed between these two species. This latter finding was surprising, considering the taxonomic proximity of these two bee tribes. The tRNA Lys gene translocation was investigated within Meliponini and showed high conservation across the Pantropical range of the tribe.
Resumo:
Cryptosporidium spp. are important cause of enteric disease in humans, but may also infect animals. This study describes the relative frequency of several Cryptosporidium species found in human specimens from HIV infected patients in the São Paulo municipality obtained from January to July 2007. Sequence analysis of the products of nested-PCR based on small subunit rRNA and Cryptosporidium oocyst wall protein coding genes revealed 17 (63.0%) isolates of C. hominis, four (14.8%) C. parvum, five (18.5%) C. felis and one (3.7%) C. canis. These findings suggest that, in urban environments of Brazil, the cat adapted C. felis may play a potential role in the zoonotic transmission of cryptosporidiosis whereas the anthroponotic transmission of cryptosporidiosis caused by C. hominis seems to predominate.