919 resultados para Hsp70 transcript
Resumo:
The goals of the human genome project did not include sequencing of the heterochromatic regions. We describe here an initial sequence of 1.1 Mb of the short arm of human chromosome 21 (HSA21p), estimated to be 10% of 21p. This region contains extensive euchromatic-like sequence and includes on average one transcript every 100 kb. These transcripts show multiple inter- and intrachromosomal copies, and extensive copy number and sequence variability. The sequencing of the "heterochromatic" regions of the human genome is likely to reveal many additional functional elements and provide important evolutionary information.
Resumo:
This report presents systematic empirical annotation of transcript products from 399 annotated protein-coding loci across the 1% of the human genome targeted by the Encyclopedia of DNA elements (ENCODE) pilot project using a combination of 5' rapid amplification of cDNA ends (RACE) and high-density resolution tiling arrays. We identified previously unannotated and often tissue- or cell-line-specific transcribed fragments (RACEfrags), both 5' distal to the annotated 5' terminus and internal to the annotated gene bounds for the vast majority (81.5%) of the tested genes. Half of the distal RACEfrags span large segments of genomic sequences away from the main portion of the coding transcript and often overlap with the upstream-annotated gene(s). Notably, at least 20% of the resultant novel transcripts have changes in their open reading frames (ORFs), most of them fusing ORFs of adjacent transcripts. A significant fraction of distal RACEfrags show expression levels comparable to those of known exons of the same locus, suggesting that they are not part of very minority splice forms. These results have significant implications concerning (1) our current understanding of the architecture of protein-coding genes; (2) our views on locations of regulatory regions in the genome; and (3) the interpretation of sequence polymorphisms mapping to regions hitherto considered to be "noncoding," ultimately relating to the identification of disease-related sequence alterations.
Resumo:
Understanding the molecular mechanisms responsible for the regulation of the transcriptome present in eukaryotic cells isone of the most challenging tasks in the postgenomic era. In this regard, alternative splicing (AS) is a key phenomenoncontributing to the production of different mature transcripts from the same primary RNA sequence. As a plethora ofdifferent transcript forms is available in databases, a first step to uncover the biology that drives AS is to identify thedifferent types of reflected splicing variation. In this work, we present a general definition of the AS event along with anotation system that involves the relative positions of the splice sites. This nomenclature univocally and dynamically assignsa specific ‘‘AS code’’ to every possible pattern of splicing variation. On the basis of this definition and the correspondingcodes, we have developed a computational tool (AStalavista) that automatically characterizes the complete landscape of ASevents in a given transcript annotation of a genome, thus providing a platform to investigate the transcriptome diversityacross genes, chromosomes, and species. Our analysis reveals that a substantial part—in human more than a quarter—ofthe observed splicing variations are ignored in common classification pipelines. We have used AStalavista to investigate andto compare the AS landscape of different reference annotation sets in human and in other metazoan species and found thatproportions of AS events change substantially depending on the annotation protocol, species-specific attributes, andcoding constraints acting on the transcripts. The AStalavista system therefore provides a general framework to conductspecific studies investigating the occurrence, impact, and regulation of AS.
Resumo:
Background: We present the results of EGASP, a community experiment to assess the state-ofthe-art in genome annotation within the ENCODE regions, which span 1% of the human genomesequence. The experiment had two major goals: the assessment of the accuracy of computationalmethods to predict protein coding genes; and the overall assessment of the completeness of thecurrent human genome annotations as represented in the ENCODE regions. For thecomputational prediction assessment, eighteen groups contributed gene predictions. Weevaluated these submissions against each other based on a ‘reference set’ of annotationsgenerated as part of the GENCODE project. These annotations were not available to theprediction groups prior to the submission deadline, so that their predictions were blind and anexternal advisory committee could perform a fair assessment.Results: The best methods had at least one gene transcript correctly predicted for close to 70%of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into accountalternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotidelevel, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programsrelying on mRNA and protein sequences were the most accurate in reproducing the manuallycurated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could beverified.Conclusions: This is the first such experiment in human DNA, and we have followed thestandards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe theresults presented here contribute to the value of ongoing large-scale annotation projects and shouldguide further experimental methods when being scaled up to the entire human genome sequence.
Resumo:
Background: Alternatively spliced exons play an important role in the diversification of gene function in most metazoans and are highly regulated by conserved motifs in exons and introns. Two contradicting properties have been associated to evolutionary conserved alternative exons: higher sequence conservation and higher rate of non-synonymous substitutions, relative to constitutive exons. In order to clarify this issue, we have performed an analysis of the evolution of alternative and constitutive exons, using a large set of protein coding exons conserved between human and mouse and taking into account the conservation of the transcript exonic structure. Further, we have also defined a measure of the variation of the arrangement of exonic splicing enhancers (ESE-conservation score) to study the evolution of splicing regulatory sequences. We have used this measure to correlate the changes in the arrangement of ESEs with the divergence of exon and intron sequences. Results: We find evidence for a relation between the lack of conservation of the exonic structure and the weakening of the sequence evolutionary constraints in alternative and constitutive exons. Exons in transcripts with non-conserved exonic structures have higher synonymous (dS) and non-synonymous (dN) substitution rates than exons in conserved structures. Moreover, alternative exons in transcripts with non-conserved exonic structure are the least constrained in sequence evolution, and at high EST-inclusion levels they are found to be very similar to constitutive exons, whereas alternative exons in transcripts with conserved exonic structure have a dS significantly lower than average at all EST-inclusion levels. We also find higher conservation in the arrangement of ESEs in constitutive exons compared to alternative ones. Additionally, the sequence conservation at flanking introns remains constant for constitutive exons at all ESE-conservation values, but increases for alternative exons at high ESE-conservation values. Conclusion: We conclude that most of the differences in dN observed between alternative and constitutive exons can be explained by the conservation of the transcript exonic structure. Low dS values are more characteristic of alternative exons with conserved exonic structure, but not of those with non-conserved exonic structure. Additionally, constitutive exons are characterized by a higher conservation in the arrangement of ESEs, and alternative exons with an ESE-conservation similar to that of constitutive exons are characterized by a conservation of the flanking intron sequences higher than average, indicating the presence of more intronic regulatory signals.
Resumo:
Background: The understanding of whole genome sequences in higher eukaryotes depends to a large degree on the reliable definition of transcription units including exon/intron structures, translated open reading frames (ORFs) and flanking untranslated regions. The best currently available chicken transcript catalog is the Ensembl build based on the mappings of a relatively small number of full length cDNAs and ESTs to the genome as well as genome sequence derived in silico gene predictions.Results: We use Long Serial Analysis of Gene Expression (LongSAGE) in bursal lymphocytes and the DT40 cell line to verify the quality and completeness of the annotated transcripts. 53.6% of the more than 38,000 unique SAGE tags (unitags) match to full length bursal cDNAs, the Ensembl transcript build or the genome sequence. The majority of all matching unitags show single matches to the genome, but no matches to the genome derived Ensembl transcript build. Nevertheless, most of these tags map close to the 3' boundaries of annotated Ensembl transcripts.Conclusions: These results suggests that rather few genes are missing in the current Ensembl chicken transcript build, but that the 3' ends of many transcripts may not have been accurately predicted. The tags with no match in the transcript sequences can now be used to improve gene predictions, pinpoint the genomic location of entirely missed transcripts and optimize the accuracy of gene finder software.
Resumo:
ABSTRACT Upregulation of the Major Facilitator transporter gene MDR1 (Multi_drug Resistance 1) is one of the mechanisms observed in Candida albicans clinical isolates developing resistance to azole antifungal agents. To better understand this phenomenon, the cis-acting regulatory elements present in a modulatable reporter system under the control of the MDR1 promoter were characterized. In an azole-susceptible strain, transcription of this reporter is transiently upregulated in response to either benomyl or H2O2, whereas its expression is constitutively high in an azole-resistant strain (FR2). Two cis-acting regulatory elements, that are necessary and sufficient to convey the same transcriptional responses to a heterologous promoter (CDR2), were identified within the MDR1promoter. The first element, called BRE (for Benomyl Response Element, -296 to -260 with respect to the ATG start codon), is required for benomyl-dependent MDR1 upregulation and for constitutive high expression of MDR1 in FR2. The second element, termed HRE (for H2O2 Response Element, -561 to -520), is required for H2O2-dependent MDR1 upregulation, but is dispensable for constitutive high expression. Two potential binding sites (TTAG/CTAA) for the blip transcription factor Cap1p lie within the HRE. Moreover, inactivation of CAP1 abolished the transient response to H2O2 and diminished significantly the transient response to benomyl. Cap1p, which has been previously implicated in cellular responses to oxidative stress, may thus play a transacting and positive regulatory role in benomyl- and H2O2-dependent transcription of MDR1. However, it is not the only transcription factor involved in the response of MDR1 to benomyl. A minimal BRE element (-290 to -273) that is sufficient to detect in vitro sequence-specific binding of protein complexes in crude extracts prepared from C. albicans was also delimited. Genome-wide transcript profiling analyses undertaken with a matched pair of clinical isolates, one of which being azole-resistant and upregulating MDR1, and with an azole-susceptible strain exposed to benomyl, revealed that genes specifically upregulated by benomyl harbour in their promoters Cap1p binding site(s). This strengthened the idea that Cap1p plays a role in benomyl-dependent upregulation of MDR1. BRE-like sequences were also identified in several genes co-regulated with MDR1 in both conditions, which was consistent with the involvement of the BRE in both processes. A set of 147 mutants lacking a single transcription factor gene was next screened for loss of MDR1response to benomyl. Unfortunately, none of the tested mutants showed a loss of benomyl-dependent MDR1 upregulation. Nevertheless, a significant diminution of the response was observed in the mutants in which the MADS-box transcription factor Mcm1p and the C2H2 zinc finger transcription factor orf19.13374p were inactivated, suggesting that Mcm1p and orf19.13374p are involved in MDR1response to benomyl. Interestingly, the BRE contains a perfect match to the binding consensus of Mcm1p, raising the possibility that MDR1may be a direct target of this transcriptional activator. In conclusion, while the identity of the trans-acting factors that bind to the BRE and HRE remains to be confirmed, the tools we have developed during characterization of the cis-acting elements of the MDR1promoter should now serve to elucidate the nature of the components that modulate its activity. RESUME La surexpression du gène MDR1 (pour Résistance Multidrogue 1), qui code pour un transporteur de la famille des Major Facilitators, est l'un des mécanismes observés dans les isolats cliniques de la levure Candida albicans développant une résistance aux agents antifongiques appelés azoles. Pour mieux comprendre ce phénomène, les éléments de régulation agissant en cis dans un système rapporteur modulable sous le contrôle du promoteur MDR1 ont été caractérisés. Dans une souche sensible aux azoles, la transcription de ce rapporteur est transitoirement surélevée en réponse soit au bénomyl soit à l'agent oxydant H2O2, alors que son expression est constitutivement élevée dans une souche résistante aux azoles (souche FR2). Deux éléments de régulation agissant en cis, nécessaires et suffisants pour transmettre les mêmes réponses transcriptionnelles à un promoteur hétérologue (CDR2), ont été identifiés dans le promoteur MDR1. Le premier élément, appelé BRE (pour Elément de Réponse au Bénomyl, de -296 à -260 par rapport au codon d'initiation ATG) est requis pour la surexpression de MDR1dépendante du bénomyl et pour l'expression constitutive de MDR1 dans FR2. Le deuxième élément, appelé HRE (pour Elément de Réponse à l'H2O2, de -561 à -520), est requis pour la surexpression de MDR1 dépendante de l'H2O2, mais n'est pas impliqué dans l'expression constitutive du gène MDR1. Deux sites de fixation potentiels (TTAG/CTAA) pour le facteur de transcription Cap1p ont été identifiés dans l'élément HRE. De plus, l'inactivation de CAP1 abolit la réponse transitoire à l'H2O2 et diminua significativement la réponse transitoire au bénomyl. Cap1p, qui est impliqué dans les réponses de la cellule au stress oxydatif, doit donc jouer un rôle positif en trans dans la surexpression de MDR1 dépendante du bénomyl et de l'H2O2. Cependant, ce n'est pas le seul facteur de transcription impliqué dans la réponse au bénomyl. Un élément BRE d'une longueur minimale (de -290 à -273) a également été défini et est suffisant pour détecter une interaction spécifique in vitro avec des protéines provenant d'extraits bruts de C. albicans. L'analyse du profil de transcription d'une paire d'isolats cliniques comprenant une souche résistante aux azoles surexprimant MDR1, et d'une souche sensible aux azoles exposée au bénomyl, a révélé que les gènes spécifiquement surexprimés par le bénomyl contiennent dans leurs promoteurs un ou plusieurs sites de fixation pour Cap1p. Ceci renforce l'idée que Cap1p joue un rôle dans la surexpression de MDR1dépendante du bénomyl. Une ou deux séquences ressemblant à l'élément BRE ont également été identifiées dans la plupart des gènes corégulés avec MDR1 dans ces deux conditions, ce qui était attendu compte-tenu du rôle joué par cet élément dans les deux processus. Une collection de 147 mutants dans lesquels un seul facteur de transcription est inactivé a été testée pour la perte de réponse au bénomyl de MDR1. Malheureusement, la surexpression de MDR1 dépendante du bénomyl n'a été perdue dans aucun des mutants testés. Néanmoins, une diminution significative de la réponse a été observée chez des mutants dans lesquels le facteur de transcription à MADS-box Mcm1p et le facteur de transcription à doigts de zinc de type C2H2 orf19.13374p ont été inactivés, suggérant que Mcm1p et orf19.13374p sont impliqués dans la réponse de MDR1au bénomyl. Il est intéressant de noter que la BRE contient une séquence qui s'aligne parfaitement avec la séquence consensus du site de fixation de Mcm1p, ce qui soulève la possibilité que MDR1 pourrait être une cible directe de cet activateur transcriptionnel. En conclusion, alors que l'identité des facteurs agissant en trans en se fixant à la BRE et à la HRE reste à être confirmée, les outils que nous avons développés au cours de la caractérisation des éléments agissant en cis sur le promoteur MDR1 peut maintenant servir à élucider la nature des composants modulant son activité.
Resumo:
The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since the first public release of this annotation data set, few new protein-coding loci have been added, yet the number of alternative splicing transcripts annotated has steadily increased. The GENCODE 7 release contains 20,687 protein-coding and 9640 long noncoding RNA loci and has 33,977 coding transcripts not represented in UCSC genes and RefSeq. It also has the most comprehensive annotation of long noncoding RNA (lncRNA) loci publicly available with the predominant transcript form consisting of two exons. We have examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites. Over one-third of GENCODE protein-coding genes are supported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas. New models derived from the Illumina Body Map 2.0 RNA-seq data identify 3689 new loci not currently in GENCODE, of which 3127 consist of two exon models indicating that they are possibly unannotated long noncoding loci. GENCODE 7 is publicly available from gencodegenes.org and via the Ensembl and UCSC Genome Browsers.
Resumo:
The transmembrane protein HER2 is over-expressed in approximately 15% of invasive breast cancers as a result of HER2 gene amplification. HER2 proteolytic cleavage (HER2 shedding) generates soluble truncated HER2 molecules that include only the extracellular domain and the concentration of which can be measured in the serum fraction of blood. HER2 shedding also generates a constitutively active truncated intracellular receptor of 95kDa (p95(HER2)). Another soluble truncated HER2 protein (Herstatin), which can also be found in serum, is the product of an alternatively spliced HER2 transcript. Recent preclinical findings may provide crucial insights into the biological and clinical relevance of increased sHER2 concentrations for the outcome of HER2-positive breast cancer and sensitivity to trastuzumab and lapatinib treatment. We present here the most recent findings about the role and biology of sHER2 based on data obtained using a standardized test, which has been cleared by FDA in 2000, for measuring sHER2. This test includes quality control assessments and has been already widely used to evaluate the clinical utility of sHER2 as a biomarker in breast cancer. We will describe in detail data concerning the assessment of sHER2 as a surrogate maker to optimize the evaluation of the HER2 status of a primary tumor and as a prognosis and predictive marker of response to therapies, both in early and metastatic breast cancer.
Resumo:
The high Km glucose transporter GLUT2 is a membrane protein expressed in tissues involved in maintaining glucose homeostasis, and in cells where glucose-sensing is necessary. In many experimental models of diabetes, GLUT2 gene expression is decreased in pancreatic beta-cells, which could lead to a loss of glucose-induced insulin secretion. In order to identify factors involved in pancreatic beta-cell specific expression of GLUT2, we have recently cloned the murine GLUT2 promoter and identified cis-elements within the 338-bp of the proximal promoter capable of binding islet-specific trans-acting factors. Furthermore, in transient transfection studies, this 338-bp fragment could efficiently drive the expression of the chloramphenicol acetyl transferase (CAT) gene in cell lines derived from the endocrine pancreas, but displayed no promoter activity in non-pancreatic cells. In this report, we tested the cell-specific expression of a CAT reporter gene driven by a short (338 bp) and a larger (1311 bp) fragment of the GLUT2 promoter in transgenic mice. We generated ten transgenic lines that integrated one of the constructs. CAT mRNA expression in transgenic tissues was assessed using the RNAse protection assay and the quantitative reverse transcribed polymerase chain reaction (RT-PCR). Overall CAT mRNA expression for both constructs was low compared to endogenous GLUT2 mRNA levels but the reporter transcript could be detected in all animals in the pancreatic islets and the liver, and in a few transgenic lines in the kidney and the small intestine. The CAT protein was also present in Langerhans islets and in the liver for both constructs by immunocytochemistry. These findings suggest that the proximal 338 bp of the murine GLUT2 promoter contain cis-elements required for the islet-specific expression of GLUT2.
Resumo:
Sexual reproduction is extremely widespread in spite of its presumed costs relative to asexual reproduction, indicating that it must provide significant advantages. One postulated benefit of sex and recombination is that they facilitate the purging of mildly deleterious mutations, which would accumulate in asexual lineages and contribute to their short evolutionary life span. To test this prediction, we estimated the accumulation rate of coding (nonsynonymous) mutations, which are expected to be deleterious, in parts of one mitochondrial (COI) and two nuclear (Actin and Hsp70) genes in six independently derived asexual lineages and related sexual species of Timema stick insects. We found signatures of increased coding mutation accumulation in all six asexual Timema and for each of the three analyzed genes, with 3.6- to 13.4-fold higher rates in the asexuals as compared with the sexuals. In addition, because coding mutations in the asexuals often resulted in considerable hydrophobicity changes at the concerned amino acid positions, coding mutations in the asexuals are likely associated with more strongly deleterious effects than in the sexuals. Our results demonstrate that deleterious mutation accumulation can differentially affect sexual and asexual lineages and support the idea that deleterious mutation accumulation plays an important role in limiting the long-term persistence of all-female lineages.
Resumo:
Inorganic phosphate (Pi) is one of the main nutrients limiting plant growth anddevelopment in many agro-ecosystems. In plants, phosphate is acquired from the soil by theroots, and is then transferred to the shoot via the xylem. In the model plant Arabidopsisthaliana, PHO1 was previously identified as being involved in loading Pi into the xylem ofroots. AtPHO1, belongs to a multigenic family composed of 10 additional members, namelyAtPHO1;H1 to AtPHO1;10. In this study, we aimed at further investigating the role of thePHO1 gene family in Pi homeostasis in plants, and to this end we isolated and characterizedthe PHO1 members of two main model plants, the moss Physcomitrella patens and the riceOryza sativa.In the bryophyte P. patens, bioinformatic analyses revealed the presence of seven AtPHO1homologues, highly similar to AtPHO1. The seven moss PHO1 genes, namely PpPHO1;1 toPpPHO1;7 appeared to be differentially regulated, both at the tissue level and in response toPi status. However only PpPHO1;1 and PpPHO1;7 were specifically up-regulated upon Pistarvation, suggesting a potential role in Pi homeostasis. We also characterized the responseof P. patens to Pi starvation, showing that higher and lower plants share some commonstrategies to adapt to Pi-deficiency.In the second part, focusing on the monocotyledon rice, we showed the existence of threePHO1 homologues OsPHO1;1 to OsPHO1;3, with the unique particularity of each havingNatural Antisense Transcripts (NATs). Molecular analyses revealed that both the sense andthe antisense OsPHO1;2 transcripts were by far the most abundantly expressed transcripts ofthe family, preferentially expressed in the roots. The stable expression of OsPHO1;2 in allconditions tested, in opposition with the highly induced antisense transcript upon Pistarvation, suggest a putative role for the antisense in regulating the sense transcript.Moreover, mutant analyses revealed that OsPHO1;2 plays a key role in Pi homeostasis, intransferring Pi from the root to the shoot. Finally, complementing the pho1 mutant inArabidopsis, characterized by low Pi in the shoot and reduced growth, with the riceOsPHO1;2 gene revealed a new role for PHO1 in Pi signaling. Indeed, the complementedplants showed normal growth, with however low Pi content.
Resumo:
In the plant-beneficial soil bacterium and biocontrol model organism Pseudomonas fluorescens CHA0, the GacS/GacA two-component system upregulates the production of biocontrol factors, i.e. antifungal secondary metabolites and extracellular enzymes, under conditions of slow, non-exponential growth. When activated, the GacS/GacA system promotes the transcription of a small regulatory RNA (RsmZ), which sequesters the small RNA-binding protein RsmA, a translational regulator of genes involved in biocontrol. The gene for a second GacA-regulated small RNA (RsmY) was detected in silico in various pseudomonads, and was cloned from strain CHA0. RsmY, like RsmZ, contains several characteristic GGA motifs. The rsmY gene was expressed in strain CHA0 as a 118 nt transcript which was most abundant in stationary phase, as revealed by Northern blot and transcriptional fusion analysis. Transcription of rsmY was enhanced by the addition of the strain's own supernatant extract containing a quorum-sensing signal and was abolished in gacS or gacA mutants. An rsmA mutation led to reduced rsmY expression, via a gacA-independent mechanism. Overexpression of rsmY restored the expression of target genes (hcnA, aprA) to gacS or gacA mutants. Whereas mutants deleted for either the rsmY or the rsmZ structural gene were not significantly altered in the synthesis of extracellular products (hydrogen cyanide, 2,4-diacetylphloroglucinol, exoprotease), an rsmY rsmZ double mutant was strongly impaired in this production and in its biocontrol properties in a cucumber-Pythium ultimum microcosm. Mobility shift assays demonstrated that multiple molecules of RsmA bound specifically to RsmY and RsmZ RNAs. In conclusion, two small, untranslated RNAs, RsmY and RsmZ, are key factors that relieve RsmA-mediated regulation of secondary metabolism and biocontrol traits in the GacS/GacA cascade of strain CHA0.
Resumo:
Narcolepsy is a sleep disorder characterized by excessive daytime sleepiness and attacks of muscle atonia triggered by strong emotions (cataplexy). Narcolepsy is caused by hypocretin (orexin) deficiency, paralleled by a dramatic loss in hypothalamic hypocretin-producing neurons. It is believed that narcolepsy is an autoimmune disorder, although definitive proof of this, such as the presence of autoantibodies, is still lacking. We engineered a transgenic mouse model to identify peptides enriched within hypocretin-producing neurons that could serve as potential autoimmune targets. Initial analysis indicated that the transcript encoding Tribbles homolog 2 (Trib2), previously identified as an autoantigen in autoimmune uveitis, was enriched in hypocretin neurons in these mice. ELISA analysis showed that sera from narcolepsy patients with cataplexy had higher Trib2-specific antibody titers compared with either normal controls or patients with idiopathic hypersomnia, multiple sclerosis, or other inflammatory neurological disorders. Trib2-specific antibody titers were highest early after narcolepsy onset, sharply decreased within 2-3 years, and then stabilized at levels substantially higher than that of controls for up to 30 years. High Trib2-specific antibody titers correlated with the severity of cataplexy. Serum of a patient showed specific immunoreactivity with over 86% of hypocretin neurons in the mouse hypothalamus. Thus, we have identified reactive autoantibodies in human narcolepsy, providing evidence that narcolepsy is an autoimmune disorder.