Biblioteca Digital

853 resultados para ALTERNATIVE SPLICING

Improving gene annotation using peptide mass spectrometry

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Annotation of protein-coding genes is a key goal of genome sequencing projects. In spite of tremendous recent advances in computational gene finding, comprehensive annotation remains a challenge. Peptide mass spectrometry is a powerful tool for researching the dynamic proteome and suggests an attractive approach to discover and validate protein-coding genes. We present algorithms to construct and efficiently search spectra against a genomic database, with no prior knowledge of encoded proteins. By searching a corpus of 18.5 million tandem mass spectra (MS/MS) from human proteomic samples, we validate 39,000 exons and 11,000 introns at the level of translation. We present translation-level evidence for novel or extended exons in 16 genes, confirm translation of 224 hypothetical proteins, and discover or confirm over 40 alternative splicing events. Polymorphisms are efficiently encoded in our database, allowing us to observe variant alleles for 308 coding SNPs. Finally, we demonstrate the use of mass spectrometry to improve automated gene prediction, adding 800 correct exons to our predictions using a simple rescoring strategy. Our results demonstrate that proteomic profiling should play a role in any genome sequencing project.

EGASP: the human ENCODE Genome Annotation Assessment Project

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background: We present the results of EGASP, a community experiment to assess the state-ofthe-art in genome annotation within the ENCODE regions, which span 1% of the human genomesequence. The experiment had two major goals: the assessment of the accuracy of computationalmethods to predict protein coding genes; and the overall assessment of the completeness of thecurrent human genome annotations as represented in the ENCODE regions. For thecomputational prediction assessment, eighteen groups contributed gene predictions. Weevaluated these submissions against each other based on a ‘reference set’ of annotationsgenerated as part of the GENCODE project. These annotations were not available to theprediction groups prior to the submission deadline, so that their predictions were blind and anexternal advisory committee could perform a fair assessment.Results: The best methods had at least one gene transcript correctly predicted for close to 70%of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into accountalternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotidelevel, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programsrelying on mRNA and protein sequences were the most accurate in reproducing the manuallycurated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could beverified.Conclusions: This is the first such experiment in human DNA, and we have followed thestandards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe theresults presented here contribute to the value of ongoing large-scale annotation projects and shouldguide further experimental methods when being scaled up to the entire human genome sequence.

GENCODE: the reference human genome annotation for The ENCODE Project.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since the first public release of this annotation data set, few new protein-coding loci have been added, yet the number of alternative splicing transcripts annotated has steadily increased. The GENCODE 7 release contains 20,687 protein-coding and 9640 long noncoding RNA loci and has 33,977 coding transcripts not represented in UCSC genes and RefSeq. It also has the most comprehensive annotation of long noncoding RNA (lncRNA) loci publicly available with the predominant transcript form consisting of two exons. We have examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites. Over one-third of GENCODE protein-coding genes are supported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas. New models derived from the Illumina Body Map 2.0 RNA-seq data identify 3689 new loci not currently in GENCODE, of which 3127 consist of two exon models indicating that they are possibly unannotated long noncoding loci. GENCODE 7 is publicly available from gencodegenes.org and via the Ensembl and UCSC Genome Browsers.

Mutations leading to X-linked hypohidrotic ectodermal dysplasia affect three major functional domains in the tumor necrosis factor family member ectodysplasin-A.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Mutations in the epithelial morphogen ectodysplasin-A (EDA), a member of the tumor necrosis factor (TNF) family, are responsible for the human disorder X-linked hypohidrotic ectodermal dysplasia (XLHED) characterized by impaired development of hair, eccrine sweat glands, and teeth. EDA-A1 and EDA-A2 are two splice variants of EDA, which bind distinct EDA-A1 and X-linked EDA-A2 receptors. We identified a series of novel EDA mutations in families with XLHED, allowing the identification of the following three functionally important regions in EDA: a C-terminal TNF homology domain, a collagen domain, and a furin protease recognition sequence. Mutations in the TNF homology domain impair binding of both splice variants to their receptors. Mutations in the collagen domain can inhibit multimerization of the TNF homology region, whereas those in the consensus furin recognition sequence prevent proteolytic cleavage of EDA. Finally, a mutation affecting an intron splice donor site is predicted to eliminate specifically the EDA-A1 but not the EDA-A2 splice variant. Thus a proteolytically processed, oligomeric form of EDA-A1 is required in vivo for proper morphogenesis.

The alternatively spliced domain TnFnIII A1A2 of the extracellular matrix protein tenascin-C suppresses activation-induced T lymphocyte proliferation and cytokine production.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Several lines of evidences have suggested that T cell activation could be impaired in the tumor environment, a condition referred to as tumor-induced immunosuppression. We have previously shown that tenascin-C, an extracellular matrix protein highly expressed in the tumor stroma, inhibits T lymphocyte activation in vitro, raising the possibility that this molecule might contribute to tumor-induced immunosuppression in vivo. However, the region of the protein mediating this effect has remained elusive. Here we report the identification of the minimal region of tenascin-C that can inhibit T cell activation. Recombinant fragments corresponding to defined regions of the molecule were tested for their ability to inhibit in vitro activation of human peripheral blood T cells induced by anti-CD3 mAbs in combination with fibronectin or IL-2. A recombinant protein encompassing the alternatively spliced fibronectin type III domains of tenascin-C (TnFnIII A-D) vigorously inhibited both early and late lymphocyte activation events including activation-induced TCR/CD8 down-modulation, cytokine production, and DNA synthesis. In agreement with this, full length recombinant tenascin-C containing the alternatively spliced region suppressed T cell activation, whereas tenascin-C lacking this region did not. Using a series of smaller fragments and deletion mutants issued from this region, we have identified the TnFnIII A1A2 domain as the minimal region suppressing T cell activation. Single TnFnIII A1 or A2 domains were no longer inhibitory, while maximal inhibition required the presence of the TnFnIII A3 domain. Altogether, these data demonstrate that the TnFnIII A1A2 domain mediate the ability of tenascin-C to inhibit in vitro T cell activation and provide insights into the immunosuppressive activity of tenascin-C in vivo.

hShroom1 links a membrane bound protein to the actin cytoskeleton.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

hShroom1 (hShrm1) is a member of the Apx/Shroom (Shrm) protein family and was identified from a yeast two-hybrid screen as a protein that interacts with the cytoplasmic domain of melanoma cell adhesion molecule (MCAM). The characteristic signature of the Shrm family is the presence of a unique domain, ASD2 (Apx/Shroom domain 2). mRNA analysis suggests that hShrm1 is expressed in brain, heart, skeletal muscle, colon, small intestine, kidney, placenta and lung tissue, as well a variety of melanoma and other cell lines. Co-immunoprecipitation and bioluminescence resonance energy transfer (BRET) experiments indicate that hShrm1 and MCAM interact in vivo and by immunofluorescence microscopy some co-localization of these proteins is observed. hShrm1 partly co-localises with beta-actin and is found in the Triton X-100 insoluble fraction of melanoma cell extracts. We propose that hShrm1 is involved in linking MCAM to the cytoskeleton.

BAFF, APRIL and their receptors: structure, function and signaling.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

BAFF, APRIL and their receptors play important immunological roles, especially in the B cell arm of the immune system. A number of splice isoforms have been described for both ligands and receptors in this subfamily, some of which are conserved between mouse and human, while others are species-specific. Structural and mutational analyses have revealed key determinants of receptor-ligand specificity. BAFF-R has a strong selectivity for BAFF; BCMA has a higher affinity for APRIL than for BAFF, while TACI binds both ligands equally well. The molecular signaling events downstream of BAFF-R, BCMA and TACI are still incompletely characterized. Survival appears to be mediated by upregulation of Bcl-2 family members through NF-kappaB activation, degradation of the pro-apototic Bim protein, and control of subcellular localization of PCKdelta. Very little is known about other signaling events associated with receptor engagement by BAFF and APRIL that lead for example to B cell activation or to CD40L-independent Ig switch.

EGASP: the human ENCODE Genome Annotation Assessment Project.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

BACKGROUND: We present the results of EGASP, a community experiment to assess the state-of-the-art in genome annotation within the ENCODE regions, which span 1% of the human genome sequence. The experiment had two major goals: the assessment of the accuracy of computational methods to predict protein coding genes; and the overall assessment of the completeness of the current human genome annotations as represented in the ENCODE regions. For the computational prediction assessment, eighteen groups contributed gene predictions. We evaluated these submissions against each other based on a 'reference set' of annotations generated as part of the GENCODE project. These annotations were not available to the prediction groups prior to the submission deadline, so that their predictions were blind and an external advisory committee could perform a fair assessment. RESULTS: The best methods had at least one gene transcript correctly predicted for close to 70% of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into account alternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotide level, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programs relying on mRNA and protein sequences were the most accurate in reproducing the manually curated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could be verified. CONCLUSION: This is the first such experiment in human DNA, and we have followed the standards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe the results presented here contribute to the value of ongoing large-scale annotation projects and should guide further experimental methods when being scaled up to the entire human genome sequence.

Large scale assessment of regulatory evolution and transcriptome complexity in mammals

Relevância:

60.00% 60.00%

Publicador:

Resumo:

AbstractIn addition to genetic changes affecting the function of gene products, changes in gene expression have been suggested to underlie many or even most of the phenotypic differences among mammals. However, detailed gene expression comparisons were, until recently, restricted to closely related species, owing to technological limitations. Thus, we took advantage of the latest technologies (RNA-Seq) to generate extensive qualitative and quantitative transcriptome data for a unique collection of somatic and germline tissues from representatives of all major mammalian lineages (placental mammals, marsupials and monotremes) and birds, the evolutionary outgroup.In the first major project of my thesis, we performed global comparative analyses of gene expression levels based on these data. Our analyses provided fundamental insights into the dynamics of transcriptome change during mammalian evolution (e.g., the rate of expression change across species, tissues and chromosomes) and allowed the exploration of the functional relevance and phenotypic implications of transcription changes at a genome-wide scale (e.g., we identified numerous potentially selectively driven expression switches).In a second project of my thesis, which was also based on the unique transcriptome data generated in the context of the first project we focused on the evolution of alternative splicing in mammals. Alternative splicing contributes to transcriptome complexity by generating several transcript isoforms from a single gene, which can, thus, perform various functions. To complete the global comparative analysis of gene expression changes, we explored patterns of alternative splicing evolution. This work uncovered several general and unexpected patterns of alternative splicing evolution (e.g., we found that alternative splicing evolves extremely rapidly) as well as a large number of conserved alternative isoforms that may be crucial for the functioning of mammalian organs.Finally, the third and final project of my PhD consisted in analyzing in detail the unique functional and evolutionary properties of the testis by exploring the extent of its transcriptome complexity. This organ was previously shown to evolve rapidly both at the phenotypic and molecular level, apparently because of the specific pressures that act on this organ and are associated with its reproductive function. Moreover, my analyses of the amniote tissue transcriptome data described above, revealed strikingly widespread transcriptional activity of both functional and nonfunctional genomic elements in the testis compared to the other organs. To elucidate the cellular source and mechanisms underlying this promiscuous transcription in the testis, we generated deep coverage RNA-Seq data for all major testis cell types as well as epigenetic data (DNA and histone methylation) using the mouse as model system. The integration of these complete dataset revealed that meiotic and especially post-meiotic germ cells are the major contributors to the widespread functional and nonfunctional transcriptome complexity of the testis, and that this "promiscuous" spermatogenic transcription is resulting, at least partially, from an overall transcriptionally permissive chromatin state. We hypothesize that this particular open state of the chromatin results from the extensive chromatin remodeling that occurs during spermatogenesis which ultimately leads to the replacement of histones by protamines in the mature spermatozoa. Our results have important functional and evolutionary implications (e.g., regarding new gene birth and testicular gene expression evolution).Generally, these three large-scale projects of my thesis provide complete and massive datasets that constitute valuables resources for further functional and evolutionary analyses of mammalian genomes.

Membrane mu poly(A) signal and 3' flanking sequences function as a transcription terminator for immunoglobulin-encoding genes.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Developmentally regulated mechanisms involving alternative RNA splicing and/or polyadenylation, as well as transcription termination, are implicated in controlling the levels of secreted mu (mu s), membrane mu (mu m) and delta immunoglobulin (Ig) heavy chain mRNAs during B cell differentiation (mu gene encodes the mu heavy chain). Using expression vectors constructed with genomic DNA segments composed of the mu m polyadenylation signal region, we analyzed poly(A) site utilization and termination of transcription in stably transfected myeloma cells and in murine fibroblast L cells. We found that the gene segment containing the mu m poly(A) signals, along with 536 bp of downstream flanking sequence, acted as a transcription terminator in both myeloma cells and L cell fibroblasts. Neither a 141-bp DNA fragment (which directed efficient polyadenylation at the mu m site), nor the 536-bp flanking nucleotide sequence alone, were sufficient to obtain a similar regulation. This shows that the mu m poly(A) region plays a central role in controlling developmentally regulated transcription termination by blocking downstream delta gene expression. Because this gene segment exhibited the same RNA processing and termination activities in fibroblasts, it appears that these processes are not tissue-specific.

PFKFB3 (6-phosphofructo-2-kinase/fructose-2,6-biphosphatase 3)

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The human PFKFB3 is composed of 19 exons spanning genomic region about 90,6 Kb (GenBank). Alternative splicing variants have been reported. The main variants corresponding to mRNAs of 4453 bp and 4224 bp for the variant 1 u-PFK2 (NM_004566.3) and variant 2 i-PFK2 (NM_001145443.1), respectively...

Do protein–protein interaction databases identify moonlighting proteins?

Relevância:

60.00% 60.00%

Publicador:

Resumo:

One of the most striking results of the human (and mammalian) genomes is the low number of protein-coding genes. To-date, the main molecular mechanism to increase the number of different protein isoforms and functions is alternative splicing. However, a less-known way to increase the number of protein functions is the existence of multifunctional, multitask, or ‘‘moonlighting’’, proteins. By and large, moonlighting proteins are experimentally disclosed by serendipity. Proteomics is becoming one of the very active areas of biomedical research, which permits researchers to identify previously unseen connections among proteins and pathways. In principle, protein–protein interaction (PPI) databases should contain information on moonlighting proteins and could provide suggestions to further analysis in order to prove the multifunctionality. As far as we know, nobody has verified whether PPI databases actually disclose moonlighting proteins. In the present work we check whether well-established moonlighting proteins present in PPI databases connect with their known partners and, therefore, a careful inspection of these databases could help to suggest their different functions. The results of our research suggest that PPI databases could be a valuable tool to suggest multifunctionality.

The evolution of the thyroid hormone distributor protein transthyretin in the order insectivora, class mammalia.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Thyroid hormones are involved in the regulation of growth and metabolism in all vertebrates. Transthyretin is one of the extracellular proteins with high affinity for thyroid hormones which determine the partitioning of these hormones between extracellular compartments and intracellular lipids. During vertebrate evolution, both the tissue pattern of expression and the structure of the gene for transthyretin underwent characteristic changes. The purpose of this study was to characterize the position of Insectivora in the evolution of transthyretin in eutherians, a subclass of Mammalia. Transthyretin was identified by thyroxine binding and Western analysis in the blood of adult shrews, hedgehogs, and moles. Transthyretin is synthesized in the liver and secreted into the bloodstream, similar to the situation for other adult eutherians, birds, and diprotodont marsupials, but different from that for adult fish, amphibians, reptiles, monotremes, and Australian polyprotodont marsupials. For the characterization of the structure of the gene and the processing of mRNA for transthyretin, cDNA libraries were prepared from RNA from hedgehog and shrew livers, and full-length cDNA clones were isolated and sequenced. Sections of genomic DNA in the regions coding for the splice sites between exons 1 and 2 were synthesized by polymerase chain reaction and sequenced. The location of splicing was deduced from comparison of genomic with cDNA nucleotide sequences. Changes in the nucleotide sequence of the transthyretin gene during evolution are most pronounced in the region coding for the N-terminal region of the protein. Both the derived overall amino sequences and the N-terminal regions of the transthyretins in Insectivora were found to be very similar to those in other eutherians but differed from those found in marsupials, birds, reptiles, amphibians, and fish. Also, the pattern of transthyretin precursor mRNA splicing in Insectivora was more similar to that in other eutherians than to that in marsupials, reptiles, and birds. Thus, in contrast to the marsupials, with a different pattern of transthyretin gene expression in the evolutionarily "older" polyprotodonts compared with the evolutionarily "younger" diprotodonts, no separate lineages of transthyretin evolution could be identified in eutherians. We conclude that transthyretin gene expression in the liver of adult eutherians probably appeared before the branching of the lineages leading to modern eutherian species.

A hyperactive quantitative trait locus allele of Arabidopsis BRX contributes to natural variation in root growth vigor.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Quantitative trait loci analysis of natural Arabidopsis thaliana accessions is increasingly exploited for gene isolation. However, to date this has mostly revealed deleterious mutations. Among them, a loss-of-function allele identified the root growth regulator BREVIS RADIX (BRX). Here we present evidence that BRX and the paralogous BRX-LIKE (BRXL) genes are under selective constraint in monocotyledons as well as dicotyledons. Unexpectedly, however, whereas none of the Arabidopsis orthologs except AtBRXL1 could complement brx null mutants when expressed constitutively, nearly all monocotyledon BRXLs tested could. Thus, BRXL proteins seem to be more diversified in dicotyledons than in monocotyledons. This functional diversification was correlated with accelerated rates of sequence divergence in the N-terminal regions. Population genetic analyses of 30 haplotypes are suggestive of an adaptive role of AtBRX and AtBRXL1. In two accessions, Lc-0 and Lov-5, seven amino acids are deleted in the variable region between the highly conserved C-terminal, so-called BRX domains. Genotyping of 42 additional accessions also found this deletion in Kz-1, Pu2-7, and Ws-0. In segregating recombinant inbred lines, the Lc-0 allele (AtBRX(Lc-0)) conferred significantly enhanced root growth. Moreover, when constitutively expressed in the same regulatory context, AtBRX(Lc-0) complemented brx mutants more efficiently than an allele without deletion. The same was observed for AtBRXL1, which compared with AtBRX carries a 13 amino acid deletion that encompasses the deletion found in AtBRX(Lc-0). Thus, the AtBRX(Lc-0) allele seems to contribute to natural variation in root growth vigor and provides a rare example of an experimentally confirmed, hyperactive allelic variant.

Efficient targeted transcript discovery via array-based normalization of RACE libraries.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Rapid amplification of cDNA ends (RACE) is a widely used approach for transcript identification. Random clone selection from the RACE mixture, however, is an ineffective sampling strategy if the dynamic range of transcript abundances is large. To improve sampling efficiency of human transcripts, we hybridized the products of the RACE reaction onto tiling arrays and used the detected exons to delineate a series of reverse-transcriptase (RT)-PCRs, through which the original RACE transcript population was segregated into simpler transcript populations. We independently cloned the products and sequenced randomly selected clones. This approach, RACEarray, is superior to direct cloning and sequencing of RACE products because it specifically targets new transcripts and often results in overall normalization of transcript abundance. We show theoretically and experimentally that this strategy leads indeed to efficient sampling of new transcripts, and we investigated multiplexing the strategy by pooling RACE reactions from multiple interrogated loci before hybridization.

«
1
2
...
5
6
7
8
9
10
11
...
56
57
»