199 resultados para expressed sequences tag
em Université de Lausanne, Switzerland
Resumo:
High throughput genome (HTG) and expressed sequence tag (EST) sequences are currently the most abundant nucleotide sequence classes in the public database. The large volume, high degree of fragmentation and lack of gene structure annotations prevent efficient and effective searches of HTG and EST data for protein sequence homologies by standard search methods. Here, we briefly describe three newly developed resources that should make discovery of interesting genes in these sequence classes easier in the future, especially to biologists not having access to a powerful local bioinformatics environment. trEST and trGEN are regularly regenerated databases of hypothetical protein sequences predicted from EST and HTG sequences, respectively. Hits is a web-based data retrieval and analysis system providing access to precomputed matches between protein sequences (including sequences from trEST and trGEN) and patterns and profiles from Prosite and Pfam. The three resources can be accessed via the Hits home page (http://hits. isb-sib.ch).
Resumo:
The publication of a draft of the human genome and of large collections of transcribed sequences has made it possible to study the complex relationship between the transcriptome and the genome. In the work presented here, we have focused on mapping mRNA 3' ends onto the genome by use of the raw data generated by the expressed sequence tag (EST) sequencing projects. We find that at least half of the human genes encode multiple transcripts whose polyadenylation is driven by multiple signals. The corresponding transcript 3' ends are spread over distances in the kilobase range. This finding has profound implications for our understanding of gene expression regulation and of the diversity of human transcripts, for the design of cDNA microarray probes, and for the interpretation of gene expression profiling experiments.
Resumo:
The goals of the human genome project did not include sequencing of the heterochromatic regions. We describe here an initial sequence of 1.1 Mb of the short arm of human chromosome 21 (HSA21p), estimated to be 10% of 21p. This region contains extensive euchromatic-like sequence and includes on average one transcript every 100 kb. These transcripts show multiple inter- and intrachromosomal copies, and extensive copy number and sequence variability. The sequencing of the "heterochromatic" regions of the human genome is likely to reveal many additional functional elements and provide important evolutionary information.
Resumo:
Parkinson's disease (PD) is a chronic neurodegenerative disorder characterized by progressive loss of dopaminergic (DA) neurons of the substantia nigra pars compacta with unknown aetiology. 6-Hydroxydopamine (6-OHDA) treatment of neuronal cells is an established in vivo model for mimicking the effect of oxidative stress found in PD brains. We examined the effects of 6-OHDA treatment on human neuroblastoma cells (SH-SY5Y) and primary mesencephalic cultures. Using a reverse arbitrarily primed polymerase chain reaction (RAP-PCR) approach we generated reproducible genetic fingerprints of differential expression levels in cell cultures treated with 6-OHDA. Of the resulting sequences, 23 showed considerable homology to known human coding sequences. The results of the RAP-PCR were validated by reverse transcription PCR, real-time PCR and, for selected genes, by Western blot analysis and immunofluorescence. In four cases, [tomoregulin-1 (TMEFF-1), collapsin response mediator protein 1 (CRMP-1), neurexin-1, and phosphoribosylaminoimidazole synthetase (GART)], a down-regulation of mRNA and protein levels was detected. Further studies will be necessary on the physiological role of the identified proteins and their impact on pathways leading to neurodegeneration in PD.
Resumo:
Résumé Les télomères sont les structures ADN-protéines des extrémités des chromosomes des eucaryotes. L'ADN télomérique est constitué de courtes séquences répétitives. L'intégrité des télomères est essentielle pour protéger les extrémités des chromosomes contre les systèmes de dégradations et pour les distinguer des cassures de l'ADN double brin. Parce que la machinerie de la réplication de l'ADN n'est pas capable de répliquer l'extrémité des chromosomes, les télomères raccourcissent au fur et à mesure des cycles de réplication. Dès que les télomères atteignent une longueur critique, leur structure protectrice est perdue. Cela induit un signal de dommage de l'ADN et l'arrêt du cycle cellulaire. Pour contrebalancer le raccourcissement des télomères, les cellules qui s'auto régénèrent, dont les cellules de la moelle osseuse, les lymphocytes activés et 80-90% des cellules cancéreuses, expriment la télomérase. C'est une ribonucléoprotéine qui a la capacité de synthétiser des séquences télomériques par transcription inverse d'une courte séquence contenue dans sa propre sous-unité ARN avec laquelle elle est associée. La télomérase humaine est une enzyme processive au niveau de l'addition des nucléotides et aussi des répétitions télomériques. La télomérase de levure et la télomérase humaine sont toutes deux dimériques et il a été montré que la télomérase humaine recombinante contient deux ARN qui coopèrent pour fonctionner ainsi que deux sous-unités catalytiques. Cependant, il n'a pas encore été montré quel est le rôle de la dimérisation dans l'activité de la télomérase. Afin d'élucider ce rôle, nous avons exprimé, reconstitué et purifié la télomérase humaine dimérique recombinante. Et pour étudier l'effet d'ARN mutants sur l'activité de la télomérase, nous avons développé une méthode pour reconstituer et enrichir en hétérodimères de télomérase. Les hétérodimères contiennent une sous-unité ARN sauvage et une sous-unité ARN mutée au niveau de la séquence de la matrice. Sur l'ARN muté nous avons introduit une étiquette aptamer ARN-S1 puis nous avons purifié la télomérase via l'etiquette Si. Nous avons montré que la dimérisation est essentielle pour l'activité de la télomérase. Nos données indiquent que chaque télomérase du dimère allonge leur substrat, l'ADN télomérique, indépendamment l'une de l'autre à chaque cycle d'élongation mais que l'addition itérative de répétitions télomériques nécessite une coopération entre les deux télomérases du dimère. Nous proposons donc un modèle dans lequel les deux télomérases du dimères se lient et allongent deux substrats télomères et que pendant l'élongation processive les deux enzymes subissent un changement de conformation de manière coordonnée, ce changement va permettre le repositionnement des substrats pour d'autres cycles d'additions de répétitions télomériques. Dyskeratosis congenita est une maladie mortelle due majoritairement au disfonctionnement de la moelle osseuse. Dans la forme autosomale de la maladie, l'ARN de la télomérase contient des mutations. En utilisant notre système de reconstitution, nous avons montré que ces ARN mutés, qui ont perdu leur activité enzymatique dans le cas d'un homodimère de mutants, sont dominant négatifs quand ils sont présents dans les hétérodimères sauvage/mutant. Cet effet trans-dominant négatif pourrait contribuer à la progression de la maladie. Abstract Telomeres are protein-DNA structures at the ends of linear eukaryotic chromosomes. The telomeric DNA consists of tandemly repeated sequences. Telomeric integrity is essential to protect chromosomal ends from nucleolytic degradation and to prevent their recognition as DNA double strand breaks. Due to the inability of the conventional DNA replication machinery to replicate terminal DNA stretches, telomeres shorten with continuous rounds of DNA replication. As soon as telomeres reach a critical length, their protective structure is lost and the deprotected telomeres will induce a DNA damage response leading to cell cycle arrest. To counteract telomere shortening, self-renewing cells, including bone marrow cells, activated lymphocytes and 80-90% of cancer cells express the cellular reverse transcriptase telomerase, which has the capacity to synthesize telomeric repeats by reverse transcription of a short template sequence encoded by its stably associated RNA subunit. Human telomerase is a processive enzyme for nucleotide as well as repeat addition. Both yeast and human telomerase are dimeric enzymes and recombinant human telomerase has been shown to contain two functionally cooperating RNAs and most probably also two protein subunits. However, it has remained unclear how dimerization may contribute to telomerase activity. To study the role of dimerization, we expressed, reconstituted and purified recombinant human telomerase. We also developed a new method to reconstitute and enrich for telomerase heterodimers containing wild-type (wt) and mutant telomerase RNA subunits. To this end we introduced an S1-RNA-aptamer tag into telomerase RNA and purified telomerase reconstituted with a mixture of untagged and tagged RNA via the S1-tag. Using this experimental system, we introduced template mutations in the tagged RNA subunit and examined the effect of mutant RNAs on wt telomerase activity in wt/mutant heterodimers. We obtained evidence that dimerization is essential for telomerase activity. Our data indicate that the two subunits elongate telomere substrates independently of each other during single rounds of elongation, but that iterative addition of telomeric repeats requires cooperation between the two subunits. We suggest a model, in which dimeric telomerases bind and elongate two telomere substrates and that the two subunits undergo coordinated conformational changes during processive elongation that enable repositioning the substrates for subsequent rounds of repeat addition. Dyskeratosis congenita is a multisystemic disease with bone marrow failure as the major cause of death. The autosomal form of this disease was found to harbor mutations in the telomerase RNA. Using our reconstitution system, we tested whether mutant dyskeratosis telomerase RNAs behaved in a dominant negative manner. We observed that dyskeratosis telomerase RNA mutants, which lacked enzymatic activity were dominant negative, when present in wt/ mutant heterodimers. The transdominant negative effect of these mutants may contribute to disease progression.
Resumo:
BACKGROUND: The Complete Arabidopsis Transcript MicroArray (CATMA) initiative combines the efforts of laboratories in eight European countries 1 to deliver gene-specific sequence tags (GSTs) for the Arabidopsis research community. The CATMA initiative offers the power and flexibility to regularly update the GST collection according to evolving knowledge about the gene repertoire. These GST amplicons can easily be reamplified and shared, subsets can be picked at will to print dedicated arrays, and the GSTs can be cloned and used for other functional studies. This ongoing initiative has already produced approximately 24,000 GSTs that have been made publicly available for spotted microarray printing and RNA interference. RESULTS: GSTs from the CATMA version 2 repertoire (CATMAv2, created in 2002) were mapped onto the gene models from two independent Arabidopsis nuclear genome annotation efforts, TIGR5 and PSB-EuGène, to consolidate a list of genes that were targeted by previously designed CATMA tags. A total of 9,027 gene models were not tagged by any amplified CATMAv2 GST, and 2,533 amplified GSTs were no longer predicted to tag an updated gene model. To validate the efficacy of GST mapping criteria and design rules, the predicted and experimentally observed hybridization characteristics associated to GST features were correlated in transcript profiling datasets obtained with the CATMAv2 microarray, confirming the reliability of this platform. To complete the CATMA repertoire, all 9,027 gene models for which no GST had yet been designed were processed with an adjusted version of the Specific Primer and Amplicon Design Software (SPADS). A total of 5,756 novel GSTs were designed and amplified by PCR from genomic DNA. Together with the pre-existing GST collection, this new addition constitutes the CATMAv3 repertoire. It comprises 30,343 unique amplified sequences that tag 24,202 and 23,009 protein-encoding nuclear gene models in the TAIR6 and EuGène genome annotations, respectively. To cover the remaining untagged genes, we identified 543 additional GSTs using less stringent design criteria and designed 990 sequence tags matching multiple members of gene families (Gene Family Tags or GFTs) to cover any remaining untagged genes. These latter 1,533 features constitute the CATMAv4 addition. CONCLUSION: To update the CATMA GST repertoire, we designed 7,289 additional sequence tags, bringing the total number of tagged TAIR6-annotated Arabidopsis nuclear protein-coding genes to 26,173. This resource is used both for the production of spotted microarrays and the large-scale cloning of hairpin RNA silencing vectors. All information about the resulting updated CATMA repertoire is available through the CATMA database http://www.catma.org.
Resumo:
The efficient removal of a N- or C-terminal purification tag from a fusion protein is necessary to obtain a protein in a pure and active form, ready for use in human or animal medicine. Current techniques based on enzymatic cleavage are expensive and result in the presence of additional amino acids at either end of the proteins, as well as contaminating proteases in the preparation. Here we evaluate an alternative method to the one-step affinity/protease purification process for large-scale purification. It is based upon the cyanogen bromide (CNBr) cleavage at a single methionine placed in between a histidine tag and a Plasmodium falciparum antigen. The C-terminal segment of the circumsporozoite polypeptide was expressed as a fusion protein with a histidine tag in Escherichia coli purified by Ni-NAT agarose column chromatography and subsequently cleaved by CNBr to obtain a polypeptide without any extraneous amino acids derived from the cleavage site or from the affinity purification tag. Thus, a recombinant protein is produced without the need for further purification, demonstrating that CNBr cleavage is a precise, efficient, and low-cost alternative to enzymatic digestion, and can be applied to large-scale preparations of recombinant proteins.
Resumo:
MHC-peptide tetramers have become essential tools for T-cell analysis, but few MHC class II tetramers incorporating peptides from human tumor and self-antigens have been developed. Among limiting factors are the high polymorphism of class II molecules and the low binding capacity of the peptides. Here, we report the generation of molecularly defined tetramers using His-tagged peptides and isolation of folded MHC/peptide monomers by affinity purification. Using this strategy we generated tetramers of DR52b (DRB3*0202), an allele expressed by approximately half of Caucasians, incorporating an epitope from the tumor antigen NY-ESO-1. Molecularly defined tetramers avidly and stably bound to specific CD4(+) T cells with negligible background on nonspecific cells. Using molecularly defined DR52b/NY-ESO-1 tetramers, we could demonstrate that in DR52b(+) cancer patients immunized with a recombinant NY-ESO-1 vaccine, vaccine-induced tetramer-positive cells represent ex vivo in average 1:5,000 circulating CD4(+) T cells, include central and transitional memory polyfunctional populations, and do not include CD4(+)CD25(+)CD127(-) regulatory T cells. This approach may significantly accelerate the development of reliable MHC class II tetramers to monitor immune responses to tumor and self-antigens.
Resumo:
We previously introduced two new protein databases (trEST and trGEN) of hypothetical protein sequences predicted from EST and HTG sequences, respectively. Here, we present the updates made on these two databases plus a new database (trome), which uses alignments of EST data to HTG or full genomes to generate virtual transcripts and coding sequences. This new database is of higher quality and since it contains the information in a much denser format it is of much smaller size. These new databases are in a Swiss-Prot-like format and are updated on a weekly basis (trEST and trGEN) or every 3 months (trome). They can be downloaded by anonymous ftp from ftp://ftp.isrec.isb-sib.ch/pub/databases.
Resumo:
A recent phase 1 trial has demonstrated that the generation of tumor-reactive T lymphocytes by transfer of specific T-cell receptor (TCR) genes into autologous lymphocytes is feasible. However, compared with results obtained by infusion of tumor-infiltrating lymphocytes, the response rate observed in this first TCR gene therapy trial is low. One strategy that is likely to enhance the success rate of TCR gene therapy is the use of tumor-reactive TCRs with a higher capacity for tumor cell recognition. We therefore sought to develop standardized procedures for the selection of well-expressed, high-affinity, and safe human TCRs. Here we show that TCR surface expression can be improved by modification of TCR alpha and beta sequences and that such improvement has a marked effect on the in vivo function of TCR gene-modified T cells. From a panel of human, melanoma-reactive TCRs we subsequently selected the TCR with the highest affinity. Furthermore, a generally applicable assay was used to assess the lack of alloreactivity of this TCR against a large series of common human leukocyte antigen alleles. The procedures described in this study should be of general value for the selection of well- and stably expressed, high-affinity, and safe human TCRs for subsequent clinical testing.
Resumo:
Among the largest resources for biological sequence data is the large amount of expressed sequence tags (ESTs) available in public and proprietary databases. ESTs provide information on transcripts but for technical reasons they often contain sequencing errors. Therefore, when analyzing EST sequences computationally, such errors must be taken into account. Earlier attempts to model error prone coding regions have shown good performance in detecting and predicting these while correcting sequencing errors using codon usage frequencies. In the research presented here, we improve the detection of translation start and stop sites by integrating a more complex mRNA model with codon usage bias based error correction into one hidden Markov model (HMM), thus generalizing this error correction approach to more complex HMMs. We show that our method maintains the performance in detecting coding sequences.
Resumo:
The long term goal of this research is to develop a program able to produce an automatic segmentation and categorization of textual sequences into discourse types. In this preliminary contribution, we present the construction of an algorithm which takes a segmented text as input and attempts to produce a categorization of sequences, such as narrative, argumentative, descriptive and so on. Also, this work aims at investigating a possible convergence between the typological approach developed in particular in the field of text and discourse analysis in French by Adam (2008) and Bronckart (1997) and unsupervised statistical learning.
Resumo:
The Miocene PX1 gabbro-pyroxenite pluton, Fuerteventura, Canary Islands, is a 3.5 x 5.5 km shallow-level intrusion (0.15-0.2 GPa and 1100-1120 degrees C), interpreted as the feeder-zone to an ocean-island volcano. It displays a vertical magmatic banding expressed in five 50 to 100 metre-wide NNE-SSW trending alkaline gabbro sequences alternating with pyroxenites. This emplacement geometry was controlled by brittle to ductile shear zones, generated by a regional E-W extensional tectonic setting that affected Fuerteventura during the Miocene. At a smaller scale, the PX1 gabbro and pyroxenite bands consist of metre-thick differentiation units, which suggest emplacement by periodic injection of magma pulses as vertical dykes that amalgamated, similarly to a sub-volcanic sheeted dyke complex. Individual dykes underwent internal differentiation following a solidification front parallel to the dyke edges. This solidification front may have been favoured by a significant lateral/horizontal thermal gradient, expressed by the vertical banding in the gabbros, the fractionation asymmetry within individual dykes and the migmatisation of the wall rocks. Pyroxenitic layers result from the fractionation and accumulation of clinopyroxene +/- olivine +/- plagioclase crystals from a mildly alkaline basaltic liquid. They are interpreted as truncated differentiation sequences, from which residual melts were extracted at various stages of their chemical evolution by subsequent dyke intrusions, either next to or within the crystallising unit. Compaction and squeezing of the crystal mush is ascribed to the incoming and inflating magma pulses. The expelled interstitial liquid was likely collected and erupted along with the magma flowing through the newly injected dykes. Clinopyroxene mineral orientation - as evidenced by EBSD and micro X-ray tomography investigations - displays a marked pure-shear component, supporting the interpretation of the role of compaction in the generation of the pyroxenites. Conversely, gabbro sequences underwent minor melt extraction and are believed to represent crystallised coalesced magma batches emplaced at lower rates at the end of eruptive cycles. Clinopyroxene orientations in gabbros record a simple shear component suggesting syn-magmatic deformation parallel to observed NNE-SSW trending shear zones induced by the regional tensional stress field. This emplacement model implies a crystallisation time of 1 to 5 years for individual dykes, consistent with PX1 emplacement over less than 0.5 My. A minimum amount of approximately 150 km(3) of magma is needed to generate the pluton, part of it having been erupted through the Central Volcanic Centre of Fuerteventura. If the regional extensional tectonic regime controls the PX1 feeder-zone initiation and overall geometry, rates and volumes of magma depend on other, source-related factors. High injection rates are likely to induce intrusion growth rates larger than could be accommodated by the regional extension. In this case, dyke intrusion by propagation of a weak tip, combined with the inability of magma to circulate through previously emplaced and crystallised dykes could result in an increase of non-lithostatic pressure on previously emplaced mushy dyke walls; thus generating strong pure-shear compaction within the pluton feeder-zone and interstitial melt expulsion. These compaction-dominated processes are recorded by the cumulitic pyroxenite bands. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
The lymphatic vasculature is important for the regulation of tissue fluid homeostasis, immune response, and lipid absorption, and the development of in vitro models should allow for a better understanding of the mechanisms regulating lymphatic vascular growth, repair, and function. Here we report isolation and characterization of lymphatic endothelial cells from human intestine and show that intestinal lymphatic endothelial cells have a related but distinct gene expression profile from human dermal lymphatic endothelial cells. Furthermore, we identify liprin beta1, a member of the family of LAR transmembrane tyrosine phosphatase-interacting proteins, as highly expressed in intestinal lymphatic endothelial cells in vitro and lymphatic vasculature in vivo, and show that it plays an important role in the maintenance of lymphatic vessel integrity in Xenopus tadpoles.