22 resultados para PROTO-SPLICE SITES

em Consorci de Serveis Universitaris de Catalunya (CSUC), Spain


Relevância:

100.00% 100.00%

Publicador:

Resumo:

We have carried out an initial analysis of the dynamics of the recent evolution of the splice-sites sequences on a large collection of human, rodent (mouse and rat), and chicken introns. Our results indicate that the sequences of splice sites are largely homogeneous within tetrapoda. We have also found that orthologous splice signals between human and rodents and within rodents are more conserved than unrelated splice sites, but the additional conservation can be explained mostly by background intron conservation. In contrast, additional conservation over background is detectable in orthologous mammalian and chicken splice sites. Our results also indicate that the U2 and U12 intron classes seem to have evolved independently since the split of mammals and birds; we have not been able to find a convincing case of interconversion between these two classes in our collections of orthologous introns. Similarly, we have not found a single case of switching between AT-AC and GT-AG subtypes within U12 introns, suggesting that this event has been a rare occurrence in recent evolutionary times. Switching between GT-AG and the noncanonical GC-AG U2 subtypes, on the contrary, does not appear to be unusual; in particular, T to C mutations appear to be relatively well tolerated in GT-AG introns with very strong donor sites.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

GeneID is a program to predict genes in anonymous genomic sequences designed with a hierarchical structure. In the first step, splice sites, and start and stop codons are predicted and scored along the sequence using position weight matrices (PWMs). In the second step, exons are built from the sites. Exons are scored as the sum of the scores of the defining sites, plus the log-likelihood ratio of a Markov model for coding DNA. In the last step, from the set of predicted exons, the gene structure is assembled, maximizing the sum of the scores of the assembled exons. In this paper we describe the obtention of PWMs for sites, and the Markov model of coding DNA in Drosophila melanogaster. We also compare other models of coding DNA with the Markov model. Finally, we present and discuss the results obtained when GeneID is used to predict genes in the Adh region. These results show that the accuracy of GeneID predictions compares currently with that of other existing tools but that GeneID is likely to be more efficient in terms of speed and memory usage.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Understanding the molecular mechanisms responsible for the regulation of the transcriptome present in eukaryotic cells isone of the most challenging tasks in the postgenomic era. In this regard, alternative splicing (AS) is a key phenomenoncontributing to the production of different mature transcripts from the same primary RNA sequence. As a plethora ofdifferent transcript forms is available in databases, a first step to uncover the biology that drives AS is to identify thedifferent types of reflected splicing variation. In this work, we present a general definition of the AS event along with anotation system that involves the relative positions of the splice sites. This nomenclature univocally and dynamically assignsa specific ‘‘AS code’’ to every possible pattern of splicing variation. On the basis of this definition and the correspondingcodes, we have developed a computational tool (AStalavista) that automatically characterizes the complete landscape of ASevents in a given transcript annotation of a genome, thus providing a platform to investigate the transcriptome diversityacross genes, chromosomes, and species. Our analysis reveals that a substantial part—in human more than a quarter—ofthe observed splicing variations are ignored in common classification pipelines. We have used AStalavista to investigate andto compare the AS landscape of different reference annotation sets in human and in other metazoan species and found thatproportions of AS events change substantially depending on the annotation protocol, species-specific attributes, andcoding constraints acting on the transcripts. The AStalavista system therefore provides a general framework to conductspecific studies investigating the occurrence, impact, and regulation of AS.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background: Freshwater planarians are an attractive model for regeneration and stem cell research and have become a promising tool in the field of regenerative medicine. With the availability of a sequenced planarian genome, the recent application of modern genetic and high-throughput tools has resulted in revitalized interest in these animals, long known for their amazing regenerative capabilities, which enable them to regrow even a new head after decapitation. However, a detailed description of the planarian transcriptome is essential for future investigation into regenerative processes using planarians as a model system. Results: In order to complement and improve existing gene annotations, we used a 454 pyrosequencing approach to analyze the transcriptome of the planarian species Schmidtea mediterranea Altogether, 598,435 454-sequencing reads, with an average length of 327 bp, were assembled together with the ~10,000 sequences of the S. mediterranea UniGene set using different similarity cutoffs. The assembly was then mapped onto the current genome data. Remarkably, our Smed454 dataset contains more than 3 million novel transcribed nucleotides sequenced for the first time. A descriptive analysis of planarian splice sites was conducted on those Smed454 contigs that mapped univocally to the current genome assembly. Sequence analysis allowed us to identify genes encoding putative proteins with defined structural properties, such as transmembrane domains. Moreover, we annotated the Smed454 dataset using Gene Ontology, and identified putative homologues of several gene families that may play a key role during regeneration, such as neurotransmitter and hormone receptors, homeobox-containing genes, and genes related to eye function. Conclusions: We report the first planarian transcript dataset, Smed454, as an open resource tool that can be accessed via a web interface. Smed454 contains significant novel sequence information about most expressed genes of S. mediterranea. Analysis of the annotated data promises to contribute to identification of gene families poorly characterized at a functional level. The Smed454 transcriptome data will assist in the molecular characterization of S. mediterranea as a model organism, which will be useful to a broad scientific community.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background Freshwater planarians are an attractive model for regeneration and stem cell research and have become a promising tool in the field of regenerative medicine. With the availability of a sequenced planarian genome, the recent application of modern genetic and high-throughput tools has resulted in revitalized interest in these animals, long known for their amazing regenerative capabilities, which enable them to regrow even a new head after decapitation. However, a detailed description of the planarian transcriptome is essential for future investigation into regenerative processes using planarians as a model system. Results In order to complement and improve existing gene annotations, we used a 454 pyrosequencing approach to analyze the transcriptome of the planarian species Schmidtea mediterranea Altogether, 598,435 454-sequencing reads, with an average length of 327 bp, were assembled together with the ~10,000 sequences of the S. mediterranea UniGene set using different similarity cutoffs. The assembly was then mapped onto the current genome data. Remarkably, our Smed454 dataset contains more than 3 million novel transcribed nucleotides sequenced for the first time. A descriptive analysis of planarian splice sites was conducted on those Smed454 contigs that mapped univocally to the current genome assembly. Sequence analysis allowed us to identify genes encoding putative proteins with defined structural properties, such as transmembrane domains. Moreover, we annotated the Smed454 dataset using Gene Ontology, and identified putative homologues of several gene families that may play a key role during regeneration, such as neurotransmitter and hormone receptors, homeobox-containing genes, and genes related to eye function. Conclusions We report the first planarian transcript dataset, Smed454, as an open resource tool that can be accessed via a web interface. Smed454 contains significant novel sequence information about most expressed genes of S. mediterranea. Analysis of the annotated data promises to contribute to identification of gene families poorly characterized at a functional level. The Smed454 transcriptome data will assist in the molecular characterization of S. mediterranea as a model organism, which will be useful to a broad scientific community.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This report presents systematic empirical annotation of transcript products from 399 annotated protein-coding loci across the 1% of the human genome targeted by the Encyclopedia of DNA elements (ENCODE) pilot project using a combination of 5' rapid amplification of cDNA ends (RACE) and high-density resolution tiling arrays. We identified previously unannotated and often tissue- or cell-line-specific transcribed fragments (RACEfrags), both 5' distal to the annotated 5' terminus and internal to the annotated gene bounds for the vast majority (81.5%) of the tested genes. Half of the distal RACEfrags span large segments of genomic sequences away from the main portion of the coding transcript and often overlap with the upstream-annotated gene(s). Notably, at least 20% of the resultant novel transcripts have changes in their open reading frames (ORFs), most of them fusing ORFs of adjacent transcripts. A significant fraction of distal RACEfrags show expression levels comparable to those of known exons of the same locus, suggesting that they are not part of very minority splice forms. These results have significant implications concerning (1) our current understanding of the architecture of protein-coding genes; (2) our views on locations of regulatory regions in the genome; and (3) the interpretation of sequence polymorphisms mapping to regions hitherto considered to be "noncoding," ultimately relating to the identification of disease-related sequence alterations.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

La llegada de las tecnologías de la información, y su uso cada vez mayor y generalizado que se está haciendo de ellas, ha permitido para que una nueva situación aparezca que, a su vez, ha originado - de hecho, que ha precipitado - una serie de cambios de gran importancia en todos los niveles, pero especialmente en los niveles sociales, económicos y culturales. Este nueva escena ha tenido una gran influencia en el entorno pedagógico. De hecho, la aparición de nuevos modelos educativos como resultado de este cambio ha estado sucediendo de una manera continuada e ininterrumpida durante la década pasada. Estos cambios recientes en los sistemas actuales de enseñanza y de aprendizaje han significado un aumento y un cambio en el tipo de demandas hechas desde las bibliotecas y los centros de documentación.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The purpose of this work is determine the extent of closure between measurements and models of UV irradiances at diverse sites using state of the art instruments, models, and the best available data as inputs to the models. These include information about aerosol optical depth (unfortunately not extending down as far into the UVB region as desirable because such information is not generally available), ozone column amounts, as well as vertical profiles of temperature. We concentrate on clear-sky irradiances, and report the results in terms of UV Index (UVI)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genetic and functional data indicate that variation in the expression of the neurotrophin-3 receptor gene (NTRK3) may have an impact on neuronal plasticity, suggesting a role for NTRK3 in the pathophysiology of anxiety disorders. MicroRNA (miRNA) posttranscriptional gene regulators act by base-pairing to specific sequence sites, usually at the 3'UTR of the target mRNA. Variants at these sites might result in gene expression changes contributing to disease susceptibility. We investigated genetic variation in two different isoforms of NTRK3 as candidate susceptibility factors for anxiety by resequencing their 3'UTRs in patients with panic disorder (PD), obsessive-compulsive disorder (OCD), and in controls. We have found the C allele of rs28521337, located in a functional target site for miR-485-3p in the truncated isoform of NTRK3, to be significantly associated with the hoarding phenotype of OCD. We have also identified two new rare variants in the 3'UTR of NTRK3, ss102661458 and ss102661460, each present only in one chromosome of a patient with PD. The ss102661458 variant is located in a functional target site for miR-765, and the ss102661460 in functional target sites for two miRNAs, miR-509 and miR-128, the latter being a brain-enriched miRNA involved in neuronal differentiation and synaptic processing. Interestingly, these two variants significantly alter the miRNA-mediated regulation of NTRK3, resulting in recovery of gene expression. These data implicate miRNAs as key posttranscriptional regulators of NTRK3 and provide a framework for allele-specific miRNA regulation of NTRK3 in anxiety disorders.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

White adipose tissue (WAT) is a disperse organ acting as energy storage depot and endocrine/paracrine controlling factor in the management of energy availability and inflammation. WAT sites response under energy-related stress is not uniform. In the present study we have analyzed how different WAT sites respond to limited food restriction as a way to better understand the role of WAT in the pathogenesis of the metabolic syndrome.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

En internet encontramos gran cantidad de información científico-técnica cuya validez no suele estar controlada por comités correctores. Para aprovechar estos recursos es necesario filtrar y facilitar el acceso del usuario a la información. En este artículo se expone la experiencia práctica en el desarrollo de una página WEB centrada en las actividades del grupo de investigación «Calidad Nutricional y Tecnología de los Lípidos». Los objetivos de esta página WEB fueron los siguientes: difusión de las actividades del grupo de investigación, aprovechar los recursos que ofrece internet y fomentar y facilitar su uso. Esta experiencia permitió presentar una metodología de trabajo eficaz para conseguir estos objetivos. Finalmente, se presentan un gran número de direcciones WEB agrupadas por apartados en el ámbito de los lípidos. Estas direcciones han sido rigurosamente seleccionadas, entre un gran número de referencias consultadas, siguiendo una serie de criterios que se discuten en este trabajo, para ofrecer aquellas que presentan un mayor interés práctico.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Information about the genomic coordinates and the sequence of experimentally identified transcription factor binding sites is found scattered under a variety of diverse formats. The availability of standard collections of such high-quality data is important to design, evaluate and improve novel computational approaches to identify binding motifs on promoter sequences from related genes. ABS (http://genome.imim.es/datasets/abs2005/index.html) is a public database of known binding sites identified in promoters of orthologous vertebrate genes that have been manually curated from bibliography. We have annotated 650 experimental binding sites from 68 transcription factors and 100 orthologous target genes in human, mouse, rat or chicken genome sequences. Computational predictions and promoter alignment information are also provided for each entry. A simple and easy-to-use web interface facilitates data retrieval allowing different views of the information. In addition, the release 1.0 of ABS includes a customizable generator of artificial datasets based on the known sites contained in the collection and an evaluation tool to aid during the training and the assessment of motif-finding programs.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The gibbon genome exhibits extensive karyotypic diversity with an increased rate of chromosomal rearrangements during evolution. In an effort to understand the mechanistic origin and implications of these rearrangement events, we sequenced 24 synteny breakpoint regions in the white-cheeked gibbon (Nomascus leucogenys, NLE) in the form of high-quality BAC insert sequences (4.2 Mbp). While there is a significant deficit of breakpoints in genes, we identified seven human gene structures involved in signaling pathways (DEPDC4, GNG10), phospholipid metabolism (ENPP5, PLSCR2), beta-oxidation (ECH1), cellular structure and transport (HEATR4), and transcription (ZNF461), that have been disrupted in the NLE gibbon lineage. Notably, only three of these genes show the expected evolutionary signatures of pseudogenization. Sequence analysis of the breakpoints suggested both nonclassical nonhomologous end-joining (NHEJ) and replication-based mechanisms of rearrangement. A substantial number (11/24) of human-NLE gibbon breakpoints showed new insertions of gibbon-specific repeats and mosaic structures formed from disparate sequences including segmental duplications, LINE, SINE, and LTR elements. Analysis of these sites provides a model for a replication-dependent repair mechanism for double-strand breaks (DSBs) at rearrangement sites and insights into the structure and formation of primate segmental duplications at sites of genomic rearrangements during evolution.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Forensic archaeology has been of inestimable help in the location and excavation of clandestine burials (Hunter 1996a:16-17, 1996b; Killam 2004; Levine et al. 1984). Landscape archaeology techniques have been adapted and widely applied for such purposes. In this article, an adaptation of one of the most common applications of archaeological GIS, that is predictive site location, is applied to the detection of body dump sites and clandestine burial sites, bringing together in the process the fields of landscape archaeology, forensic sciences, and GIS.