34 resultados para rna sequence

em Consorci de Serveis Universitaris de Catalunya (CSUC), Spain


Relevância:

60.00% 60.00%

Publicador:

Resumo:

The “one-gene, one-protein” rule, coined by Beadle and Tatum, has been fundamental to molecular biology. The rule implies that the genetic complexity of an organism depends essentially on its gene number. The discovery, however, that alternative gene splicing and transcription are widespread phenomena dramatically altered our understanding of the genetic complexity of higher eukaryotic organisms; in these, a limited number of genes may potentially encode a much larger number of proteins. Here we investigate yet another phenomenon that may contribute to generate additional protein diversity. Indeed, by relying on both computational and experimental analysis, we estimate that at least 4%–5% of the tandem gene pairs in the human genome can be eventually transcribed into a single RNA sequence encoding a putative chimeric protein. While the functional significance of most of these chimeric transcripts remains to be determined, we provide strong evidence that this phenomenon does not correspond to mere technical artifacts and that it is a common mechanism with the potential of generating hundreds of additional proteins in the human genome.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Understanding the molecular mechanisms responsible for the regulation of the transcriptome present in eukaryotic cells isone of the most challenging tasks in the postgenomic era. In this regard, alternative splicing (AS) is a key phenomenoncontributing to the production of different mature transcripts from the same primary RNA sequence. As a plethora ofdifferent transcript forms is available in databases, a first step to uncover the biology that drives AS is to identify thedifferent types of reflected splicing variation. In this work, we present a general definition of the AS event along with anotation system that involves the relative positions of the splice sites. This nomenclature univocally and dynamically assignsa specific ‘‘AS code’’ to every possible pattern of splicing variation. On the basis of this definition and the correspondingcodes, we have developed a computational tool (AStalavista) that automatically characterizes the complete landscape of ASevents in a given transcript annotation of a genome, thus providing a platform to investigate the transcriptome diversityacross genes, chromosomes, and species. Our analysis reveals that a substantial part—in human more than a quarter—ofthe observed splicing variations are ignored in common classification pipelines. We have used AStalavista to investigate andto compare the AS landscape of different reference annotation sets in human and in other metazoan species and found thatproportions of AS events change substantially depending on the annotation protocol, species-specific attributes, andcoding constraints acting on the transcripts. The AStalavista system therefore provides a general framework to conductspecific studies investigating the occurrence, impact, and regulation of AS.

Relevância:

60.00% 60.00%

Publicador:

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background: We present the results of EGASP, a community experiment to assess the state-ofthe-art in genome annotation within the ENCODE regions, which span 1% of the human genomesequence. The experiment had two major goals: the assessment of the accuracy of computationalmethods to predict protein coding genes; and the overall assessment of the completeness of thecurrent human genome annotations as represented in the ENCODE regions. For thecomputational prediction assessment, eighteen groups contributed gene predictions. Weevaluated these submissions against each other based on a ‘reference set’ of annotationsgenerated as part of the GENCODE project. These annotations were not available to theprediction groups prior to the submission deadline, so that their predictions were blind and anexternal advisory committee could perform a fair assessment.Results: The best methods had at least one gene transcript correctly predicted for close to 70%of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into accountalternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotidelevel, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programsrelying on mRNA and protein sequences were the most accurate in reproducing the manuallycurated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could beverified.Conclusions: This is the first such experiment in human DNA, and we have followed thestandards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe theresults presented here contribute to the value of ongoing large-scale annotation projects and shouldguide further experimental methods when being scaled up to the entire human genome sequence.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Selenoproteins are a diverse group of proteinsusually misidentified and misannotated in sequencedatabases. The presence of an in-frame UGA (stop)codon in the coding sequence of selenoproteingenes precludes their identification and correctannotation. The in-frame UGA codons are recodedto cotranslationally incorporate selenocysteine,a rare selenium-containing amino acid. The developmentof ad hoc experimental and, more recently,computational approaches have allowed the efficientidentification and characterization of theselenoproteomes of a growing number of species.Today, dozens of selenoprotein families have beendescribed and more are being discovered in recentlysequenced species, but the correct genomic annotationis not available for the majority of thesegenes. SelenoDB is a long-term project that aims toprovide, through the collaborative effort of experimentaland computational researchers, automaticand manually curated annotations of selenoproteingenes, proteins and SECIS elements. Version 1.0 ofthe database includes an initial set of eukaryoticgenomic annotations, with special emphasis on thehuman selenoproteome, for immediate inspectionby selenium researchers or incorporation into moregeneral databases. SelenoDB is freely available athttp://www.selenodb.org.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The design and synthesis of two Janus-type heterocycles with the capacity to simultaneously recognize guanine and uracyl in G-U mismatched pairs through complementary hydrogen bond pairing is described. Both compounds were conveniently functionalized with a carboxylic function and efficiently attached to a tripeptide sequence by using solid-phase methodologies. Ligands based on the derivatization of such Janus compounds with a small aminoglycoside, neamine, and its guanidinylated analogue have been synthesized, and their interaction with Tau RNA has been investigated by using several biophysical techniques, including UV-monitored melting curves, fluorescence titration experiments, and 1H NMR. The overall results indicated that Janus-neamine/guanidinoneamine showed some preference for the +3 mutated RNA sequence associated with the development of some tauopathies, although preliminary NMR studies have not confirmed binding to G-U pairs. Moreover, a good correlation has been found between the RNA binding affinity of such Janus-containing ligands and their ability to stabilize this secondary structure upon complexation.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This article introduces a new interface for T-Coffee, a consistency-based multiple sequence alignment program. This interface provides an easy and intuitive access to the most popular functionality of the package. These include the default T-Coffee mode for protein and nucleic acid sequences, the M-Coffee mode that allows combining the output of any other aligners, and template-based modes of T-Coffee that deliver high accuracy alignments while using structural or homology derived templates. These three available template modes are Expresso for the alignment of protein with a known 3D-Structure, R-Coffee to align RNA sequences with conserved secondary structures and PSI-Coffee to accurately align distantly related sequences using homology extension. The new server benefits from recent improvements of the T-Coffee algorithm and can align up to 150 sequences as long as 10 000 residues and is available from both http://www.tcoffee.org and its main mirror http://tcoffee.crg.cat.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Selenoproteins contain the amino acid selenocysteine which is encoded by a UGA Sec codon. Recoding UGA Sec requires a complex mechanism, comprising the cis-acting SECIS RNA hairpin in the 3′UTR of selenoprotein mRNAs, and trans-acting factors. Among these, the SECIS Binding Protein 2 (SBP2) is central to the mechanism. SBP2 has been so far functionally characterized only in rats and humans. In this work, we report the characterization of the Drosophila melanogaster SBP2 (dSBP2). Despite its shorter length, it retained the same selenoprotein synthesis-promoting capabilities as the mammalian counterpart. However, a major difference resides in the SECIS recognition pattern: while human SBP2 (hSBP2) binds the distinct form 1 and 2 SECIS RNAs with similar affinities, dSBP2 exhibits high affinity toward form 2 only. In addition, we report the identification of a K (lysine)-rich domain in all SBP2s, essential for SECIS and 60S ribosomal subunit binding, differing from the well-characterized L7Ae RNA-binding domain. Swapping only five amino acids between dSBP2 and hSBP2 in the K-rich domain conferred reversed SECIS-binding properties to the proteins, thus unveiling an important sequence for form 1 binding.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Gene set enrichment (GSE) analysis is a popular framework for condensing information from gene expression profiles into a pathway or signature summary. The strengths of this approach over single gene analysis include noise and dimension reduction, as well as greater biological interpretability. As molecular profiling experiments move beyond simple case-control studies, robust and flexible GSE methodologies are needed that can model pathway activity within highly heterogeneous data sets. To address this challenge, we introduce Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner. We demonstrate the robustness of GSVA in a comparison with current state of the art sample-wise enrichment methods. Further, we provide examples of its utility in differential pathway activity and survival analysis. Lastly, we show how GSVA works analogously with data from both microarray and RNA-seq experiments. GSVA provides increased power to detect subtle pathway activity changes over a sample population in comparison to corresponding methods. While GSE methods are generally regarded as end points of a bioinformatic analysis, GSVA constitutes a starting point to build pathway-centric models of biology. Moreover, GSVA contributes to the current need of GSE methods for RNA-seq data. GSVA is an open source software package for R which forms part of the Bioconductor project and can be downloaded at http://www.bioconductor.org.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background MicroRNAs (miRNAs) are short non-coding regulatory RNAs that control gene expression usually producing translational repression and gene silencing. High-throughput sequencing technologies have revealed heterogeneity at length and sequence level for the majority of mature miRNAs (IsomiRs). Most isomiRs can be explained by variability in either Dicer1 or Drosha cleavage during miRNA biogenesis at 5" or 3" of the miRNA (trimming variants). Although isomiRs have been described in different tissues and organisms, their functional validation as modulators of gene expression remains elusive. Here we have characterized the expression and function of a highly abundant miR-101 5"-trimming variant (5"-isomiR-101). Results The analysis of small RNA sequencing data in several human tissues and cell lines indicates that 5"-isomiR-101 is ubiquitously detected and a highly abundant, especially in the brain. 5"- isomiR-101 was found in Ago-2 immunocomplexes and complementary approaches showed that 5"-isomiR-101 interacted with different members of the silencing (RISC) complex. In addition, 5"-isomiR-101 decreased the expression of five validated miR-101 targets, suggesting that it is a functional variant. Both the binding to RISC members and the degree of silencing were less efficient for 5"-isomiR-101 compared with miR-101. For some targets, both miR-101 and 5"-isomiR-101 significantly decreased protein expression with no changes in the respective mRNA levels. Although a high number of overlapping predicted targets suggest similar targeted biological pathways, a correlation analysis of the expression profiles of miR-101 variants and predicted mRNA targets in human brains at different ages, suggest specific functions for miR-101- and 5"-isomiR-101. Conclusions These results suggest that isomiRs are functional variants and further indicate that for a given miRNA, the different isomiRs may contribute to the overall effect as quantitative and qualitative fine-tuners of gene expression.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Els dominis d’activació (ADs) de les procarboxipeptidases de la subfamília A/B sempre han sorprès ja que representen una quarta part del proenzim. S’han realitzat alguns estudis per intentar descobrir-ne alguna possible funció alternativa, però no han estat fructífers. El descobriment de l’elevada velocitat de plegament del domini d’activació de la procarboxipeptidasa A2 humana, (ADA2h), emperò, va portar a proposar la possibilitat de que realitzessin una funció d’assistència al plegament del domini enzimàtic. Posteriorment, l’anàlisi del plegament d’ADA2h a pH baix va revelar la capacitat d’aquest domini per formar fibres amiloides, a més de demostrar que un increment de l’estabilitat proteica podia prevenir la formació d’aquests agregats. La profunda caracterització del plegament d’ADA2h va fer que aquesta proteïna fos un bon model amiloidogènic, de manera que es van proposar un seguit d’experiments que s’han desenvolupat en el present treball per tal de conèixer millor aquest procés. S’han dut a terme estudis cinètics d’agregació per tal de valorar la contribució dels diferents aminoàcids de la seqüència polipeptídica, utilitzant 29 variants puntuals d’ADA2h. Es va eliminar la contribució de l’estabilitat mitjançant la utilització d’urea, i per dicroïsme circular conjuntament amb un aparell de flux detingut, es van obtenir dues velocitats diferents, v1 i v2, que corresponen a la formació d’un intermediari i a la seva reorganització, respectivament. Experiments complementaris utilitzant espectroscòpia d’infraroig (IR) revelaren la reorganització de l’estat natiu (en aquest cas) per a donar la forma agregada. Les cinètiques d’IR van mostrar que ADA2h forma l’estructura _ típica de les fibres amiloides, previ desplegament les seves hèlixs-_. Finalment, s’han realitzat estudis de biocomputació per tal d’esbrinar possibles funcions alternatives dels ADs. Les superposicions estructurals semblen mostrar similaritat dels ADs amb dominis de reconeixement d’RNA (RRM). Aquesta hipòtesi s’ha comprovat experimentalment amb ADA4h, mostrant una dèbil, però existent, unió a RNA.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

El GB virus C (GBV-C) o virus de l'hepatitis G (HGV) es un virus format per una única cadena de RNA que pertany a la familia Flaviviridae. En els últims anys, s'han publicat nombrosos treballs en els quals s'associa la coinfecció del GBV-C i del virus de la immunodeficiència humana (VIH) amb una menor progressió de l'esmentada malaltia així com amb una major supervivència dels pacients una vegada que la SIDA s'ha desenvolupat. El mecanisme pel qual el virus GBV-C/HGV exerceix un “efecte protector” en els pacients amb VIH encara no està descrit. L’estudi de la interacció entre els virus GBVC/HGV i VIH podria donar lloc al desenvolupament de nous agents terapèutics per al tractament de la SIDA.Treballs recents mostren com la capacitat inhibitòria del virus del GBV-C/HGV és deguda a la seva glicoproteina estructural E2. S’ha vist que aquesta proteina seria capaç d’inhibir la primera fase de replicació de VIH, així com la unió i la fusió amb les membranes cel•lulars. Sobre la base d’aquests estudis, l’objectiu d’aquest treball ha estat seleccionar inhibidors del pèptid de fusió del VIH utilitzant pèptids sintètics de la proteina E2 del GBV-C/HGV. El treball realitzat ha consistit en estudiar, utilitzant assajos biofísics de leakage i de lipid mixing, la capacitat dels pèptids de la proteina estructural del virus del GBV-C/HGV per inhibir la interacció i el procés de desestabilització de membranes induïdes pel pèptid de fusió de la glicoproteina de l’embolcall, GP41, del VIH. Aquests assajos, com es descriu en treballs anteriors, han resultat útils per a la selecció i la identificació de compostos amb activitat específica anti-GP41. Es pot afirmar que efectivament els pèptids seleccionats de la proteina E2 del virus del GBV-C/HGV inhibeixen l’activitat del pèptid de fusió del VIH probablement com a consequència d’un canvi conformacional en aquest darrer.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The genetic diversity of three temperate fruit tree phytoplasmas ‘Candidatus Phytoplasma prunorum’, ‘Ca. P. mali’ and ‘Ca. P. pyri’ has been established by multilocus sequence analysis. Among the four genetic loci used, the genes imp and aceF distinguished 30 and 24 genotypes, respectively, and showed the highest variability. Percentage of substitution for imp ranged from 50 to 68% according to species. Percentage of substitution varied between 9 and 12% for aceF, whereas it was between 5 and 6% for pnp and secY. In the case of ‘Ca P. prunorum’ the three most prevalent aceF genotypes were detected in both plants and insect vectors, confirming that the prevalent isolates are propagated by insects. The four isolates known to be hypo-virulent had the same aceF sequence, indicating a possible monophyletic origin. Haplotype network reconstructed by eBURST revealed that among the 34 haplotypes of ‘Ca. P. prunorum’, the four hypo-virulent isolates also grouped together in the same clade. Genotyping of some Spanish and Azerbaijanese ‘Ca. P. pyri’ isolates showed that they shared some alleles with ‘Ca. P. prunorum’, supporting for the first time to our knowledge, the existence of inter-species recombination between these two species.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In order to search for novel genes involved in cell proliferation, the hypothesis was that by infecting primary cells with a cDNA library of immortal cells would render immortalizing genes. Consequently it has been discovered CIRP (Cold inducible RNA-binding protein). Mammalian cells exposed to mild hypothermia show a general inhibition of protein synthesis and a concomitant increase in the expression of a small number of cold-shock mRNAs and proteins. Rbm3, another RNA binding protein belonging to the same family, has been postulated to facilitate protein synthesis at mild cold shock. To investigate if the same occurs for CIRP, CIRP was overexpressed in primary cells and protein sintesis was measured. Interestingly, CIRP increased protein synthesis, however, such increase did not involve an increase in the polysome fraction or affected the ribosome profile. In addition, the effect caused by CIRP inhibition or knockdown was also analyzed. Different siRNAs against CIRP were tested. Once checked their efficiency by decreasing CIRP at mRNA and protein levels, proliferation was tested by BrdU, cell number (DAPI) and proliferation curves were performed. Interestingly, CIRP provoke a decreased proliferation in primary cells: MEFs, HMEC; and cancer cells: TERA2 and HeLa. In conclusion, we describe for the first time that CIRP bypasses replicative senescence when over-expressed at physiological temperature (37ºC) by increasing a general protein synthesis. This effect is achieved through ERK1/2 activation in MEFs.The decrease in growth rate found in mammalian cells treated with mild cold stress is not entirely attributable to arrested metabolism. This decrease may also involve an active process in which CIRP and other stress-responsive proteins play a fundamental role in stimulating proliferation. Although most cell proteins are down-regulated or inhibited with cold stress, CIRP is activated to maintain cells in an active proliferative status and its overexpression at 37°C might be potentially oncogenic.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

When underwater vehicles perform navigation close to the ocean floor, computer vision techniques can be applied to obtain quite accurate motion estimates. The most crucial step in the vision-based estimation of the vehicle motion consists on detecting matchings between image pairs. Here we propose the extensive use of texture analysis as a tool to ameliorate the correspondence problem in underwater images. Once a robust set of correspondences has been found, the three-dimensional motion of the vehicle can be computed with respect to the bed of the sea. Finally, motion estimates allow the construction of a map that could aid to the navigation of the robot