13 resultados para Splice Variants
em Consorci de Serveis Universitaris de Catalunya (CSUC), Spain
Resumo:
Background: The GENCODE consortium was formed to identify and map all protein-coding genes within the ENCODE regions. This was achieved by a combination of initial manualannotation by the HAVANA team, experimental validation by the GENCODE consortium and a refinement of the annotation based on these experimental results.Results: The GENCODE gene features are divided into eight different categories of which onlythe first two (known and novel coding sequence) are confidently predicted to be protein-codinggenes. 5’ rapid amplification of cDNA ends (RACE) and RT-PCR were used to experimentallyverify the initial annotation. Of the 420 coding loci tested, 229 RACE products have beensequenced. They supported 5’ extensions of 30 loci and new splice variants in 50 loci. In addition,46 loci without evidence for a coding sequence were validated, consisting of 31 novel and 15putative transcripts. We assessed the comprehensiveness of the GENCODE annotation byattempting to validate all the predicted exon boundaries outside the GENCODE annotation. Outof 1,215 tested in a subset of the ENCODE regions, 14 novel exon pairs were validated, only twoof them in intergenic regions.Conclusions: In total, 487 loci, of which 434 are coding, have been annotated as part of theGENCODE reference set available from the UCSC browser. Comparison of GENCODEannotation with RefSeq and ENSEMBL show only 40% of GENCODE exons are contained withinthe two sets, which is a reflection of the high number of alternative splice forms with uniqueexons annotated. Over 50% of coding loci have been experimentally verified by 5’ RACE forEGASP and the GENCODE collaboration is continuing to refine its annotation of 1% humangenome with the aid of experimental validation.
Resumo:
Aquest treball ofereix, en primer lloc, una descripció lingüística de les formes que presenta la preposició a en quatre varietats del català (barceloní, varietats occidentals del català central, balear i tortosí). En segon lloc, presenta una anàlisi de les dades en què es demostra que la llengua resol els contactes vocàlics on intervé la preposició a mitjançant mecanismes que s'escapen del comportament fonològic general, és a dir processos d'elisió o fusió, això és mitjançant les formes /an/ o /am(b)/. En aquest sentit, es tracten aquestes formes com un cas d'al·lomorfia externa amb ordenació lèxica dels al·lomorfs, en el qual tenen un paper rellevant els conceptes d'especificitat i de pressió paradigmàtica. Finalment, es fa un tractament del fenomen en termes de la teoria de l'optimitat
Resumo:
El Mieloma múltiple és una patología hematològica maligna que cursa amb la presència d’una proteïna monoclonal responsable del deteriorament del pacient. Existeixen múltiples factors que afavoreixen la progressió de la malaltia d’entre els quals destaca la interleukina 6 (IL-6), una citoquina que actua com a factor de creixement de les cèl•lules malignes i com a inhibidor de la seva apoptosi. En aquest estudi ens hem plantejat si les variants genètiques d’aquesta IL-6 també poden causar diferències en l’evolució del mieloma múltiple. En concret hem estudiat la presència de guanina o lisina en la posició 174 de la regió promotora del gen de la IL-6.
Resumo:
We have carried out an initial analysis of the dynamics of the recent evolution of the splice-sites sequences on a large collection of human, rodent (mouse and rat), and chicken introns. Our results indicate that the sequences of splice sites are largely homogeneous within tetrapoda. We have also found that orthologous splice signals between human and rodents and within rodents are more conserved than unrelated splice sites, but the additional conservation can be explained mostly by background intron conservation. In contrast, additional conservation over background is detectable in orthologous mammalian and chicken splice sites. Our results also indicate that the U2 and U12 intron classes seem to have evolved independently since the split of mammals and birds; we have not been able to find a convincing case of interconversion between these two classes in our collections of orthologous introns. Similarly, we have not found a single case of switching between AT-AC and GT-AG subtypes within U12 introns, suggesting that this event has been a rare occurrence in recent evolutionary times. Switching between GT-AG and the noncanonical GC-AG U2 subtypes, on the contrary, does not appear to be unusual; in particular, T to C mutations appear to be relatively well tolerated in GT-AG introns with very strong donor sites.
Resumo:
Genetic and functional data indicate that variation in the expression of the neurotrophin-3 receptor gene (NTRK3) may have an impact on neuronal plasticity, suggesting a role for NTRK3 in the pathophysiology of anxiety disorders. MicroRNA (miRNA) posttranscriptional gene regulators act by base-pairing to specific sequence sites, usually at the 3'UTR of the target mRNA. Variants at these sites might result in gene expression changes contributing to disease susceptibility. We investigated genetic variation in two different isoforms of NTRK3 as candidate susceptibility factors for anxiety by resequencing their 3'UTRs in patients with panic disorder (PD), obsessive-compulsive disorder (OCD), and in controls. We have found the C allele of rs28521337, located in a functional target site for miR-485-3p in the truncated isoform of NTRK3, to be significantly associated with the hoarding phenotype of OCD. We have also identified two new rare variants in the 3'UTR of NTRK3, ss102661458 and ss102661460, each present only in one chromosome of a patient with PD. The ss102661458 variant is located in a functional target site for miR-765, and the ss102661460 in functional target sites for two miRNAs, miR-509 and miR-128, the latter being a brain-enriched miRNA involved in neuronal differentiation and synaptic processing. Interestingly, these two variants significantly alter the miRNA-mediated regulation of NTRK3, resulting in recovery of gene expression. These data implicate miRNAs as key posttranscriptional regulators of NTRK3 and provide a framework for allele-specific miRNA regulation of NTRK3 in anxiety disorders.
Resumo:
Background: Aproximately 5–10% of cases of mental retardation in males are due to copy number variations (CNV) on the X chromosome. Novel technologies, such as array comparative genomic hybridization (aCGH), may help to uncover cryptic rearrangements in X-linked mental retardation (XLMR) patients. We have constructed an X-chromosome tiling path array using bacterial artificial chromosomes (BACs) and validated it using samples with cytogenetically defined copy number changes. We have studied 54 patients with idiopathic mental retardation and 20 controls subjects. Results: Known genomic aberrations were reliably detected on the array and eight novel submicroscopic imbalances, likely causative for the mental retardation (MR) phenotype, were detected. Putatively pathogenic rearrangements included three deletions and five duplications (ranging between 82 kb to one Mb), all but two affecting genes previously known to be responsible for XLMR. Additionally, we describe different CNV regions with significant different frequencies in XLMR and control subjects (44% vs. 20%). Conclusion:This tiling path array of the human X chromosome has proven successful for the detection and characterization of known rearrangements and novel CNVs in XLMR patients.
Resumo:
Background: Single Nucleotide Polymorphisms, among other type of sequence variants, constitute key elements in genetic epidemiology and pharmacogenomics. While sequence data about genetic variation is found at databases such as dbSNP, clues about the functional and phenotypic consequences of the variations are generally found in biomedical literature. The identification of the relevant documents and the extraction of the information from them are hampered by the large size of literature databases and the lack of widely accepted standard notation for biomedical entities. Thus, automatic systems for the identification of citations of allelic variants of genes in biomedical texts are required. Results: Our group has previously reported the development of OSIRIS, a system aimed at the retrieval of literature about allelic variants of genes http://ibi.imim.es/osirisform.html. Here we describe the development of a new version of OSIRIS (OSIRISv1.2, http://ibi.imim.es/OSIRISv1.2.html webcite) which incorporates a new entity recognition module and is built on top of a local mirror of the MEDLINE collection and HgenetInfoDB: a database that collects data on human gene sequence variations. The new entity recognition module is based on a pattern-based search algorithm for the identification of variation terms in the texts and their mapping to dbSNP identifiers. The performance of OSIRISv1.2 was evaluated on a manually annotated corpus, resulting in 99% precision, 82% recall, and an F-score of 0.89. As an example, the application of the system for collecting literature citations for the allelic variants of genes related to the diseases intracranial aneurysm and breast cancer is presented. Conclusion: OSIRISv1.2 can be used to link literature references to dbSNP database entries with high accuracy, and therefore is suitable for collecting current knowledge on gene sequence variations and supporting the functional annotation of variation databases. The application of OSIRISv1.2 in combination with controlled vocabularies like MeSH provides a way to identify associations of biomedical interest, such as those that relate SNPs with diseases.
Resumo:
Introduction: Breastfeeding effects on cognition are attributed to long-chain polyunsaturated fatty acids (LC-PUFAs), but controversy persists. Genetic variation in fatty acid desaturase (FADS) and elongase (ELOVL) enzymes has been overlooked when studying the effects of LC-PUFAs supply on cognition. We aimed to: 1) to determine whether maternal genetic variants in the FADS cluster and ELOVL genes contribute to differences in LC-PUFA levels in colostrum; 2) to analyze whether these maternal variants are related to child cognition; and 3) to assess whether children's variants modify breastfeeding effects on cognition. Methods: Data come from two population-based birth cohorts (n = 400 mother-child pairs from INMA-Sabadell; and n = 340 children from INMA-Menorca). LC-PUFAs were measured in 270 colostrum samples from INMA-Sabadell. Tag SNPs were genotyped both in mothers and children (13 in the FADS cluster, 6 in ELOVL2, and 7 in ELOVL5). Child cognition was assessed at 14 mo and 4 y using the Bayley Scales of Infant Development and the McCarthy Scales of Children"s Abilities, respectively. Results: Children of mothers carrying genetic variants associated with lower FADS1 activity (regulating AA and EPA synthesis), higher FADS2 activity (regulating DHA synthesis), and with higher EPA/AA and DHA/AA ratios in colostrum showed a significant advantage in cognition at 14 mo (3.5 to 5.3 points). Not being breastfed conferred an 8- to 9-point disadvantage in cognition among children GG homozygote for rs174468 (low FADS1 activity) but not among those with the A allele. Moreover, not being breastfed resulted in a disadvantage in cognition (5 to 8 points) among children CC homozygote for rs2397142 (low ELOVL5 activity), but not among those carrying the G allele. Conclusion: Genetically determined maternal supplies of LC-PUFAs during pregnancy and lactation appear to be crucial for child cognition. Breastfeeding effects on cognition are modified by child genetic variation in fatty acid desaturase and elongase enzymes.
Resumo:
Next-generation sequencing techniques such as exome sequencing can successfully detect all genetic variants in a human exome and it has been useful together with the implementation of variant filters to identify causing-disease mutations. Two filters aremainly used for the mutations identification: low allele frequency and the computational annotation of the genetic variant. Bioinformatic tools to predict the effect of a givenvariant may have errors due to the existing bias in databases and sometimes show a limited coincidence among them. Advances in functional and comparative genomics are needed in order to properly annotate these variants.The goal of this study is to: first, functionally annotate Common Variable Immunodeficiency disease (CVID) variants with the available bioinformatic methods in order to assess the reliability of these strategies. Sencondly, as the development of new methods to reduce the number of candidate genetic variants is an active and necessary field of research, we are exploring the utility of gene function information at organism level as a filter for rare disease genes identification. Recently, it has been proposed that only 10-15% of human genes are essential and therefore we would expect that severe rare diseases are mostly caused by mutations on them. Our goal is to determine whether or not these rare and severe diseases are caused by deleterious mutations in these essential genes. If this hypothesis were true, taking into account essential genes as a filter would be an interesting parameter to identify causingdisease mutations.
Resumo:
En aquest Treball de Final de Grau s’exposen els resultats de l’anàlisi de les dades genètiques del projecte EurGast2 "Genetic susceptibility, environmental exposure and gastric cancer risk in an European population”, estudi cas‐control niat a la cohort europea EPIC “European Prospective lnvestigation into Cancer and Nutrition”, que té per objectiu l’estudi dels factors genètics i ambientals associats amb el risc de desenvolupar càncer gàstric (CG). A partir de les dades resultants de l’estudi EurGast2, en el què es van analitzar 1.294 SNPs en 365 casos de càncer gàstric i 1.284 controls en l’anàlisi Single SNP previ, la hipòtesi de partida del present Treball de Final de Grau és que algunes variants amb un efecte marginal molt feble, però que conjuntament amb altres variants estarien associades al risc de CG, podrien no haver‐se detectat. Així doncs, l’objectiu principal del projecte és la identificació d’interaccions de segon ordre entre variants genètiques de gens candidats implicades en la carcinogènesi de càncer gàstric. L’anàlisi de les interaccions s’ha dut a terme aplicant el mètode estadístic Model‐based Multifactor Dimensionality Reduction Method (MB‐MDR), desenvolupat per Calle et al. l’any 2008 i s’han aplicat dues metodologies de filtratge per seleccionar les interaccions que s’exploraran: 1) filtratge d’interaccions amb un SNP significatiu en el Single SNP analysis i 2) filtratge d’interaccions segons la mesura Sinèrgia. Els resultats del projecte han identificat 5 interaccions de segon ordre entre SNPs associades significativament amb un major risc de desenvolupar càncer gàstric, amb p‐valor inferior a 10‐4. Les interaccions identificades corresponen a interaccions entre els gens MPO i CDH1, XRCC1 i GAS6, ADH1B i NR5A2 i IL4R i IL1RN (que s’ha validat en les dues metodologies de filtratge). Excepte CDH1, cap altre d’aquests gens s’havia associat significativament amb el CG o prioritzat en les anàlisis prèvies, el que confirma l’interès d’analitzar les interaccions genètiques de segon ordre. Aquestes poden ser un punt de partida per altres anàlisis destinades a confirmar gens putatius i a estudiar a nivell biològic i molecular els mecanismes de carcinogènesi, i orientades a la recerca de noves dianes terapèutiques i mètodes de diagnosi i pronòstic més eficients.
Resumo:
The relationship between inflammation and cancer is well established in several tumor types, including bladder cancer. We performed an association study between 886 inflammatory-gene variants and bladder cancer risk in 1,047 cases and 988 controls from the Spanish Bladder Cancer (SBC)/EPICURO Study. A preliminary exploration with the widely used univariate logistic regression approach did not identify any significant SNP after correcting for multiple testing. We further applied two more comprehensive methods to capture the complexity of bladder cancer genetic susceptibility: Bayesian Threshold LASSO (BTL), a regularized regression method, and AUC-Random Forest, a machine-learning algorithm. Both approaches explore the joint effect of markers. BTL analysis identified a signature of 37 SNPs in 34 genes showing an association with bladder cancer. AUC-RF detected an optimal predictive subset of 56 SNPs. 13 SNPs were identified by both methods in the total population. Using resources from the Texas Bladder Cancer study we were able to replicate 30% of the SNPs assessed. The associations between inflammatory SNPs and bladder cancer were reexamined among non-smokers to eliminate the effect of tobacco, one of the strongest and most prevalent environmental risk factor for this tumor. A 9 SNP-signature was detected by BTL. Here we report, for the first time, a set of SNP in inflammatory genes jointly associated with bladder cancer risk. These results highlight the importance of the complex structure of genetic susceptibility associated with cancer risk.
Resumo:
Background: Toll-like receptors (TLRs) are critical components for host pathogen recognition and variants in genes participating in this response influence susceptibility to infections. Recently, TLR1 gene polymorphisms have been found correlated with whole blood hyper-inflammatory responses to pathogen-associated molecules and associated with sepsis-associated multiorgan dysfunction and acute lung injury (ALI). We examined the association of common variants of TLR1 gene with sepsis-derived complications in an independent study and with serum levels for four inflammatory biomarker among septic patients. Methodology/Principal Findings: Seven tagging single nucleotide polymorphisms of the TLR1 gene were genotyped in samples from a prospective multicenter case-only study of patients with severe sepsis admitted into a network of intensive care units followed for disease severity. Interleukin (IL)-1 b, IL-6, IL-10, and C-reactive protein (CRP) serum levels were measured at study entry, at 48 h and at 7th day. Alleles -7202G and 248Ser, and the 248Ser-602Ile haplotype were associated with circulatory dysfunction among severe septic patients (0.001<=p <= 0.022), and with reduced IL-10 (0.012<= p <=0.047) and elevated CRP (0.011<= p <=0.036) serum levels during the first week of sepsis development. Additionally, the -7202GG genotype was found to be associated with hospital mortality (p =0.017) and ALI (p =0.050) in a combined analysis with European Americans, suggesting common risk effects among studies Conclusions/Significance: These results partially replicate and extend previous findings, supporting that variants of TLR1 gene are determinants of severe complications during sepsis.
Resumo:
We investigated two siblings with granulomatous histiocytosis prominent in the nasal area, mimicking rhinoscleroma and Rosai-Dorfman syndrome. Genome-wide linkage analysis and whole-exome sequencing identified a homozygous frameshift deletion in SLC29A3, which encodes human equilibrative nucleoside transporter-3 (hENT3). Germline mutations in SLC29A3 have been reported in rare patients with a wide range of overlapping clinical features and inherited disorders including H syndrome, pigmented hypertrichosis with insulin-dependent diabetes, and Faisalabad histiocytosis. With the exception of insulin-dependent diabetes and mild finger and toe contractures in one sibling, the two patients with nasal granulomatous histiocytosis studied here displayed none of the many SLC29A3-associated phenotypes. This mild clinical phenotype probably results from a remarkable genetic mechanism. The SLC29A3 frameshift deletion prevents the expression of the normally coding transcripts. It instead leads to the translation, expression, and function of an otherwise noncoding, out-of-frame mRNA splice variant lacking exon 3 that is eliminated by nonsense-mediated mRNA decay (NMD) in healthy individuals. The mutated isoform differs from the wild-type hENT3 by the modification of 20 residues in exon 2 and the removal of another 28 amino acids in exon 3, which include the second transmembrane domain. As a result, this new isoform displays some functional activity. This mechanism probably accounts for the narrow and mild clinical phenotype of the patients. This study highlights the"rescue" role played by a normally noncoding mRNA splice variant of SLC29A3, uncovering a new mechanism by which frameshift mutations can be hypomorphic.