88 resultados para 2 splice variants
Resumo:
Résumé: L'automatisation du séquençage et de l'annotation des génomes, ainsi que l'application à large échelle de méthodes de mesure de l'expression génique, génèrent une quantité phénoménale de données pour des organismes modèles tels que l'homme ou la souris. Dans ce déluge de données, il devient très difficile d'obtenir des informations spécifiques à un organisme ou à un gène, et une telle recherche aboutit fréquemment à des réponses fragmentées, voir incomplètes. La création d'une base de données capable de gérer et d'intégrer aussi bien les données génomiques que les données transcriptomiques peut grandement améliorer la vitesse de recherche ainsi que la qualité des résultats obtenus, en permettant une comparaison directe de mesures d'expression des gènes provenant d'expériences réalisées grâce à des techniques différentes. L'objectif principal de ce projet, appelé CleanEx, est de fournir un accès direct aux données d'expression publiques par le biais de noms de gènes officiels, et de représenter des données d'expression produites selon des protocoles différents de manière à faciliter une analyse générale et une comparaison entre plusieurs jeux de données. Une mise à jour cohérente et régulière de la nomenclature des gènes est assurée en associant chaque expérience d'expression de gène à un identificateur permanent de la séquence-cible, donnant une description physique de la population d'ARN visée par l'expérience. Ces identificateurs sont ensuite associés à intervalles réguliers aux catalogues, en constante évolution, des gènes d'organismes modèles. Cette procédure automatique de traçage se fonde en partie sur des ressources externes d'information génomique, telles que UniGene et RefSeq. La partie centrale de CleanEx consiste en un index de gènes établi de manière hebdomadaire et qui contient les liens à toutes les données publiques d'expression déjà incorporées au système. En outre, la base de données des séquences-cible fournit un lien sur le gène correspondant ainsi qu'un contrôle de qualité de ce lien pour différents types de ressources expérimentales, telles que des clones ou des sondes Affymetrix. Le système de recherche en ligne de CleanEx offre un accès aux entrées individuelles ainsi qu'à des outils d'analyse croisée de jeux de donnnées. Ces outils se sont avérés très efficaces dans le cadre de la comparaison de l'expression de gènes, ainsi que, dans une certaine mesure, dans la détection d'une variation de cette expression liée au phénomène d'épissage alternatif. Les fichiers et les outils de CleanEx sont accessibles en ligne (http://www.cleanex.isb-sib.ch/). Abstract: The automatic genome sequencing and annotation, as well as the large-scale gene expression measurements methods, generate a massive amount of data for model organisms. Searching for genespecific or organism-specific information througout all the different databases has become a very difficult task, and often results in fragmented and unrelated answers. The generation of a database which will federate and integrate genomic and transcriptomic data together will greatly improve the search speed as well as the quality of the results by allowing a direct comparison of expression results obtained by different techniques. The main goal of this project, called the CleanEx database, is thus to provide access to public gene expression data via unique gene names and to represent heterogeneous expression data produced by different technologies in a way that facilitates joint analysis and crossdataset comparisons. A consistent and uptodate gene nomenclature is achieved by associating each single gene expression experiment with a permanent target identifier consisting of a physical description of the targeted RNA population or the hybridization reagent used. These targets are then mapped at regular intervals to the growing and evolving catalogues of genes from model organisms, such as human and mouse. The completely automatic mapping procedure relies partly on external genome information resources such as UniGene and RefSeq. The central part of CleanEx is a weekly built gene index containing crossreferences to all public expression data already incorporated into the system. In addition, the expression target database of CleanEx provides gene mapping and quality control information for various types of experimental resources, such as cDNA clones or Affymetrix probe sets. The Affymetrix mapping files are accessible as text files, for further use in external applications, and as individual entries, via the webbased interfaces . The CleanEx webbased query interfaces offer access to individual entries via text string searches or quantitative expression criteria, as well as crossdataset analysis tools, and crosschip gene comparison. These tools have proven to be very efficient in expression data comparison and even, to a certain extent, in detection of differentially expressed splice variants. The CleanEx flat files and tools are available online at: http://www.cleanex.isbsib. ch/.
Resumo:
OBJECTIVE: Proinsulin is a precursor of mature insulin and C-peptide. Higher circulating proinsulin levels are associated with impaired β-cell function, raised glucose levels, insulin resistance, and type 2 diabetes (T2D). Studies of the insulin processing pathway could provide new insights about T2D pathophysiology. RESEARCH DESIGN AND METHODS: We have conducted a meta-analysis of genome-wide association tests of ∼2.5 million genotyped or imputed single nucleotide polymorphisms (SNPs) and fasting proinsulin levels in 10,701 nondiabetic adults of European ancestry, with follow-up of 23 loci in up to 16,378 individuals, using additive genetic models adjusted for age, sex, fasting insulin, and study-specific covariates. RESULTS: Nine SNPs at eight loci were associated with proinsulin levels (P < 5 × 10(-8)). Two loci (LARP6 and SGSM2) have not been previously related to metabolic traits, one (MADD) has been associated with fasting glucose, one (PCSK1) has been implicated in obesity, and four (TCF7L2, SLC30A8, VPS13C/C2CD4A/B, and ARAP1, formerly CENTD2) increase T2D risk. The proinsulin-raising allele of ARAP1 was associated with a lower fasting glucose (P = 1.7 × 10(-4)), improved β-cell function (P = 1.1 × 10(-5)), and lower risk of T2D (odds ratio 0.88; P = 7.8 × 10(-6)). Notably, PCSK1 encodes the protein prohormone convertase 1/3, the first enzyme in the insulin processing pathway. A genotype score composed of the nine proinsulin-raising alleles was not associated with coronary disease in two large case-control datasets. CONCLUSIONS: We have identified nine genetic variants associated with fasting proinsulin. Our findings illuminate the biology underlying glucose homeostasis and T2D development in humans and argue against a direct role of proinsulin in coronary artery disease pathogenesis.
Resumo:
Gene-lifestyle interactions have been suggested to contribute to the development of type 2 diabetes. Glucose levels 2 h after a standard 75-g glucose challenge are used to diagnose diabetes and are associated with both genetic and lifestyle factors. However, whether these factors interact to determine 2-h glucose levels is unknown. We meta-analyzed single nucleotide polymorphism (SNP) × BMI and SNP × physical activity (PA) interaction regression models for five SNPs previously associated with 2-h glucose levels from up to 22 studies comprising 54,884 individuals without diabetes. PA levels were dichotomized, with individuals below the first quintile classified as inactive (20%) and the remainder as active (80%). BMI was considered a continuous trait. Inactive individuals had higher 2-h glucose levels than active individuals (β = 0.22 mmol/L [95% CI 0.13-0.31], P = 1.63 × 10(-6)). All SNPs were associated with 2-h glucose (β = 0.06-0.12 mmol/allele, P ≤ 1.53 × 10(-7)), but no significant interactions were found with PA (P > 0.18) or BMI (P ≥ 0.04). In this large study of gene-lifestyle interaction, we observed no interactions between genetic and lifestyle factors, both of which were associated with 2-h glucose. It is perhaps unlikely that top loci from genome-wide association studies will exhibit strong subgroup-specific effects, and may not, therefore, make the best candidates for the study of interactions.
Resumo:
L-2-Hydroxyglutaric aciduria (L2HGA) is a rare, neurometabolic disorder with an autosomal recessive mode of inheritance. Affected individuals only have neurological manifestations, including psychomotor retardation, cerebellar ataxia, and more variably macrocephaly, or epilepsy. The diagnosis of L2HGA can be made based on magnetic resonance imaging (MRI), biochemical analysis, and mutational analysis of L2HGDH. About 200 patients with elevated concentrations of 2-hydroxyglutarate (2HG) in the urine were referred for chiral determination of 2HG and L2HGDH mutational analysis. All patients with increased L2HG (n=106; 83 families) were included. Clinical information on 61 patients was obtained via questionnaires. In 82 families the mutations were detected by direct sequence analysis and/or multiplex ligation dependent probe amplification (MLPA), including one case where MLPA was essential to detect the second allele. In another case RT-PCR followed by deep intronic sequencing was needed to detect the mutation. Thirty-five novel mutations as well as 35 reported mutations and 14 nondisease-related variants are reviewed and included in a novel Leiden Open source Variation Database (LOVD) for L2HGDH variants (http://www.LOVD.nl/L2HGDH). Every user can access the database and submit variants/patients. Furthermore, we report on the phenotype, including neurological manifestations and urinary levels of L2HG, and we evaluate the phenotype-genotype relationship.
Resumo:
Background: T reatment o f chronic hepatitis C i s evolving, a nd direct acting antivirals ( DAAs) are now a dded to p egylated interferon-α ( Peg- INF-α) and ribavirin (RBV) for the treatment o f hepatitis C v irus ( HCV) genotype 1 infection. DAAs c ause d ifferent side effects and can even worsen RBV induced hemolytic anemia. T herefore, identifying host genetic d eterminants of R BV bioavailability and therapeutic e fficacy will remain crucial for individualized treatment. Recent d ata showed associations between R BV induced h emolytic anemia and genetic polymorphisms o f concentrative nucleoside transporters s uch as C NT3 (SLC28A3) and i nosine t riphosphatase (ITPA). T o analyze t he association of genetic variants of SLC28 transporters and ITPA with RBV induced hemolytic anemia and treatment o utcome. Methods: I n our study, 173 patients f rom t he S wiss Hepatitis C C ohort Study and 2 2 patients from Swiss Association for the Study of the Liver study 24 (61% HCV g enotype 1, 3 9% genotypes 2 o r 3) were analyzed for SLC28A2 single nucleotide p olymorphism (SNP) rs11854484, SLC28A3 rs56350726 and SLC28A3 rs10868138 as well as ITPA SNPs rs1127354 and rs7270101. RBV serum levels during treatment were measured in 49 patients. Results: SLC28A2 r s11854484 genotype TT was associated with significantly higher dosage- and body weight-adjusted RBV levels as compared to genotypes TC and CC (p=0.04 and p=0.02 at weeks 4 and 8, respectively). ITPA SNPs rs1127354 and rs7270101 were associated with h emolytic a nemia both in genotype as w ell as i n allelic a nalyses. SLC28A3 rs56350726 genotype TT (vs. AT/AA, RR=2.1; 95% CI 1.1-4.1) as well as the T allele (vs. A; RR=1.8, 95% CI 1.1-3.2) were associated with increased SVR rates. The combined analysis of overall ITPA activity and SLC28 v ariants together revealed n o significant a dditive effects on either treatment-related anemia or SVR. Conclusions: T he newly identified association between RBV serum levels a nd SLC28A2 rs11854484 genotype as well as the replicated association of ITPA and SLC28A3 g enetic p olymorphisms w ith RBV induced hemolytic anemia and treatment r esponse underpin the need for further studies on host genetic d eterminants of R BV bioavailability and therapeutic e fficacy f or individualized treatment of chronic hepatitis C.
Resumo:
The limited ability of common variants to account for the genetic contribution to complex disease has prompted searches for rare variants of large effect, to partly explain the 'missing heritability'. Analyses of genome-wide genotyping data have identified genomic structural variants (GSVs) as a source of such rare causal variants. Recent studies have reported multiple GSV loci associated with risk of obesity. We attempted to replicate these associations by similar analysis of two familial-obesity case-control cohorts and a population cohort, and detected GSVs at 11 out of 18 loci, at frequencies similar to those previously reported. Based on their reported frequencies and effect sizes (OR≥25), we had sufficient statistical power to detect the large majority (80%) of genuine associations at these loci. However, only one obesity association was replicated. Deletion of a 220 kb region on chromosome 16p11.2 has a carrier population frequency of 2×10(-4) (95% confidence interval [9.6×10(-5)-3.1×10(-4)]); accounts overall for 0.5% [0.19%-0.82%] of severe childhood obesity cases (P202;=202;3.8×10(-10); odds ratio202;=202;25.0 [9.9-60.6]); and results in a mean body mass index (BMI) increase of 5.8 kg.m(-2) [1.8-10.3] in adults from the general population. We also attempted replication using BMI as a quantitative trait in our population cohort; associations with BMI at or near nominal significance were detected at two further loci near KIF2B and within FOXP2, but these did not survive correction for multiple testing. These findings emphasise several issues of importance when conducting rare GSV association, including the need for careful cohort selection and replication strategy, accurate GSV identification, and appropriate correction for multiple testing and/or control of false discovery rate. Moreover, they highlight the potential difficulty in replicating rare CNV associations across different populations. Nevertheless, we show that such studies are potentially valuable for the identification of variants making an appreciable contribution to complex disease.
Resumo:
Anatomical structures and mechanisms linking genes to neuropsychiatric disorders are not deciphered. Reciprocal copy number variants at the 16p11.2 BP4-BP5 locus offer a unique opportunity to study the intermediate phenotypes in carriers at high risk for autism spectrum disorder (ASD) or schizophrenia (SZ). We investigated the variation in brain anatomy in 16p11.2 deletion and duplication carriers. Beyond gene dosage effects on global brain metrics, we show that the number of genomic copies negatively correlated to the gray matter volume and white matter tissue properties in cortico-subcortical regions implicated in reward, language and social cognition. Despite the near absence of ASD or SZ diagnoses in our 16p11.2 cohort, the pattern of brain anatomy changes in carriers spatially overlaps with the well-established structural abnormalities in ASD and SZ. Using measures of peripheral mRNA levels, we confirm our genomic copy number findings. This combined molecular, neuroimaging and clinical approach, applied to larger datasets, will help interpret the relative contributions of genes to neuropsychiatric conditions by measuring their effect on local brain anatomy.Molecular Psychiatry advance online publication, 25 November 2014; doi:10.1038/mp.2014.145.
Resumo:
BACKGROUND: The FTO gene harbors the strongest known susceptibility locus for obesity. While many individual studies have suggested that physical activity (PA) may attenuate the effect of FTO on obesity risk, other studies have not been able to confirm this interaction. To confirm or refute unambiguously whether PA attenuates the association of FTO with obesity risk, we meta-analyzed data from 45 studies of adults (n202;=202;218,166) and nine studies of children and adolescents (n202;=202;19,268). METHODS AND FINDINGS: All studies identified to have data on the FTO rs9939609 variant (or any proxy [r(2)>0.8]) and PA were invited to participate, regardless of ethnicity or age of the participants. PA was standardized by categorizing it into a dichotomous variable (physically inactive versus active) in each study. Overall, 25% of adults and 13% of children were categorized as inactive. Interaction analyses were performed within each study by including the FTO×PA interaction term in an additive model, adjusting for age and sex. Subsequently, random effects meta-analysis was used to pool the interaction terms. In adults, the minor (A-) allele of rs9939609 increased the odds of obesity by 1.23-fold/allele (95% CI 1.20-1.26), but PA attenuated this effect (p(interaction) 202;=202;0.001). More specifically, the minor allele of rs9939609 increased the odds of obesity less in the physically active group (odds ratio 202;=202;1.22/allele, 95% CI 1.19-1.25) than in the inactive group (odds ratio 202;=202;1.30/allele, 95% CI 1.24-1.36). No such interaction was found in children and adolescents. CONCLUSIONS: The association of the FTO risk allele with the odds of obesity is attenuated by 27% in physically active adults, highlighting the importance of PA in particular in those genetically predisposed to obesity.
Resumo:
Candidaemia is the fourth most common cause of bloodstream infection, with a high mortality rate of up to 40%. Identification of host genetic factors that confer susceptibility to candidaemia may aid in designing adjunctive immunotherapeutic strategies. Here we hypothesize that variation in immune genes may predispose to candidaemia. We analyse 118,989 single-nucleotide polymorphisms (SNPs) across 186 loci known to be associated with immune-mediated diseases in the largest candidaemia cohort to date of 217 patients of European ancestry and a group of 11,920 controls. We validate the significant associations by comparison with a disease-matched control group. We observe significant association between candidaemia and SNPs in the CD58 (P = 1.97 × 10(-11); odds ratio (OR) = 4.68), LCE4A-C1orf68 (P = 1.98 × 10(-10); OR = 4.25) and TAGAP (P = 1.84 × 10(-8); OR = 2.96) loci. Individuals carrying two or more risk alleles have an increased risk for candidaemia of 19.4-fold compared with individuals carrying no risk allele. We identify three novel genetic risk factors for candidaemia, which we subsequently validate for their role in antifungal host defence.
Resumo:
Cryptic exons or pseudoexons are typically activated by point mutations that create GT or AG dinucleotides of new 5' or 3' splice sites in introns, often in repetitive elements. Here we describe two cases of tetrahydrobiopterin deficiency caused by mutations improving the branch point sequence and polypyrimidine tracts of repeat-containing pseudoexons in the PTS gene. In the first case, we demonstrate a novel pathway of antisense Alu exonization, resulting from an intronic deletion that removed the poly(T)-tail of antisense AluSq. The deletion brought a favorable branch point sequence within proximity of the pseudoexon 3' splice site and removed an upstream AG dinucleotide required for the 3' splice site repression on normal alleles. New Alu exons can thus arise in the absence of poly(T)-tails that facilitated inclusion of most transposed elements in mRNAs by serving as polypyrimidine tracts, highlighting extraordinary flexibility of Alu repeats in shaping intron-exon structure. In the other case, a PTS pseudoexon was activated by an A>T substitution 9 nt upstream of its 3' splice site in a LINE-2 sequence, providing the first example of a disease-causing exonization of the most ancient interspersed repeat. These observations expand the spectrum of mutational mechanisms that introduce repetitive sequences in mature transcripts and illustrate the importance of intronic mutations in alternative splicing and phenotypic variability of hereditary disorders.
Resumo:
The 15q24.1 locus, including CYP1A2, is associated with blood pressure (BP). The CYP1A2 rs762551 C allele is associated with lower CYP1A2 enzyme activity. CYP1A2 metabolizes caffeine and is induced by smoking. The association of caffeine consumption with hypertension remains controversial. We explored the effects of CYP1A2 variants and CYP1A2 enzyme activity on BP, focusing on caffeine as the potential mediator of CYP1A2 effects. Four observational (n = 16 719) and one quasi-experimental studies (n = 106) including European adults were conducted. Outcome measures were BP, caffeine intake, CYP1A2 activity and polymorphisms rs762551, rs1133323 and rs1378942. CYP1A2 variants were associated with hypertension in non-smokers, but not in smokers (CYP1A2-smoking interaction P = 0.01). Odds ratios (95% CIs) for hypertension for rs762551 CC, CA and AA genotypes were 1 (reference), 0.78 (0.59-1.02) and 0.66 (0.50-0.86), respectively, P = 0.004. Results were similar for the other variants. Higher CYP1A2 activity was linearly associated with lower BP after quitting smoking (P = 0.049 and P = 0.02 for systolic and diastolic BP, respectively), but not while smoking. In non-smokers, the CYP1A2 variants were associated with higher reported caffeine intake, which in turn was associated with lower odds of hypertension and lower BP (P = 0.01). In Mendelian randomization analyses using rs1133323 as instrument, each cup of caffeinated beverage was negatively associated with systolic BP [-9.57 (-16.22, -2.91) mmHg]. The associations of CYP1A2 variants with BP were modified by reported caffeine intake. These observational and quasi-experimental results strongly support a causal role of CYP1A2 in BP control via caffeine intake.
Resumo:
Delta(3),Delta(2)-enoyl CoA isomerase (ECI) is an enzyme that participates in the degradation of unsaturated fatty acids through the beta-oxidation cycle. Three genes encoding Delta(3),Delta(2)-enoyl CoA isomerases and named AtECI1, AtECI2 and AtECI3 have been identified in Arabidopsis thaliana. When expressed heterologously in Saccharomyces cerevisiae, all three ECI proteins were targeted to the peroxisomes and enabled the yeast Deltaeci1 mutant to degrade 10Z-heptadecenoic acid, demonstrating Delta(3),Delta(2)-enoyl CoA isomerase activity in vivo. Fusion proteins between yellow fluorescent protein and AtECI1 or AtECI2 were targeted to the peroxisomes in onion epidermal cells and Arabidopsis root cells, but a similar fusion protein with AtECI3 remained in the cytosol for both tissues. AtECI3 targeting to peroxisomes in S. cerevisiae was dependent on yeast PEX5, while expression of Arabidopsis PEX5 in yeast failed to target AtECI3 to peroxisomes. AtECI2 and AtECI3 are tandem duplicated genes and show a high level of amino acid conservation, except at the C-terminus; AtECI2 ends with the well conserved peroxisome targeting signal 1 (PTS1) terminal tripeptide PKL, while AtECI3 possesses a divergent HNL terminal tripeptide. Evolutionary analysis of ECI genes in plants revealed several independent duplication events, with duplications occurring in rice and Medicago truncatula, generating homologues with divergent C-termini and no recognizable PTS1. All plant ECI genes analyzed, including AtECI3, are under negative purifying selection, implying functionality of the cytosolic AtECI3. Analysis of the mammalian and fungal genomes failed to identify cytosolic variants of the Delta(3),Delta(2)-enoyl CoA isomerase, indicating that evolution of cytosolic Delta(3),Delta(2)-enoyl CoA isomerases is restricted to the plant kingdom
Resumo:
There are two forms of orosomucoid (ORM) in the sera of most individuals. They are encoded by two separate but closely linked loci, ORM1 and ORM2. A number of variants have been identified in various populations. Duplication and nonexpression are also observed in some populations. Thus, the ORM system is very complicated and its nomenclature is very confusing. In order to propose a new nomenclature, ORM variants detected by several laboratories have been compared and characterized by isoelectric focusing (IEF) followed by immunoprinting. A total of 57 different alleles including 17 new ones were identified. The 27 alleles were assigned to the ORM1 locus, and the others to the ORM2 locus. The designations ORM*F1, ORM1*F2, ORM1*S and ORM2*M were adopted for the four common alleles instead of ORM1*1, ORM1*3, ORM1*2 and ORM2*1 (ORM2*A), respectively. The variants were designated alpha numerically according to their relative mobilities after IEF in a pH gradient of 4.5-5.4 with Triton X-100 and glycerol. For the duplicated genes a prefix is added to a combined name of two alleles, e.g. ORM1*dB9S. Silent alleles were named ORM1*Q0 and ORM2*Q0 conventionally. In addition, the effects of diseases to ORM band patterns after IEF are also discussed.
Resumo:
TNFRSF13B encodes transmembrane activator and calcium modulator and cyclophilin ligand interactor (TACI), a B cell- specific tumor necrosis factor (TNF) receptor superfamily member. Both biallelic and monoallelic TNFRSF13B mutations were identified in patients with common variable immunodeficiency disorders. The genetic complexity and variable clinical presentation of TACI deficiency prompted us to evaluate the genetic, immunologic, and clinical condition in 50 individuals with TNFRSF13B alterations, following screening of 564 unrelated patients with hypogammaglobulinemia. We identified 13 new sequence variants. The most frequent TNFRSF13B variants (C104R and A181E; n=39; 6.9%) were also present in a heterozygous state in 2% of 675 controls. All patients with biallelic mutations had hypogammaglobulinemia and nearly all showed impaired binding to a proliferation-inducing ligand (APRIL). However, the majority (n=41; 82%) of the pa-tients carried monoallelic changes in TNFRSF13B. Presence of a heterozygous mutation was associated with antibody deficiency (P< .001, relative risk 3.6). Heterozygosity for the most common mutation, C104R, was associated with disease (P< .001, relative risk 4.2). Furthermore, heterozygosity for C104R was associated with low numbers of IgD(-)CD27(+) B cells (P= .019), benign lymphoproliferation (P< .001), and autoimmune complications (P= .001). These associations indicate that C104R heterozygosity increases the risk for common variable immunodeficiency disorders and influences clinical presentation.