885 resultados para Genomewide association studies
Resumo:
The heritability of attention deficit hyperactivity disorder (ADHD) is approximately 0.8. Despite several larger scale attempts, genome-wide association studies (GWAS) have not led to the identification of significant results. We performed a GWAS based on 495 German young patients with ADHD (according to DSM-IV criteria; Human660W-Quadv1; Illumina, San Diego, CA) and on 1,300 population-based adult controls (HumanHap550v3; Illumina). Some genes neighboring the single nucleotide polymorphisms (SNPs) with the lowest P-values (best P-value: 8.38 × 10(-7)) have potential relevance for ADHD (e.g., glutamate receptor, metabotropic 5 gene, GRM5). After quality control, the 30 independent SNPs with the lowest P-values (P-values ≤ 7.57 × 10(-5) ) were chosen for confirmation. Genotyping of these SNPs in up to 320 independent German families comprising at least one child with ADHD revealed directionally consistent effect-size point estimates for 19 (10 not consistent) of the SNPs. In silico analyses of the 30 SNPs in the largest meta-analysis so far (2,064 trios, 896 cases, and 2,455 controls) revealed directionally consistent effect-size point estimates for 16 SNPs (11 not consistent). None of the combined analyses revealed a genome-wide significant result. SNPs in previously described autosomal candidate genes did not show significantly lower P-values compared to SNPs within random sets of genes of the same size. We did not find genome-wide significant results in a GWAS of German children with ADHD compared to controls. The second best SNP is located in an intron of GRM5, a gene located within a recently described region with an infrequent copy number variation in patients with ADHD.
Resumo:
Background Three non-synonymous single nucleotide polymorphisms (Q223R, K109R and K656N) of the leptin receptor gene (LEPR) have been tested for association with obesity-related outcomes in multiple studies, showing inconclusive results. We performed a systematic review and meta-analysis on the association of the three LEPR variants with BMI. In addition, we analysed 15 SNPs within the LEPR gene in the CoLaus study, assessing the interaction of the variants with sex. Methodology/Principal Findings We searched electronic databases, including population-based studies that investigated the association between LEPR variants Q223R, K109R and K656N and obesity- related phenotypes in healthy, unrelated subjects. We furthermore performed meta-analyses of the genotype and allele frequencies in case-control studies. Results were stratified by SNP and by potential effect modifiers. CoLaus data were analysed by logistic and linear regressions and tested for interaction with sex. The meta-analysis of published data did not show an overall association between any of the tested LEPR variants and overweight. However, the choice of a BMI cut-off value to distinguish cases from controls was crucial to explain heterogeneity in Q223R. Differences in allele frequencies across ethnic groups are compatible with natural selection of derived alleles in Q223R and K109R and of the ancient allele in K656N in Asians. In CoLaus, the rs10128072, rs3790438 and rs3790437 variants showed interaction with sex for their association with overweight, waist circumference and fat mass in linear regressions. Conclusions Our systematic review and analysis of primary data from the CoLaus study did not show an overall association between LEPR SNPs and overweight. Most studies were underpowered to detect small effect sizes. A potential effect modification by sex, population stratification, as well as the role of natural selection should be addressed in future genetic association studies.
Resumo:
The level of body iron storage and the erythropoietic need for iron are indicated by the serum levels of ferritin and soluble transferrin receptor (sTfR), respectively. A meta-analysis of five genome-wide association studies on sTfR and ferritin revealed novel association to the PCSK7 and TMPRSS6 loci for sTfR and the HFE locus for both parameters. The PCSK7 association was the most significant (rs236918, P = 1.1 × 10E-27) suggesting that proprotein convertase 7, the gene product of PCSK7, may be involved in sTfR generation and/or iron homeostasis. Conditioning the sTfR analyses on transferrin saturation abolished the HFE signal and substantially diminished the TMPRSS6 signal while the PCSK7 association was unaffected, suggesting that the former may be mediated by transferrin saturation whereas the PCSK7-associated effect on sTfR generation appears to be more direct.
Resumo:
We identified a bipolar disorder (BPD) susceptibility region on chromosome 3q29 in a genome-wide linkage scan (Bailer et al. 2002 (Biol Psychiatry 52: 40), NPL-score 4.09) and follow-up linkage analysis (Schosser et al. 2004 (J Psychiatr Res 38(3): 357), NPL-scores >3 with five markers). These findings were supported by further fine-mapping of this region (Schosser et al. 2007 (Eur Neuropsychopharmacol 17(6-7): 501)), finding NPL-scores >3.9 with SNPs (single nucleotide polymorphisms) spanning a region of 3.46 Mbp in BPD families. Since genetic association studies are more powerful than linkage studies for detecting susceptibility genes of small effect size, we aimed to replicate these findings in an independent case-control sample collected in London (UK) and Vienna (Austria).
Resumo:
Lung function measures are heritable, predict mortality and are relevant in diagnosis of chronic obstructive pulmonary disease (COPD). COPD and asthma are diseases of the airways with major public health impacts and each have a heritable component. Genome-wide association studies of SNPs have revealed novel genetic associations with both diseases but only account for a small proportion of the heritability. Complex copy number variation may account for some of the missing heritability. A well-characterised genomic region of complex copy number variation contains beta-defensin genes (DEFB103, DEFB104 and DEFB4), which have a role in the innate immune response. Previous studies have implicated these and related genes as being associated with asthma or COPD. We hypothesised that copy number variation of these genes may play a role in lung function in the general population and in COPD and asthma risk. We undertook copy number typing of this locus in 1149 adult and 689 children using a paralogue ratio test and investigated association with COPD, asthma and lung function. Replication of findings was assessed in a larger independent sample of COPD cases and smoking controls. We found evidence for an association of beta-defensin copy number with COPD in the adult cohort (OR = 1.4, 95%CI:1.02-1.92, P = 0.039) but this finding, and findings from a previous study, were not replicated in a larger follow-up sample(OR = 0.89, 95%CI:0.72-1.07, P = 0.217). No robust evidence of association with asthma in children was observed. We found no evidence for association between beta-defensin copy number and lung function in the general populations. Our findings suggest that previous reports of association of beta-defensin copy number with COPD should be viewed with caution. Suboptimal measurement of copy number can lead to spurious associations. Further beta-defensin copy number measurement in larger sample sizes of COPD cases and children with asthma are needed.
Resumo:
BACKGROUND: Neural tube defects (NTDs) occur in as many as 0.5-2 per 1000 live births in the United States. One of the most common and severe neural tube defects is meningomyelocele (MM) resulting from failed closure of the caudal end of the neural tube. MM has been induced by retinoic acid teratogenicity in rodent models. We hypothesized that genetic variants influencing retinoic acid (RA) induction via retinoic acid receptors (RARs) may be associated with risk for MM. METHODS: We analyzed 47 single nucleotide polymorphisms (SNPs) that span across the three retinoic acid receptor genes using the SNPlex genotyping platform. Our cohort consisted of 610 MM families. RESULTS: One variant in the RARA gene (rs12051734), three variants in the RARB gene (rs6799734, rs12630816, rs17016462), and a single variant in the RARG gene (rs3741434) were found to be statistically significant at p < 0.05. CONCLUSION: RAR genes were associated with risk for MM. For all associated SNPs, the rare allele conferred a protective effect for MM susceptibility.
Resumo:
BACKGROUND: Meningomyelocele (MM) results from lack of closure of the neural tube during embryologic development. Periconceptional folic acid supplementation is a modifier of MM risk in humans, leading toan interest in the folate transport genes as potential candidates for association to MM. METHODS: This study used the SNPlex Genotyping (ABI, Foster City, CA) platform to genotype 20 single polymorphic variants across the folate receptor genes (FOLR1, FOLR2, FOLR3) and the folate carrier gene (SLC19A1) to assess their association to MM. The study population included 329 trio and 281 duo families. Only cases with MM were included. Genetic association was assessed using the transmission disequilibrium test in PLINK. RESULTS: A variant in the FOLR2 gene (rs13908), three linked variants in the FOLR3 gene (rs7925545, rs7926875, rs7926987), and two variants in the SLC19A1 gene (rs1888530 and rs3788200) were statistically significant for association to MM in our population. CONCLUSION: This study involved the analyses of selected single nucleotide polymorphisms across the folate receptor genes and the folate carrier gene in a large population sample. It provided evidence that the rare alleles of specific single nucleotide polymorphisms within these genes appear to be statistically significant for association to MM in the patient population that was tested.
Resumo:
The molecular analysis of genes influencing human height has been notoriously difficult. Genome-wide association studies (GWAS) for height in humans based on tens of thousands to hundreds of thousands of samples so far revealed ∼200 loci for human height explaining only 20% of the heritability. In domestic animals isolated populations with a greatly reduced genetic heterogeneity facilitate a more efficient analysis of complex traits. We performed a genome-wide association study on 1,077 Franches-Montagnes (FM) horses using ∼40,000 SNPs. Our study revealed two QTL for height at withers on chromosomes 3 and 9. The association signal on chromosome 3 is close to the LCORL/NCAPG genes. The association signal on chromosome 9 is close to the ZFAT gene. Both loci have already been shown to influence height in humans. Interestingly, there are very large intergenic regions at the association signals. The two detected QTL together explain ∼18.2% of the heritable variation of height in horses. However, another large fraction of the variance for height in horses results from ECA 1 (11.0%), although the association analysis did not reveal significantly associated SNPs on this chromosome. The QTL region on ECA 3 associated with height at withers was also significantly associated with wither height, conformation of legs, ventral border of mandible, correctness of gaits, and expression of the head. The region on ECA 9 associated with height at withers was also associated with wither height, length of croup and length of back. In addition to these two QTL regions on ECA 3 and ECA 9 we detected another QTL on ECA 6 for correctness of gaits. Our study highlights the value of domestic animal populations for the genetic analysis of complex traits.
Resumo:
Androgens are precursors for sex steroids and are predominantly produced in the human gonads and the adrenal cortex. They are important for intrauterine and postnatal sexual development and human reproduction. Although human androgen biosynthesis has been extensively studied in the past, exact mechanisms underlying the regulation of androgen production in health and disease remain vague. Here, the knowledge on human androgen biosynthesis and regulation is reviewed with a special focus on human adrenal androgen production and the hyperandrogenic disorder of polycystic ovary syndrome (PCOS). Since human androgen regulation is highly specific without a good animal model, most studies are performed on patients harboring inborn errors of androgen biosynthesis, on human biomaterials and human (tumor) cell models. In the past, most studies used a candidate gene approach while newer studies use high throughput technologies to identify novel regulators of androgen biosynthesis. Using genome wide association studies on cohorts of patients, novel PCOS candidate genes have been recently described. Variant 2 of the DENND1A gene was found overexpressed in PCOS theca cells and confirmed to enhance androgen production. Transcriptome profiling of dissected adrenal zones established a role for BMP4 in androgen synthesis. Similarly, transcriptome analysis of human adrenal NCI-H295 cells identified novel regulators of androgen production. Kinase p38α (MAPK14) was found to phosphorylate CYP17 for enhanced 17,20 lyase activity and RARB and ANGPTL1 were detected in novel networks regulating androgens. The discovery of novel players for androgen biosynthesis is of clinical significance as it provides targets for diagnostic and therapeutic use.
Resumo:
Linkage and association studies are major analytical tools to search for susceptibility genes for complex diseases. With the availability of large collection of single nucleotide polymorphisms (SNPs) and the rapid progresses for high throughput genotyping technologies, together with the ambitious goals of the International HapMap Project, genetic markers covering the whole genome will be available for genome-wide linkage and association studies. In order not to inflate the type I error rate in performing genome-wide linkage and association studies, multiple adjustment for the significant level for each independent linkage and/or association test is required, and this has led to the suggestion of genome-wide significant cut-off as low as 5 × 10 −7. Almost no linkage and/or association study can meet such a stringent threshold by the standard statistical methods. Developing new statistics with high power is urgently needed to tackle this problem. This dissertation proposes and explores a class of novel test statistics that can be used in both population-based and family-based genetic data by employing a completely new strategy, which uses nonlinear transformation of the sample means to construct test statistics for linkage and association studies. Extensive simulation studies are used to illustrate the properties of the nonlinear test statistics. Power calculations are performed using both analytical and empirical methods. Finally, real data sets are analyzed with the nonlinear test statistics. Results show that the nonlinear test statistics have correct type I error rates, and most of the studied nonlinear test statistics have higher power than the standard chi-square test. This dissertation introduces a new idea to design novel test statistics with high power and might open new ways to mapping susceptibility genes for complex diseases. ^
Resumo:
Genome-wide association studies (GWAS) have rapidly become a standard method for disease gene discovery. Many recent GWAS indicate that for most disorders, only a few common variants are implicated and the associated SNPs explain only a small fraction of the genetic risk. The current study incorporated gene network information into gene-based analysis of GWAS data for Crohn's disease (CD). The purpose was to develop statistical models to boost the power of identifying disease-associated genes and gene subnetworks by maximizing the use of existing biological knowledge from multiple sources. The results revealed that Markov random field (MRF) based mixture model incorporating direct neighborhood information from a single gene network is not efficient in identifying CD-related genes based on the GWAS data. The incorporation of solely direct neighborhood information might lead to the low efficiency of these models. Alternative MRF models looking beyond direct neighboring information are necessary to be developed in the future for the purpose of this study.^
Resumo:
Studies have shown that rare genetic variants have stronger effects in predisposing common diseases, and several statistical methods have been developed for association studies involving rare variants. In order to better understand how these statistical methods perform, we seek to compare two recently developed rare variant statistical methods (VT and C-alpha) on 10,000 simulated re-sequencing data sets with disease status and the corresponding 10,000 simulated null data sets. The SLC1A1 gene has been suggested to be associated with diastolic blood pressure (DBP) in previous studies. In the current study, we applied VT and C-alpha methods to the empirical re-sequencing data for the SLC1A1 gene from 300 whites and 200 blacks. We found that VT method obtains higher power and performs better than C-alpha method with the simulated data we used. The type I errors were well-controlled for both methods. In addition, both VT and C-alpha methods suggested no statistical evidence for the association between the SLC1A1 gene and DBP. Overall, our findings provided an important comparison of the two statistical methods for future reference and provided preliminary and pioneer findings on the association between the SLC1A1 gene and blood pressure.^
Resumo:
Genome-wide association studies (GWAS) have successfully identified several genetic loci associated with inherited predisposition to primary biliary cirrhosis (PBC), the most common autoimmune disease of the liver. Pathway-based tests constitute a novel paradigm for GWAS analysis. By evaluating genetic variation across a biological pathway (gene set), these tests have the potential to determine the collective impact of variants with subtle effects that are individually too weak to be detected in traditional single variant GWAS analysis. To identify biological pathways associated with the risk of development of PBC, GWAS of PBC from Italy (449 cases and 940 controls) and Canada (530 cases and 398 controls) were independently analyzed. The linear combination test (LCT), a recently developed pathway-level statistical method was used for this analysis. For additional validation, pathways that were replicated at the P <0.05 level of significance in both GWAS on LCT analysis were also tested for association with PBC in each dataset using two complementary GWAS pathway approaches. The complementary approaches included a modification of the gene set enrichment analysis algorithm (i-GSEA4GWAS) and Fisher's exact test for pathway enrichment ratios. Twenty-five pathways were associated with PBC risk on LCT analysis in the Italian dataset at P<0.05, of which eight had an FDR<0.25. The top pathway in the Italian dataset was the TNF/stress related signaling pathway (p=7.38×10 -4, FDR=0.18). Twenty-six pathways were associated with PBC at the P<0.05 level using the LCT in the Canadian dataset with the regulation and function of ChREBP in liver pathway (p=5.68×10-4, FDR=0.285) emerging as the most significant pathway. Two pathways, phosphatidylinositol signaling system (Italian: p=0.016, FDR=0.436; Canadian: p=0.034, FDR=0.693) and hedgehog signaling (Italian: p=0.044, FDR=0.636; Canadian: p=0.041, FDR=0.693), were replicated at LCT P<0.05 in both datasets. Statistically significant association of both pathways with PBC genetic susceptibility was confirmed in the Italian dataset on i-GSEA4GWAS. Results for the phosphatidylinositol signaling system were also significant in both datasets on applying Fisher's exact test for pathway enrichment ratios. This study identified a combination of known and novel pathway-level associations with PBC risk. If functionally validated, the findings may yield fresh insights into the etiology of this complex autoimmune disease with possible preventive and therapeutic application.^
Resumo:
Complex diseases such as cancer result from multiple genetic changes and environmental exposures. Due to the rapid development of genotyping and sequencing technologies, we are now able to more accurately assess causal effects of many genetic and environmental factors. Genome-wide association studies have been able to localize many causal genetic variants predisposing to certain diseases. However, these studies only explain a small portion of variations in the heritability of diseases. More advanced statistical models are urgently needed to identify and characterize some additional genetic and environmental factors and their interactions, which will enable us to better understand the causes of complex diseases. In the past decade, thanks to the increasing computational capabilities and novel statistical developments, Bayesian methods have been widely applied in the genetics/genomics researches and demonstrating superiority over some regular approaches in certain research areas. Gene-environment and gene-gene interaction studies are among the areas where Bayesian methods may fully exert its functionalities and advantages. This dissertation focuses on developing new Bayesian statistical methods for data analysis with complex gene-environment and gene-gene interactions, as well as extending some existing methods for gene-environment interactions to other related areas. It includes three sections: (1) Deriving the Bayesian variable selection framework for the hierarchical gene-environment and gene-gene interactions; (2) Developing the Bayesian Natural and Orthogonal Interaction (NOIA) models for gene-environment interactions; and (3) extending the applications of two Bayesian statistical methods which were developed for gene-environment interaction studies, to other related types of studies such as adaptive borrowing historical data. We propose a Bayesian hierarchical mixture model framework that allows us to investigate the genetic and environmental effects, gene by gene interactions (epistasis) and gene by environment interactions in the same model. It is well known that, in many practical situations, there exists a natural hierarchical structure between the main effects and interactions in the linear model. Here we propose a model that incorporates this hierarchical structure into the Bayesian mixture model, such that the irrelevant interaction effects can be removed more efficiently, resulting in more robust, parsimonious and powerful models. We evaluate both of the 'strong hierarchical' and 'weak hierarchical' models, which specify that both or one of the main effects between interacting factors must be present for the interactions to be included in the model. The extensive simulation results show that the proposed strong and weak hierarchical mixture models control the proportion of false positive discoveries and yield a powerful approach to identify the predisposing main effects and interactions in the studies with complex gene-environment and gene-gene interactions. We also compare these two models with the 'independent' model that does not impose this hierarchical constraint and observe their superior performances in most of the considered situations. The proposed models are implemented in the real data analysis of gene and environment interactions in the cases of lung cancer and cutaneous melanoma case-control studies. The Bayesian statistical models enjoy the properties of being allowed to incorporate useful prior information in the modeling process. Moreover, the Bayesian mixture model outperforms the multivariate logistic model in terms of the performances on the parameter estimation and variable selection in most cases. Our proposed models hold the hierarchical constraints, that further improve the Bayesian mixture model by reducing the proportion of false positive findings among the identified interactions and successfully identifying the reported associations. This is practically appealing for the study of investigating the causal factors from a moderate number of candidate genetic and environmental factors along with a relatively large number of interactions. The natural and orthogonal interaction (NOIA) models of genetic effects have previously been developed to provide an analysis framework, by which the estimates of effects for a quantitative trait are statistically orthogonal regardless of the existence of Hardy-Weinberg Equilibrium (HWE) within loci. Ma et al. (2012) recently developed a NOIA model for the gene-environment interaction studies and have shown the advantages of using the model for detecting the true main effects and interactions, compared with the usual functional model. In this project, we propose a novel Bayesian statistical model that combines the Bayesian hierarchical mixture model with the NOIA statistical model and the usual functional model. The proposed Bayesian NOIA model demonstrates more power at detecting the non-null effects with higher marginal posterior probabilities. Also, we review two Bayesian statistical models (Bayesian empirical shrinkage-type estimator and Bayesian model averaging), which were developed for the gene-environment interaction studies. Inspired by these Bayesian models, we develop two novel statistical methods that are able to handle the related problems such as borrowing data from historical studies. The proposed methods are analogous to the methods for the gene-environment interactions on behalf of the success on balancing the statistical efficiency and bias in a unified model. By extensive simulation studies, we compare the operating characteristics of the proposed models with the existing models including the hierarchical meta-analysis model. The results show that the proposed approaches adaptively borrow the historical data in a data-driven way. These novel models may have a broad range of statistical applications in both of genetic/genomic and clinical studies.
Resumo:
A classic T-cell phenotype in systemic lupus erythematosus (SLE) is the downregulation and replacement of the CD3ζ chain that alters T-cell receptor signaling. However, genetic associations with SLE in the human CD247 locus that encodes CD3ζ are not well established and require replication in independent cohorts. Our aim was therefore to examine, localize and validate CD247-SLE association in a large multiethnic population. We typed 44 contiguous CD247 single-nucleotide polymorphisms (SNPs) in 8922 SLE patients and 8077 controls from four ethnically distinct populations. The strongest associations were found in the Asian population (11 SNPs in intron 1, 4.99 × 10(-4) < P < 4.15 × 10(-2)), where we further identified a five-marker haplotype (rs12141731-rs2949655-rs16859085-rs12144621-rs858554; G-G-A-G-A; P(hap) = 2.12 × 10(-5)) that exceeded the most associated single SNP rs858554 (minor allele frequency in controls = 13%; P = 4.99 × 10(-4), odds ratio = 1.32) in significance. Imputation and subsequent association analysis showed evidence of association (P < 0.05) at 27 additional SNPs within intron 1. Cross-ethnic meta-analysis, assuming an additive genetic model adjusted for population proportions, showed five SNPs with significant P-values (1.40 × 10(-3) < P< 3.97 × 10(-2)), with one (rs704848) remaining significant after Bonferroni correction (P(meta) = 2.66 × 10(-2)). Our study independently confirms and extends the association of SLE with CD247, which is shared by various autoimmune disorders and supports a common T-cell-mediated mechanism.