929 resultados para Genomewide association studies
Resumo:
The identification of associations between interleukin-28B (IL-28B) variants and the spontaneous clearance of hepatitis C virus (HCV) raises the issues of causality and the net contribution of host genetics to the trait. To estimate more precisely the net effect of IL-28B genetic variation on HCV clearance, we optimized genotyping and compared the host contributions in multiple- and single-source cohorts to control for viral and demographic effects. The analysis included individuals with chronic or spontaneously cleared HCV infections from a multiple-source cohort (n = 389) and a single-source cohort (n = 71). We performed detailed genotyping in the coding region of IL-28B and searched for copy number variations to identify the genetic variant or haplotype carrying the strongest association with viral clearance. This analysis was used to compare the effects of IL-28B variation in the two cohorts. Haplotypes characterized by carriage of the major alleles at IL-28B single-nucleotide polymorphisms (SNPs) were highly overrepresented in individuals with spontaneous clearance versus those with chronic HCV infections (66.1% versus 38.6%, P = 6 × 10(-9) ). The odds ratios for clearance were 2.1 [95% confidence interval (CI) = 1.6-3.0] and 3.9 (95% CI = 1.5-10.2) in the multiple- and single-source cohorts, respectively. Protective haplotypes were in perfect linkage (r(2) = 1.0) with a nonsynonymous coding variant (rs8103142). Copy number variants were not detected. CONCLUSION: We identified IL-28B haplotypes highly predictive of spontaneous HCV clearance. The high linkage disequilibrium between IL-28B SNPs indicates that association studies need to be complemented by functional experiments to identify single causal variants. The point estimate for the genetic effect was higher in the single-source cohort, which was used to effectively control for viral diversity, sex, and coinfections and, therefore, offered a precise estimate of the net host genetic contribution.
Resumo:
The domestic dog offers a unique opportunity to explore the genetic basis of disease, morphology and behaviour. Humans share many diseases with our canine companions, making dogs an ideal model organism for comparative disease genetics. Using newly developed resources, genome-wide association studies in dog breeds are proving to be exceptionally powerful. Towards this aim, veterinarians and geneticists from 12 European countries are collaborating to collect and analyse the DNA from large cohorts of dogs suffering from a range of carefully defined diseases of relevance to human health. This project, named LUPA, has already delivered considerable results. The consortium has collaborated to develop a new high density single nucleotide polymorphism (SNP) array. Mutations for four monogenic diseases have been identified and the information has been utilised to find mutations in human patients. Several complex diseases have been mapped and fine mapping is underway. These findings should ultimately lead to a better understanding of the molecular mechanisms underlying complex diseases in both humans and their best friend.
Resumo:
Attention-deficit/hyperactivity disorder (ADHD) is a common, highly heritable neurodevelopmental disorder. Genetic loci have not yet been identified by genome-wide association studies. Rare copy number variations (CNVs), such as chromosomal deletions or duplications, have been implicated in ADHD and other neurodevelopmental disorders. To identify rare (frequency 1%) CNVs that increase the risk of ADHD, we performed a whole-genome CNV analysis based on 489 young ADHD patients and 1285 adult population-based controls and identified one significantly associated CNV region. In tests for a global burden of large (>500 kb) rare CNVs, we observed a nonsignificant (P=0.271) 1.126-fold enriched rate of subjects carrying at least one such CNV in the group of ADHD cases. Locus-specific tests of association were used to assess if there were more rare CNVs in cases compared with controls. Detected CNVs, which were significantly enriched in the ADHD group, were validated by quantitative (q)PCR. Findings were replicated in an independent sample of 386 young patients with ADHD and 781 young population-based healthy controls. We identified rare CNVs within the parkinson protein 2 gene (PARK2) with a significantly higher prevalence in ADHD patients than in controls (P=2.8 × 10(-4) after empirical correction for genome-wide testing). In total, the PARK2 locus (chr 6: 162 659 756-162 767 019) harboured three deletions and nine duplications in the ADHD patients and two deletions and two duplications in the controls. By qPCR analysis, we validated 11 of the 12 CNVs in ADHD patients (P=1.2 × 10(-3) after empirical correction for genome-wide testing). In the replication sample, CNVs at the PARK2 locus were found in four additional ADHD patients and one additional control (P=4.3 × 10(-2)). Our results suggest that copy number variants at the PARK2 locus contribute to the genetic susceptibility of ADHD. Mutations and CNVs in PARK2 are known to be associated with Parkinson disease.Molecular Psychiatry advance online publication, 20 November 2012; doi:10.1038/mp.2012.161.
Resumo:
Doublecortin and calmodulin like kinase 1 (DCLK1) is implicated in synaptic plasticity and neurodevelopment. Genetic variants in DCLK1 are associated with cognitive traits, specifically verbal memory and general cognition. We investigated the role of DCLK1 variants in three psychiatric disorders that have neuro-cognitive dysfunctions: schizophrenia (SCZ), bipolar affective disorder (BP) and attention deficit/hyperactivity disorder (ADHD). We mined six genome wide association studies (GWASs) that were available publically or through collaboration; three for BP, two for SCZ and one for ADHD. We also genotyped the DCLK1 region in additional samples of cases with SCZ, BP or ADHD and controls that had not been whole-genome typed. In total, 9895 subjects were analysed, including 5308 normal controls and 4,587 patients (1,125 with SCZ, 2,496 with BP and 966 with ADHD). Several DCLK1 variants were associated with disease phenotypes in the different samples. The main effect was observed for rs7989807 in intron 3, which was strongly associated with SCZ alone and even more so when cases with SCZ and ADHD were combined (P-value = 4 × 10(-5) and 4 × 10(-6), respectively). Associations were also observed with additional markers in intron 3 (combination of SCZ, ADHD and BP), intron 19 (SCZ+BP) and the 3'UTR (SCZ+BP). Our results suggest that genetic variants in DCLK1 are associated with SCZ and, to a lesser extent, with ADHD and BP. Interestingly the association is strongest when SCZ and ADHD are considered together, suggesting common genetic susceptibility. Given that DCLK1 variants were previously found to be associated with cognitive traits, these results are consistent with the role of DCLK1 in neurodevelopment and synaptic plasticity.
Resumo:
Crohn's disease and ulcerative colitis, the two common forms of inflammatory bowel disease (IBD), affect over 2.5 million people of European ancestry, with rising prevalence in other populations. Genome-wide association studies and subsequent meta-analyses of these two diseases as separate phenotypes have implicated previously unsuspected mechanisms, such as autophagy, in their pathogenesis and showed that some IBD loci are shared with other inflammatory diseases. Here we expand on the knowledge of relevant pathways by undertaking a meta-analysis of Crohn's disease and ulcerative colitis genome-wide association scans, followed by extensive validation of significant findings, with a combined total of more than 75,000 cases and controls. We identify 71 new associations, for a total of 163 IBD loci, that meet genome-wide significance thresholds. Most loci contribute to both phenotypes, and both directional (consistently favouring one allele over the course of human history) and balancing (favouring the retention of both alleles within populations) selection effects are evident. Many IBD loci are also implicated in other immune-mediated disorders, most notably with ankylosing spondylitis and psoriasis. We also observe considerable overlap between susceptibility loci for IBD and mycobacterial infection. Gene co-expression network analysis emphasizes this relationship, with pathways shared between host responses to mycobacteria and those predisposing to IBD.
Resumo:
High-throughput SNP arrays provide estimates of genotypes for up to one million loci, often used in genome-wide association studies. While these estimates are typically very accurate, genotyping errors do occur, which can influence in particular the most extreme test statistics and p-values. Estimates for the genotype uncertainties are also available, although typically ignored. In this manuscript, we develop a framework to incorporate these genotype uncertainties in case-control studies for any genetic model. We verify that using the assumption of a “local alternative” in the score test is very reasonable for effect sizes typically seen in SNP association studies, and show that the power of the score test is simply a function of the correlation of the genotype probabilities with the true genotypes. We demonstrate that the power to detect a true association can be substantially increased for difficult to call genotypes, resulting in improved inference in association studies.
Resumo:
Statistical approaches to evaluate higher order SNP-SNP and SNP-environment interactions are critical in genetic association studies, as susceptibility to complex disease is likely to be related to the interaction of multiple SNPs and environmental factors. Logic regression (Kooperberg et al., 2001; Ruczinski et al., 2003) is one such approach, where interactions between SNPs and environmental variables are assessed in a regression framework, and interactions become part of the model search space. In this manuscript we extend the logic regression methodology, originally developed for cohort and case-control studies, for studies of trios with affected probands. Trio logic regression accounts for the linkage disequilibrium (LD) structure in the genotype data, and accommodates missing genotypes via haplotype-based imputation. We also derive an efficient algorithm to simulate case-parent trios where genetic risk is determined via epistatic interactions.
Resumo:
Genome-wide association studies (GWAS) are used to discover genes underlying complex, heritable disorders for which less powerful study designs have failed in the past. The number of GWAS has skyrocketed recently with findings reported in top journals and the mainstream media. Mircorarrays are the genotype calling technology of choice in GWAS as they permit exploration of more than a million single nucleotide polymorphisms (SNPs)simultaneously. The starting point for the statistical analyses used by GWAS, to determine association between loci and disease, are genotype calls (AA, AB, or BB). However, the raw data, microarray probe intensities, are heavily processed before arriving at these calls. Various sophisticated statistical procedures have been proposed for transforming raw data into genotype calls. We find that variability in microarray output quality across different SNPs, different arrays, and different sample batches has substantial inuence on the accuracy of genotype calls made by existing algorithms. Failure to account for these sources of variability, GWAS run the risk of adversely affecting the quality of reported findings. In this paper we present solutions based on a multi-level mixed model. Software implementation of the method described in this paper is available as free and open source code in the crlmm R/BioConductor.
Resumo:
Submicroscopic changes in chromosomal DNA copy number dosage are common and have been implicated in many heritable diseases and cancers. Recent high-throughput technologies have a resolution that permits the detection of segmental changes in DNA copy number that span thousands of basepairs across the genome. Genome-wide association studies (GWAS) may simultaneously screen for copy number-phenotype and SNP-phenotype associations as part of the analytic strategy. However, genome-wide array analyses are particularly susceptible to batch effects as the logistics of preparing DNA and processing thousands of arrays often involves multiple laboratories and technicians, or changes over calendar time to the reagents and laboratory equipment. Failure to adjust for batch effects can lead to incorrect inference and requires inefficient post-hoc quality control procedures that exclude regions that are associated with batch. Our work extends previous model-based approaches for copy number estimation by explicitly modeling batch effects and using shrinkage to improve locus-specific estimates of copy number uncertainty. Key features of this approach include the use of diallelic genotype calls from experimental data to estimate batch- and locus-specific parameters of background and signal without the requirement of training data. We illustrate these ideas using a study of bipolar disease and a study of chromosome 21 trisomy. The former has batch effects that dominate much of the observed variation in quantile-normalized intensities, while the latter illustrates the robustness of our approach to datasets where as many as 25% of the samples have altered copy number. Locus-specific estimates of copy number can be plotted on the copy-number scale to investigate mosaicism and guide the choice of appropriate downstream approaches for smoothing the copy number as a function of physical position. The software is open source and implemented in the R package CRLMM available at Bioconductor (http:www.bioconductor.org).
Resumo:
Chronic alcohol consumption is a major risk factor for the development of chronic pancreatitis. However, chronic pancreatitis occurs only in a minority of heavy drinkers. This variability may be due to yet unidentified genetic factors. Several enzymes involved in the degradation of reactive oxidants and xenobiotics, such as glutathione-S-transferase P1 (GSTP1) and manganese-superoxide dismutase (MnSOD) reveal functional polymorphisms that affect the antioxidative capacity and may therefore modulate the development of chronic pancreatitis and long-term complications like endocrine and exocrine pancreatic insufficiency. Two functional polymorphisms of the MnSOD and the GSTP1 gene were assessed by polymerase chain reaction and restriction fragment length polymorphism in 165 patients with chronic alcoholic pancreatitis, 140 alcoholics without evidence of pancreatic disease and 160 healthy control subjects. The distribution of GSTP1 and MnSOD genotypes were in Hardy-Weinberg equilibrium in the total cohort. Genotype and allele frequencies for both genes were not statistically different between the three groups. Although genotype MnSOD Ala/Val was seemingly associated with the presence of exocrine pancreatic insufficiency, this subgroup was too small and the association statistically underpowered. None of the tested genotypes affected the development of endocrine pancreatic insufficiency. Polymorphisms of MnSOD and GSTP1 are not associated with chronic alcoholic pancreatitis. The present data emphasize the need for stringently designed candidate gene association studies with well-characterized cases and controls and sufficient statistical power to exclude chance observations.
Resumo:
BACKGROUND: As only a minority of alcoholics develop cirrhosis, polymorphic genes, whose products are involved in fibrosis development were suggested to confer individual susceptibility. We tested whether a functional promoter polymorphism in the gene encoding matrix metalloproteinase-3 (MMP-3; 1171 5A/6A) was associated liver cirrhosis in alcoholics. METHODS: Independent cohorts from the UK and Germany were studied. (i) UK cohort: 320 alcoholic cirrhotics and 183 heavy drinkers without liver damage and (ii) German cohort: 149 alcoholic cirrhotics, 220 alcoholic cirrhotics who underwent liver transplantation and 151 alcoholics without liver disease. Patients were genotyped for MMP-3 variants by restriction fragment length polymorphism, single strand confirmation polymorphism, and direct sequencing. In addition, MMP-3 transcript levels were correlated with MMP-3 genotype in normal liver tissues. RESULTS: Matrix metalloproteinase-3 genotype and allele distribution in all 1023 alcoholic patients were in Hardy-Weinberg equilibrium. No significant differences in MMP-3 genotype and allele frequencies were observed either between alcoholics with or without cirrhosis. There were no differences in hepatic mRNA transcription levels according to MMP-3 genotype. CONCLUSIONS: Matrix metalloproteinase-3 1171 promoter polymorphism plays no role in the genetic predisposition for liver cirrhosis in alcoholics. Stringently designed candidate gene association studies are required to exclude chance observations.
Resumo:
BACKGROUND: HIV-infected individuals have an increased risk of myocardial infarction. Antiretroviral therapy (ART) is regarded as a major determinant of dyslipidemia in HIV-infected individuals. Previous genetic studies have been limited by the validity of the single-nucleotide polymorphisms (SNPs) interrogated and by cross-sectional design. Recent genome-wide association studies have reliably associated common SNPs to dyslipidemia in the general population. METHODS AND RESULTS: We validated the contribution of 42 SNPs (33 identified in genome-wide association studies and 9 previously reported SNPs not included in genome-wide association study chips) and of longitudinally measured key nongenetic variables (ART, underlying conditions, sex, age, ethnicity, and HIV disease parameters) to dyslipidemia in 745 HIV-infected study participants (n=34 565 lipid measurements; median follow-up, 7.6 years). The relative impact of SNPs and ART to lipid variation in the study population and their cumulative influence on sustained dyslipidemia at the level of the individual were calculated. SNPs were associated with lipid changes consistent with genome-wide association study estimates. SNPs explained up to 7.6% (non-high-density lipoprotein cholesterol), 6.2% (high-density lipoprotein cholesterol), and 6.8% (triglycerides) of lipid variation; ART explained 3.9% (non-high-density lipoprotein cholesterol), 1.5% (high-density lipoprotein cholesterol), and 6.2% (triglycerides). An individual with the most dyslipidemic antiretroviral and genetic background had an approximately 3- to 5-fold increased risk of sustained dyslipidemia compared with an individual with the least dyslipidemic therapy and genetic background. CONCLUSIONS: In the HIV-infected population treated with ART, the weight of the contribution of common SNPs and ART to dyslipidemia was similar. When selecting an ART regimen, genetic information should be considered in addition to the dyslipidemic effects of ART agents.
Resumo:
Acute infection with the hepatitis C virus (HCV) induces a wide range of innate and adaptive immune responses. A total of 20-50% of acutely HCV-infected individuals permanently control the virus, referred to as 'spontaneous hepatitis C clearance', while the infection progresses to chronic hepatitis C in the majority of cases. Numerous studies have examined host genetic determinants of hepatitis C infection outcome and revealed the influence of genetic polymorphisms of human leukocyte antigens, killer immunoglobulin-like receptors, chemokines, interleukins and interferon-stimulated genes on spontaneous hepatitis C clearance. However, most genetic associations were not confirmed in independent cohorts, revealed opposing results in diverse populations or were limited by varying definitions of hepatitis C outcomes or small sample size. Coordinated efforts are needed in the search for key genetic determinants of spontaneous hepatitis C clearance that include well-conducted candidate genetic and genome-wide association studies, direct sequencing and follow-up functional studies.
Resumo:
Attention deficit/hyperactivity disorder (ADHD) is a highly heritable neurodevelopmental disorder of childhood onset. Clinical and biological evidence points to shared common central nervous system (CNS) pathology of ADHD and restless legs syndrome (RLS). It was hypothesized that variants previously found to be associated with RLS in two large genome-wide association studies (GWA), will also be associated with ADHD. SNPs located in MEIS1 (rs2300478), BTBD9 (rs9296249, rs3923809, rs6923737), and MAP2K5 (rs12593813, rs4489954) as well as three SNPs tagging the identified haplotype in MEIS1 (rs6710341, rs12469063, rs4544423) were genotyped in a well characterized German sample of 224 families comprising one or more affected sibs (386 children) and both parents. We found no evidence for preferential transmission of the hypothesized variants to ADHD. Subsequent analyses elicited nominal significant association with haplotypes consisting of the three SNPs in BTBD9 (chi2 = 14.8, df = 7, nominal p = 0.039). According to exploratory post hoc analyses, the major contribution to this finding came from the A-A-A-haplotype with a haplotype-wise nominal p-value of 0.009. However, this result did not withstand correction for multiple testing. In view of our results, RLS risk alleles may have a lower effect on ADHD than on RLS or may not be involved in ADHD. The negative findings may additionally result from genetic heterogeneity of ADHD, i.e. risk alleles for RLS may only be relevant for certain subtypes of ADHD. Genes relevant to RLS remain interesting candidates for ADHD; particularly BTBD9 needs further study, as it has been related to iron storage, a potential pathophysiological link between RLS and certain subtypes of ADHD.
Resumo:
Horses were domesticated from the Eurasian steppes 5,000-6,000 years ago. Since then, the use of horses for transportation, warfare, and agriculture, as well as selection for desired traits and fitness, has resulted in diverse populations distributed across the world, many of which have become or are in the process of becoming formally organized into closed, breeding populations (breeds). This report describes the use of a genome-wide set of autosomal SNPs and 814 horses from 36 breeds to provide the first detailed description of equine breed diversity. F(ST) calculations, parsimony, and distance analysis demonstrated relationships among the breeds that largely reflect geographic origins and known breed histories. Low levels of population divergence were observed between breeds that are relatively early on in the process of breed development, and between those with high levels of within-breed diversity, whether due to large population size, ongoing outcrossing, or large within-breed phenotypic diversity. Populations with low within-breed diversity included those which have experienced population bottlenecks, have been under intense selective pressure, or are closed populations with long breed histories. These results provide new insights into the relationships among and the diversity within breeds of horses. In addition these results will facilitate future genome-wide association studies and investigations into genomic targets of selection.