939 resultados para genome wide complex trait analysis
Resumo:
Bipolar disorder (BD) and attention deficit/hyperactivity disorder (ADHD) may share common genetic risk factors as indicated by the high co-morbidity of BD and ADHD, their phenotypic overlap especially in pediatric populations, the high heritability of both disorders, and the co-occurrence in families. We therefore examined whether known polygenic BD risk alleles are associated with ADHD. We chose the eight best SNPs of the recent genome-wide association study (GWAS) of BD patients of German ancestry and the nine SNPs from international GWAS meeting a 'genome-wide significance' level of α = 5 × 10(-8). A GWAS was performed in 495 ADHD children and 1,300 population-based controls using HumanHap550v3 and Human660 W-Quadv1 BeadArrays. We found no significant association of childhood ADHD with single BD risk alleles surviving adjustment for multiple testing. Yet, risk alleles for BD and ADHD were directionally consistent at eight of nine loci with the strongest support for three SNPs in or near NCAN, BRE, and LMAN2L. The polygene analysis for the BP risk alleles at all 14 loci indicated a higher probability of being a BD risk allele carrier in the ADHD cases as compared to the controls. At a moderate power to detect association with ADHD, if true effects were close to estimates from GWAS for BD, our results suggest that the possible contribution of BD risk variants to childhood ADHD risk is considerably lower than for BD. Yet, our findings should encourage researchers to search for common genetic risk factors in BD and childhood ADHD in future studies.
Resumo:
Crohn's disease and ulcerative colitis, the two common forms of inflammatory bowel disease (IBD), affect over 2.5 million people of European ancestry, with rising prevalence in other populations. Genome-wide association studies and subsequent meta-analyses of these two diseases as separate phenotypes have implicated previously unsuspected mechanisms, such as autophagy, in their pathogenesis and showed that some IBD loci are shared with other inflammatory diseases. Here we expand on the knowledge of relevant pathways by undertaking a meta-analysis of Crohn's disease and ulcerative colitis genome-wide association scans, followed by extensive validation of significant findings, with a combined total of more than 75,000 cases and controls. We identify 71 new associations, for a total of 163 IBD loci, that meet genome-wide significance thresholds. Most loci contribute to both phenotypes, and both directional (consistently favouring one allele over the course of human history) and balancing (favouring the retention of both alleles within populations) selection effects are evident. Many IBD loci are also implicated in other immune-mediated disorders, most notably with ankylosing spondylitis and psoriasis. We also observe considerable overlap between susceptibility loci for IBD and mycobacterial infection. Gene co-expression network analysis emphasizes this relationship, with pathways shared between host responses to mycobacteria and those predisposing to IBD.
Resumo:
Genome-wide association studies (GWAS) are used to discover genes underlying complex, heritable disorders for which less powerful study designs have failed in the past. The number of GWAS has skyrocketed recently with findings reported in top journals and the mainstream media. Mircorarrays are the genotype calling technology of choice in GWAS as they permit exploration of more than a million single nucleotide polymorphisms (SNPs)simultaneously. The starting point for the statistical analyses used by GWAS, to determine association between loci and disease, are genotype calls (AA, AB, or BB). However, the raw data, microarray probe intensities, are heavily processed before arriving at these calls. Various sophisticated statistical procedures have been proposed for transforming raw data into genotype calls. We find that variability in microarray output quality across different SNPs, different arrays, and different sample batches has substantial inuence on the accuracy of genotype calls made by existing algorithms. Failure to account for these sources of variability, GWAS run the risk of adversely affecting the quality of reported findings. In this paper we present solutions based on a multi-level mixed model. Software implementation of the method described in this paper is available as free and open source code in the crlmm R/BioConductor.
Resumo:
Simulation-based assessment is a popular and frequently necessary approach to evaluation of statistical procedures. Sometimes overlooked is the ability to take advantage of underlying mathematical relations and we focus on this aspect. We show how to take advantage of large-sample theory when conducting a simulation using the analysis of genomic data as a motivating example. The approach uses convergence results to provide an approximation to smaller-sample results, results that are available only by simulation. We consider evaluating and comparing a variety of ranking-based methods for identifying the most highly associated SNPs in a genome-wide association study, derive integral equation representations of the pre-posterior distribution of percentiles produced by three ranking methods, and provide examples comparing performance. These results are of interest in their own right and set the framework for a more extensive set of comparisons.
Resumo:
OBJECTIVE: To report the study of a multigenerational Swiss family with dopa-responsive dystonia (DRD). METHODS: Clinical investigation was made of available family members, including historical and chart reviews. Subject examinations were video recorded. Genetic analysis included a genome-wide linkage study with microsatellite markers (STR), GTP cyclohydrolase I (GCH1) gene sequencing, and dosage analysis. RESULTS: We evaluated 32 individuals, of whom 6 were clinically diagnosed with DRD, with childhood-onset progressive foot dystonia, later generalizing, followed by parkinsonism in the two older patients. The response to levodopa was very good. Two additional patients had late onset dopa-responsive parkinsonism. Three other subjects had DRD symptoms on historical grounds. We found suggestive linkage to the previously reported DYT14 locus, which excluded GCH1. However, further study with more stringent criteria for disease status attribution showed linkage to a larger region, which included GCH1. No mutation was found in GCH1 by gene sequencing but dosage methods identified a novel heterozygous deletion of exons 3 to 6 of GCH1. The mutation was found in seven subjects. One of the patients with dystonia represented a phenocopy. CONCLUSIONS: This study rules out the previously reported DYT14 locus as a cause of disease, as a novel multiexonic deletion was identified in GCH1. This work highlights the necessity of an accurate clinical diagnosis in linkage studies as well as the need for appropriate allele frequencies, penetrance, and phenocopy estimates. Comprehensive sequencing and dosage analysis of known genes is recommended prior to genome-wide linkage analysis.
Resumo:
Little is known about the genes and proteins involved in the process of human memory. To identify genetic factors related to human episodic memory performance, we conducted an ultra-high-density genome-wide screen at > 500 000 single nucleotide polymorphisms (SNPs) in a sample of normal young adults stratified for performance on an episodic recall memory test. Analysis of this data identified SNPs within the calmodulin-binding transcription activator 1 (CAMTA1) gene that were significantly associated with memory performance. A follow up study, focused on the CAMTA1 locus in an independent cohort consisting of cognitively normal young adults, singled out SNP rs4908449 with a P-value of 0.0002 as the most significant associated SNP in the region. These validated genetic findings were further supported by the identification of CAMTA1 transcript enrichment in memory-related human brain regions and through a functional magnetic resonance imaging experiment on individuals matched for memory performance that identified CAMTA1 allele-specific upregulation of medial temporal lobe brain activity in those individuals harboring the 'at-risk' allele for poorer memory performance. The CAMTA1 locus encodes a purported transcription factor that interfaces with the calcium-calmodulin system of the cell to alter gene expression patterns. Our validated genomic and functional biological findings described herein suggest a role for CAMTA1 in human episodic memory.
Resumo:
In this study, we demonstrate the power of applying complementary DNA (cDNA) microarray technology to identifying candidate loci that exhibit subtle differences in expression levels associated with a complex trait in natural populations of a nonmodel organism. Using a highly replicated experimental design involving 180 cDNA microarray experiments, we measured gene-expression levels from 1098 transcript probes in 90 individuals originating from six brown trout (Salmo trutta) and one Atlantic salmon (Salmo salar) population, which follow either a migratory or a sedentary life history. We identified several candidate genes associated with preparatory adaptations to different life histories in salmonids, including genes encoding for transaldolase 1, constitutive heat-shock protein HSC70-1 and endozepine. Some of these genes clustered into functional groups, providing insight into the physiological pathways potentially involved in the expression of life-history related phenotypic differences. Such differences included the down-regulation of genes involved in the respiratory system of future migratory individuals. In addition, we used linear discriminant analysis to identify a set of 12 genes that correctly classified immature individuals as migratory or sedentary with high accuracy. Using the expression levels of these 12 genes, 17 out of 18 individuals used for cross-validation were correctly assigned to their respective life-history phenotype. Finally, we found various candidate genes associated with physiological changes that are likely to be involved in preadaptations to seawater in anadromous populations of the genus Salmo, one of which was identified to encode for nucleophosmin 1. Our findings thus provide new molecular insights into salmonid life-history variation, opening new perspectives in the study of this complex trait.
Resumo:
Lung function measures are heritable, predict mortality and are relevant in diagnosis of chronic obstructive pulmonary disease (COPD). COPD and asthma are diseases of the airways with major public health impacts and each have a heritable component. Genome-wide association studies of SNPs have revealed novel genetic associations with both diseases but only account for a small proportion of the heritability. Complex copy number variation may account for some of the missing heritability. A well-characterised genomic region of complex copy number variation contains beta-defensin genes (DEFB103, DEFB104 and DEFB4), which have a role in the innate immune response. Previous studies have implicated these and related genes as being associated with asthma or COPD. We hypothesised that copy number variation of these genes may play a role in lung function in the general population and in COPD and asthma risk. We undertook copy number typing of this locus in 1149 adult and 689 children using a paralogue ratio test and investigated association with COPD, asthma and lung function. Replication of findings was assessed in a larger independent sample of COPD cases and smoking controls. We found evidence for an association of beta-defensin copy number with COPD in the adult cohort (OR = 1.4, 95%CI:1.02-1.92, P = 0.039) but this finding, and findings from a previous study, were not replicated in a larger follow-up sample(OR = 0.89, 95%CI:0.72-1.07, P = 0.217). No robust evidence of association with asthma in children was observed. We found no evidence for association between beta-defensin copy number and lung function in the general populations. Our findings suggest that previous reports of association of beta-defensin copy number with COPD should be viewed with caution. Suboptimal measurement of copy number can lead to spurious associations. Further beta-defensin copy number measurement in larger sample sizes of COPD cases and children with asthma are needed.
Resumo:
Hereditary nasal parakeratosis (HNPK), an inherited monogenic autosomal recessive skin disorder, leads to crusts and fissures on the nasal planum of Labrador Retrievers. We performed a genome-wide association study (GWAS) using 13 HNPK cases and 23 controls. We obtained a single strong association signal on chromosome 2 (p(raw) = 4.4×10⁻¹⁴). The analysis of shared haplotypes among the 13 cases defined a critical interval of 1.6 Mb with 25 predicted genes. We re-sequenced the genome of one case at 38× coverage and detected 3 non-synonymous variants in the critical interval with respect to the reference genome assembly. We genotyped these variants in larger cohorts of dogs and only one was perfectly associated with the HNPK phenotype in a cohort of more than 500 dogs. This candidate causative variant is a missense variant in the SUV39H2 gene encoding a histone 3 lysine 9 (H3K9) methyltransferase, which mediates chromatin silencing. The variant c.972T>G is predicted to change an evolutionary conserved asparagine into a lysine in the catalytically active domain of the enzyme (p.N324K). We further studied the histopathological alterations in the epidermis in vivo. Our data suggest that the HNPK phenotype is not caused by hyperproliferation, but rather delayed terminal differentiation of keratinocytes. Thus, our data provide evidence that SUV39H2 is involved in the epigenetic regulation of keratinocyte differentiation ensuring proper stratification and tight sealing of the mammalian epidermis.
Resumo:
Imerslund-Gräsbeck syndrome (IGS) or selective cobalamin malabsorption has been described in humans and dogs. IGS occurs in Border Collies and is inherited as a monogenic autosomal recessive trait in this breed. Using 7 IGS cases and 7 non-affected controls we mapped the causative mutation by genome-wide association and homozygosity mapping to a 3.53 Mb interval on chromosome 2. We re-sequenced the genome of one affected dog at ∼10× coverage and detected 17 non-synonymous variants in the critical interval. Two of these non-synonymous variants were in the cubilin gene (CUBN), which is known to play an essential role in cobalamin uptake from the ileum. We tested these two CUBN variants for association with IGS in larger cohorts of dogs and found that only one of them was perfectly associated with the phenotype. This variant, a single base pair deletion (c.8392delC), is predicted to cause a frameshift and premature stop codon in the CUBN gene. The resulting mutant open reading frame is 821 codons shorter than the wildtype open reading frame (p.Q2798Rfs*3). Interestingly, we observed an additional nonsense mutation in the MRC1 gene encoding the mannose receptor, C type 1, which was in perfect linkage disequilibrium with the CUBN frameshift mutation. Based on our genetic data and the known role of CUBN for cobalamin uptake we conclude that the identified CUBN frameshift mutation is most likely causative for IGS in Border Collies.
Resumo:
We describe a mild form of disproportionate dwarfism in Labrador Retrievers, which is not associated with any obvious health problems such as secondary arthrosis. We designate this phenotype as skeletal dysplasia 2 (SD2). It is inherited as a monogenic autosomal recessive trait with incomplete penetrance primarily in working lines of the Labrador Retriever breed. Using 23 cases and 37 controls we mapped the causative mutation by genome-wide association and homozygosity mapping to a 4.44 Mb interval on chromosome 12. We re-sequenced the genome of one affected dog at 30x coverage and detected 92 non-synonymous variants in the critical interval. Only two of these variants, located in the lymphotoxin A (LTA) and collagen alpha-2(XI) chain gene (COL11A2), respectively, were perfectly associated with the trait. Previously described COL11A2 variants in humans or mice lead to skeletal dysplasias and/or deafness. The dog variant associated with disproportionate dwarfism, COL11A2:c.143G>C or p.R48P, probably has only a minor effect on collagen XI function, which might explain the comparatively mild phenotype seen in our study. The identification of this candidate causative mutation thus widens the known phenotypic spectrum of COL11A2 mutations. We speculate that non-pathogenic COL11A2 variants might even contribute to the heritable variation in height.
Resumo:
Background Persons infected with human immunodeficiency virus (HIV) have increased rates of coronary artery disease (CAD). The relative contribution of genetic background, HIV-related factors, antiretroviral medications, and traditional risk factors to CAD has not been fully evaluated in the setting of HIV infection. Methods In the general population, 23 common single-nucleotide polymorphisms (SNPs) were shown to be associated with CAD through genome-wide association analysis. Using the Metabochip, we genotyped 1875 HIV-positive, white individuals enrolled in 24 HIV observational studies, including 571 participants with a first CAD event during the 9-year study period and 1304 controls matched on sex and cohort. Results A genetic risk score built from 23 CAD-associated SNPs contributed significantly to CAD (P = 2.9×10−4). In the final multivariable model, participants with an unfavorable genetic background (top genetic score quartile) had a CAD odds ratio (OR) of 1.47 (95% confidence interval [CI], 1.05–2.04). This effect was similar to hypertension (OR = 1.36; 95% CI, 1.06–1.73), hypercholesterolemia (OR = 1.51; 95% CI, 1.16–1.96), diabetes (OR = 1.66; 95% CI, 1.10–2.49), ≥1 year lopinavir exposure (OR = 1.36; 95% CI, 1.06–1.73), and current abacavir treatment (OR = 1.56; 95% CI, 1.17–2.07). The effect of the genetic risk score was additive to the effect of nongenetic CAD risk factors, and did not change after adjustment for family history of CAD. Conclusions In the setting of HIV infection, the effect of an unfavorable genetic background was similar to traditional CAD risk factors and certain adverse antiretroviral exposures. Genetic testing may provide prognostic information complementary to family history of CAD.
Resumo:
Different life-cycle stages of Trypanosoma brucei are characterized by stage-specific glycoprotein coats. GPEET procyclin, the major surface protein of early procyclic (insect midgut) forms, is transcribed in the nucleolus by RNA polymerase I as part of a polycistronic precursor that is processed to monocistronic mRNAs. In culture, when differentiation to late procyclic forms is triggered by removal of glycerol, the precursor is still transcribed, but accumulation of GPEET mRNA is prevented by a glycerol-responsive element in the 3' UTR. A genome-wide RNAi screen for persistent expression of GPEET in glycerol-free medium identified a novel protein, NRG1 (Nucleolar Regulator of GPEET 1), as a negative regulator. NRG1 associates with GPEET mRNA and with several nucleolar proteins. These include two PUF proteins, TbPUF7 and TbPUF10, and BOP1, a protein required for rRNA processing in other organisms. RNAi against each of these components prolonged or even increased GPEET expression in the absence of glycerol as well as causing a significant reduction in 5.8S rRNA and its immediate precursor. These results indicate that components of a complex used for rRNA maturation can have an additional role in regulating mRNAs that originate in the nucleolus.
Resumo:
DMRT (Doublesex and Mab-3 related transcription factor) proteins generally associated with sexual differentiation in many organisms share a common DNA binding domain and are often expressed in reproductive tissues. Aside from doublesex, which is a central factor in the regulation of sex determination, Drosophila possesses three different dmrt genes that are of unknown function. Because the association with sexual differentiation and reproduction is not universal and some DMRT proteins have been found to play other developmental roles we chose to further characterize one of these Drosophila genes. We carried out genetic analysis of dmrt93B, which was previously found to be expressed sex-specifically in the developing somatic gonad and to affect testis morphogenesis in RNAi knockdowns. In order to disrupt this gene, the GAL4 yeast transcriptional activator followed by a polyadenylation signal was inserted after the dmrt93B start codon and introduced into the genome by homologous recombination. Analysis of the knock-in mutation as well as a small deletion removing all dmrt93B sequence demonstrate that loss of function causes partial lethality at the late pupal stage. Surprisingly, these mutations have no significant effect on gonad formation or male fertility. Analysis of GAL4-driven GFP reporter expression indicates that the dmrt93B promoter activity is highly specific to neurons in the suboesophageal and proventricular ganglion in larva and adult of both sexes suggesting a possible role in digestive tract function. Using the Capillary Feeder (CAFÉ) assay to measure daily food intake we find that reduction in this gene’s function leads to an increase in food consumption. These results suggest dmrt93 plays an important role in the formation or maintenance of neurons that affect feeding and support the idea that dmrt genes may not be restricted to roles in sexual differentiation.