983 resultados para Complex traits
Resumo:
The molecular analysis of genes influencing human height has been notoriously difficult. Genome-wide association studies (GWAS) for height in humans based on tens of thousands to hundreds of thousands of samples so far revealed ∼200 loci for human height explaining only 20% of the heritability. In domestic animals isolated populations with a greatly reduced genetic heterogeneity facilitate a more efficient analysis of complex traits. We performed a genome-wide association study on 1,077 Franches-Montagnes (FM) horses using ∼40,000 SNPs. Our study revealed two QTL for height at withers on chromosomes 3 and 9. The association signal on chromosome 3 is close to the LCORL/NCAPG genes. The association signal on chromosome 9 is close to the ZFAT gene. Both loci have already been shown to influence height in humans. Interestingly, there are very large intergenic regions at the association signals. The two detected QTL together explain ∼18.2% of the heritable variation of height in horses. However, another large fraction of the variance for height in horses results from ECA 1 (11.0%), although the association analysis did not reveal significantly associated SNPs on this chromosome. The QTL region on ECA 3 associated with height at withers was also significantly associated with wither height, conformation of legs, ventral border of mandible, correctness of gaits, and expression of the head. The region on ECA 9 associated with height at withers was also associated with wither height, length of croup and length of back. In addition to these two QTL regions on ECA 3 and ECA 9 we detected another QTL on ECA 6 for correctness of gaits. Our study highlights the value of domestic animal populations for the genetic analysis of complex traits.
Resumo:
Univariate linkage analysis is used routinely to localise genes for human complex traits. Often, many traits are analysed but the significance of linkage for each trait is not corrected for multiple trait testing, which increases the experiment-wise type-I error rate. In addition, univariate analyses do not realise the full power provided by multivariate data sets. Multivariate linkage is the ideal solution but it is computationally intensive, so genome-wide analysis and evaluation of empirical significance are often prohibitive. We describe two simple methods that efficiently alleviate these caveats by combining P-values from multiple univariate linkage analyses. The first method estimates empirical pointwise and genome-wide significance between one trait and one marker when multiple traits have been tested. It is as robust as an appropriate Bonferroni adjustment, with the advantage that no assumptions are required about the number of independent tests performed. The second method estimates the significance of linkage between multiple traits and one marker and, therefore, it can be used to localise regions that harbour pleiotropic quantitative trait loci (QTL). We show that this method has greater power than individual univariate analyses to detect a pleiotropic QTL across different situations. In addition, when traits are moderately correlated and the QTL influences all traits, it can outperform formal multivariate VC analysis. This approach is computationally feasible for any number of traits and was not affected by the residual correlation between traits. We illustrate the utility of our approach with a genome scan of three asthma traits measured in families with a twin proband.
Resumo:
Height is a complex physical trait that displays strong heritability. Adult height is related to length of the long bones, which is determined by growth at the epiphyseal growth plate. Longitudinal bone growth occurs via the process of endochondral ossification, where bone forms over the differentiating cartilage template at the growth plate. Estrogen plays a major role in regulating longitudinal bone growth and is responsible for inducing the pubertal growth spurt and fusion of the epiphyseal growth plate. However, the mechanism by which estrogen promotes epiphyseal fusion is poorly understood. It has been hypothesised that estrogen functions to regulate growth plate fusion by stimulating chondrocyte apoptosis, angiogenesis and bone cell invasion in the growth plate. Another theory has suggested that estrogen exposure exhausts the proliferative capacity of growth plate chondrocytes, which accelerates the process of chondrocyte senescence, leading to growth plate fusion. The overall objective of this study was to gain a greater understanding of the molecular mechanisms behind estrogen-mediated growth and height attainment by examining gene regulation in chondrocytes and the role of some of these genes in normal height inheritance. With the heritability of height so well established, the initial hypothesis was that genetic variation in candidate genes associated with longitudinal bone growth would be involved in normal adult height variation. The height-related genes FGFR3, CBFA1, ER and CBFA1 were screened for novel polymorphisms using denaturing HPLC and RFLP analysis. In total, 24 polymorphisms were identified. Two SNPs in ER (rs3757323 C>T and rs1801132 G>C) were strongly associated with adult male height and displayed an 8 cm and 9 cm height difference between homozygous genotypes, respectively. The TC haplotype of these SNPs was associated with a 6 cm decrease in height and remarkably, no homozygous carriers of the TC haplotype were identified in tall subjects. No significant associations with height were found for polymorphisms in the FGFR3, CBFA1 or VDR genes. In the epiphyseal growth plate, chondrocyte proliferation, matrix synthesis and chondrocyte hypertrophy are all major contributors to long bone growth. As estrogen plays such a significant role in both growth and final height attainment, another hypothesis of this study was that estrogen exerted its effects in the growth plate by influencing chondrocyte proliferation and mediating the expression of chondrocyte marker genes. The examination of genes regulated by estrogen in chondrocyte-like cells aimed to identify potential regulators of growth plate fusion, which may further elucidate mechanisms involved in the cessation of linear growth. While estrogen did not dramatically alter the proliferation of the SW1353 cell line, gene expression experiments identified several estrogen regulated genes. Sixteen chondrocyte marker genes were examined in response to estrogen concentrations ranging from 10-12 M to 10-8 M over varying time points. Of the genes analysed, IHH, FGFR3, collagen II and collagen X were not readily detectable and PTHrP, GHR, ER, BMP6, SOX9 and TGF1 mRNAs showed no significant response to estrogen treatments. However, the expression of MMP13, CBFA1, BCL-2 and BAX genes were significantly decreased. Interestingly, the majority of estrogen regulated genes in SW1353 cells are expressed in the hypertrophic zone of the growth plate. Estrogen is also known to regulate systemic GH secretion and local GH action. At the molecular level, estrogen functions to inhibit GH action by negatively regulating GH signalling. GH treated SW1353 cells displayed increases in MMP9 mRNA expression (4.4-fold) and MMP13 mRNA expression (64-fold) in SW1353 cells. Increases were also detected in their respective proteins. Treatment with AG490, an established JAK2 inhibitor, blocked the GH mediated stimulation of both MMP9 and MMP13 mRNA expression. The application of estrogen and GH to SW1353 cells attenuated GH-stimulated MMP13 levels, but did not affect MMP9 levels. Investigation of GH signalling revealed that SW1353 cells have high levels of activated JAK2 and exposure to GH, estrogen, AG490 and other signalling inhibitors did not affect JAK2 phosphorylation. Interestingly, AG490 treatment dramatically decreased ERK2 signalling, although GH did stimulate ERK2 phosphorylation above control levels. AG490 also decreased CBFA1 expression, a transcription factor known to activate MMP9 and MMP13. Finally, GH and estrogen treatment increased expression of SOCS3 mRNA, suggesting that SOCS3 may regulate JAK/STAT signalling in SW1353 cells. The modulation of GH-mediated MMP expression by estrogen in SW1353 cells represents a potentially novel mechanism by which estrogen may regulate longitudinal bone growth. However, further investigation is required in order to elucidate the precise mechanisms behind estrogen and GH regulation of MMP13 expression in SW1353 cells. This study has provided additional evidence that estrogen and the ER gene are major factors in the regulation of growth and the determination of adult height. Newly identified polymorphisms in the ER gene not only contribute to our understanding of the genetic basis of human height, but may also be useful in association studies examining other complex traits. This study also identified several estrogen regulated genes and indicated that estrogen modifies the expression of genes which are primarily expressed in the hypertrophic region of the epiphyseal growth plate. Furthermore, synergistic studies incorporating GH and estrogen have revealed the ability of estrogen to attenuate the effects of GH on MMP13 expression, revealing potential pathways by which estrogen may modulate growth plate fusion, longitudinal bone growth and even arthritis.
Resumo:
Motivation: Unravelling the genetic architecture of complex traits requires large amounts of data, sophisticated models and large computational resources. The lack of user-friendly software incorporating all these requisites is delaying progress in the analysis of complex traits. Methods: Linkage disequilibrium and linkage analysis (LDLA) is a high-resolution gene mapping approach based on sophisticated mixed linear models, applicable to any population structure. LDLA can use population history information in addition to pedigree and molecular markers to decompose traits into genetic components. Analyses are distributed in parallel over a large public grid of computers in the UK. Results: We have proven the performance of LDLA with analyses of simulated data. There are real gains in statistical power to detect quantitative trait loci when using historical information compared with traditional linkage analysis. Moreover, the use of a grid of computers significantly increases computational speed, hence allowing analyses that would have been prohibitive on a single computer. © The Author 2009. Published by Oxford University Press. All rights reserved.
Resumo:
The past five years have seen many scientific and biological discoveries made through the experimental design of genome-wide association studies (GWASs). These studies were aimed at detecting variants at genomic loci that are associated with complex traits in the population and, in particular, at detecting associations between common single-nucleotide polymorphisms (SNPs) and common diseases such as heart disease, diabetes, auto-immune diseases, and psychiatric disorders. We start by giving a number of quotes from scientists and journalists about perceived problems with GWASs. We will then briefly give the history of GWASs and focus on the discoveries made through this experimental design, what those discoveries tell us and do not tell us about the genetics and biology of complex traits, and what immediate utility has come out of these studies. Rather than giving an exhaustive review of all reported findings for all diseases and other complex traits, we focus on the results for auto-immune diseases and metabolic diseases. We return to the perceived failure or disappointment about GWASs in the concluding section. © 2012 The American Society of Human Genetics.
Resumo:
Vertebral fracture risk is a heritable complex trait. The aim of this study was to identify genetic susceptibility factors for osteoporotic vertebral fractures applying a genome-wide association study (GWAS) approach. The GWAS discovery was based on the Rotterdam Study, a population-based study of elderly Dutch individuals aged >55years; and comprising 329 cases and 2666 controls with radiographic scoring (McCloskey-Kanis) and genetic data. Replication of one top-associated SNP was pursued by de-novo genotyping of 15 independent studies across Europe, the United States, and Australia and one Asian study. Radiographic vertebral fracture assessment was performed using McCloskey-Kanis or Genant semi-quantitative definitions. SNPs were analyzed in relation to vertebral fracture using logistic regression models corrected for age and sex. Fixed effects inverse variance and Han-Eskin alternative random effects meta-analyses were applied. Genome-wide significance was set at p<5×10-8. In the discovery, a SNP (rs11645938) on chromosome 16q24 was associated with the risk for vertebral fractures at p=4.6×10-8. However, the association was not significant across 5720 cases and 21,791 controls from 14 studies. Fixed-effects meta-analysis summary estimate was 1.06 (95% CI: 0.98-1.14; p=0.17), displaying high degree of heterogeneity (I2=57%; Qhet p=0.0006). Under Han-Eskin alternative random effects model the summary effect was significant (p=0.0005). The SNP maps to a region previously found associated with lumbar spine bone mineral density (LS-BMD) in two large meta-analyses from the GEFOS consortium. A false positive association in the GWAS discovery cannot be excluded, yet, the low-powered setting of the discovery and replication settings (appropriate to identify risk effect size >1.25) may still be consistent with an effect size <1.10, more of the type expected in complex traits. Larger effort in studies with standardized phenotype definitions is needed to confirm or reject the involvement of this locus on the risk for vertebral fractures.
Resumo:
A major challenge in human genetics is to devise a systematic strategy to integrate disease-associated variants with diverse genomic and biological data sets to provide insight into disease pathogenesis and guide drug discovery for complex traits such as rheumatoid arthritis (RA)1. Here we performed a genome-wide association study meta-analysis in a total of >100,000 subjects of European and Asian ancestries (29,880 RA cases and 73,758 controls), by evaluating ~10 million single-nucleotide polymorphisms. We discovered 42 novel RA risk loci at a genome-wide level of significance, bringing the total to 101 (refs 2, 3, 4). We devised an in silico pipeline using established bioinformatics methods based on functional annotation5, cis-acting expression quantitative trait loci6 and pathway analyses7, 8, 9—as well as novel methods based on genetic overlap with human primary immunodeficiency, haematological cancer somatic mutations and knockout mouse phenotypes—to identify 98 biological candidate genes at these 101 risk loci. We demonstrate that these genes are the targets of approved therapies for RA, and further suggest that drugs approved for other indications may be repurposed for the treatment of RA. Together, this comprehensive genetic study sheds light on fundamental genes, pathways and cell types that contribute to RA pathogenesis, and provides empirical evidence that the genetics of RA can provide important information for drug discovery.
Resumo:
Genotype-environment interactions (GEI) limit genetic gain for complex traits such as tolerance to drought. Characterization of the crop environment is an important step in understanding GEI. A modelling approach is proposed here to characterize broadly (large geographic area, long-term period) and locally (field experiment) drought-related environmental stresses, which enables breeders to analyse their experimental trials with regard to the broad population of environments that they target. Water-deficit patterns experienced by wheat crops were determined for drought-prone north-eastern Australia, using the APSIM crop model to account for the interactions of crops with their environment (e.g. feedback of plant growth on water depletion). Simulations based on more than 100 years of historical climate data were conducted for representative locations, soils, and management systems, for a check cultivar, Hartog. The three main environment types identified differed in their patterns of simulated water stress around flowering and during grain-filling. Over the entire region, the terminal drought-stress pattern was most common (50% of production environments) followed by a flowering stress (24%), although the frequencies of occurrence of the three types varied greatly across regions, years, and management. This environment classification was applied to 16 trials relevant to late stages testing of a breeding programme. The incorporation of the independently-determined environment types in a statistical analysis assisted interpretation of the GEI for yield among the 18 representative genotypes by reducing the relative effect of GEI compared with genotypic variance, and helped to identify opportunities to improve breeding and germplasm-testing strategies for this region.
Resumo:
Prior genome-wide association studies (GWAS) of major depressive disorder (MDD) have met with limited success. We sought to increase statistical power to detect disease loci by conducting a GWAS mega-analysis for MDD. In the MDD discovery phase, we analyzed more than 1.2 million autosomal and X chromosome single-nucleotide polymorphisms (SNPs) in 18 759 independent and unrelated subjects of recent European ancestry (9240 MDD cases and 9519 controls). In the MDD replication phase, we evaluated 554 SNPs in independent samples (6783 MDD cases and 50 695 controls). We also conducted a cross-disorder meta-analysis using 819 autosomal SNPs with P<0.0001 for either MDD or the Psychiatric GWAS Consortium bipolar disorder (BIP) mega-analysis (9238 MDD cases/8039 controls and 6998 BIP cases/7775 controls). No SNPs achieved genome-wide significance in the MDD discovery phase, the MDD replication phase or in pre-planned secondary analyses (by sex, recurrent MDD, recurrent early-onset MDD, age of onset, pre-pubertal onset MDD or typical-like MDD from a latent class analyses of the MDD criteria). In the MDD-bipolar cross-disorder analysis, 15 SNPs exceeded genome-wide significance (P<5 x 10(-8)), and all were in a 248 kb interval of high LD on 3p21.1 (chr3:52 425 083-53 822 102, minimum P=5.9 x 10(-9) at rs2535629). Although this is the largest genome-wide analysis of MDD yet conducted, its high prevalence means that the sample is still underpowered to detect genetic effects typical for complex traits. Therefore, we were unable to identify robust and replicable findings. We discuss what this means for genetic research for MDD. The 3p21.1 MDD-BIP finding should be interpreted with caution as the most significant SNP did not replicate in MDD samples, and genotyping in independent samples will be needed to resolve its status.
Inference of the genetic architecture underlying BMI and height with the use of 20,240 sibling pairs
Resumo:
Evidence that complex traits are highly polygenic has been presented by population-based genome-wide association studies (GWASs) through the identification of many significant variants, as well as by family-based de novo sequencing studies indicating that several traits have a large mutational target size. Here, using a third study design, we show results consistent with extreme polygenicity for body mass index (BMI) and height. On a sample of 20,240 siblings (from 9,570 nuclear families), we used a within-family method to obtain narrow-sense heritability estimates of 0.42 (SE = 0.17, p = 0.01) and 0.69 (SE = 0.14, p = 6 x 10(-)(7)) for BMI and height, respectively, after adjusting for covariates. The genomic inflation factors from locus-specific linkage analysis were 1.69 (SE = 0.21, p = 0.04) for BMI and 2.18 (SE = 0.21, p = 2 x 10(-10)) for height. This inflation is free of confounding and congruent with polygenicity, consistent with observations of ever-increasing genomic-inflation factors from GWASs with large sample sizes, implying that those signals are due to true genetic signals across the genome rather than population stratification. We also demonstrate that the distribution of the observed test statistics is consistent with both rare and common variants underlying a polygenic architecture and that previous reports of linkage signals in complex traits are probably a consequence of polygenic architecture rather than the segregation of variants with large effects. The convergent empirical evidence from GWASs, de novo studies, and within-family segregation implies that family-based sequencing studies for complex traits require very large sample sizes because the effects of causal variants are small on average.
Resumo:
There is evidence across several species for genetic control of phenotypic variation of complex traits1, 2, 3, 4, such that the variance among phenotypes is genotype dependent. Understanding genetic control of variability is important in evolutionary biology, agricultural selection programmes and human medicine, yet for complex traits, no individual genetic variants associated with variance, as opposed to the mean, have been identified. Here we perform a meta-analysis of genome-wide association studies of phenotypic variation using ~170,000 samples on height and body mass index (BMI) in human populations. We report evidence that the single nucleotide polymorphism (SNP) rs7202116 at the FTO gene locus, which is known to be associated with obesity (as measured by mean BMI for each rs7202116 genotype)5, 6, 7, is also associated with phenotypic variability. We show that the results are not due to scale effects or other artefacts, and find no other experiment-wise significant evidence for effects on variability, either at loci other than FTO for BMI or at any locus for height. The difference in variance for BMI among individuals with opposite homozygous genotypes at the FTO locus is approximately 7%, corresponding to a difference of ~0.5 kilograms in the standard deviation of weight. Our results indicate that genetic variants can be discovered that are associated with variability, and that between-person variability in obesity can partly be explained by the genotype at the FTO locus. The results are consistent with reported FTO by environment interactions for BMI8, possibly mediated by DNA methylation9, 10. Our BMI results for other SNPs and our height results for all SNPs suggest that most genetic variants, including those that influence mean height or mean BMI, are not associated with phenotypic variance, or that their effects on variability are too small to detect even with samples sizes greater than 100,000.
Resumo:
This project aims to use simulatiion modelling to improve our understanding of the genetics and physiology of complex traits with a view to increasing the rate of genetic gain in plant breeding programs.
Resumo:
SNPs discovered by genome-wide association studies (GWASs) account for only a small fraction of the genetic variation of complex traits in human populations. Where is the remaining heritability? We estimated the proportion of variance for human height explained by 294,831 SNPs genotyped on 3,925 unrelated individuals using a linear model analysis, and validated the estimation method with simulations based on the observed genotype data. We show that 45% of variance can be explained by considering all SNPs simultaneously. Thus, most of the heritability is not missing but has not previously been detected because the individual effects are too small to pass stringent significance tests. We provide evidence that the remaining heritability is due to incomplete linkage disequilibrium between causal variants and genotyped SNPs, exacerbated by causal variants having lower minor allele frequency than the SNPs explored to date.
Resumo:
Most information in linkage analysis for quantitative traits comes from pairs of relatives that are phenotypically most discordant or concordant. Confounding this, within-family outliers from non-genetic causes may create false positives and negatives. We investigated the influence of within-family outliers empirically, using one of the largest genome-wide linkage scans for height. The subjects were drawn from Australian twin cohorts consisting of 8447 individuals in 2861 families, providing a total of 5815 possible pairs of siblings in sibships. A variance component linkage analysis was performed, either including or excluding the within-family outliers. Using the entire dataset, the largest LOD scores were on chromosome 15q (LOD 2.3) and 11q (1.5). Excluding within-family outliers increased the LOD score for most regions, but the LOD score on chromosome 15 decreased from 2.3 to 1.2, suggesting that the outliers may create false negatives and false positives, although rare alleles of large effect may also be an explanation. Several regions suggestive of linkage to height were found after removing the outliers, including 1q23.1 (2.0), 3q22.1 (1.9) and 5q32 (2.3). We conclude that the investigation of the effect of within-family outliers, which is usually neglected, should be a standard quality control measure in linkage analysis for complex traits and may reduce the noise for the search of common variants of modest effect size as well as help identify rare variants of large effect and clinical significance. We suggest that the effect of within-family outliers deserves further investigation via theoretical and simulation studies.
Resumo:
The commonly used "end diagnosis" phenotype that is adopted in linkage and association studies of complex traits is likely to represent an oversimplified model of the genetic background of a disease. This is also likely to be the case for common types of migraine, for which no convincingly associated genetic variants have been reported. In headache disorders, most genetic studies have used end diagnoses of the International Headache Society (IHS) classification as phenotypes. Here, we introduce an alternative strategy; we use trait components--individual clinical symptoms of migraine--to determine affection status in genomewide linkage analyses of migraine-affected families. We identified linkage between several traits and markers on chromosome 4q24 (highest LOD score under locus heterogeneity [HLOD] 4.52), a locus we previously reported to be linked to the end diagnosis migraine with aura. The pulsation trait identified a novel locus on 17p13 (HLOD 4.65). Additionally, a trait combination phenotype (IHS full criteria) revealed a locus on 18q12 (HLOD 3.29), and the age at onset trait revealed a locus on 4q28 (HLOD 2.99). Furthermore, suggestive or nearly suggestive evidence of linkage to four additional loci was observed with the traits phonophobia (10q22) and aggravation by physical exercise (12q21, 15q14, and Xp21), and, interestingly, these loci have been linked to migraine in previous studies. Our findings suggest that the use of symptom components of migraine instead of the end diagnosis provides a useful tool in stratifying the sample for genetic studies.