23 resultados para Genomewide association studies
em CentAUR: Central Archive University of Reading - UK
Resumo:
Pharmacovigilance, the monitoring of adverse events (AEs), is an integral part in the clinical evaluation of a new drug. Until recently, attempts to relate the incidence of AEs to putative causes have been restricted to the evaluation of simple demographic and environmental factors. The advent of large-scale genotyping, however, provides an opportunity to look for associations between AEs and genetic markers, such as single nucleotides polymorphisms (SNPs). It is envisaged that a very large number of SNPs, possibly over 500 000, will be used in pharmacovigilance in an attempt to identify any genetic difference between patients who have experienced an AE and those who have not. We propose a sequential genome-wide association test for analysing AEs as they arise, allowing evidence-based decision-making at the earliest opportunity. This gives us the capability of quickly establishing whether there is a group of patients at high-risk of an AE based upon their DNA. Our method provides a valid test which takes account of linkage disequilibrium and allows for the sequential nature of the procedure. The method is more powerful than using a correction, such as idák, that assumes that the tests are independent. Copyright © 2006 John Wiley & Sons, Ltd.
Resumo:
We introduce a procedure for association based analysis of nuclear families that allows for dichotomous and more general measurements of phenotype and inclusion of covariate information. Standard generalized linear models are used to relate phenotype and its predictors. Our test procedure, based on the likelihood ratio, unifies the estimation of all parameters through the likelihood itself and yields maximum likelihood estimates of the genetic relative risk and interaction parameters. Our method has advantages in modelling the covariate and gene-covariate interaction terms over recently proposed conditional score tests that include covariate information via a two-stage modelling approach. We apply our method in a study of human systemic lupus erythematosus and the C-reactive protein that includes sex as a covariate.
Resumo:
Genome-wide association studies (GWAS) have been widely used in genetic dissection of complex traits. However, common methods are all based on a fixed-SNP-effect mixed linear model (MLM) and single marker analysis, such as efficient mixed model analysis (EMMA). These methods require Bonferroni correction for multiple tests, which often is too conservative when the number of markers is extremely large. To address this concern, we proposed a random-SNP-effect MLM (RMLM) and a multi-locus RMLM (MRMLM) for GWAS. The RMLM simply treats the SNP-effect as random, but it allows a modified Bonferroni correction to be used to calculate the threshold p value for significance tests. The MRMLM is a multi-locus model including markers selected from the RMLM method with a less stringent selection criterion. Due to the multi-locus nature, no multiple test correction is needed. Simulation studies show that the MRMLM is more powerful in QTN detection and more accurate in QTN effect estimation than the RMLM, which in turn is more powerful and accurate than the EMMA. To demonstrate the new methods, we analyzed six flowering time related traits in Arabidopsis thaliana and detected more genes than previous reported using the EMMA. Therefore, the MRMLM provides an alternative for multi-locus GWAS.
Resumo:
A Bayesian approach to analysing data from family-based association studies is developed. This permits direct assessment of the range of possible values of model parameters, such as the recombination frequency and allelic associations, in the light of the data. In addition, sophisticated comparisons of different models may be handled easily, even when such models are not nested. The methodology is developed in such a way as to allow separate inferences to be made about linkage and association by including theta, the recombination fraction between the marker and disease susceptibility locus under study, explicitly in the model. The method is illustrated by application to a previously published data set. The data analysis raises some interesting issues, notably with regard to the weight of evidence necessary to convince us of linkage between a candidate locus and disease.
Resumo:
A study or experiment can be described as sequential if its design includes one or more interim analyses at which it is possible to stop the study, having reached a definitive conclusion concerning the primary question of interest. The potential of the sequential study to terminate earlier than the equivalent fixed sample size study means that, typically, there are ethical and economic advantages to be gained from using a sequential design. These advantages have secured a place for the methodology in the conduct of many clinical trials of novel therapies. Recently, there has been increasing interest in pharmacogenetics: the study of how DNA variation in the human genome affects the safety and efficacy of drugs. The potential for using sequential methodology in pharmacogenetic studies is considered and the conduct of candidate gene association studies, family-based designs and genome-wide association studies within the sequential setting is explored. The objective is to provide a unified framework for the conduct of these types of studies as sequential designs and hence allow experimenters to consider using sequential methodology in their future pharmacogenetic studies.
Resumo:
BACKGROUND: Evidence suggests the wide variation in platelet response within the population is genetically controlled. Unraveling the complex relationship between sequence variation and platelet phenotype requires accurate and reproducible measurement of platelet response. OBJECTIVE: To develop a methodology suitable for measuring signaling pathway-specific platelet phenotype, to use this to measure platelet response in a large cohort, and to demonstrate the effect size of sequence variation in a relevant model gene. METHODS: Three established platelet assays were evaluated: mobilization of [Ca(2+)](i), aggregometry and flow cytometry, each in response to adenosine 5'-diphosphate (ADP) or the glycoprotein (GP) VI-specific crosslinked collagen-related peptide (CRP). Flow cytometric measurement of fibrinogen binding and P-selectin expression in response to a single, intermediate dose of each agonist gave the best combination of reproducibility and inter-individual variability and was used to measure the platelet response in 506 healthy volunteers. Pathway specificity was ensured by blocking the main subsidiary signaling pathways. RESULTS: Individuals were identified who were hypo- or hyper-responders for both pathways, or who had differential responses to the two agonists, or between outcomes. 89 individuals, retested three months later using the same methodology, showed high concordance between the two visits in all four assays (r(2) = 0.872, 0.868, 0.766 and 0.549); all subjects retaining their phenotype at recall. The effect of sequence variation at the GP6 locus accounted for approximately 35% of the variation in the CRP-XL response. CONCLUSION: Genotyping-phenotype association studies in a well-characterized, large cohort provides a powerful strategy to measure the effect of sequence variation in genes regulating the platelet response.
Resumo:
Before the advent of genome-wide association studies (GWASs), hundreds of candidate genes for obesity-susceptibility had been identified through a variety of approaches. We examined whether those obesity candidate genes are enriched for associations with body mass index (BMI) compared with non-candidate genes by using data from a large-scale GWAS. A thorough literature search identified 547 candidate genes for obesity-susceptibility based on evidence from animal studies, Mendelian syndromes, linkage studies, genetic association studies and expression studies. Genomic regions were defined to include the genes ±10 kb of flanking sequence around candidate and non-candidate genes. We used summary statistics publicly available from the discovery stage of the genome-wide meta-analysis for BMI performed by the genetic investigation of anthropometric traits consortium in 123 564 individuals. Hypergeometric, rank tail-strength and gene-set enrichment analysis tests were used to test for the enrichment of association in candidate compared with non-candidate genes. The hypergeometric test of enrichment was not significant at the 5% P-value quantile (P = 0.35), but was nominally significant at the 25% quantile (P = 0.015). The rank tail-strength and gene-set enrichment tests were nominally significant for the full set of genes and borderline significant for the subset without SNPs at P < 10(-7). Taken together, the observed evidence for enrichment suggests that the candidate gene approach retains some value. However, the degree of enrichment is small despite the extensive number of candidate genes and the large sample size. Studies that focus on candidate genes have only slightly increased chances of detecting associations, and are likely to miss many true effects in non-candidate genes, at least for obesity-related traits.
Resumo:
The INSIG2 rs7566605 polymorphism was identified for obesity (BMI> or =30 kg/m(2)) in one of the first genome-wide association studies, but replications were inconsistent. We collected statistics from 34 studies (n = 74,345), including general population (GP) studies, population-based studies with subjects selected for conditions related to a better health status ('healthy population', HP), and obesity studies (OB). We tested five hypotheses to explore potential sources of heterogeneity. The meta-analysis of 27 studies on Caucasian adults (n = 66,213) combining the different study designs did not support overall association of the CC-genotype with obesity, yielding an odds ratio (OR) of 1.05 (p-value = 0.27). The I(2) measure of 41% (p-value = 0.015) indicated between-study heterogeneity. Restricting to GP studies resulted in a declined I(2) measure of 11% (p-value = 0.33) and an OR of 1.10 (p-value = 0.015). Regarding the five hypotheses, our data showed (a) some difference between GP and HP studies (p-value = 0.012) and (b) an association in extreme comparisons (BMI> or =32.5, 35.0, 37.5, 40.0 kg/m(2) versus BMI<25 kg/m(2)) yielding ORs of 1.16, 1.18, 1.22, or 1.27 (p-values 0.001 to 0.003), which was also underscored by significantly increased CC-genotype frequencies across BMI categories (10.4% to 12.5%, p-value for trend = 0.0002). We did not find evidence for differential ORs (c) among studies with higher than average obesity prevalence compared to lower, (d) among studies with BMI assessment after the year 2000 compared to those before, or (e) among studies from older populations compared to younger. Analysis of non-Caucasian adults (n = 4889) or children (n = 3243) yielded ORs of 1.01 (p-value = 0.94) or 1.15 (p-value = 0.22), respectively. There was no evidence for overall association of the rs7566605 polymorphism with obesity. Our data suggested an association with extreme degrees of obesity, and consequently heterogeneous effects from different study designs may mask an underlying association when unaccounted for. The importance of study design might be under-recognized in gene discovery and association replication so far.
A hierarchical Bayesian model for predicting the functional consequences of amino-acid polymorphisms
Resumo:
Genetic polymorphisms in deoxyribonucleic acid coding regions may have a phenotypic effect on the carrier, e.g. by influencing susceptibility to disease. Detection of deleterious mutations via association studies is hampered by the large number of candidate sites; therefore methods are needed to narrow down the search to the most promising sites. For this, a possible approach is to use structural and sequence-based information of the encoded protein to predict whether a mutation at a particular site is likely to disrupt the functionality of the protein itself. We propose a hierarchical Bayesian multivariate adaptive regression spline (BMARS) model for supervised learning in this context and assess its predictive performance by using data from mutagenesis experiments on lac repressor and lysozyme proteins. In these experiments, about 12 amino-acid substitutions were performed at each native amino-acid position and the effect on protein functionality was assessed. The training data thus consist of repeated observations at each position, which the hierarchical framework is needed to account for. The model is trained on the lac repressor data and tested on the lysozyme mutations and vice versa. In particular, we show that the hierarchical BMARS model, by allowing for the clustered nature of the data, yields lower out-of-sample misclassification rates compared with both a BMARS and a frequen-tist MARS model, a support vector machine classifier and an optimally pruned classification tree.
High throughput, high resolution selection of polymorphic microsatellite loci for multiplex analysis
Resumo:
Background Large-scale genetic profiling, mapping and genetic association studies require access to a series of well-characterised and polymorphic microsatellite markers with distinct and broad allele ranges. Selection of complementary microsatellite markers with non-overlapping allele ranges has historically proved to be a bottleneck in the development of multiplex microsatellite assays. The characterisation process for each microsatellite locus can be laborious and costly given the need for numerous, locus-specific fluorescent primers. Results Here, we describe a simple and inexpensive approach to select useful microsatellite markers. The system is based on the pooling of multiple unlabelled PCR amplicons and their subsequent ligation into a standard cloning vector. A second round of amplification utilising generic labelled primers targeting the vector and unlabelled locus-specific primers targeting the microsatellite flanking region yield allelic profiles that are representative of all individuals contained within the pool. Suitability of various DNA pool sizes was then tested for this purpose. DNA template pools containing between 8 and 96 individuals were assessed for the determination of allele ranges of individual microsatellite markers across a broad population. This helped resolve the balance between using pools that are large enough to allow the detection of many alleles against the risk of including too many individuals in a pool such that rare alleles are over-diluted and so do not appear in the pooled microsatellite profile. Pools of DNA from 12 individuals allowed the reliable detection of all alleles present in the pool. Conclusion The use of generic vector-specific fluorescent primers and unlabelled locus-specific primers provides a high resolution, rapid and inexpensive approach for the selection of highly polymorphic microsatellite loci that possess non-overlapping allele ranges for use in large-scale multiplex assays.
Resumo:
Objective: SNPs identified from genome wide association studies associate with lipid risk markers of cardiovascular disease. This study investigated whether these SNPs altered the plasma lipid response to diet in the ‘RISCK’ study cohort. Methods: Participants (n = 490) from a dietary intervention to lower saturated fat by replacement with carbohydrate or monounsaturated fat, were genotyped for 39 lipid-associated SNPs. The association of each individual SNP, and of the SNPs combined (using genetic predisposition scores), with plasma lipid concentrations was assessed at baseline, and on change in response to 24 weeks on diets. Results: The associations between SNPs and lipid concentrations were directionally consistent with previous findings. The genetic predisposition scores were associated with higher baseline concentrations of plasma total(P = 0.02) and LDL (P = 0.002) cholesterol, triglycerides (P = 0.001) and apolipoprotein B (P = 0.004), and with lower baseline concentrations of HDL cholesterol (P < 0.001) and apolipoprotein A-I (P < 0.001). None of the SNPs showed significant association with the reduction of plasma lipids in response to the dietary interventions and there was no evidence of diet-gene interactions. Conclusion: Results from this exploratory study have shown that increased genetic predisposition was associated with an unfavourable plasma lipid profile at baseline, but did not influence the improvement in lipid profiles by the low-saturated-fat diets.
Resumo:
Autism Spectrum Conditions (ASC) are much more common in males, a bias that may offer clues to the etiology of this condition. Although the cause of this bias remains a mystery, we argue that it occurs because ASC is an extreme manifestation of the male brain. The extreme male brain (EMB) theory, first proposed in 1997, is an extension of the Empathizing-Systemizing (E-S) theory of typical sex differences that proposes that females on average have a stronger drive to empathize while males on average have a stronger drive to systemize. In this first major update since 2005, we describe some of the evidence relating to the EMB theory of ASC and consider how typical sex differences in brain structure may be relevant to ASC. One possible biological mechanism to account for the male bias is the effect of fetal testosterone (fT). We also consider alternative biological theories, the X and Y chromosome theories, and the reduced autosomal penetrance theory. None of these theories has yet been fully confirmed or refuted, though the weight of evidence in favor of the fT theory is growing from converging sources (longitudinal amniocentesis studies from pregnancy to age 10 years old, current hormone studies, and genetic association studies of SNPs in the sex steroid pathways). Ultimately, as these theories are not mutually exclusive and ASC is multi-factorial, they may help explain the male prevalence of ASC.
Resumo:
Genome-wide association studies have identified SNPs reproducibly associated with type 2 diabetes (T2D). We examined the effect of genetic predisposition to T2D on insulin sensitivity and secretion using detailed phenotyping in overweight individuals with no diagnosis of T2D. Furthermore, we investigated whether this genetic predisposition modifies the responses in beta-cell function and insulin sensitivity to a 24-week dietary intervention. We genotyped 25 T2D-associated SNPs in 377 white participants from the RISCK study. Participants underwent an IVGTT prior to and following a dietary intervention that aimed to lower saturated fat intake by replacement with monounsaturated fat or carbohydrate. We composed a genetic predisposition score (T2D-GPS) by summing the T2D risk-increasing alleles of the 25 SNPs and tested for association with insulin secretion and sensitivity at baseline, and with the change in response to the dietary intervention. At baseline, a higher T2D-GPS was associated with lower acute insulin secretion (AIRg 4% lower/risk allele, P = 0.006) and lower insulin secretion for a given level of insulin sensitivity, assessed by the disposition index (DI 5% lower/risk allele, P = 0.002), but not with insulin sensitivity (Si). T2D-GPS did not modify changes in insulin secretion, insulin sensitivity or the disposition index in response to the dietary interventions to lower saturated fat. Participants genetically predisposed to T2D have an impaired ability to compensate for peripheral insulin resistance with insulin secretion at baseline, but this does not modify the response to a reduction in dietary saturated fat through iso-energetic replacement with carbohydrate or monounsaturated fat.
Resumo:
Background: Autism spectrum conditions have a strong genetic component. Atypical sensory sensitivities are one of the core but neglected features of autism spectrum conditions. GABRB3 is a well-characterised candidate gene for autism spectrum conditions. In mice, heterozygous Gabrb3 deletion is associated with increased tactile sensitivity. However, no study has examined if tactile sensitivity is associated with GABRB3 genetic variation in humans. To test this, we conducted two pilot genetic association studies in the general population, analysing two phenotypic measures of tactile sensitivity (a parent-report and a behavioural measure) for association with 43 SNPs in GABRB3. Findings: Across both tactile sensitivity measures, three SNPs (rs11636966, rs8023959 and rs2162241) were nominally associated with both phenotypes, providing a measure of internal validation. Parent-report scores were nominally associated with six SNPs (P <0.05). Behaviourally measured tactile sensitivity was nominally associated with 10 SNPs (three after Bonferroni correction). Conclusions: This is the first human study to show an association between GABRB3 variation and tactile sensitivity. This provides support for the evidence from animal models implicating the role of GABRB3 variation in the atypical sensory sensitivity in autism spectrum conditions. Future research is underway to directly test this association in cases of autism spectrum conditions.
Resumo:
Multilocus digenic linkage disequilibria (LD) and their population structure were investigated in eleven landrace populations of barley (Hordeum vulgare ssp. vulgare L.) in Sardinia, using 134 dominant simple-sequence amplified polymorphism markers. The analysis of molecular variance for these markers indicated that the populations were partially differentiated (F ST = 0.18), and clustered into three geographic areas. Consistent with this population pattern, STRUCTURE analysis allocated individuals from a bulk of all populations into four genetic groups, and these groups also showed geographic patterns. In agreement with other molecular studies in barley, the general level of LD was low (13 % of locus pairs, with P < 0.01) in the bulk of 337 lines, and decayed steeply with map distance between markers. The partitioning of multilocus associations into various components indicated that genetic drift and founder effects played a major role in determining the overall genetic makeup of the diversity in these landrace populations, but that epistatic homogenising or diversifying selection was also present. Notably, the variance of the disequilibrium component was relatively high, which implies caution in the pooling of barley lines for association studies. Finally, we compared the analyses of multilocus structure in barley landrace populations with parallel analyses in both composite crosses of barley on the one hand and in natural populations of wild barley on the other. Neither of these serves as suitable mimics of landraces in barley, which require their own study. Overall, the results suggest that these populations can be exploited for LD mapping if population structure is controlled.