902 resultados para Genome-wide association study


Relevância:

100.00% 100.00%

Publicador:

Resumo:

INTRODUCTION Although the high heritability of BMD variation has long been established, few genes have been conclusively shown to affect the variation of BMD in the general population. Extreme truncate selection has been proposed as a more powerful alternative to unselected cohort designs in quantitative trait association studies. We sought to test these theoretical predictions in studies of the bone densitometry measures BMD, BMC, and femoral neck area, by investigating their association with members of the Wnt pathway, some of which have previously been shown to be associated with BMD in much larger cohorts, in a moderate-sized extreme truncate selected cohort (absolute value BMD Z-scores = 1.5-4.0; n = 344). MATERIALS AND METHODS Ninety-six tag-single nucleotide polymorphism (SNPs) lying in 13 Wnt signaling pathway genes were selected to tag common genetic variation (minor allele frequency [MAF] > 5% with an r(2) > 0.8) within 5 kb of all exons of 13 Wnt signaling pathway genes. The genes studied included LRP1, LRP5, LRP6, Wnt3a, Wnt7b, Wnt10b, SFRP1, SFRP2, DKK1, DKK2, FZD7, WISP3, and SOST. Three hundred forty-four cases with either high or low BMD were genotyped by Illumina Goldengate microarray SNP genotyping methods. Association was tested either by Cochrane-Armitage test for dichotomous variables or by linear regression for quantitative traits. RESULTS Strong association was shown with LRP5, polymorphisms of which have previously been shown to influence total hip BMD (minimum p = 0.0006). In addition, polymorphisms of the Wnt antagonist, SFRP1, were significantly associated with BMD and BMC (minimum p = 0.00042). Previously reported associations of LRP1, LRP6, and SOST with BMD were confirmed. Two other Wnt pathway genes, Wnt3a and DKK2, also showed nominal association with BMD. CONCLUSIONS This study shows that polymorphisms of multiple members of the Wnt pathway are associated with BMD variation. Furthermore, this study shows in a practical trial that study designs involving extreme truncate selection and moderate sample sizes can robustly identify genes of relevant effect sizes involved in BMD variation in the general population. This has implications for the design of future genome-wide studies of quantitative bone phenotypes relevant to osteoporosis.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A comprehensive analysis was conducted using 48 sorghum QTL studies published from 1995 to 2010 to make information from historical sorghum QTL experiments available in a form that could be more readily used by sorghum researchers and plant breeders. In total, 771 QTL relating to 161 unique traits from 44 studies were projected onto a sorghum consensus map. Confidence intervals (CI) of QTL were estimated so that valid comparisons could be made between studies. The method accounted for the number of lines used and the phenotypic variation explained by individual QTL from each study. In addition, estimated centimorgan (cM) locations were calculated for the predicted sorghum gene models identified in Phytozome (JGI GeneModels SBI v1.4) and compared with QTL distribution genome-wide, both on genetic linkage (cM) and physical (base-pair/bp) map scales. QTL and genes were distributed unevenly across the genome. Heterochromatic enrichment for QTL was observed, with approximately 22% of QTL either entirely or partially located in the heterochromatic regions. Heterochromatic gene enrichment was also observed based on their predicted cM locations on the sorghum consensus map, due to suppressed recombination in heterochromatic regions, in contrast to the euchromatic gene enrichment observed on the physical, sequence-based map. The finding of high gene density in recombination-poor regions, coupled with the association with increased QTL density, has implications for the development of more efficient breeding systems in sorghum to better exploit heterosis. The projected QTL information described, combined with the physical locations of sorghum sequence-based markers and predicted gene models, provides sorghum researchers with a useful resource for more detailed analysis of traits and development of efficient marker-assisted breeding strategies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Cardiovascular diseases (CVD) are major contributors to morbidity and mortality worldwide. Several interacting environmental, biochemical, and genetic risk factors can increase disease susceptibility. While some of the genes involved in the etiology of CVD are known, many are yet to be discovered. During the last few decades, scientists have searched for these genes with genome-wide linkage and association methods, and with more targeted candidate gene studies. This thesis investigates variation within the upstream transcription factor 1 (USF1) gene locus in relation to CVD risk factors, atherosclerosis, and incidence and prevalence of CVD. This candidate gene was first identified in Finnish families ascertained for familial combined hyperlipidemia, a common dyslipidemia predisposing to coronary heart disease. The gene is a ubiquitously expressed transcription factor regulating expression of several genes from lipid and glucose metabolism, inflammation, and endothelial function. First, we examined association between USF1 variants and several CVD risk factors, such as lipid phenotypes, body composition measures, and metabolic syndrome, in two prospective population cohorts. Our data suggested that USF1 contributes to these CVD risk factors at the population level. Notably, the associations with quantitative measurements were mostly detected among study subjects with CVD or metabolic syndrome, suggesting complex interactions between USF1 effects and the pathophysiological state of an individual. Second, we investigated how variation at the USF1 locus contributes to atherosclerotic lesions of the coronary arteries and abdominal aorta. For this, we used two study samples of middle-aged men with detailed measurements of atherosclerosis obtained in autopsy. USF1 variation significantly associated with areas of several types of lesions, especially with calcification of the arteries. Next, we tested what effect the USF1 risk variants have on sudden cardiac death and incidence of CVD. The atherosclerosis-associated risk variant increased the risk of sudden cardiac death of the same study subjects. Furthermore, USF1 alleles associated with incidence of CVD in the Finnish population follow-up cohorts. These associations were especially prominent among women, suggesting a sex specific effect, which has also been detected in subsequent studies. Finally, as some of the low-yield DNA samples of the Finnish follow-up study cohort needed to be whole-genome amplified (WGA) prior to genotyping, we evaluated whether the produced WGA genotypes were of good quality. Although the samples giving genotype discrepancies could not be detected before genotyping with standard laboratory quality control methods, our results suggested that enhanced quality control at the time of the genotyping could identify such samples. In addition, combining two WGA reactions into one pooled DNA sample for genotyping markedly reduced the number of discrepancies and samples showing them. In conclusion, USF1 seems to have a role in the etiology of CVD. Additional studies are warranted to identify functional variants and to study interactions between USF1 and other genetic or environmental factors. This USF1 study, and other studies with low DNA yield of some samples, can benefit from whole genome amplification of the low-yield samples prior to genotyping. Careful quality control procedures are, however, needed in WGA genotyping.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Migraine is a highly prevalent disease, and despite several important breakthroughs there are still a many questions unanswered in the clinical, genetic and pathophysiological aspects of migraine research. Migraine has been linked to several other diseases such as epilepsy and stroke, but there are still unsolved issues concerning the true nature of these associations. Three genes predisposing to hemiplegic migraine and several loci associated to migraine have been identified, but so far no genes responsible for common forms of migraine have been recognized. Triptans have provided an important step in migraine treatment, but their usefulness in rare forms of migraine have been controversial. The Finnish Migraine Gene Project (FMGP) includes more than 1600 families and 7500 individuals. We evaluated comorbidity from 1000 consecutive subjects in the FMGP. To search for novel loci, we performed a genome-wide linkage scan in 36 families with high prevalences of migraine with visual aura. We collected 76 subjects from the FMGP who suffer from hemiplegic migraine and have used triptans. Finally, to study possible links between stroke and migraine we evaluated the prevalence of migraine in subjects with cervical artery dissection (CAD) and healthy controls. Migraine was associated with increased prevalence of allergy, hypotension and psychiatric diseases. Additionally, men suffering from migraine with aura had increased prevalence of epilepsy and stroke. Further evidence of association between migraine and epilepsy was found in our linkage study. The parametric two-point linkage analysis showed significant evidence of linkage between migraine aura and a locus on 9q21-q22. Interestingly, the same locus has been associated with occipitotemporal epilepsy. CAD seems to be a migraine risk factor, and therefore a link between stroke and migraine. Notably, CAD seems to alleviate migraine activity further indicating the association between these two conditions. Despite the contraindications of triptans, it seems that they are safe and effective in the abortive treatment of hemiplegic migraine.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Association studies of quantitative traits have often relied on methods in which a normal distribution of the trait is assumed. However, quantitative phenotypes from complex human diseases are often censored, highly skewed, or contaminated with outlying values. We recently developed a rank-based association method that takes into account censoring and makes no distributional assumptions about the trait. In this study, we applied our new method to age-at-onset data on ALDX1 and ALDX2. Both traits are highly skewed (skewness > 1.9) and often censored. We performed a whole genome association study of age at onset of the ALDX1 trait using Illumina single-nucleotide polymorphisms. Only slightly more than 5% of markers were significant. However, we identified two regions on chromosomes 14 and 15, which each have at least four significant markers clustering together. These two regions may harbor genes that regulate age at onset of ALDX1 and ALDX2. Future fine mapping of these two regions with densely spaced markers is warranted.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We formed the GEnetics of Nephropathy–an International Effort (GENIE) consortium to examine previously reported genetic associations with diabetic nephropathy (DN) in type 1 diabetes. GENIE consists of 6,366 similarly ascertained participants of European ancestry with type 1 diabetes, with and without DN, from the All Ireland-Warren 3-Genetics of Kidneys in Diabetes U.K. and Republic of Ireland (U.K.-R.O.I.) collection and the Finnish Diabetic Nephropathy Study (FinnDiane), combined with reanalyzed data from the Genetics of Kidneys in Diabetes U.S. Study (U.S. GoKinD). We found little evidence for the association of the EPO promoter polymorphism, rs161740, with the combined phenotype of proliferative retinopathy and end-stage renal disease in U.K.-R.O.I. (odds ratio [OR] 1.14, P = 0.19) or FinnDiane (OR 1.06, P = 0.60). However, a fixed-effects meta-analysis that included the previously reported cohorts retained a genome-wide significant association with that phenotype (OR 1.31, P = 2 × 10-9). An expanded investigation of the ELMO1 locus and genetic regions reported to be associated with DN in the U.S. GoKinD yielded only nominal statistical significance for these loci. Finally, top candidates identified in a recent meta-analysis failed to reach genome-wide significance. In conclusion, we were unable to replicate most of the previously reported genetic associations for DN, and significance for the EPO promoter association was attenuated.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Association mapping, initially developed in human disease genetics, is now being applied to plant species. The model species Arabidopsis provided some of the first examples of association mapping in plants, identifying previously cloned flowering time genes, despite high population sub-structure. More recently, association genetics has been applied to barley, where breeding activity has resulted in a high degree of population sub-structure. A major genotypic division within barley is that between winter- and spring-sown varieties, which differ in their requirement for vernalization to promote subsequent flowering. To date, all attempts to validate association genetics in barley by identifying major flowering time loci that control vernalization requirement (VRN-H1 and VRN-H2) have failed. Here, we validate the use of association genetics in barley by identifying VRN-H1 and VRN-H2, despite their prominent role in determining population sub-structure. Results: By taking barley as a typical inbreeding crop, and seasonal growth habit as a major partitioning phenotype, we develop an association mapping approach which successfully identifies VRN-H1 and VRN-H2, the underlying loci largely responsible for this agronomic division. We find a combination of Structured Association followed by Genomic Control to correct for population structure and inflation of the test statistic, resolved significant associations only with VRN-H1 and the VRN-H2 candidate genes, as well as two genes closely linked to VRN-H1 (HvCSFs1 and HvPHYC). Conclusion: We show that, after employing appropriate statistical methods to correct for population sub-structure, the genome-wide partitioning effect of allelic status at VRN-H1 and VRN-H2 does not result in the high levels of spurious association expected to occur in highly structured samples. Furthermore, we demonstrate that both VRN-H1 and the candidate VRN-H2 genes can be identified using association mapping. Discrimination between intragenic VRN-H1 markers was achieved, indicating that candidate causative polymorphisms may be discerned and prioritised within a larger set of positive associations. This proof of concept study demonstrates the feasibility of association mapping in barley, even within highly structured populations. A major advantage of this method is that it does not require large numbers of genome-wide markers, and is therefore suitable for fine mapping and candidate gene evaluation, especially in species for which large numbers of genetic markers are either unavailable or too costly.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The publication of the human genome sequence in 2001 was a major step forward in knowledge necessary to understand the variations between individuals. For farmed species, genomic sequence information will facilitate the selection of animals optimised to live, and be productive, in particular environments. The availability of cattle genome sequence has allowed the breeding industry to take the first steps towards predicting phenotypes from genotypes by estimating a genomic breeding value (gEBV) for bulls using genome-wide DNA markers. The sequencing of the buffalo genome and creation of a panel of DNA markers has created the opportunity to apply molecular selection approaches for this species.The genomes of several buffalo of different breeds were sequenced and aligned with the bovine genome, which facilitated the identification of millions of sequence variants in the buffalo genomes. Based on frequencies of variants within and among buffalo breeds, and their distribution across the genome compared with the bovine genome, 90,000 putative single nucleotide polymorphisms (SNP) were selected to create an Axiom (R) Buffalo Genotyping Array 90K. This SNP Chip was tested in buffalo populations from Italy and Brazil and found to have at least 75% high quality and polymorphic markers in these populations. The 90K SNP chip was then used to investigate the structure of buffalo populations, and to localise the variations having a major effect on milk production.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

High-throughput assays, such as yeast two-hybrid system, have generated a huge amount of protein-protein interaction (PPI) data in the past decade. This tremendously increases the need for developing reliable methods to systematically and automatically suggest protein functions and relationships between them. With the available PPI data, it is now possible to study the functions and relationships in the context of a large-scale network. To data, several network-based schemes have been provided to effectively annotate protein functions on a large scale. However, due to those inherent noises in high-throughput data generation, new methods and algorithms should be developed to increase the reliability of functional annotations. Previous work in a yeast PPI network (Samanta and Liang, 2003) has shown that the local connection topology, particularly for two proteins sharing an unusually large number of neighbors, can predict functional associations between proteins, and hence suggest their functions. One advantage of the work is that their algorithm is not sensitive to noises (false positives) in high-throughput PPI data. In this study, we improved their prediction scheme by developing a new algorithm and new methods which we applied on a human PPI network to make a genome-wide functional inference. We used the new algorithm to measure and reduce the influence of hub proteins on detecting functionally associated proteins. We used the annotations of the Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) as independent and unbiased benchmarks to evaluate our algorithms and methods within the human PPI network. We showed that, compared with the previous work from Samanta and Liang, our algorithm and methods developed in this study improved the overall quality of functional inferences for human proteins. By applying the algorithms to the human PPI network, we obtained 4,233 significant functional associations among 1,754 proteins. Further comparisons of their KEGG and GO annotations allowed us to assign 466 KEGG pathway annotations to 274 proteins and 123 GO annotations to 114 proteins with estimated false discovery rates of <21% for KEGG and <30% for GO. We clustered 1,729 proteins by their functional associations and made pathway analysis to identify several subclusters that are highly enriched in certain signaling pathways. Particularly, we performed a detailed analysis on a subcluster enriched in the transforming growth factor β signaling pathway (P<10-50) which is important in cell proliferation and tumorigenesis. Analysis of another four subclusters also suggested potential new players in six signaling pathways worthy of further experimental investigations. Our study gives clear insight into the common neighbor-based prediction scheme and provides a reliable method for large-scale functional annotations in this post-genomic era.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Understanding the complexities that are involved in the genetics of multifactorial diseases is still a monumental task. In addition to environmental factors that can influence the risk of disease, there is also a number of other complicating factors. Genetic variants associated with age of disease onset may be different from those variants associated with overall risk of disease, and variants may be located in positions that are not consistent with the traditional protein coding genetic paradigm. Latent Variable Models are well suited for the analysis of genetic data. A latent variable is one that we do not directly observe, but which is believed to exist or is included for computational or analytic convenience in a model. This thesis presents a mixture of methodological developments utilising latent variables, and results from case studies in genetic epidemiology and comparative genomics. Epidemiological studies have identified a number of environmental risk factors for appendicitis, but the disease aetiology of this oft thought useless vestige remains largely a mystery. The effects of smoking on other gastrointestinal disorders are well documented, and in light of this, the thesis investigates the association between smoking and appendicitis through the use of latent variables. By utilising data from a large Australian twin study questionnaire as both cohort and case-control, evidence is found for the association between tobacco smoking and appendicitis. Twin and family studies have also found evidence for the role of heredity in the risk of appendicitis. Results from previous studies are extended here to estimate the heritability of age-at-onset and account for the eect of smoking. This thesis presents a novel approach for performing a genome-wide variance components linkage analysis on transformed residuals from a Cox regression. This method finds evidence for a dierent subset of genes responsible for variation in age at onset than those associated with overall risk of appendicitis. Motivated by increasing evidence of functional activity in regions of the genome once thought of as evolutionary graveyards, this thesis develops a generalisation to the Bayesian multiple changepoint model on aligned DNA sequences for more than two species. This sensitive technique is applied to evaluating the distributions of evolutionary rates, with the finding that they are much more complex than previously apparent. We show strong evidence for at least 9 well-resolved evolutionary rate classes in an alignment of four Drosophila species and at least 7 classes in an alignment of four mammals, including human. A pattern of enrichment and depletion of genic regions in the profiled segments suggests they are functionally significant, and most likely consist of various functional classes. Furthermore, a method of incorporating alignment characteristics representative of function such as GC content and type of mutation into the segmentation model is developed within this thesis. Evidence of fine-structured segmental variation is presented.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Genetic research of complex diseases is a challenging, but exciting, area of research. The early development of the research was limited, however, until the completion of the Human Genome and HapMap projects, along with the reduction in the cost of genotyping, which paves the way for understanding the genetic composition of complex diseases. In this thesis, we focus on the statistical methods for two aspects of genetic research: phenotype definition for diseases with complex etiology and methods for identifying potentially associated Single Nucleotide Polymorphisms (SNPs) and SNP-SNP interactions. With regard to phenotype definition for diseases with complex etiology, we firstly investigated the effects of different statistical phenotyping approaches on the subsequent analysis. In light of the findings, and the difficulties in validating the estimated phenotype, we proposed two different methods for reconciling phenotypes of different models using Bayesian model averaging as a coherent mechanism for accounting for model uncertainty. In the second part of the thesis, the focus is turned to the methods for identifying associated SNPs and SNP interactions. We review the use of Bayesian logistic regression with variable selection for SNP identification and extended the model for detecting the interaction effects for population based case-control studies. In this part of study, we also develop a machine learning algorithm to cope with the large scale data analysis, namely modified Logic Regression with Genetic Program (MLR-GEP), which is then compared with the Bayesian model, Random Forests and other variants of logic regression.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Prostate cancer is the most frequently diagnosed cancer in males in developed countries. To identify common prostate cancer susceptibility alleles, we genotyped 211,155 SNPs on a custom Illumina array (iCOGS) in blood DNA from 25,074 prostate cancer cases and 24,272 controls from the international PRACTICAL Consortium. Twenty-three new prostate cancer susceptibility loci were identified at genome-wide significance (P < 5 × 10−8). More than 70 prostate cancer susceptibility loci, explaining ~30% of the familial risk for this disease, have now been identified. On the basis of combined risks conferred by the new and previously known risk loci, the top 1% of the risk distribution has a 4.7-fold higher risk than the average of the population being profiled. These results will facilitate population risk stratification for clinical studies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The power of testing for a population-wide association between a biallelic quantitative trait locus and a linked biallelic marker locus is predicted both empirically and deterministically for several tests. The tests were based on the analysis of variance (ANOVA) and on a number of transmission disequilibrium tests (TDT). Deterministic power predictions made use of family information, and were functions of population parameters including linkage disequilibrium, allele frequencies, and recombination rate. Deterministic power predictions were very close to the empirical power from simulations in all scenarios considered in this study. The different TDTs had very similar power, intermediate between one-way and nested ANOVAs. One-way ANOVA was the only test that was not robust against spurious disequilibrium. Our general framework for predicting power deterministically can be used to predict power in other association tests. Deterministic power calculations are a powerful tool for researchers to plan and evaluate experiments and obviate the need for elaborate simulation studies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Aim: To describe the recruitment, ophthalmic examination methods and distribution of ocular biometry of participants in the Norfolk Island Eye Study, who were individuals descended from the English Bounty mutineers and their Polynesian wives. Methods: All 1,275 permanent residents of Norfolk Island aged over 15 years were invited to participate, including 602 individuals involved in a 2001 cardiovascular disease study. Participants completed a detailed questionnaire and underwent a comprehensive eye assessment including stereo disc and retinal photography, ocular coherence topography and conjunctival autofluorescence assessment. Additionally, blood or saliva was taken for DNA testing. Results: 781 participants aged over 15 years were seen (54% female), comprising 61% of the permanent Island population. 343 people (43.9%) could trace their family history to the Pitcairn Islanders (Norfolk Island Pitcairn Pedigree). Mean anterior chamber depth was 3.32mm, mean axial length (AL) was 23.5mm, and mean central corneal thickness was 546 microns. There were no statistically significant differences in these characteristics between persons with and without Pitcairn Island ancestry. Mean intra-ocular pressure was lower in people with Pitcairn Island ancestry: 15.89mmHg compared to those without Pitcairn Island ancestry 16.49mmHg (P = .007). The mean keratometry value was lower in people with Pitcairn Island ancestry (43.22 vs. 43.52, P = .007). The corneas were flatter in people of Pitcairn ancestry but there was no corresponding difference in AL or refraction. Conclusion: Our study population is highly representative of the permanent population of Norfolk Island. Ocular biometry was similar to that of other white populations. Heritability estimates, linkage analysis and genome-wide studies will further elucidate the genetic determinants of chronic ocular diseases in this genetic isolate.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We conducted an association study across the human leukocyte antigen (HLA) complex to identify loci associated with multiple sclerosis (MS). Comparing 1927 SNPs in 1618 MS cases and 3413 controls of European ancestry, we identified seven SNPs that were independently associated with MS conditional on the others (each ). All associations were significant in an independent replication cohort of 2212 cases and 2251 controls () and were highly significant in the combined dataset (). The associated SNPs included proxies for HLA-DRB1*15:01 and HLA-DRB1*03:01, and SNPs in moderate linkage disequilibrium (LD) with HLA-A*02:01, HLA-DRB1*04:01 and HLA-DRB1*13:03. We also found a strong association with rs9277535 in the class II gene HLA-DPB1 (discovery set , replication set , combined ). HLA-DPB1 is located centromeric of the more commonly typed class II genes HLA-DRB1, -DQA1 and -DQB1. It is separated from these genes by a recombination hotspot, and the association is not affected by conditioning on genotypes at DRB1, DQA1 and DQB1. Hence rs9277535 represents an independent MS-susceptibility locus of genome-wide significance. It is correlated with the HLA-DPB1*03:01 allele, which has been implicated previously in MS in smaller studies. Further genotyping in large datasets is required to confirm and resolve this association.