Biblioteca Digital

932 resultados para deviance information criteria, model averaging, MCMC, genomewide association studies, epistasis, logistic regression, stochastic search algorithm, case-control studies, Type I diabetes, single nucleotide polymorphism, gene expression programming

A brief guide to model selection, multimodel inference and model averaging in behavioural ecology using Akaike's information criterion

Relevância:

100.00% 100.00%

Publicador:

Veja mais

A genomewide association mapping study using ultrasound-scanned information identifies potential genomic regions and candidate genes affecting carcass traits in Nellore cattle

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The aim of this study was to identify candidate genes and genomic regions associated with ultrasound-derived measurements of the rib-eye area (REA), backfat thickness (BFT) and rumpfat thickness (RFT) in Nellore cattle. Data from 640 Nellore steers and young bulls with genotypes for 290 863 single nucleotide polymorphisms (SNPs) were used for genomewide association mapping. Significant SNP associations were explored to find possible candidate genes related to physiological processes. Several of the significant markers detected were mapped onto functional candidate genes including ARFGAP3, CLSTN2 and DPYD for REA; OSBPL3 and SUDS3 for BFT; and RARRES1 and VEPH1 for RFT. The physiological pathway related to lipid metabolism (CLSTN2, OSBPL3, RARRES1 and VEPH1) was identified. The significant markers within previously reported QTLs reinforce the importance of the genomic regions, and the other loci offer candidate genes that have not been related to carcass traits in previous investigations.

Veja mais

Using information criteria to select the correct variance–covariance structure for longitudinal data in ecology

Relevância:

100.00% 100.00%

Publicador:

Resumo:

1. Ecological data sets often use clustered measurements or use repeated sampling in a longitudinal design. Choosing the correct covariance structure is an important step in the analysis of such data, as the covariance describes the degree of similarity among the repeated observations. 2. Three methods for choosing the covariance are: the Akaike information criterion (AIC), the quasi-information criterion (QIC), and the deviance information criterion (DIC). We compared the methods using a simulation study and using a data set that explored effects of forest fragmentation on avian species richness over 15 years. 3. The overall success was 80.6% for the AIC, 29.4% for the QIC and 81.6% for the DIC. For the forest fragmentation study the AIC and DIC selected the unstructured covariance, whereas the QIC selected the simpler autoregressive covariance. Graphical diagnostics suggested that the unstructured covariance was probably correct. 4. We recommend using DIC for selecting the correct covariance structure.

Veja mais

BAYESIAN MODEL SEARCH AND MULTILEVEL INFERENCE FOR SNP ASSOCIATION STUDIES.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Technological advances in genotyping have given rise to hypothesis-based association studies of increasing scope. As a result, the scientific hypotheses addressed by these studies have become more complex and more difficult to address using existing analytic methodologies. Obstacles to analysis include inference in the face of multiple comparisons, complications arising from correlations among the SNPs (single nucleotide polymorphisms), choice of their genetic parametrization and missing data. In this paper we present an efficient Bayesian model search strategy that searches over the space of genetic markers and their genetic parametrization. The resulting method for Multilevel Inference of SNP Associations, MISA, allows computation of multilevel posterior probabilities and Bayes factors at the global, gene and SNP level, with the prior distribution on SNP inclusion in the model providing an intrinsic multiplicity correction. We use simulated data sets to characterize MISA's statistical power, and show that MISA has higher power to detect association than standard procedures. Using data from the North Carolina Ovarian Cancer Study (NCOCS), MISA identifies variants that were not identified by standard methods and have been externally "validated" in independent studies. We examine sensitivity of the NCOCS results to prior choice and method for imputing missing data. MISA is available in an R package on CRAN.

Veja mais

Genomewide association for a dominant pigmentation gene in sheep

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Most published genomewide association studies (GWAS) in sheep have investigated recessively inherited monogenic traits. The objective here was to assess the feasibility of performing GWAS for a dominant trait for which the genetic basis was already known. A total of 42 Manchega and Rasa Aragonesa sheep that segregate solid black or white coat pigmentation were genotyped using the SNP50 BeadChip. Previous analysis in Manchegas demonstrated a complete association between the pigmentation trait and alleles of the MC1R gene, setting an a priori expectation for GWAS. Multiple methods were used to identify and quantify the strength of population substructure between black and white animals, before allelic association testing was performed for 49 034 SNPs. Following correction for substructure, GWAS identified the most strongly associated SNP (s26449) was also the closest to the MC1R gene. The finding was strongly supported by the permutation tree-based random forest (RF) analysis. Importantly, GWAS identified unlinked SNP with only slightly lower p-values than for s26449. Random forest analysis indicated these were false positives, suggesting interpretation based on both approaches was beneficial. The results indicate that a combined analytical approach can be successful in studies where a modest number of animals are available and substantial population stratification exists.

Veja mais

Increasing the power of genome wide association studies in natural populations using repeated measures : evaluation and implementation

Relevância:

100.00% 100.00%

Publicador:

Resumo:

1. Genomewide association studies (GWAS) enable detailed dissections of the genetic basis for organisms' ability to adapt to a changing environment. In long-term studies of natural populations, individuals are often marked at one point in their life and then repeatedly recaptured. It is therefore essential that a method for GWAS includes the process of repeated sampling. In a GWAS, the effects of thousands of single-nucleotide polymorphisms (SNPs) need to be fitted and any model development is constrained by the computational requirements. A method is therefore required that can fit a highly hierarchical model and at the same time is computationally fast enough to be useful. 2. Our method fits fixed SNP effects in a linear mixed model that can include both random polygenic effects and permanent environmental effects. In this way, the model can correct for population structure and model repeated measures. The covariance structure of the linear mixed model is first estimated and subsequently used in a generalized least squares setting to fit the SNP effects. The method was evaluated in a simulation study based on observed genotypes from a long-term study of collared flycatchers in Sweden. 3. The method we present here was successful in estimating permanent environmental effects from simulated repeated measures data. Additionally, we found that especially for variable phenotypes having large variation between years, the repeated measurements model has a substantial increase in power compared to a model using average phenotypes as a response. 4. The method is available in the R package RepeatABEL. It increases the power in GWAS having repeated measures, especially for long-term studies of natural populations, and the R implementation is expected to facilitate modelling of longitudinal data for studies of both animal and human populations.

Veja mais

Nonparametric Evaluation of Quantitative Traits in Population-Based Association Studies when the Genetic Model is Unknown

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Statistical association between a single nucleotide polymorphism (SNP) genotype and a quantitative trait in genome-wide association studies is usually assessed using a linear regression model, or, in the case of non-normally distributed trait values, using the Kruskal-Wallis test. While linear regression models assume an additive mode of inheritance via equi-distant genotype scores, Kruskal-Wallis test merely tests global differences in trait values associated with the three genotype groups. Both approaches thus exhibit suboptimal power when the underlying inheritance mode is dominant or recessive. Furthermore, these tests do not perform well in the common situations when only a few trait values are available in a rare genotype category (disbalance), or when the values associated with the three genotype categories exhibit unequal variance (variance heterogeneity). We propose a maximum test based on Marcus-type multiple contrast test for relative effect sizes. This test allows model-specific testing of either dominant, additive or recessive mode of inheritance, and it is robust against variance heterogeneity. We show how to obtain mode-specific simultaneous confidence intervals for the relative effect sizes to aid in interpreting the biological relevance of the results. Further, we discuss the use of a related all-pairwise comparisons contrast test with range preserving confidence intervals as an alternative to Kruskal-Wallis heterogeneity test. We applied the proposed maximum test to the Bogalusa Heart Study dataset, and gained a remarkable increase in the power to detect association, particularly for rare genotypes. Our simulation study also demonstrated that the proposed non-parametric tests control family-wise error rate in the presence of non-normality and variance heterogeneity contrary to the standard parametric approaches. We provide a publicly available R library nparcomp that can be used to estimate simultaneous confidence intervals or compatible multiplicity-adjusted p-values associated with the proposed maximum test.

Veja mais

Genome-wide association studies

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Genome-wide association studies (GWAS) are a powerful hypothesis-free tool for the dissection of susceptibility to common heritable human diseases, including osteoporosis. To date, more than 2000 loci for common human diseases have been identified by GWAS. Success using the GWAS model depends on genetic risk being determined by shared stretches of DNA carried with different frequencies in cases and controls, inherited from ancient ancestors, termed the “common disease–common variant” hypothesis. Not all disease risk is caused by common variants, however, and thus GWAS will not detect all variants involved. Successful GWAS performance requires careful quality control, especially as the effect sizes under study are modest, and there are multiple potential sources of error. Conservative interpretation, use of stringent significance thresholds, and replication in independent cohorts are required to ensure results are robust. Despite these challenging parameters, much has been learnt from GWAS and, as the approach matures and is modified to identify a wider range of variants, significantly more will be learnt about the etiopathogenesis of common diseases such as osteoporosis.

Veja mais

A versatile gene-based test for genome-wide association studies

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We have derived a versatile gene-based test for genome-wide association studies (GWAS). Our approach, called VEGAS (versatile gene-based association study), is applicable to all GWAS designs, including family-based GWAS, meta-analyses of GWAS on the basis of summary data, and DNA-pooling-based GWAS, where existing approaches based on permutation are not possible, as well as singleton data, where they are. The test incorporates information from a full set of markers (or a defined subset) within a gene and accounts for linkage disequilibrium between markers by using simulations from the multivariate normal distribution. We show that for an association study using singletons, our approach produces results equivalent to those obtained via permutation in a fraction of the computation time. We demonstrate proof-of-principle by using the gene-based test to replicate several genes known to be associated on the basis of results from a family-based GWAS for height in 11,536 individuals and a DNA-pooling-based GWAS for melanoma in approximately 1300 cases and controls. Our method has the potential to identify novel associated genes; provide a basis for selecting SNPs for replication; and be directly used in network (pathway) approaches that require per-gene association test statistics. We have implemented the approach in both an easy-to-use web interface, which only requires the uploading of markers with their association p-values, and a separate downloadable application.

Veja mais

Identification of a novel FGFRL1 MicroRNA target site polymorphism for bone mineral density in meta-analyses of genome-wide association studies

Relevância:

100.00% 100.00%

Publicador:

Resumo:

MicroRNAs (miRNAs) are critical post-transcriptional regulators. Based on a previous genome-wide association (GWA) scan, we conducted a polymorphism in microRNAs' Target Sites (poly-miRTS)-centric multistage meta-analysis for lumbar spine (LS)-, total hip (HIP)-, and femoral neck (FN)-bone mineral density (BMD). In stage I, 41,102 poly-miRTSs were meta-analyzed in 7 cohorts with a genome-wide significance (GWS) α=0.05/41,102=1.22×10-6. By applying α=5×10-5 (suggestive significance), 11 poly-miRTSs were selected, with FGFRL1 rs4647940 and PRR5 rs3213550 as top signals for FN-BMD (P-value=7.67×10-6 and 1.58×10-5) in gender-combined sample. In stage II in silico replication (two cohorts), FGFRL1 rs4647940 was the only signal marginally replicated for FN-BMD (P-value=5.08×10-3) at α=0.10/11=9.09×10-3. PRR5 rs3213550 was also selected based on biological significance. In stage III de novo genotyping replication (two cohorts), FGFRL1 rs4647940 was the only signal significantly replicated for FN-BMD (P-value=7.55×10-6) at α=0.05/2=0.025 in gender-combined sample. Aggregating three stages, FGFRL1 rs4647940 was the single stage I-discovered and stages II- and III-replicated signal attaining GWS for FN-BMD (P-value=8.87×10-12). Dual-luciferase reporter assays demonstrated that FGFRL1 3' untranslated region harboring rs4647940 appears to be hsa-miR-140-5p's target site. In a zebrafish microinjection experiment, dre-miR-140-5p is shown to exert a dramatic impact on craniofacial skeleton formation. Taken together, we provided functional evidence for a novel FGFRL1 poly-miRTS rs4647940 in a previously known 4p16.3 locus, and experimental and clinical genetics studies have shown both FGFRL1 and hsa-miR-140-5p are important for bone formation. © The Author 2015. Published by Oxford University Press. All rights reserved.

Veja mais

Genetic variants influencing human aging from late-onset Alzheimer's disease (LOAD) genome-wide association studies (GWAS)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Genetics plays a crucial role in human aging with up to 30% of those living to the mid-80s being determined by genetic variation. Survival to older ages likely entails an even greater genetic contribution. There is increasing evidence that genes implicated in age-related diseases, such as cancer and neuronal disease, play a role in affecting human life span. We have selected the 10 most promising late-onset Alzheimer's disease (LOAD) susceptibility genes identified through several recent large genome-wide association studies (GWAS). These 10 LOAD genes (APOE, CLU, PICALM, CR1, BIN1, ABCA7, MS4A6A, CD33, CD2AP, and EPHA1) have been tested for association with human aging in our dataset (1385 samples with documented age at death [AAD], age range: 58-108 years; mean age at death: 80.2) using the most significant single nucleotide polymorphisms (SNPs) found in the previous studies. Apart from the APOE locus (rs2075650) which showed compelling evidence of association with risk on human life span (p = 5.27 × 10(-4)), none of the other LOAD gene loci demonstrated significant evidence of association. In addition to examining the known LOAD genes, we carried out analyses using age at death as a quantitative trait. No genome-wide significant SNPs were discovered. Increasing sample size and statistical power will be imperative to detect genuine aging-associated variants in the future. In this report, we also discuss issues relating to the analysis of genome-wide association studies data from different centers and the bioinformatic approach required to distinguish spurious genome-wide significant signals from real SNP associations.

Veja mais

Meta-analysis of three genome-wide association studies identifies susceptibility loci for colorectal cancer at 1q41, 3q26.2, 12q13.13 and 20q13.33.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Genome-wide association studies (GWAS) have identified ten loci harboring common variants that influence risk of developing colorectal cancer (CRC). To enhance the power to identify additional CRC risk loci, we conducted a meta-analysis of three GWAS from the UK which included a total of 3,334 affected individuals (cases) and 4,628 controls followed by multiple validation analyses including a total of 18,095 cases and 20,197 controls. We identified associations at four new CRC risk loci: 1q41 (rs6691170, odds ratio (OR) = 1.06, P = 9.55 × 10?¹° and rs6687758, OR = 1.09, P = 2.27 × 10??, 3q26.2 (rs10936599, OR = 0.93, P = 3.39 × 10?8), 12q13.13 (rs11169552, OR = 0.92, P = 1.89 × 10?¹° and rs7136702, OR = 1.06, P = 4.02 × 10?8) and 20q13.33 (rs4925386, OR = 0.93, P = 1.89 × 10?¹°). In addition to identifying new CRC risk loci, this analysis provides evidence that additional CRC-associated variants of similar effect size remain to be discovered.