973 resultados para Association studies Genotypes
Resumo:
High-throughput SNP arrays provide estimates of genotypes for up to one million loci, often used in genome-wide association studies. While these estimates are typically very accurate, genotyping errors do occur, which can influence in particular the most extreme test statistics and p-values. Estimates for the genotype uncertainties are also available, although typically ignored. In this manuscript, we develop a framework to incorporate these genotype uncertainties in case-control studies for any genetic model. We verify that using the assumption of a “local alternative” in the score test is very reasonable for effect sizes typically seen in SNP association studies, and show that the power of the score test is simply a function of the correlation of the genotype probabilities with the true genotypes. We demonstrate that the power to detect a true association can be substantially increased for difficult to call genotypes, resulting in improved inference in association studies.
Resumo:
Interpretability and power of genome-wide association studies can be increased by imputing unobserved genotypes, using a reference panel of individuals genotyped at higher marker density. For many markers, genotypes cannot be imputed with complete certainty, and the uncertainty needs to be taken into account when testing for association with a given phenotype. In this paper, we compare currently available methods for testing association between uncertain genotypes and quantitative traits. We show that some previously described methods offer poor control of the false-positive rate (FPR), and that satisfactory performance of these methods is obtained only by using ad hoc filtering rules or by using a harsh transformation of the trait under study. We propose new methods that are based on exact maximum likelihood estimation and use a mixture model to accommodate nonnormal trait distributions when necessary. The new methods adequately control the FPR and also have equal or better power compared to all previously described methods. We provide a fast software implementation of all the methods studied here; our new method requires computation time of less than one computer-day for a typical genome-wide scan, with 2.5 M single nucleotide polymorphisms and 5000 individuals.
Resumo:
Background: Genome wide association studies (GWAS) are becoming the approach of choice to identify genetic determinants of complex phenotypes and common diseases. The astonishing amount of generated data and the use of distinct genotyping platforms with variable genomic coverage are still analytical challenges. Imputation algorithms combine directly genotyped markers information with haplotypic structure for the population of interest for the inference of a badly genotyped or missing marker and are considered a near zero cost approach to allow the comparison and combination of data generated in different studies. Several reports stated that imputed markers have an overall acceptable accuracy but no published report has performed a pair wise comparison of imputed and empiric association statistics of a complete set of GWAS markers. Results: In this report we identified a total of 73 imputed markers that yielded a nominally statistically significant association at P < 10(-5) for type 2 Diabetes Mellitus and compared them with results obtained based on empirical allelic frequencies. Interestingly, despite their overall high correlation, association statistics based on imputed frequencies were discordant in 35 of the 73 (47%) associated markers, considerably inflating the type I error rate of imputed markers. We comprehensively tested several quality thresholds, the haplotypic structure underlying imputed markers and the use of flanking markers as predictors of inaccurate association statistics derived from imputed markers. Conclusions: Our results suggest that association statistics from imputed markers showing specific MAF (Minor Allele Frequencies) range, located in weak linkage disequilibrium blocks or strongly deviating from local patterns of association are prone to have inflated false positive association signals. The present study highlights the potential of imputation procedures and proposes simple procedures for selecting the best imputed markers for follow-up genotyping studies.
Resumo:
The prevalence of hypertension in African Americans (AAs) is higher than in other US groups; yet, few have performed genome-wide association studies (GWASs) in AA. Among people of European descent, GWASs have identified genetic variants at 13 loci that are associated with blood pressure. It is unknown if these variants confer susceptibility in people of African ancestry. Here, we examined genome-wide and candidate gene associations with systolic blood pressure (SBP) and diastolic blood pressure (DBP) using the Candidate Gene Association Resource (CARe) consortium consisting of 8591 AAs. Genotypes included genome-wide single-nucleotide polymorphism (SNP) data utilizing the Affymetrix 6.0 array with imputation to 2.5 million HapMap SNPs and candidate gene SNP data utilizing a 50K cardiovascular gene-centric array (ITMAT-Broad-CARe [IBC] array). For Affymetrix data, the strongest signal for DBP was rs10474346 (P= 3.6 × 10(-8)) located near GPR98 and ARRDC3. For SBP, the strongest signal was rs2258119 in C21orf91 (P= 4.7 × 10(-8)). The top IBC association for SBP was rs2012318 (P= 6.4 × 10(-6)) near SLC25A42 and for DBP was rs2523586 (P= 1.3 × 10(-6)) near HLA-B. None of the top variants replicated in additional AA (n = 11 882) or European-American (n = 69 899) cohorts. We replicated previously reported European-American blood pressure SNPs in our AA samples (SH2B3, P= 0.009; TBX3-TBX5, P= 0.03; and CSK-ULK3, P= 0.0004). These genetic loci represent the best evidence of genetic influences on SBP and DBP in AAs to date. More broadly, this work supports that notion that blood pressure among AAs is a trait with genetic underpinnings but also with significant complexity.
Resumo:
Molecular biology techniques are of help in genetic improvement since they permit the identification, mapping and analysis of polymorphisms of genes encoding proteins that act on metabolic pathways involved in economically interesting traits. The somatotrophic axis, which essentially consists of growth hormone releasing hormone (GHRH), growth hormone (GH), insulin-like growth factors I and II (IGF-I and IGF-II), and their associated binding proteins and receptors (GHRHR, GHR, IGF-IR and IGF-IIR), plays a key role in the metabolism and physiology of mammalian growth. The objectives of the present study were to estimate the allele and genotype frequencies of the IGF-I/SnaBI, IGF-IR/TaqI and GHRH/HaeIII gene polymorphisms in different genetic groups of beef cattle and to determine associations between these polymorphisms and growth and carcass traits. For this purpose, genotyping was performed on 79 Nellore animals, 30 Canchim (5/8 Charolais+3/8 Zebu) animals and 275 crossbred cattle originating from the crosses of Simmental (n=30) and Angus (n=245) sires with Nellore females. In the association studies, traits of interest were analyzed using the GLM procedure of SAS and least square means of the genotypes were compared by the Tukey test. Associations of IGF-I/SnaBI genotypes with body weight and subcutaneous backfat were significant (p < 0.05), and nearly significant for longissimus dorsi area (p=0.06), with the 1313 genotype being favorable compared to the AB genotype. No significant associations were observed between this polymorphism and weight gain or carcass yield (P > 0.05). The IGF-IR/TaqI and GHRH/HaeIII polymorphisms showed no association with production traits. (c) 2004 Elsevier B.V All rights reserved.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
The aim of this study was to identify candidate genes and genomic regions associated with ultrasound-derived measurements of the rib-eye area (REA), backfat thickness (BFT) and rumpfat thickness (RFT) in Nellore cattle. Data from 640 Nellore steers and young bulls with genotypes for 290 863 single nucleotide polymorphisms (SNPs) were used for genomewide association mapping. Significant SNP associations were explored to find possible candidate genes related to physiological processes. Several of the significant markers detected were mapped onto functional candidate genes including ARFGAP3, CLSTN2 and DPYD for REA; OSBPL3 and SUDS3 for BFT; and RARRES1 and VEPH1 for RFT. The physiological pathway related to lipid metabolism (CLSTN2, OSBPL3, RARRES1 and VEPH1) was identified. The significant markers within previously reported QTLs reinforce the importance of the genomic regions, and the other loci offer candidate genes that have not been related to carcass traits in previous investigations.
Resumo:
Based on our clinical experience on bovine mastitis, we hypothesized that subtypes of Staphylococcus aureus (S. aureus) exist which differ in their contagious and pathogenic properties. In order to investigate this hypothesis, we analyzed strains of S. aureus isolated from spontaneous intramammary infection (IMI) with their virulence gene patterns and genotypes obtained by PCR amplification of the 16S-23S rRNA intergenic spacer (RS-PCR). The genotypes were then associated with epidemiological and clinical data including 26 herds. The results demonstrated a high association between genotypes and virulence gene patterns as well as between epidemiological and pathogenic properties of S. aureus. In particular, genotype B was related to high contagiosity and increased pathogenicity whereas the other types (C, OG) were found with infection of single cows. Because of the high clinical relevance, our results indicate the need to subtype the IMI-associated strains of S. aureus in the future.
Resumo:
BACKGROUND A cost-effective strategy to increase the density of available markers within a population is to sequence a small proportion of the population and impute whole-genome sequence data for the remaining population. Increased densities of typed markers are advantageous for genome-wide association studies (GWAS) and genomic predictions. METHODS We obtained genotypes for 54 602 SNPs (single nucleotide polymorphisms) in 1077 Franches-Montagnes (FM) horses and Illumina paired-end whole-genome sequencing data for 30 FM horses and 14 Warmblood horses. After variant calling, the sequence-derived SNP genotypes (~13 million SNPs) were used for genotype imputation with the software programs Beagle, Impute2 and FImpute. RESULTS The mean imputation accuracy of FM horses using Impute2 was 92.0%. Imputation accuracy using Beagle and FImpute was 74.3% and 77.2%, respectively. In addition, for Impute2 we determined the imputation accuracy of all individual horses in the validation population, which ranged from 85.7% to 99.8%. The subsequent inclusion of Warmblood sequence data further increased the correlation between true and imputed genotypes for most horses, especially for horses with a high level of admixture. The final imputation accuracy of the horses ranged from 91.2% to 99.5%. CONCLUSIONS Using Impute2, the imputation accuracy was higher than 91% for all horses in the validation population, which indicates that direct imputation of 50k SNP-chip data to sequence level genotypes is feasible in the FM population. The individual imputation accuracy depended mainly on the applied software and the level of admixture.
Resumo:
Cauliflower (Brassica oleracea var. botrytis) is a vernalization-responsive crop. High ambient temperatures delay harvest time. The elucidation of the genetic regulation of floral transition is highly interesting for a precise harvest scheduling and to ensure stable market supply. This study aims at genetic dissection of temperature-dependent curd induction in cauliflower by genome-wide association studies and gene expression analysis. To assess temperature dependent curd induction, two greenhouse trials under distinct temperature regimes were conducted on a diversity panel consisting of 111 cauliflower commercial parent lines, genotyped with 14,385 SNPs. Broad phenotypic variation and high heritability (0.93) were observed for temperature-related curd induction within the cauliflower population. GWA mapping identified a total of 18 QTL localized on chromosomes O1, O2, O3, O4, O6, O8, and O9 for curding time under two distinct temperature regimes. Among those, several QTL are localized within regions of promising candidate flowering genes. Inferring population structure and genetic relatedness among the diversity set assigned three main genetic clusters. Linkage disequilibrium (LD) patterns estimated global LD extent of r(2) = 0.06 and a maximum physical distance of 400 kb for genetic linkage. Transcriptional profiling of flowering genes FLOWERING LOCUS C (BoFLC) and VERNALIZATION 2 (BoVRN2) was performed, showing increased expression levels of BoVRN2 in genotypes with faster curding. However, functional relevance of BoVRN2 and BoFLC2 could not consistently be supported, which probably suggests to act facultative and/or might evidence for BoVRN2/BoFLC-independent mechanisms in temperature regulated floral transition in cauliflower. Genetic insights in temperature-regulated curd induction can underpin genetically informed phenology models and benefit molecular breeding strategies toward the development of thermo-tolerant cultivars.
Resumo:
Association studies between ADIPOR1 genetic variants and predisposition to type 2 diabetes (DM2) have provided contradictory results. We determined if two single nucleotide polymorphisms (SNP c.-8503G>A and SNP c.10225C>G) in regulatory regions of ADIPOR1 in 567 Brazilian individuals of European (EA; N = 443) or African (AfA; N = 124) ancestry from rural (quilombo remnants; N = 439) and urban (N = 567) areas. We detected a significant effect of ethnicity on the distribution of the allelic frequencies of both SNPs in these populations (EA: -8503A = 0.27; AfA: -8503A = 0.16; P = 0.001 and EA: 10225G = 0.35; AfA: 10225G = 0.51; P < 0.001). Neither of the polymorphisms were associated with DM2 in the case-control study in EA (SNP c.-8503G>A: DM2 group -8503A = 0.26; control group -8503A = 0.30; P = 0.14/SNP 10225C>G: DM2 group 10225G = 0.37; control group 10225G = 0.32; P = 0.40) and AfA populations (SNP c.-8503G>A: DM2 group -8503A = 0.16; control group -8503A = 0.15; P = 0.34/SNP 10225C>G: DM2 group 10225G = 0.51; control group 10225G = 0.52; P = 0.50). Similarly, none of the polymorphisms were associated with metabolic/anthropometric risk factors for DM2 in any of the three populations, except for HDL cholesterol, which was significantly higher in AfA heterozygotes (GC = 53.75 ± 17.26 mg/dL) than in homozygotes. We conclude that ADIPOR1 polymorphisms are unlikely to be major risk factors for DM2 or for metabolic/anthropometric measurements that represent risk factors for DM2 in populations of European and African ancestries.
Resumo:
Genome-wide association studies (GWAS) have been successful in identifying common genetic variation involved in susceptibility to etiologically complex disease. We conducted a GWAS to identify common genetic variation involved in susceptibility to upper aero-digestive tract (UADT) cancers. Genome-wide genotyping was carried out using the Illumina HumanHap300 beadchips in 2,091 UADT cancer cases and 3,513 controls from two large European multi-centre UADT cancer studies, as well as 4,821 generic controls. The 19 top-ranked variants were investigated further in an additional 6,514 UADT cancer cases and 7,892 controls of European descent from an additional 13 UADT cancer studies participating in the INHANCE consortium. Five common variants presented evidence for significant association in the combined analysis (p <= 5 x 10(-7)). Two novel variants were identified, a 4q21 variant (rs1494961, p = 1 x 10(-8)) located near DNA repair related genes HEL308 and FAM175A (or Abraxas) and a 12q24 variant (rs4767364, p = 2 x 10(-8)) located in an extended linkage disequilibrium region that contains multiple genes including the aldehyde dehydrogenase 2 (ALDH2) gene. Three remaining variants are located in the ADH gene cluster and were identified previously in a candidate gene study involving some of these samples. The association between these three variants and UADT cancers was independently replicated in 5,092 UADT cancer cases and 6,794 controls non-overlapping samples presented here (rs1573496-ADH7, p = 5 x 10(-8); rs1229984-ADH1B, p = 7 x 10(-9); and rs698-ADH1C, p = 0.02). These results implicate two variants at 4q21 and 12q24 and further highlight three ADH variants in UADT cancer susceptibility.
Resumo:
Apart from very few families who have a direct cause from genetic mutation, causes of most Parkinson's disease (PD) remain unclear. Many allelic association studies on polymorphism of different candidate genes have been studied. Although these association studies do not imply a causal relationship, it does warrant further studies to elucidate the pathophysiologic significance. CYP1A1 polymorphisms have been reported to be associated with PD in a Japanese population sample. Since CYP1A1 transforms aromatic hydrocarbons into products that may be neurotoxic and perhaps lead to PD, we therefore undertook a study to look at the possible association of CYP1A1 polymorphism and PD in a Chinese population. Contrary to the Japanese result, we did not find any statistically significant difference between the PD group and the control group in our study with a bigger sample size.
Resumo:
The International Agency for Research on Cancer classified formaldehyde as carcinogenic to humans because there is “sufficient epidemiological evidence that it causes nasopharyngeal cancer in humans”. Genes involved in DNA repair and maintenance of genome integrity are critically involved in protecting against mutations that lead to cancer and/or inherited genetic disease. Association studies have recently provided evidence for a link between DNA repair polymorphisms and micronucleus (MN) induction. We used the cytokinesis-block micronucleus (CBMN assay) in peripheral lymphocytes and MN test in buccal cells to investigate the effects of XRCC3 Thr241Met, ADH5 Val309Ile, and Asp353Glu polymorphisms on the frequency of genotoxicity biomarkers in individuals occupationally exposed to formaldehyde (n = 54) and unexposed workers (n = 82). XRCC3 participates in DNA double-strand break/recombination repair, while ADH5 is an important component of cellular metabolism for the elimination of formaldehyde. Exposed workers had significantly higher frequencies (P < 0.01) than controls for all genotoxicity biomarkers evaluated in this study. Moreover, there were significant associations between XRCC3 genotypes and nuclear buds, namely XRCC3 Met/Met (OR = 3.975, CI 1.053–14.998, P = 0.042) and XRCC3 Thr/Met (OR = 5.632, CI 1.673–18.961, P = 0.005) in comparison with XRCC3 Thr/Thr. ADH5 polymorphisms did not show significant effects. This study highlights the importance of integrating genotoxicity biomarkers and genetic polymorphisms in human biomonitoring studies.
Resumo:
Sickle cell disease (SCD) is a genetic disorder with recessive transmission, caused by the mutation HBB:c.20A>T. It originates hemoglobin S that forms polymers inside the erythrocyte, upon deoxygenation, deforming it and ultimately leading to premature hemolysis. The disease presents with high heterogeneity of clinical manifestations, the most devastating of which, ischemic stroke, occurs in 11% of patients until 20 years of age. In this study, we tried to identify genetic modifiers of risk and episodes of stroke by studying 66 children with SCD, grouped according to the degree of cerebral vasculopathy (Stroke, Risk and Control). Association studies were performed between the three phenotypic groups and hematological and biochemical parameters of patients, as well as with 23 polymorphic regions in genes related to vascular cell adhesion (VCAM-1, THBS-1 and CD36), vascular tonus (NOS3 and ET-1) and inflammation (TNF-α and HMOX-1). Relevant data was collected from patient’s medical records. Known genetic modulators of SCD (beta-globin cluster haplotype and HBA and BCL11A genotypes) and putative genetic modifiers of cerebral vasculopathy were characterized. Differences in their distribution among groups were assessed. VCAM-1 rs1409419 allele C and NOS3 rs207044 allele C were associated to stroke events, while VCAM-1 rs1409419 allele T was found to be protective. Alleles 4a and 4b of NOS3 27 bp VNTR appeared to be respectively associated to stroke risk and protection. HMOX-1 longer STRs seemed to predispose to stroke. Higher hemoglobin F levels were found in Control group, as a result of Senegal haplotype or of BCL11A rs11886868 allele T, and higher lactate dehydrogenase levels, marker of hemolysis, were found in Risk group. Molecular mechanisms underlying the modifier functions of the relevant genetic variants are discussed.