22 resultados para Single nucleotide polymorphism
em DigitalCommons@The Texas Medical Center
Resumo:
With hundreds of single nucleotide polymorphisms (SNPs) in a candidate gene and millions of SNPs across the genome, selecting an informative subset of SNPs to maximize the ability to detect genotype-phenotype association is of great interest and importance. In addition, with a large number of SNPs, analytic methods are needed that allow investigators to control the false positive rate resulting from large numbers of SNP genotype-phenotype analyses. This dissertation uses simulated data to explore methods for selecting SNPs for genotype-phenotype association studies. I examined the pattern of linkage disequilibrium (LD) across a candidate gene region and used this pattern to aid in localizing a disease-influencing mutation. The results indicate that the r2 measure of linkage disequilibrium is preferred over the common D′ measure for use in genotype-phenotype association studies. Using step-wise linear regression, the best predictor of the quantitative trait was not usually the single functional mutation. Rather it was a SNP that was in high linkage disequilibrium with the functional mutation. Next, I compared three strategies for selecting SNPs for application to phenotype association studies: based on measures of linkage disequilibrium, based on a measure of haplotype diversity, and random selection. The results demonstrate that SNPs selected based on maximum haplotype diversity are more informative and yield higher power than randomly selected SNPs or SNPs selected based on low pair-wise LD. The data also indicate that for genes with small contribution to the phenotype, it is more prudent for investigators to increase their sample size than to continuously increase the number of SNPs in order to improve statistical power. When typing large numbers of SNPs, researchers are faced with the challenge of utilizing an appropriate statistical method that controls the type I error rate while maintaining adequate power. We show that an empirical genotype based multi-locus global test that uses permutation testing to investigate the null distribution of the maximum test statistic maintains a desired overall type I error rate while not overly sacrificing statistical power. The results also show that when the penetrance model is simple the multi-locus global test does as well or better than the haplotype analysis. However, for more complex models, haplotype analyses offer advantages. The results of this dissertation will be of utility to human geneticists designing large-scale multi-locus genotype-phenotype association studies. ^
Resumo:
Hereditary nonpolyposis colorectal cancer (HNPCC) is an autosomal dominant disease caused by germline mutations in DNA mismatch repair(MMR) genes. The nucleotide excision repair(NER) pathway plays a very important role in cancer development. We systematically studied interactions between NER and MMR genes to identify NER gene single nucleotide polymorphism (SNP) risk factors that modify the effect of MMR mutations on risk for cancer in HNPCC. We analyzed data from polymorphisms in 10 NER genes that had been genotyped in HNPCC patients that carry MSH2 and MLH1 gene mutations. The influence of the NER gene SNPs on time to onset of colorectal cancer (CRC) was assessed using survival analysis and a semiparametric proportional hazard model. We found the median age of onset for CRC among MMR mutation carriers with the ERCC1 mutation was 3.9 years earlier than patients with wildtype ERCC1(median 47.7 vs 51.6, log-rank test p=0.035). The influence of Rad23B A249V SNP on age of onset of HNPCC is age dependent (likelihood ratio test p=0.0056). Interestingly, using the likelihood ratio test, we also found evidence of genetic interactions between the MMR gene mutations and SNPs in ERCC1 gene(C8092A) and XPG/ERCC5 gene(D1104H) with p-values of 0.004 and 0.042, respectively. An assessment using tree structured survival analysis (TSSA) showed distinct gene interactions in MLH1 mutation carriers and MSH2 mutation carriers. ERCC1 SNP genotypes greatly modified the age onset of HNPCC in MSH2 mutation carriers, while no effect was detected in MLH1 mutation carriers. Given the NER genes in this study play different roles in NER pathway, they may have distinct influences on the development of HNPCC. The findings of this study are very important for elucidation of the molecular mechanism of colon cancer development and for understanding why some mutation carriers of the MSH2 and MLH1 gene develop CRC early and others never develop CRC. Overall, the findings also have important implications for the development of early detection strategies and prevention as well as understanding the mechanism of colorectal carcinogenesis in HNPCC. ^
Resumo:
Systemic sclerosis (SSc) or Scleroderma is a complex disease and its etiopathogenesis remains unelucidated. Fibrosis in multiple organs is a key feature of SSc and studies have shown that transforming growth factor-β (TGF-β) pathway has a crucial role in fibrotic responses. For a complex disease such as SSc, expression quantitative trait loci (eQTL) analysis is a powerful tool for identifying genetic variations that affect expression of genes involved in this disease. In this study, a multilevel model is described to perform a multivariate eQTL for identifying genetic variation (SNPs) specifically associated with the expression of three members of TGF-β pathway, CTGF, SPARC and COL3A1. The uniqueness of this model is that all three genes were included in one model, rather than one gene being examined at a time. A protein might contribute to multiple pathways and this approach allows the identification of important genetic variations linked to multiple genes belonging to the same pathway. In this study, 29 SNPs were identified and 16 of them located in known genes. Exploring the roles of these genes in TGF-β regulation will help elucidate the etiology of SSc, which will in turn help to better manage this complex disease. ^
Resumo:
A wealth of genetic associations for cardiovascular and metabolic phenotypes in humans has been accumulating over the last decade, in particular a large number of loci derived from recent genome wide association studies (GWAS). True complex disease-associated loci often exert modest effects, so their delineation currently requires integration of diverse phenotypic data from large studies to ensure robust meta-analyses. We have designed a gene-centric 50 K single nucleotide polymorphism (SNP) array to assess potentially relevant loci across a range of cardiovascular, metabolic and inflammatory syndromes. The array utilizes a "cosmopolitan" tagging approach to capture the genetic diversity across approximately 2,000 loci in populations represented in the HapMap and SeattleSNPs projects. The array content is informed by GWAS of vascular and inflammatory disease, expression quantitative trait loci implicated in atherosclerosis, pathway based approaches and comprehensive literature searching. The custom flexibility of the array platform facilitated interrogation of loci at differing stringencies, according to a gene prioritization strategy that allows saturation of high priority loci with a greater density of markers than the existing GWAS tools, particularly in African HapMap samples. We also demonstrate that the IBC array can be used to complement GWAS, increasing coverage in high priority CVD-related loci across all major HapMap populations. DNA from over 200,000 extensively phenotyped individuals will be genotyped with this array with a significant portion of the generated data being released into the academic domain facilitating in silico replication attempts, analyses of rare variants and cross-cohort meta-analyses in diverse populations. These datasets will also facilitate more robust secondary analyses, such as explorations with alternative genetic models, epistasis and gene-environment interactions.
Resumo:
Glutathione S-transferase (GST) genes detoxify and metabolize carcinogens, including oxygen free radicals which may contribute to salivary gland carcinogenesis. This cancer center-based case-control association study included 166 patients with incident salivary gland carcinoma (SGC) and 511 cancer-free controls. We performed multiplex polymerase chain reaction-based polymorphism genotyping assays for GSTM1 and GSTT1 null genotypes. Odds ratios (ORs) and 95% confidence intervals (CIs) were calculated with multivariable logistic regression analyses adjusted for age, sex, ethnicity, tobacco use, family history of cancer, alcohol use and radiation exposure. In our results, 27.7% of the SGC cases and 20.6% of the controls were null for the GSTT1 (P = 0.054), and 53.0% of the SGC cases and 50.9% of the controls were null for the GSTM1 (P = 0.633). The results of the adjusted multivariale regression analysis suggested that having GSTT1 null genotype was associated with a significantly increased risk for SGC (odds ratio 1.5, 95% confidence interval 1.0-2.3). Additionally, 13.9% of the SGC cases but only 8.4% of the controls were null for both genes and the results of the adjusted multivariable regression analysis suggested that having both null genotypes was significantly associated with an approximately 2-fold increased risk for SGC (odds ratio 1.9, 95% confidence interval 1.0-3.5). The presence of GSTT1 null genotype and the simultaneous presence of GSTM1 and GSTT1 null genotypes appear associated with significantly increased SGC risk. These findings warrant further study with larger sample sizes.
Resumo:
PURPOSE: The present study defines genomic loci underlying coordinate changes in gene expression following retinal injury. METHODS: A group of acute phase genes expressed in diverse nervous system tissues was defined by combining microarray results from injury studies from rat retina, brain, and spinal cord. Genomic loci regulating the brain expression of acute phase genes were identified using a panel of BXD recombinant inbred (RI) mouse strains. Candidate upstream regulators within a locus were defined using single nucleotide polymorphism databases and promoter motif databases. RESULTS: The acute phase response of rat retina, brain, and spinal cord was dominated by transcription factors. Three genomic loci control transcript expression of acute phase genes in brains of BXD RI mouse strains. One locus was identified on chromosome 12 and was highly correlated with the expression of classic acute phase genes. Within the locus we identified the inhibitor of DNA binding 2 (Id2) as a candidate upstream regulator. Id2 was upregulated as an acute phase transcript in injury models of rat retina, brain, and spinal cord. CONCLUSIONS: We defined a group of transcriptional changes associated with the retinal acute injury response. Using genetic linkage analysis of natural transcript variation, we identified regulatory loci and candidate regulators that control transcript levels of acute phase genes.
Resumo:
Nonsyndromic cleft lip with or without cleft palate (NSCLP) is a common birth anomaly that requires prolonged multidisciplinary rehabilitation. Although variation in several genes has been identified as contributing to NSCLP, most of the genetic susceptibility loci have yet to be defined. To identify additional contributory genes, a high-throughput genomic scan was performed using the Illumina Linkage IVb Panel platform. We genotyped 6008 SNPs in nine non-Hispanic white NSCLP multiplex families and a single large African-American NSCLP multiplex family. Fourteen chromosomal regions were identified with LOD>1.5, including six regions not previously reported. Analysis of the data from the African-American and non-Hispanic white families revealed two likely chromosomal regions: 8q21.3-24.12 and 22q12.2-12.3 with LOD scores of 2.98 and 2.66, respectively. On the basis of biological function, syndecan 2 (SDC2) and growth differentiation factor 6 (GDF6) in 8q21.3-24.12 and myosin heavy-chain 9, non-muscle (MYH9) in 22q12.2-12.3 were selected as candidate genes. Association analyses from these genes yielded marginally significant P-values for SNPs in SDC2 and GDF6 (0.01
Resumo:
The ventricular system is a critical component of the central nervous system (CNS) that is formed early in the developmental stages and remains functional through the lifetime. Changes in the ventricular system can be easily discerned via neuroimaging procedures and most of the time it reflects changes in the physiology of the CNS. In this study we attempted to identify specific genes associated with variation in ventricular volume in humans. Methods. We conducted a genome wide association (GWA) analysis of the volume of the lateral ventricles among 1605 individuals of European ancestry from two community based cohorts, the Genetics of Microangiopathic Brain Injury (GMBI; N=814) and Atherosclerosis Risk in Communities (ARIC; N=791). Significant findings from the analysis were tested for replication in both the cohorts and then meta-analyzed to get an estimate of overall significance. Results. In our GWA analyses, no single nucleotide polymorphism (SNP) reached a genome-wide significance of p<10−8. There were 25 SNPs in GMBI and 9 SNPs in ARIC that reached a threshold of p<10 −5. However, none of the top SNPs from each cohort were replicated in the other. In the meta-analysis, no SNP reached the genome-wide threshold of 5×10−8, but we identified five novel SNPs associated with variation in ventricular volume at the p<10 −5 level. Strongest association was for rs2112536 in an intergenic region on chromosome 5q33 (Pmeta= 8.46×10−7 ). The remaining four SNPs were located on chromosome 3q23 encompassing the gene for Calsyntenin-2 (CLSTN2). The SNPs with strongest association in this region were rs17338555 (Pmeta= 5.28×10 −6), rs9812091 (Pmeta= 5.89×10−6 ), rs9812283 (Pmeta= 5.97×10−6) and rs9833213 (Pmeta= 6.96×10−6). Conclusions. This GWA study of ventricular volumes in the community-based cohorts of European descent identifies potential locus on chromosomes 3 and 5. Further characterization of these loci may provide insights into pathophysiology of ventricular involvement in various neurological diseases.^
Resumo:
This thesis project is motivated by the potential problem of using observational data to draw inferences about a causal relationship in observational epidemiology research when controlled randomization is not applicable. Instrumental variable (IV) method is one of the statistical tools to overcome this problem. Mendelian randomization study uses genetic variants as IVs in genetic association study. In this thesis, the IV method, as well as standard logistic and linear regression models, is used to investigate the causal association between risk of pancreatic cancer and the circulating levels of soluble receptor for advanced glycation end-products (sRAGE). Higher levels of serum sRAGE were found to be associated with a lower risk of pancreatic cancer in a previous observational study (255 cases and 485 controls). However, such a novel association may be biased by unknown confounding factors. In a case-control study, we aimed to use the IV approach to confirm or refute this observation in a subset of study subjects for whom the genotyping data were available (178 cases and 177 controls). Two-stage IV method using generalized method of moments-structural mean models (GMM-SMM) was conducted and the relative risk (RR) was calculated. In the first stage analysis, we found that the single nucleotide polymorphism (SNP) rs2070600 of the receptor for advanced glycation end-products (AGER) gene meets all three general assumptions for a genetic IV in examining the causal association between sRAGE and risk of pancreatic cancer. The variant allele of SNP rs2070600 of the AGER gene was associated with lower levels of sRAGE, and it was neither associated with risk of pancreatic cancer, nor with the confounding factors. It was a potential strong IV (F statistic = 29.2). However, in the second stage analysis, the GMM-SMM model failed to converge due to non- concaveness probably because of the small sample size. Therefore, the IV analysis could not support the causality of the association between serum sRAGE levels and risk of pancreatic cancer. Nevertheless, these analyses suggest that rs2070600 was a potentially good genetic IV for testing the causality between the risk of pancreatic cancer and sRAGE levels. A larger sample size is required to conduct a credible IV analysis.^
Resumo:
BACKGROUND: Variants in the complement cascade genes and the LOC387715/HTRA1, have been widely reported to associate with age-related macular degeneration (AMD), the most common cause of visual impairment in industrialized countries. METHODS/PRINCIPAL FINDINGS: We investigated the association between the LOC387715 A69S and complement component C3 R102G risk alleles in the Finnish case-control material and found a significant association with both variants (OR 2.98, p = 3.75 x 10(-9); non-AMD controls and OR 2.79, p = 2.78 x 10(-19), blood donor controls and OR 1.83, p = 0.008; non-AMD controls and OR 1.39, p = 0.039; blood donor controls), respectively. Previously, we have shown a strong association between complement factor H (CFH) Y402H and AMD in the Finnish population. A carrier of at least one risk allele in each of the three susceptibility loci (LOC387715, C3, CFH) had an 18-fold risk of AMD when compared to a non-carrier homozygote in all three loci. A tentative gene-gene interaction between the two major AMD-associated loci, LOC387715 and CFH, was found in this study using a multiplicative (logistic regression) model, a synergy index (departure-from-additivity model) and the mutual information method (MI), suggesting that a common causative pathway may exist for these genes. Smoking (ever vs. never) exerted an extra risk for AMD, but somewhat surprisingly, only in connection with other factors such as sex and the C3 genotype. Population attributable risks (PAR) for the CFH, LOC387715 and C3 variants were 58.2%, 51.4% and 5.8%, respectively, the summary PAR for the three variants being 65.4%. CONCLUSIONS/SIGNIFICANCE: Evidence for gene-gene interaction between two major AMD associated loci CFH and LOC387715 was obtained using three methods, logistic regression, a synergy index and the mutual information (MI) index.
Resumo:
Persistently low white blood cell count (WBC) and neutrophil count is a well-described phenomenon in persons of African ancestry, whose etiology remains unknown. We recently used admixture mapping to identify an approximately 1-megabase region on chromosome 1, where ancestry status (African or European) almost entirely accounted for the difference in WBC between African Americans and European Americans. To identify the specific genetic change responsible for this association, we analyzed genotype and phenotype data from 6,005 African Americans from the Jackson Heart Study (JHS), the Health, Aging and Body Composition (Health ABC) Study, and the Atherosclerosis Risk in Communities (ARIC) Study. We demonstrate that the causal variant must be at least 91% different in frequency between West Africans and European Americans. An excellent candidate is the Duffy Null polymorphism (SNP rs2814778 at chromosome 1q23.2), which is the only polymorphism in the region known to be so differentiated in frequency and is already known to protect against Plasmodium vivax malaria. We confirm that rs2814778 is predictive of WBC and neutrophil count in African Americans above beyond the previously described admixture association (P = 3.8 x 10(-5)), establishing a novel phenotype for this genetic variant.
Resumo:
Extremes of electrocardiographic QT interval are associated with increased risk for sudden cardiac death (SCD); thus, identification and characterization of genetic variants that modulate QT interval may elucidate the underlying etiology of SCD. Previous studies have revealed an association between a common genetic variant in NOS1AP and QT interval in populations of European ancestry, but this finding has not been extended to other ethnic populations. We sought to characterize the effects of NOS1AP genetic variants on QT interval in the multi-ethnic population-based Dallas Heart Study (DHS, n = 3,072). The SNP most strongly associated with QT interval in previous samples of European ancestry, rs16847548, was the most strongly associated in White (P = 0.005) and Black (P = 3.6 x 10(-5)) participants, with the same direction of effect in Hispanics (P = 0.17), and further showed a significant SNP x sex-interaction (P = 0.03). A second SNP, rs16856785, uncorrelated with rs16847548, was also associated with QT interval in Blacks (P = 0.01), with qualitatively similar results in Whites and Hispanics. In a previously genotyped cohort of 14,107 White individuals drawn from the combined Atherosclerotic Risk in Communities (ARIC) and Cardiovascular Health Study (CHS) cohorts, we validated both the second locus at rs16856785 (P = 7.63 x 10(-8)), as well as the sex-interaction with rs16847548 (P = 8.68 x 10(-6)). These data extend the association of genetic variants in NOS1AP with QT interval to a Black population, with similar trends, though not statistically significant at P<0.05, in Hispanics. In addition, we identify a strong sex-interaction and the presence of a second independent site within NOS1AP associated with the QT interval. These results highlight the consistent and complex role of NOS1AP genetic variants in modulating QT interval.
Resumo:
Methylating agents are involved in carcinogenesis, and the DNA repair protein O(6)-methylguanine-DNA methyltransferase (MGMT) removes methyl group from O(6)-methylguanine. Genetic variation in DNA repair genes has been shown to contribute to susceptibility to squamous cell carcinoma of the head and neck (SCCHN). We hypothesize that MGMT polymorphisms are associated with risk of SCCHN. In a hospital-based case-control study of 721 patients with SCCHN and 1234 cancer-free controls frequency-matched by age, sex and ethnicity, we genotyped four MGMT polymorphisms, two in exon 3, 16195C>T and 16286C>T and two in the promoter region, 45996G>T and 46346C>A. We found that none of these polymorphisms alone had a significant effect on risk of SCCHN. However, when these four polymorphisms were evaluated together by the number of putative risk genotypes (i.e. 16195CC, 16286CC, 45996GT+TT, and 46346CA+AA), a statistically significantly increased risk of SCCHN was associated with the combined genotypes with three to four risk genotypes, compared with those with zero to two risk genotypes (adjusted odds ratio (OR)=1.27; 95% confidence interval (CI)=1.05-1.53). This increased risk was also more pronounced among young subjects (OR=1.81; 95% CI=1.11-2.96), men (OR=1.24; 95% CI=1.00-1.55), ever smokers (OR=1.25; 95%=1.01-1.56), ever drinkers (OR=1.29; 95% CI=1.04-1.60), patients with oropharyngeal cancer (OR=1.45; 95% CI=1.12-1.87), and oropharyngeal cancer with regional lymph node metastasis (OR=1.52; 95% CI=1.16-1.89). In conclusion, our results suggest that any one of MGMT variants may not have a substantial effect on SCCHN risk, but a joint effect of several MGMT variants may contribute to risk and progression of SCCHN, particularly for oropharyngeal cancer, in non-Hispanic whites.
Resumo:
BACKGROUND: Neural tube defects (NTDs) occur in as many as 0.5-2 per 1000 live births in the United States. One of the most common and severe neural tube defects is meningomyelocele (MM) resulting from failed closure of the caudal end of the neural tube. MM has been induced by retinoic acid teratogenicity in rodent models. We hypothesized that genetic variants influencing retinoic acid (RA) induction via retinoic acid receptors (RARs) may be associated with risk for MM. METHODS: We analyzed 47 single nucleotide polymorphisms (SNPs) that span across the three retinoic acid receptor genes using the SNPlex genotyping platform. Our cohort consisted of 610 MM families. RESULTS: One variant in the RARA gene (rs12051734), three variants in the RARB gene (rs6799734, rs12630816, rs17016462), and a single variant in the RARG gene (rs3741434) were found to be statistically significant at p < 0.05. CONCLUSION: RAR genes were associated with risk for MM. For all associated SNPs, the rare allele conferred a protective effect for MM susceptibility.
Resumo:
BACKGROUND: Meningomyelocele (MM) results from lack of closure of the neural tube during embryologic development. Periconceptional folic acid supplementation is a modifier of MM risk in humans, leading toan interest in the folate transport genes as potential candidates for association to MM. METHODS: This study used the SNPlex Genotyping (ABI, Foster City, CA) platform to genotype 20 single polymorphic variants across the folate receptor genes (FOLR1, FOLR2, FOLR3) and the folate carrier gene (SLC19A1) to assess their association to MM. The study population included 329 trio and 281 duo families. Only cases with MM were included. Genetic association was assessed using the transmission disequilibrium test in PLINK. RESULTS: A variant in the FOLR2 gene (rs13908), three linked variants in the FOLR3 gene (rs7925545, rs7926875, rs7926987), and two variants in the SLC19A1 gene (rs1888530 and rs3788200) were statistically significant for association to MM in our population. CONCLUSION: This study involved the analyses of selected single nucleotide polymorphisms across the folate receptor genes and the folate carrier gene in a large population sample. It provided evidence that the rare alleles of specific single nucleotide polymorphisms within these genes appear to be statistically significant for association to MM in the patient population that was tested.