36 resultados para Genome-Wide Association


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The discovery of expanded simple repeated sequences causing or associated with human disease has lead to a new area of research involved in the elucidation of how the expanded repeat causes disease and how the repeat becomes unstable. ^ To study the genetic basis of the (CTG)n repeat instability in the DMPK gene in myotonic dystrophy (DM1) patients, somatic cell hybrids were constructed between the lymphocytes of DM1 patients and a variety of Chinese hamster ovary (CHO) cell DNA repair gene deficient mutants. By using small pool PCR (SP-PCR), the instability of the (CTG)n can be quantitated for both the frequency and sizes of length change mutations. ^ Additional SP-PCR analysis on 2/11 subclones generated from this original hybrid showed a marked increase in large repeat deletions, ∼50%. A bimodal distribution of repeats was seen around the progenitor allele and at a large deleted product (within the normal range) with no intermediate products present. ^ To determine if the repair capacity of the CHO cell led to a mutator phenotype in the hamster and hybrid clones, SP-PCR was also done on 3 hamster microsatellites in a variety of hamster cell backgrounds. No variant alleles were seen in over 2500 genome equivalents screened. ^ Human-hamster hybrids have long been shown to be chromosomally unstable, yet information about the stability of repeated sequences was not known. To test if repeat instability was associated with either intact or non-intact human chromosomes, more than 300 microsatellite repeats on 13 human chromosomes (intact and non-intact) were analyzed in eight hybrid cells. No variants were seen between the hybrid and patient alleles in the hybrids. ^ To identify whether DM1 patients have a previously undetected level of genome wide instability or if the instability is truly locus specific, SP-PCR was done on 6 human microsatellites within the patient used to make the hybrid cells. No variants were seen in over 1000 genomes screened. ^ These studies show that the somatic cell hybrid approach is a genetically stable system that allows for the determination of factors that could lead to changes in microsatellite instability. It also shows that there is something inherent about the DM1 expanded (CTG)n repeat that it is solely targeted by, as of yet, and unknown mechanism that causes the repeat to be unstable. (Abstract shortened by UMI.)^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Extremes of electrocardiographic QT interval are associated with increased risk for sudden cardiac death (SCD); thus, identification and characterization of genetic variants that modulate QT interval may elucidate the underlying etiology of SCD. Previous studies have revealed an association between a common genetic variant in NOS1AP and QT interval in populations of European ancestry, but this finding has not been extended to other ethnic populations. We sought to characterize the effects of NOS1AP genetic variants on QT interval in the multi-ethnic population-based Dallas Heart Study (DHS, n = 3,072). The SNP most strongly associated with QT interval in previous samples of European ancestry, rs16847548, was the most strongly associated in White (P = 0.005) and Black (P = 3.6 x 10(-5)) participants, with the same direction of effect in Hispanics (P = 0.17), and further showed a significant SNP x sex-interaction (P = 0.03). A second SNP, rs16856785, uncorrelated with rs16847548, was also associated with QT interval in Blacks (P = 0.01), with qualitatively similar results in Whites and Hispanics. In a previously genotyped cohort of 14,107 White individuals drawn from the combined Atherosclerotic Risk in Communities (ARIC) and Cardiovascular Health Study (CHS) cohorts, we validated both the second locus at rs16856785 (P = 7.63 x 10(-8)), as well as the sex-interaction with rs16847548 (P = 8.68 x 10(-6)). These data extend the association of genetic variants in NOS1AP with QT interval to a Black population, with similar trends, though not statistically significant at P<0.05, in Hispanics. In addition, we identify a strong sex-interaction and the presence of a second independent site within NOS1AP associated with the QT interval. These results highlight the consistent and complex role of NOS1AP genetic variants in modulating QT interval.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

C-Reactive Protein (CRP) is a biomarker indicating tissue damage, inflammation, and infection. High-sensitivity CRP (hsCRP) is an emerging biomarker often used to estimate an individual’s risk for future coronary heart disease (CHD). hsCRP levels falling below 1.00 mg/l indicate a low risk for developing CHD, levels ranging between 1.00 mg/l and 3.00 mg/l indicate an elevated risk, and levels exceeding 3.00 mg/l indicate high risk. Multiple Genome-Wide Association Studies (GWAS) have identified a number of genetic polymorphisms which influence CRP levels. SNPs implicated in such studies have been found in or near genes of interest including: CRP, APOE, APOC, IL-6, HNF1A, LEPR, and GCKR. A strong positive correlation has also been found to exist between CRP levels and BMI, a known risk factor for CHD and a state of chronic inflammation. We conducted a series of analyses designed to identify loci which interact with BMI to influence CRP levels in a subsample of European-Americans in the ARIC cohort. In a stratified GWA analysis, 15 genetic regions were identified as having significantly (p-value < 2.00*10-3) distinct effects on hsCRP levels between the two obesity strata: lean (18.50 kg/m2 < BMI < 24.99 kg/m2) and obese (BMI ≥ 30.00 kg/m2). A GWA analysis performed on all individuals combined (i.e. not a priori stratified for obesity status) with the inclusion of an additional parameter for BMI by gene interaction, identified 11 regions which interact with BMI to influence hsCRP levels. Two regions containing the genes GJA5 and GJA8 (on chromosome 1) and FBXO11 (on chromosome 2) were identified in both methods of analysis suggesting that these genes possibly interact with BMI to influence hsCRP levels. We speculate that atrial fibrillation (AF), age-related cataracts and the TGF-β pathway may be the biological processes influenced by the interaction of GJA5, GJA8 and FBXO11, respectively, with BMI to cause changes in hsCRP levels. Future studies should focus on the influence of gene x bmi interaction on AF, age-related cataracts and TGF-β.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Stroke is the third leading cause of death and a major debilitating disease in the United States. Multiple factors, including genetic factors, contribute to the development of the disease. Genome-wide association studies (GWAS) have contributed to the identification of genetic loci influencing risk for complex diseases, such as stroke. In 2010, a GWAS of incident stroke was performed in four large prospective cohorts from the USA and Europe and identified an association of two Single Nucleotide Polymorphisms (SNPs) on chromosome 12p13 with a greater risk of ischemic stroke in individuals of European and African-American ancestry. These SNPs are located 11 Kb upstream of the nerve injury-induced gene 2, Ninjurin2 (NINJ2), suggesting that this gene may be involved in stroke pathogenesis. NINJ2 is a cell adhesion molecule induced in the distal glial cells from a sciatic-nerve injury at 7-days after injury. In an effort to ascribe a possible role of NINJ2 in stroke, we have assessed changes in the level of gene and protein expression of NINJ2 following a time-course from a transiently induced middle cerebral artery ischemic stroke in mice brains. We report an increase in the gene expression of NINJ2 in the ischemic and peri-infarct (ipsilateral) cortical tissues at 7 and 14-days after stroke. We also report an increase in the protein expression of NINJ2 in the cortex of both the ipsilateral and contralateral cortical tissues at the same time-points. We conclude that the expression of NINJ2 is regulated by an ischemic stroke in the cortex and is consistent with NINJ2 being involved in the recovery time-points of stroke.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Prostate cancer (PrCa) is a leading cause of morbidity and mortality, yet the etiology remains uncertain. Meta-analyses show that PrCa risk is reduced by 16% in men with type 2 diabetes (T2D), but the mechanism is unknown. Recent genome-wide association studies and meta-analyses have found single nucleotide polymorphisms (SNPs) that consistently predict T2D risk. We evaluated associations of incident PrCa with 14 T2D SNPs in the Atherosclerosis Risk in Communities (ARIC) study. From 1987-2000, there were 397 incident PrCa cases ascertained from state or local cancer registries among 6,642 men (1,560 blacks and 5,082 whites) aged 45-64 years at baseline. Genotypes were determined by TaqMan assay. Cox proportional hazards models were used to assess the association between PrCa and increasing number of T2D risk-raising alleles for individual SNPs and for genetic risk scores (GRS) comprised of the number of T2D risk-raising alleles across SNPs. Two-way gene-gene interactions were evaluated with likelihood ratio tests. Using additive genetic models, the T2D risk-raising allele was associated with significantly reduced risk of PrCa for IGF2BP2 rs4402960 (hazard ratio [HR]=0.79; P=0.07 among blacks only), SLC2A2 rs5400 (race-adjusted HR=0.85; P=0.05) and UCP2 rs660339 (race-adjusted HR=0.84; P=0.02), but significantly increased risk of PrCa for CAPN10 rs3792267 (race-adjusted HR=1.20; P=0.05). No other SNPs were associated with PrCa using an additive genetic model. However, at least one copy of the T2D risk-raising allele for TCF7L2 rs7903146 was associated with reduced PrCa risk using a dominant genetic model (race-adjusted HR=0.79; P=0.03). These results imply that the T2D-PrCa association may be partly due to shared genetic variation, but these results should be verified since multiple tests were performed. When the combined, additive effects of these SNPs were tested using a GRS, there was nearly a 10% reduction in risk of PrCa per T2D risk-raising allele (race-adjusted HR=0.92; P=0.02). SNPs in IGF2BP2, KCNJ11 and SLC2A2 were also involved in multiple synergistic gene-gene interactions on a multiplicative scale. In conclusion, it appears that the T2D-PrCa association may be due, in part, to common genetic variation. Further knowledge of T2D gene-PrCa mechanisms may improve understanding of PrCa etiology and may inform PrCa prevention and treatment.^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Despite extensive research, the etiology of adult glioma remains largely unknown. We sought to further explore the role of immune and genetic factors in glioma etiology using data from the Harris County Brain Tumor Study and the first U.S. genome-wide association study of glioma. First, using a case-control study design, we examined the association between adult glioma risk and surrogates of the timing and frequency of common early childhood infections, birth order and sibship size, respectively. We found that each one-unit increase in birth order was associated with a 12% decreased risk of glioma development in adulthood (OR=0.88, 95% CI=0.81-0.96); however, sibship size was not associated with adult glioma risk (OR=0.96, 95% CI=0.91-1.02). Second, we used a multi-strategic approach to explore the relationships between glioma risk, history of asthma/allergies, and 23 functional SNPs in 11 inflammation genes. We found three inflammation gene SNPs to be significantly associated with glioma risk (COX2/PTGS2 rs20417 [OR=1.41]; IL10 rs1800896 [OR=1.57]; and IL13 rs20541 [OR=0.39]). Joint effects analysis of the risk-conferring alleles of these three SNPs revealed a trend of increasing risk with increasing number of adverse alleles among those without asthma/allergies (p<0.0001). Finally, we conducted a case-only study to explore pairwise SNP-SNP interactions in immune-related pathways among a population of 1304 non-Hispanic white glioma cases. After correction for multiple comparisons, we found 279 significant SNP-SNP interactions among polymorphisms of immune-related genes, many of which have not been previously examined. Our results, cumulatively, suggest an important role for immune and genetic factors in glioma etiology and provide several new hypotheses for future studies.^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Response to pharmacological treatment is variable among individuals. Some patients respond favorably to a drug while others develop adverse reactions. Early investigations showed evidence of variation in genes that code for drug receptors, drug transporters, and drug metabolizing enzymes; and pharmacogenetics appeared as the science that studies the relationship between drug response and genetic variation. Thiazide diuretics are the recommended first-line monotherapy for hypertension (i.e. SBP>140 or DBP>90). Even so, diuretics are associated with adverse metabolic side effects, such as hyperglycemia, which increase the risk of developing type 2 diabetes. Published approaches testing variation in candidate genes (e.g. the renin-angiotensin-aldosteron system (RAAS) and salt–sensitivity genes) have met with only limited success. We conducted the first genome wide association study to identify genes influencing hyperglycemia as an adverse effect of thiazide diuretics in non-Hispanic White hypertensive patients participating in the Genetic Epidemiology of Responses to Antihypertensives (GERA) and Pharmacogenomic Evaluation of Antihypertensive Responses (PEAR) clinical trials. No SNP reached the a priori defined threshold of statistical significance (p<5x10-8). We detected 50 SNPs in 9 genomic regions with suggestive p-values (p<1x10-5). Two of them, rs6870564 (p-value=3.28 X 10-6) and rs7702121 (p-value=5.09 X 10-6), were located close to biologic candidate genes, MYO and MGAT1, and one SNP in a genomic region in chromosome 6, rs7762018 (p-value=4.59 X 10-6) has been previously related to Insulin-Dependent Diabetes Mellitus (IDDM8). I conclude that 1) there are unlikely to be common SNPs with large effects on the adverse metabolic effects to hydrochlorothiazide treatment and 2) larger sample sizes are needed for pharmacogenetic studies of inter-individual variation in response to commonly prescribed medication.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Lung cancer is the leading cause of cancer-related mortality in the US. Emerging evidence has shown that host genetic factors can interact with environmental exposures to influence patient susceptibility to the diseases as well as clinical outcomes, such as survival and recurrence. We aimed to identify genetic prognostic markers for non-small cell lung cancer (NSCLC), a major (85%) subtype of lung cancer, and also in other subgroups. With the fast evolution of genotyping technology, genetic association studies have went through candidate gene approach, to pathway-based approach, to the genome wide association study (GWAS). Even in the era of GWAS, pathway-based approach has its own advantages on studying cancer clinical outcomes: it is cost-effective, requiring a smaller sample size than GWAS easier to identify a validation population and explore gene-gene interactions. In the current study, we adopted pathway-based approach focusing on two critical pathways - miRNA and inflammation pathways. MicroRNAs (miRNA) post-transcriptionally regulate around 30% of human genes. Polymorphisms within miRNA processing pathways and binding sites may influence patients’ prognosis through altered gene regulation. Inflammation plays an important role in cancer initiation and progression, and also has shown to impact patients’ clinical outcomes. We first evaluated 240 single nucleotide polymorphisms (SNPs) in miRNA biogenesis genes and predicted binding sites in NSCLC patients to determine associations with clinical outcomes in early-stage (stage I and II) and late-stage (stage III and IV) lung cancer patients, respectively. First, in 535 early-stage patients, after correcting multiple comparisons, FZD4:rs713065 (hazard ratio [HR]:0.46, 95% confidence interval [CI]:0.32-0.65) showed a significant inverse association with survival in early stage surgery-only patients. SP1:rs17695156 (HR:2.22, 95% CI:1.44-3.41) and DROSHA:rs6886834 (HR:6.38, 95% CI:2.49-16.31) conferred increased risk of progression in the all patients and surgery-only populations, respectively. FAS:rs2234978 was significantly associated with improved survival in all patients (HR:0.59, 95% CI:0.44-0.77) and in the surgery plus chemotherapy populations (HR:0.19, 95% CI:0.07-0.46).. Functional genomics analysis demonstrated that this variant creates a miR-651 binding site resulting in altered miRNA regulation of FAS, providing biological plausibility for the observed association. We then analyzed these associations in 598 late-stage patients. After multiple comparison corrections, no SNPs remained significant in the late stage group, while the top SNP NAT1:rs15561 (HR=1.98, 96%CI=1.32-2.94) conferred a significantly increased risk of death in the chemotherapy subgroup. To test the hypothesis that genetic variants in the inflammation-related pathways may be associated with survival in NSCLC patients, we first conducted a three-stage study. In the discovery phase, we investigated a comprehensive panel of 11,930 inflammation-related SNPs in three independent lung cancer populations. A missense SNP (rs2071554) in HLA-DOB was significantly associated with poor survival in the discovery population (HR: 1.46, 95% CI: 1.02-2.09), internal validation population (HR: 1.51, 95% CI: 1.02-2.25), and external validation (HR: 1.52, 95% CI: 1.01-2.29) population. Rs2900420 in KLRK1 was significantly associated with a reduced risk for death in the discovery (HR: 0.76, 95% CI: 0.60-0.96) and internal validation (HR: 0.77, 95% CI: 0.61-0.99) populations, and the association reached borderline significance in the external validation population (HR: 0.80, 95% CI: 0.63-1.02). We also evaluated these inflammation-related SNPs in NSCLC patients in never smokers. Lung cancer in never smokers has been increasingly recognized as distinct disease from that in ever-smokers. A two-stage study was performed using a discovery population from MD Anderson (411 patients) and a validation population from Mayo Clinic (311 patients). Three SNPs (IL17RA:rs879576, BMP8A:rs698141, and STK:rs290229) that were significantly associated with survival were validated (pCD74:rs1056400 and CD38:rs10805347) were borderline significant (p=0.08) in the Mayo Clinic population. In the combined analysis, IL17RA:rs879576 resulted in a 40% reduction in the risk for death (p=4.1 × 10-5 [p=0.61, heterogeneity test]). We also validated a survival tree created in MD Anderson population in the Mayo Clinic population. In conclusion, our results provided strong evidence that genetic variations in specific pathways that examined (miRNA and inflammation pathways) influenced clinical outcomes in NSCLC patients, and with further functional studies, the novel loci have potential to be translated into clinical use.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Nonsyndromic cleft lip with or without cleft palate (NSCLP) is a common birth defect with a multifactorial etiology. Despite decades of research, the genetic underpinnings of NSCLP still remain largely unexplained. A genome wide association study (GWAS) of a large NSCLP African American family with seven affected individuals across three generations found evidence for linkage at 8q21.3-24.12 (LOD = 2.98). This region contained three biologically relevant candidate genes: Frizzled-6 (FZD6) (LOD = 2.8), Matrilin-2 (MATN2) (LOD = 2.3), and Solute Carrier Family 25, Member 32 (SLC26A32) (LOD = 1.6). Sequencing of the coding regions and the 5’ and 3’ UTRs of these genes in two affected family members identified a rare intronic variant, rs138557689 (c.-153+432A>C), in FZD6. The rs138557689/C allele segregated with the NSCLP phenotype; in silico analysis predicted and EMSA analysis showed that the 138557689/C allele creates new DNA binding sites. FZD6 is part of the WNT pathway, which is involved in craniofacial development, including midface development and upper lip fusion. Our novel findings suggest that an alteration in FZD6 gene regulation may perturb this tightly controlled biological pathway and in turn contribute to the development of NSCLP in this family. Studies are underway to further define how the rs138557689/C variant affects expression of FZD6.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Introduction. Distant metastasis remains the leading cause of death among prostate cancer patients. Several genetic susceptibility loci associated with Prostate cancer have been identified by the Genome Wide Association Studies (GWAS). To date, few studies have explored the ability of these SNPs to identify metastatic prostate cancer. Based on the identification of genetic variants as predictors of aggressive disease, a case comparison study of prostate cancer patients was designed to explore the association of 96 GWAS single nucleotide polymorphisms (SNPs) with metastatic disease. ^ Method. 1242 histologically confirmed prostate cancer patients, with and without metastatic disease, were enrolled into the study. Data were collected from personal interviews, hospital database and abstraction of medical records. Ninety six SNPs identified from GWAS studies based on their associations with prostate cancer risk were genotyped in the study population. Univariate and multivariate logistic regression analyses were used to explore the relationships of the variants with metastatic prostate cancer in Whites and African American men. ^ Results. Four SNPs showed independent associations with metastatic prostate cancer (rs721048 in EHBP1 (2p15), rs3025039 in VEGF (6p12), rs11228565 in Intergenic(11q13.2) and rs2735839 in KLK3(19q13.33)) in the White population. For SNP rs2735839 in KLK3, genotype GA was 1.71 times as likely to be associated with metastatic prostate cancer diagnosis as genotype AA after adjusting for other significant SNPs and covariates (95% CI, 1.12-2.60; p=0.012). In men of African descent, three SNPs: rs1512268 in NKX3-1(8p21.2), rs12155172 in intergenic (7p15.3) & rs10486567 in JAZF1 (7p15.2) were positively associated with metastatic disease in the multivariate analysis. The strongest SNP was rs1512268 heterozygous genotype AG in NKX3-1(8p21.2) which was associated with 3.97-fold increased risk of metastatic prostate cancer diagnosis (95% CI, 1.69-9.34; p =0.002). ^ Conclusion. Genetic variants associated with metastatic prostate cancer were different in Whites and African American men. Given the high mortality rate recorded in men diagnosed with metastatic prostate tumor, further studies are needed to validate associations and establish their clinical application.^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The interplay between obesity, physical activity, weight gain and genetic variants in mTOR pathway have not been studied in renal cell carcinoma (RCC). We examined the associations between obesity, weight gain, physical activity and RCC risk. We also analyzed whether genetic variants in the mTOR pathway could modify the association. Incident renal cell carcinoma cases and healthy controls were recruited from the University of Texas MD Anderson Cancer Center in Houston, Texas. Cases and controls were frequency-matched by age (±5 years), ethnicity, sex, and county of residence. Epidemiologic data were collected via in-person interview. A total of 577 cases and 593 healthy controls (all white) were included. One hundred ninety-two (192) SNPs from 22 genes were available and their genotyping data were extracted from previous genome-wide association studies. Logistic regression and regression spline were performed to obtain odds ratios. Obesity at age 20, 40, and 3 years prior to diagnosis/recruitment, and moderate and large weight gain from age 20 to 40 were each significantly associated with increased RCC risk. Low physical activity was associated with a 4.08-fold (95% CI: 2.92-5.70) increased risk. Five single nucleotide polymorphisms (SNPs) were significantly associated with RCC risk and their cumulative effect increased the risk by up to 72% (95% CI: 1.20-2.46). Strata specific effects for weight change and genotyping cumulative groups were observed. However, no interaction was suggested by our study. In conclusion, energy balance related risk factors and genetic variants in the mTOR pathway may jointly influence susceptibility to RCC. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Pancreatic cancer is the 4th most common cause for cancer death in the United States, accompanied by less than 5% five-year survival rate based on current treatments, particularly because it is usually detected at a late stage. Identifying a high-risk population to launch an effective preventive strategy and intervention to control this highly lethal disease is desperately needed. The genetic etiology of pancreatic cancer has not been well profiled. We hypothesized that unidentified genetic variants by previous genome-wide association study (GWAS) for pancreatic cancer, due to stringent statistical threshold or missing interaction analysis, may be unveiled using alternative approaches. To achieve this aim, we explored genetic susceptibility to pancreatic cancer in terms of marginal associations of pathway and genes, as well as their interactions with risk factors. We conducted pathway- and gene-based analysis using GWAS data from 3141 pancreatic cancer patients and 3367 controls with European ancestry. Using the gene set ridge regression in association studies (GRASS) method, we analyzed 197 pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Using the logistic kernel machine (LKM) test, we analyzed 17906 genes defined by University of California Santa Cruz (UCSC) database. Using the likelihood ratio test (LRT) in a logistic regression model, we analyzed 177 pathways and 17906 genes for interactions with risk factors in 2028 pancreatic cancer patients and 2109 controls with European ancestry. After adjusting for multiple comparisons, six pathways were marginally associated with risk of pancreatic cancer ( P < 0.00025): Fc epsilon RI signaling, maturity onset diabetes of the young, neuroactive ligand-receptor interaction, long-term depression (Ps < 0.0002), and the olfactory transduction and vascular smooth muscle contraction pathways (P = 0.0002; Nine genes were marginally associated with pancreatic cancer risk (P < 2.62 × 10−5), including five reported genes (ABO, HNF1A, CLPTM1L, SHH and MYC), as well as four novel genes (OR13C4, OR 13C3, KCNA6 and HNF4 G); three pathways significantly interacted with risk factors on modifying the risk of pancreatic cancer (P < 2.82 × 10−4): chemokine signaling pathway with obesity ( P < 1.43 × 10−4), calcium signaling pathway (P < 2.27 × 10−4) and MAPK signaling pathway with diabetes (P < 2.77 × 10−4). However, none of the 17906 genes tested for interactions survived the multiple comparisons corrections. In summary, our current GWAS study unveiled unidentified genetic susceptibility to pancreatic cancer using alternative methods. These novel findings provide new perspectives on genetic susceptibility to and molecular mechanisms of pancreatic cancer, once confirmed, will shed promising light on the prevention and treatment of this disease. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Complex diseases such as cancer result from multiple genetic changes and environmental exposures. Due to the rapid development of genotyping and sequencing technologies, we are now able to more accurately assess causal effects of many genetic and environmental factors. Genome-wide association studies have been able to localize many causal genetic variants predisposing to certain diseases. However, these studies only explain a small portion of variations in the heritability of diseases. More advanced statistical models are urgently needed to identify and characterize some additional genetic and environmental factors and their interactions, which will enable us to better understand the causes of complex diseases. In the past decade, thanks to the increasing computational capabilities and novel statistical developments, Bayesian methods have been widely applied in the genetics/genomics researches and demonstrating superiority over some regular approaches in certain research areas. Gene-environment and gene-gene interaction studies are among the areas where Bayesian methods may fully exert its functionalities and advantages. This dissertation focuses on developing new Bayesian statistical methods for data analysis with complex gene-environment and gene-gene interactions, as well as extending some existing methods for gene-environment interactions to other related areas. It includes three sections: (1) Deriving the Bayesian variable selection framework for the hierarchical gene-environment and gene-gene interactions; (2) Developing the Bayesian Natural and Orthogonal Interaction (NOIA) models for gene-environment interactions; and (3) extending the applications of two Bayesian statistical methods which were developed for gene-environment interaction studies, to other related types of studies such as adaptive borrowing historical data. We propose a Bayesian hierarchical mixture model framework that allows us to investigate the genetic and environmental effects, gene by gene interactions (epistasis) and gene by environment interactions in the same model. It is well known that, in many practical situations, there exists a natural hierarchical structure between the main effects and interactions in the linear model. Here we propose a model that incorporates this hierarchical structure into the Bayesian mixture model, such that the irrelevant interaction effects can be removed more efficiently, resulting in more robust, parsimonious and powerful models. We evaluate both of the 'strong hierarchical' and 'weak hierarchical' models, which specify that both or one of the main effects between interacting factors must be present for the interactions to be included in the model. The extensive simulation results show that the proposed strong and weak hierarchical mixture models control the proportion of false positive discoveries and yield a powerful approach to identify the predisposing main effects and interactions in the studies with complex gene-environment and gene-gene interactions. We also compare these two models with the 'independent' model that does not impose this hierarchical constraint and observe their superior performances in most of the considered situations. The proposed models are implemented in the real data analysis of gene and environment interactions in the cases of lung cancer and cutaneous melanoma case-control studies. The Bayesian statistical models enjoy the properties of being allowed to incorporate useful prior information in the modeling process. Moreover, the Bayesian mixture model outperforms the multivariate logistic model in terms of the performances on the parameter estimation and variable selection in most cases. Our proposed models hold the hierarchical constraints, that further improve the Bayesian mixture model by reducing the proportion of false positive findings among the identified interactions and successfully identifying the reported associations. This is practically appealing for the study of investigating the causal factors from a moderate number of candidate genetic and environmental factors along with a relatively large number of interactions. The natural and orthogonal interaction (NOIA) models of genetic effects have previously been developed to provide an analysis framework, by which the estimates of effects for a quantitative trait are statistically orthogonal regardless of the existence of Hardy-Weinberg Equilibrium (HWE) within loci. Ma et al. (2012) recently developed a NOIA model for the gene-environment interaction studies and have shown the advantages of using the model for detecting the true main effects and interactions, compared with the usual functional model. In this project, we propose a novel Bayesian statistical model that combines the Bayesian hierarchical mixture model with the NOIA statistical model and the usual functional model. The proposed Bayesian NOIA model demonstrates more power at detecting the non-null effects with higher marginal posterior probabilities. Also, we review two Bayesian statistical models (Bayesian empirical shrinkage-type estimator and Bayesian model averaging), which were developed for the gene-environment interaction studies. Inspired by these Bayesian models, we develop two novel statistical methods that are able to handle the related problems such as borrowing data from historical studies. The proposed methods are analogous to the methods for the gene-environment interactions on behalf of the success on balancing the statistical efficiency and bias in a unified model. By extensive simulation studies, we compare the operating characteristics of the proposed models with the existing models including the hierarchical meta-analysis model. The results show that the proposed approaches adaptively borrow the historical data in a data-driven way. These novel models may have a broad range of statistical applications in both of genetic/genomic and clinical studies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

High-throughput assays, such as yeast two-hybrid system, have generated a huge amount of protein-protein interaction (PPI) data in the past decade. This tremendously increases the need for developing reliable methods to systematically and automatically suggest protein functions and relationships between them. With the available PPI data, it is now possible to study the functions and relationships in the context of a large-scale network. To data, several network-based schemes have been provided to effectively annotate protein functions on a large scale. However, due to those inherent noises in high-throughput data generation, new methods and algorithms should be developed to increase the reliability of functional annotations. Previous work in a yeast PPI network (Samanta and Liang, 2003) has shown that the local connection topology, particularly for two proteins sharing an unusually large number of neighbors, can predict functional associations between proteins, and hence suggest their functions. One advantage of the work is that their algorithm is not sensitive to noises (false positives) in high-throughput PPI data. In this study, we improved their prediction scheme by developing a new algorithm and new methods which we applied on a human PPI network to make a genome-wide functional inference. We used the new algorithm to measure and reduce the influence of hub proteins on detecting functionally associated proteins. We used the annotations of the Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) as independent and unbiased benchmarks to evaluate our algorithms and methods within the human PPI network. We showed that, compared with the previous work from Samanta and Liang, our algorithm and methods developed in this study improved the overall quality of functional inferences for human proteins. By applying the algorithms to the human PPI network, we obtained 4,233 significant functional associations among 1,754 proteins. Further comparisons of their KEGG and GO annotations allowed us to assign 466 KEGG pathway annotations to 274 proteins and 123 GO annotations to 114 proteins with estimated false discovery rates of <21% for KEGG and <30% for GO. We clustered 1,729 proteins by their functional associations and made pathway analysis to identify several subclusters that are highly enriched in certain signaling pathways. Particularly, we performed a detailed analysis on a subcluster enriched in the transforming growth factor β signaling pathway (P<10-50) which is important in cell proliferation and tumorigenesis. Analysis of another four subclusters also suggested potential new players in six signaling pathways worthy of further experimental investigations. Our study gives clear insight into the common neighbor-based prediction scheme and provides a reliable method for large-scale functional annotations in this post-genomic era.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The genetic etiology of stroke likely reflects the influence of multiple loci with small effects, each modulating different pathophysiological processes. This research project utilized three analytical strategies to address the paucity of information related to the identification and characterization of genetic variation associated with stroke in the general population. ^ First, the general contribution of familial factors to stroke susceptibility was evaluated in a population-based sample of unrelated individuals. Increased risk of subclinical cerebral infarction was observed among individuals with a positive parental history of stroke. This association did not appear to be mediated by established stroke risk factors, specifically blood pressure levels or hypertension status. ^ The need to identify specific gene variation associated with stroke in the general population was addressed by evaluating seven candidate gene polymorphisms in a population-based sample of unrelated individuals. Three polymorphisms were significantly associated with increased subclinical cerebral infarction or incident clinical ischemic stroke risk. These relationships include the G-protein β3 subunit 825C/T polymorphism and clinical stroke in Whites, the lipoprotein lipase S/X447 polymorphism and subclinical and clinical stroke in men, and the angiotensin I-converting enzyme Ins/Del polymorphism and subclinical stroke in White men. These associations did not appear to be obfuscated by the stroke risk factors adjusted for in the analysis models specifically blood pressure levels or anti-hypertensive medication use. ^ The final research strategy considered, on a genome-wide scale, the idea that genetic variation may contribute to the occurrence of hypertension or stroke through a common etiologic pathway. Genomic regions were identified for which significant evidence of heterogeneity was observed among hypertensive sibpairs stratified by family history of stroke information. Regions identified on chromosome 15 in African Americans, and chromosome 13 in Whites and African Americans, suggest the presence of genes influencing hypertension and stroke susceptibility. ^ Insight into the role of genetics in stroke is useful for the potential early identification of individuals at increased risk for stroke and improved understanding of the etiology of the disease. The ultimate goal of these endeavors is to guide the development of therapeutic intervention and informed prevention to provide a lasting and positive impact on public health. ^