7 resultados para QUANTITATIVE TRAITS
em DigitalCommons@The Texas Medical Center
Resumo:
Linkage disequilibrium methods can be used to find genes influencing quantitative trait variation in humans. Linkage disequilibrium methods can require smaller sample sizes than linkage equilibrium methods, such as the variance component approach to find loci with a specific effect size. The increase in power is at the expense of requiring more markers to be typed to scan the entire genome. This thesis compares different linkage disequilibrium methods to determine which factors influence the power to detect disequilibrium. The costs of disequilibrium and equilibrium tests were compared to determine whether the savings in phenotyping costs when using disequilibrium methods outweigh the additional genotyping costs.^ Nine linkage disequilibrium tests were examined by simulation. Five tests involve selecting isolated unrelated individuals while four involved the selection of parent child trios (TDT). All nine tests were found to be able to identify disequilibrium with the correct significance level in Hardy-Weinberg populations. Increasing linked genetic variance and trait allele frequency were found to increase the power to detect disequilibrium, while increasing the number of generations and distance between marker and trait loci decreased the power to detect disequilibrium. Discordant sampling was used for several of the tests. It was found that the more stringent the sampling, the greater the power to detect disequilibrium in a sample of given size. The power to detect disequilibrium was not affected by the presence of polygenic effects.^ When the trait locus had more than two trait alleles, the power of the tests maximized to less than one. For the simulation methods used here, when there were more than two-trait alleles there was a probability equal to 1-heterozygosity of the marker locus that both trait alleles were in disequilibrium with the same marker allele, resulting in the marker being uninformative for disequilibrium.^ The five tests using isolated unrelated individuals were found to have excess error rates when there was disequilibrium due to population admixture. Increased error rates also resulted from increased unlinked major gene effects, discordant trait allele frequency, and increased disequilibrium. Polygenic effects did not affect the error rates. The TDT, Transmission Disequilibrium Test, based tests were not liable to any increase in error rates.^ For all sample ascertainment costs, for recent mutations ($<$100 generations) linkage disequilibrium tests were less expensive than the variance component test to carry out. Candidate gene scans saved even more money. The use of recently admixed populations also decreased the cost of performing a linkage disequilibrium test. ^
Resumo:
Obesity is a complex multifactorial disease and is a public health priority. Perilipin coats the surface of lipid droplets in adipocytes and is believed to stabilize these lipid bodies by protecting triglyceride from early lipolysis. This research project evaluated the association between genetic variation within the human perilipin (PLIN) gene and obesity-related quantitative traits and disease-related phenotypes in Non-Hispanic White (NHW) and African American (AA) participants from the Atherosclerosis Risk in Communities (ARIC) Study. ^ Multivariate linear regression, multivariate logistic regression, and Cox proportional hazards models evaluated the association between single gene variants (rs2304794, rs894160, rs8179071, and rs2304795) and multilocus variation (rs894160 and rs2304795) within the PLIN gene and both obesity-related quantitative traits (body weight, body mass index [BMI], waist girth, waist-to-hip ratio [WHR], estimated percent body fat, and plasma total triglycerides) and disease-related phenotypes (prevalent obesity, metabolic syndrome [MetS], prevalent coronary heart disease [CHD], and incident CHD). Single variant analyses were stratified by race and gender within race while multilocus analyses were stratified by race. ^ Single variant analyses revealed that rs2304794 and rs894160 were significantly related to plasma triglyceride levels in all NHWs and NHW women. Among AA women, variant rs8179071 was associated with triglyceride levels and rs2304794 was associated with risk-raising waist circumference (>0.8 in women). The multilocus effects of variants rs894160 and rs2304795 were significantly associated with body weight, waist girth, WHR, estimated percent body fat, class II obesity (BMI ≥ 35 kg/m2), class III obesity (BMI ≥ 35 kg/m2), and risk-raising WHR (>0.9 in men and >0.8 in women) in AAs. Variant rs2304795 was significantly related to prevalent MetS among AA males and prevalent CHD in NHW women; multilocus effects of the PLIN gene were associated with prevalent CHD among NHWs. Rs2304794 was associated with incident CHD in the absence of the MetS among AAs. These findings support the hypothesis that variation within the PLIN gene influences obesity-related traits and disease-related phenotypes. ^ Understanding these effects of the PLIN genotype on the development of obesity can potentially lead to tailored health promotion interventions that are more effective. ^
Resumo:
My dissertation focuses on developing methods for gene-gene/environment interactions and imprinting effect detections for human complex diseases and quantitative traits. It includes three sections: (1) generalizing the Natural and Orthogonal interaction (NOIA) model for the coding technique originally developed for gene-gene (GxG) interaction and also to reduced models; (2) developing a novel statistical approach that allows for modeling gene-environment (GxE) interactions influencing disease risk, and (3) developing a statistical approach for modeling genetic variants displaying parent-of-origin effects (POEs), such as imprinting. In the past decade, genetic researchers have identified a large number of causal variants for human genetic diseases and traits by single-locus analysis, and interaction has now become a hot topic in the effort to search for the complex network between multiple genes or environmental exposures contributing to the outcome. Epistasis, also known as gene-gene interaction is the departure from additive genetic effects from several genes to a trait, which means that the same alleles of one gene could display different genetic effects under different genetic backgrounds. In this study, we propose to implement the NOIA model for association studies along with interaction for human complex traits and diseases. We compare the performance of the new statistical models we developed and the usual functional model by both simulation study and real data analysis. Both simulation and real data analysis revealed higher power of the NOIA GxG interaction model for detecting both main genetic effects and interaction effects. Through application on a melanoma dataset, we confirmed the previously identified significant regions for melanoma risk at 15q13.1, 16q24.3 and 9p21.3. We also identified potential interactions with these significant regions that contribute to melanoma risk. Based on the NOIA model, we developed a novel statistical approach that allows us to model effects from a genetic factor and binary environmental exposure that are jointly influencing disease risk. Both simulation and real data analyses revealed higher power of the NOIA model for detecting both main genetic effects and interaction effects for both quantitative and binary traits. We also found that estimates of the parameters from logistic regression for binary traits are no longer statistically uncorrelated under the alternative model when there is an association. Applying our novel approach to a lung cancer dataset, we confirmed four SNPs in 5p15 and 15q25 region to be significantly associated with lung cancer risk in Caucasians population: rs2736100, rs402710, rs16969968 and rs8034191. We also validated that rs16969968 and rs8034191 in 15q25 region are significantly interacting with smoking in Caucasian population. Our approach identified the potential interactions of SNP rs2256543 in 6p21 with smoking on contributing to lung cancer risk. Genetic imprinting is the most well-known cause for parent-of-origin effect (POE) whereby a gene is differentially expressed depending on the parental origin of the same alleles. Genetic imprinting affects several human disorders, including diabetes, breast cancer, alcoholism, and obesity. This phenomenon has been shown to be important for normal embryonic development in mammals. Traditional association approaches ignore this important genetic phenomenon. In this study, we propose a NOIA framework for a single locus association study that estimates both main allelic effects and POEs. We develop statistical (Stat-POE) and functional (Func-POE) models, and demonstrate conditions for orthogonality of the Stat-POE model. We conducted simulations for both quantitative and qualitative traits to evaluate the performance of the statistical and functional models with different levels of POEs. Our results showed that the newly proposed Stat-POE model, which ensures orthogonality of variance components if Hardy-Weinberg Equilibrium (HWE) or equal minor and major allele frequencies is satisfied, had greater power for detecting the main allelic additive effect than a Func-POE model, which codes according to allelic substitutions, for both quantitative and qualitative traits. The power for detecting the POE was the same for the Stat-POE and Func-POE models under HWE for quantitative traits.
Resumo:
Cardiovascular disease (CVD) is a threat to public health. It has been reported to be the leading cause of death in United States. The invention of next generation sequencing (NGS) technology has revolutionized the biomedical research. To investigate NGS data of CVD related quantitative traits would contribute to address the unknown etiology and disease mechanism of CVD. NHLBI's Exome Sequencing Project (ESP) contains CVD related phenotypes and their associated NGS exomes sequence data. Initially, a subset of next generation sequencing data consisting of 13 CVD-related quantitative traits was investigated. Only 6 traits, systolic blood pressure (SBP), diastolic blood pressure (DBP), height, platelet counts, waist circumference, and weight, were analyzed by functional linear model (FLM) and 7 currently existing methods. FLM outperformed all currently existing methods by identifying the highest number of significant genes and had identified 96, 139, 756, 1162, 1106, and 298 genes associated with SBP, DBP, Height, Platelet, Waist, and Weight respectively. ^
Resumo:
Obesity and related chronic diseases represent a tremendous public health burden among Mexican Americans, a young and rapidly-expanding population. This study investigated the impact of variation within eight candidate obesity genes, which include leptin (LEP), leptin receptor (LEPR), neuropeptide Y (NPY), NPYY1 receptor (NPYY1), glucagon-like peptide-1 (GLP-1), GLP-1 receptor (GLP1R), beta-3 adrenergic receptor (β3AR), and uncoupling protein (UCP1), on variation in human obesity status and/or quantitative traits related to obesity in Mexican Americans from Starr County, Texas. The Trp64Arg polymorphism within β3AR was typed in 820 random individuals and 240 pedigrees (N = 2,044). The Arg allele frequency was significantly greater in obese versus non-obese individuals (0.20 versus 0. 15, respectively). In addition, within the random sample, the Arg allele was associated with significantly greater body weight (p = 0.031) and body mass index (BMI, p = 0.008) than the Trp allele. In the family sample, the Trp64Arg locus was also linked to percent fat (p = 0.045) but not to body weight or BMI. No linkage between obesity, diabetes, hypertension, or gallbladder disease and the Trp64Arg mutation was observed in families using affected sib pair linkage analysis or the transmission disequilibrium test. Microsatellite markers proximate to the remaining seven genes were typed in 302 individuals from 59 families. Sib pair linkage analysis provided evidence for linkage between obesity and NPY within affected sibling pairs (p = 0.042; n = 170 pairs). NPY was also linked to weight (p = 0.020), abdominal circumference (p = 0.031), hip circumference (p = 0.012), DBP (p ≤ 0.005), and a composite measure of body mass/fat (p ≤ 0.048) in all sibling pairs (n = 545 pairs). Additionally, LEP was linked to waist/hip ratio (p ≤ 0.009), total cholesterol (p ≤ 0.030), and HDL cholesterol (p ≤ 0.026), and LEPR was linked to fasting blood glucose (p ≤ 0.018) and DBP (p ≤ 0.003). Subsequent to the linkage analyses, the NPY gene was sequenced and eight variant sites identified. Two variant sites (-880I/D and 69I/D) were typed in a random sample of 914 individuals. The 880I/D variant was significantly associated with waist/hip ratio (p = 0.035) in the entire sample (N = 914) and with BMI (p = 0. 031), abdominal circumference (p = 0.044), and waist/hip ratio (p = 0.041) in a non-obese subsample (BW < 30 kg/m2, n = 594). The 69I/D variant was a rare mutation observed in only one pedigree and was not associated with obesity or body size/mass within this pedigree. Results of this study indicate that variation at or near β3AR, LEP, LEPR, and NPY may exert effects which increase obesity susceptibility and influence obesity-related measures in this population. ^
Resumo:
Next-generation DNA sequencing platforms can effectively detect the entire spectrum of genomic variation and is emerging to be a major tool for systematic exploration of the universe of variants and interactions in the entire genome. However, the data produced by next-generation sequencing technologies will suffer from three basic problems: sequence errors, assembly errors, and missing data. Current statistical methods for genetic analysis are well suited for detecting the association of common variants, but are less suitable to rare variants. This raises great challenge for sequence-based genetic studies of complex diseases.^ This research dissertation utilized genome continuum model as a general principle, and stochastic calculus and functional data analysis as tools for developing novel and powerful statistical methods for next generation of association studies of both qualitative and quantitative traits in the context of sequencing data, which finally lead to shifting the paradigm of association analysis from the current locus-by-locus analysis to collectively analyzing genome regions.^ In this project, the functional principal component (FPC) methods coupled with high-dimensional data reduction techniques will be used to develop novel and powerful methods for testing the associations of the entire spectrum of genetic variation within a segment of genome or a gene regardless of whether the variants are common or rare.^ The classical quantitative genetics suffer from high type I error rates and low power for rare variants. To overcome these limitations for resequencing data, this project used functional linear models with scalar response to develop statistics for identifying quantitative trait loci (QTLs) for both common and rare variants. To illustrate their applications, the functional linear models were applied to five quantitative traits in Framingham heart studies. ^ This project proposed a novel concept of gene-gene co-association in which a gene or a genomic region is taken as a unit of association analysis and used stochastic calculus to develop a unified framework for testing the association of multiple genes or genomic regions for both common and rare alleles. The proposed methods were applied to gene-gene co-association analysis of psoriasis in two independent GWAS datasets which led to discovery of networks significantly associated with psoriasis.^
Resumo:
Radiotherapy involving the thoracic cavity and chemotherapy with the drug bleomycin are both dose limited by the development of pulmonary fibrosis. From evidence that there is variation in the population in susceptibility to pulmonary fibrosis, and animal data, it was hypothesized that individual variation in susceptibility to bleomycin-induced, or radiation-induced, pulmonary fibrosis is, in part, genetically controlled. In this thesis a three generation mouse genetic model of C57BL/6J (fibrosis prone) and C3Hf/Kam (fibrosis resistant) mouse strains and F1 and F2 (F1 intercross) progeny derived from the parental strains was developed to investigate the genetic basis of susceptibility to fibrosis. In the bleomycin studies the mice received 100 mg/kg (125 for females) of bleomycin, via mini osmotic pump. The animals were sacrificed at eight weeks following treatment or when their breathing rate indicated respiratory distress. In the radiation studies the mice were given a single dose of 14 or 16 Gy (Co$\sp{60})$ to the whole thorax and were sacrificed when moribund. The phenotype was defined as the percent of fibrosis area in the left lung as quantified with image analysis of histological sections. Quantitative trait loci (QTL) mapping was used to identify the chromosomal location of genes which contribute to susceptibility to bleomycin-induced pulmonary fibrosis in C57BL/6J mice compared to C3Hf/Kam mice and to determine if the QTL's which influence susceptibility to bleomycin-induced lung fibrosis in these progenitor strains could be implicated in susceptibility to radiation-induced lung fibrosis. For bleomycin, a genome wide scan revealed QTL's on chromosome 17, at the MHC, (LOD = 11.7 for males and 7.2 for females) accounting for approximately 21% of the phenotypic variance, and on chromosome 11 (LOD = 4.9), in male mice only, adding 8% of phenotypic variance. The bleomycin QTL on chromosome 17 was also implicated for susceptibility to radiation-induced fibrosis (LOD = 5.0) and contributes 7% of the phenotypic variance in the radiation study. In conclusion, susceptibility to both bleomycin-induced and radiation-induced pulmonary fibrosis are heritable traits, and are influenced by a genetic factor which maps to a genomic region containing the MHC. ^