972 resultados para Single-nucleotide-polymorphism
Resumo:
We present the most comprehensive comparison to date of the predictive benefit of genetics in addition to currently used clinical variables, using genotype data for 33 single-nucleotide polymorphisms (SNPs) in 1,547 Caucasian men from the placebo arm of the REduction by DUtasteride of prostate Cancer Events (REDUCE®) trial. Moreover, we conducted a detailed comparison of three techniques for incorporating genetics into clinical risk prediction. The first method was a standard logistic regression model, which included separate terms for the clinical covariates and for each of the genetic markers. This approach ignores a substantial amount of external information concerning effect sizes for these Genome Wide Association Study (GWAS)-replicated SNPs. The second and third methods investigated two possible approaches to incorporating meta-analysed external SNP effect estimates - one via a weighted PCa 'risk' score based solely on the meta analysis estimates, and the other incorporating both the current and prior data via informative priors in a Bayesian logistic regression model. All methods demonstrated a slight improvement in predictive performance upon incorporation of genetics. The two methods that incorporated external information showed the greatest receiver-operating-characteristic AUCs increase from 0.61 to 0.64. The value of our methods comparison is likely to lie in observations of performance similarities, rather than difference, between three approaches of very different resource requirements. The two methods that included external information performed best, but only marginally despite substantial differences in complexity.
Resumo:
Discussion on improving the power of genome-wide association studies to identify candidate variants and genes is generally centered on issues of maximizing sample size; less attention is given to the role of phenotype definition and ascertainment. The authors used genome-wide data from patients infected with human immunodeficiency virus type 1 (HIV-1) to assess whether differences in type of population (622 seroconverters vs. 636 seroprevalent subjects) or the number of measurements available for defining the phenotype resulted in differences in the effect sizes of associations between single nucleotide polymorphisms and the phenotype, HIV-1 viral load at set point. The effect estimate for the top 100 single nucleotide polymorphisms was 0.092 (95% confidence interval: 0.074, 0.110) log(10) viral load (log(10) copies of HIV-1 per mL of blood) greater in seroconverters than in seroprevalent subjects. The difference was even larger when the authors focused on chromosome 6 variants (0.153 log(10) viral load) or on variants that achieved genome-wide significance (0.232 log(10) viral load). The estimates of the genetic effects tended to be slightly larger when more viral load measurements were available, particularly among seroconverters and for variants that achieved genome-wide significance. Differences in phenotype definition and ascertainment may affect the estimated magnitude of genetic effects and should be considered in optimizing power for discovering new associations.
Resumo:
Genetic variants influence the risk to develop certain diseases or give rise to differences in drug response. Recent progresses in cost-effective, high-throughput genome-wide techniques, such as microarrays measuring Single Nucleotide Polymorphisms (SNPs), have facilitated genotyping of large clinical and population cohorts. Combining the massive genotypic data with measurements of phenotypic traits allows for the determination of genetic differences that explain, at least in part, the phenotypic variations within a population. So far, models combining the most significant variants can only explain a small fraction of the variance, indicating the limitations of current models. In particular, researchers have only begun to address the possibility of interactions between genotypes and the environment. Elucidating the contributions of such interactions is a difficult task because of the large number of genetic as well as possible environmental factors.In this thesis, I worked on several projects within this context. My first and main project was the identification of possible SNP-environment interactions, where the phenotypes were serum lipid levels of patients from the Swiss HIV Cohort Study (SHCS) treated with antiretroviral therapy. Here the genotypes consisted of a limited set of SNPs in candidate genes relevant for lipid transport and metabolism. The environmental variables were the specific combinations of drugs given to each patient over the treatment period. My work explored bioinformatic and statistical approaches to relate patients' lipid responses to these SNPs, drugs and, importantly, their interactions. The goal of this project was to improve our understanding and to explore the possibility of predicting dyslipidemia, a well-known adverse drug reaction of antiretroviral therapy. Specifically, I quantified how much of the variance in lipid profiles could be explained by the host genetic variants, the administered drugs and SNP-drug interactions and assessed the predictive power of these features on lipid responses. Using cross-validation stratified by patients, we could not validate our hypothesis that models that select a subset of SNP-drug interactions in a principled way have better predictive power than the control models using "random" subsets. Nevertheless, all models tested containing SNP and/or drug terms, exhibited significant predictive power (as compared to a random predictor) and explained a sizable proportion of variance, in the patient stratified cross-validation context. Importantly, the model containing stepwise selected SNP terms showed higher capacity to predict triglyceride levels than a model containing randomly selected SNPs. Dyslipidemia is a complex trait for which many factors remain to be discovered, thus missing from the data, and possibly explaining the limitations of our analysis. In particular, the interactions of drugs with SNPs selected from the set of candidate genes likely have small effect sizes which we were unable to detect in a sample of the present size (<800 patients).In the second part of my thesis, I performed genome-wide association studies within the Cohorte Lausannoise (CoLaus). I have been involved in several international projects to identify SNPs that are associated with various traits, such as serum calcium, body mass index, two-hour glucose levels, as well as metabolic syndrome and its components. These phenotypes are all related to major human health issues, such as cardiovascular disease. I applied statistical methods to detect new variants associated with these phenotypes, contributing to the identification of new genetic loci that may lead to new insights into the genetic basis of these traits. This kind of research will lead to a better understanding of the mechanisms underlying these pathologies, a better evaluation of disease risk, the identification of new therapeutic leads and may ultimately lead to the realization of "personalized" medicine.
Resumo:
Although experimental studies have suggested that insulin-like growth factor I (IGF-I) and its binding protein IGFBP-3 might have a role in the aetiology of coronary artery disease (CAD), the relevance of circulating IGFs and their binding proteins in the development of CAD in human populations is unclear. We conducted a nested case-control study, with a mean follow-up of six years, within the EPIC-Norfolk cohort to assess the association between circulating levels of IGF-I and IGFBP-3 and risk of CAD in up to 1,013 cases and 2,055 controls matched for age, sex and study enrolment date. After adjustment for cardiovascular risk factors, we found no association between circulating levels of IGF-I or IGFBP-3 and risk of CAD (odds ratio: 0.98 (95% Cl 0.90-1.06) per 1 SD increase in circulating IGF-I; odds ratio: 1.02 (95% Cl 0.94-1.12) for IGFBP-3). We examined associations between tagging single nucleotide polymorphisms (tSNPs) at the IGF1 and IGFBP3 loci and circulating IGF-I and IGFBP-3 levels in up to 1,133 cases and 2,223 controls and identified three tSNPs (rs1520220, rs3730204, rs2132571) that showed independent association with either circulating IGF-I or IGFBP-3 levels. In an assessment of 31 SNPs spanning the IGF1 or IGFBP3 loci, none were associated with risk of CAD in a meta-analysis that included EPIC-Norfolk and eight additional studies comprising up to 9,319 cases and 19,964 controls. Our results indicate that IGF-I and IGFBP-3 are unlikely to be importantly involved in the aetiology of CAD in human populations.
Resumo:
Pseudomonas knackmussii B13 was the first strain to be isolated in 1974 that could degrade chlorinated aromatic hydrocarbons. This discovery was the prologue for subsequent characterization of numerous bacterial metabolic pathways, for genetic and biochemical studies, and which spurred ideas for pollutant bioremediation. In this study, we determined the complete genome sequence of B13 using next generation sequencing technologies and optical mapping. Genome annotation indicated that B13 has a variety of metabolic pathways for degrading monoaromatic hydrocarbons including chlorobenzoate, aminophenol, anthranilate and hydroxyquinol, but not polyaromatic compounds. Comparative genome analysis revealed that B13 is closest to Pseudomonas denitrificans and Pseudomonas aeruginosa. The B13 genome contains at least eight genomic islands [prophages and integrative conjugative elements (ICEs)], which were absent in closely related pseudomonads. We confirm that two ICEs are identical copies of the 103 kb self-transmissible element ICEclc that carries the genes for chlorocatechol metabolism. Comparison of ICEclc showed that it is composed of a variable and a 'core' region, which is very conserved among proteobacterial genomes, suggesting a widely distributed family of so far uncharacterized ICE. Resequencing of two spontaneous B13 mutants revealed a number of single nucleotide substitutions, as well as excision of a large 220 kb region and a prophage that drastically change the host metabolic capacity and survivability.
Resumo:
Background: V itamin D insufficiency has been associated with the occurrence of various types of cancer, but causal relationships remain elusive. Methods: Associations between t he r isk o f HCV-related HCC development and CYP2R1 , GC, and DHCR7 genotypes, which are genetic determinants of reduced 25-OH-vitamin D3 (25[OH]D3) serum levels, were determined. Results: A t otal of 5604 HCV-infected patients, 1279 with a nd 4325 without progression to HCC, w ere identified. The well-known association between 25(OH)D3 s erum levels and variations in CYP2R1 ( rs1993116, rs10741657), GC ( rs2282679), a nd DHCR7 ( rs7944926, rs12785878) g enotypes was also apparent in patients w ith chronic hepatitis C. The same genotypes of t hese single nucleotide polymorphisms (SNPs), w hich are associated with reduced 25(OH)D3 s erum levels, were significantly associated with HCV-associated HCC (P=0.07 [OR=1.13] for CYP2R1 , P=0.007 [OR=1.56] for GC, P=0.003 [OR=1.42] for DHCR7; ORs for risk genotypes). In contrast, no association between t hese genetic variations and the o utcome of antiviral therapy with pegylated interferon-α and ribavirin ( P>0.2 for e ach SNP) or liver fibrosis progression rate (P>0.2 for each SNP) was observed, s uggesting a specific influence o f the genetic d eterminants of 25(OH)D3 s erum levels o n hepatocarcinogenesis. Conclusions: Our data suggest a relatively weak but functionally relevant role for vitamin D in the prevention of HCV-related HCC development. Controlled clinical trials to assess the benefit of vitamin D supplementation in HCVinfected patients with advanced liver fibrosis or cirrhosis are warranted.
Resumo:
BACKGROUND: Genetic factors play a role in chronic obstructive pulmonary disease (COPD) but are poorly understood. A number of candidate genes have been proposed on the basis of the pathogenesis of COPD. These include the matrix metalloproteinase (MMP) genes which play a role in tissue remodelling and fit in with the protease--antiprotease imbalance theory for the cause of COPD. Previous genetic studies of MMPs in COPD have had inadequate coverage of the genes, and have reported conflicting associations of both single nucleotide polymorphisms (SNPs) and SNP haplotypes, plausibly due to under-powered studies. METHODS: To address these issues we genotyped 26 SNPs, providing comprehensive coverage of reported SNP variation, in MMPs- 1, 9 and 12 from 977 COPD patients and 876 non-diseased smokers of European descent and evaluated their association with disease singly and in haplotype combinations. We used logistic regression to adjust for age, gender, centre and smoking history. RESULTS: Haplotypes of two SNPs in MMP-12 (rs652438 and rs2276109), showed an association with severe/very severe disease, corresponding to GOLD Stages III and IV. CONCLUSIONS: Those with the common A-A haplotype for these two SNPs were at greater risk of developing severe/very severe disease (p = 0.0039) while possession of the minor G variants at either SNP locus had a protective effect (adjusted odds ratio of 0.76; 95% CI 0.61 - 0.94). The A-A haplotype was also associated with significantly lower predicted FEV1 (42.62% versus 44.79%; p = 0.0129). This implicates haplotypes of MMP-12 as modifiers of disease severity.
Resumo:
BACKGROUND: Persons infected with human immunodeficiency virus (HIV) have increased rates of coronary artery disease (CAD). The relative contribution of genetic background, HIV-related factors, antiretroviral medications, and traditional risk factors to CAD has not been fully evaluated in the setting of HIV infection. METHODS: In the general population, 23 common single-nucleotide polymorphisms (SNPs) were shown to be associated with CAD through genome-wide association analysis. Using the Metabochip, we genotyped 1875 HIV-positive, white individuals enrolled in 24 HIV observational studies, including 571 participants with a first CAD event during the 9-year study period and 1304 controls matched on sex and cohort. RESULTS: A genetic risk score built from 23 CAD-associated SNPs contributed significantly to CAD (P = 2.9 × 10(-4)). In the final multivariable model, participants with an unfavorable genetic background (top genetic score quartile) had a CAD odds ratio (OR) of 1.47 (95% confidence interval [CI], 1.05-2.04). This effect was similar to hypertension (OR = 1.36; 95% CI, 1.06-1.73), hypercholesterolemia (OR = 1.51; 95% CI, 1.16-1.96), diabetes (OR = 1.66; 95% CI, 1.10-2.49), ≥ 1 year lopinavir exposure (OR = 1.36; 95% CI, 1.06-1.73), and current abacavir treatment (OR = 1.56; 95% CI, 1.17-2.07). The effect of the genetic risk score was additive to the effect of nongenetic CAD risk factors, and did not change after adjustment for family history of CAD. CONCLUSIONS: In the setting of HIV infection, the effect of an unfavorable genetic background was similar to traditional CAD risk factors and certain adverse antiretroviral exposures. Genetic testing may provide prognostic information complementary to family history of CAD.
Resumo:
La industria de la producción de camarón es una de las industrias acuícolas que se encuentra en más crecimiento en la actualidad. Los estudios para encontrar marcadores genéticos son muy efectivos para la mejora de sus propiedades y de gran interés para los productores de camarón. En este trabajo se utilizaron seis individuos de una población de Litopenaeus vannamei, donde se encontraron cuatro polimorfismos de nucleótido único (SNPs) en el gen 5HT1R (5-hidroxitriptamina receptor1) y un SNP en el gen STAT (transductor de señal y activador de la transcripción). Sin embargo, el polimorfismo en el gen STAT resultó ser homocigoto en una población diferente utilizada para análisis de asociación. Los presentes análisis revelaron que el alelo C, en dos polimorfismos SNP (C109T y C395G) del gen 5HT1R, tiende a estar asociado con el aumento del peso corporal. Consideramos que hay necesidad de hacer nuevos estudios utilizando una muestra más amplia y diversa de la población en cuestión.
Resumo:
BACKGROUND: Single-nucleotide polymorphisms (SNPs) in immune genes have been associated with susceptibility to invasive mold infection (IMI) among hematopoietic stem cell but not solid-organ transplant (SOT) recipients. METHODS: Twenty-four SNPs from systematically selected genes were genotyped among 1101 SOT recipients (715 kidney transplant recipients, 190 liver transplant recipients, 102 lung transplant recipients, 79 heart transplant recipients, and 15 recipients of other transplants) from the Swiss Transplant Cohort Study. Association between SNPs and the end point were assessed by log-rank test and Cox regression models. Cytokine production upon Aspergillus stimulation was measured by enzyme-linked immunosorbent assay in peripheral blood mononuclear cells (PBMCs) from healthy volunteers and correlated with relevant genotypes. RESULTS: Mold colonization (n = 45) and proven/probable IMI (n = 26) were associated with polymorphisms in the genes encoding interleukin 1β (IL1B; rs16944; recessive mode, P = .001 for colonization and P = .00005 for IMI, by the log-rank test), interleukin 1 receptor antagonist (IL1RN; rs419598; P = .01 and P = .02, respectively), and β-defensin 1 (DEFB1; rs1800972; P = .001 and P = .0002, respectively). The associations with IL1B and DEFB1 remained significant in a multivariate regression model (P = .002 for IL1B rs16944; P = .01 for DEFB1 rs1800972). The presence of 2 copies of the rare allele of rs16944 or rs419598 was associated with reduced Aspergillus-induced interleukin 1β and tumor necrosis factor α secretion by PBMCs. CONCLUSIONS: Functional polymorphisms in IL1B and DEFB1 influence susceptibility to mold infection in SOT recipients. This observation may contribute to individual risk stratification.
Resumo:
There is growing public concern about reducing saturated fat intake. Stearoyl-CoA desaturase (SCD) is the lipogenic enzyme responsible for the biosynthesis of oleic acid (18:1) by desaturating stearic acid (18:0). Here we describe a total of 18 mutations in the promoter and 3′ non-coding region of the pig SCD gene and provide evidence that allele T at AY487830:g.2228T>C in the promoter region enhances fat desaturation (the ratio 18:1/18:0 in muscle increases from 3.78 to 4.43 in opposite homozygotes) without affecting fat content (18:0+18:1, intramuscular fat content, and backfat thickness). No mutations that could affect the functionality of the protein were found in the coding region. First, we proved in a purebred Duroc line that the C-T-A haplotype of the 3 single nucleotide polymorphisms (SNPs) (g.2108C>T; g.2228T>C; g.2281A>G) of the promoter region was additively associated to enhanced 18:1/18:0 both in muscle and subcutaneous fat, but not in liver. We show that this association was consistent over a 10-year period of overlapping generations and, in line with these results, that the C-T-A haplotype displayed greater SCD mRNA expression in muscle. The effect of this haplotype was validated both internally, by comparing opposite homozygote siblings, and externally, by using experimental Duroc-based crossbreds. Second, the g.2281A>G and the g.2108C>T SNPs were excluded as causative mutations using new and previously published data, restricting the causality to g.2228T>C SNP, the last source of genetic variation within the haplotype. This mutation is positioned in the core sequence of several putative transcription factor binding sites, so that there are several plausible mechanisms by which allele T enhances 18:1/18:0 and, consequently, the proportion of monounsaturated to saturated fat.
Resumo:
Genome-wide association studies (GWASs) have identified multiple loci associated with cross-sectional eGFR, but a systematic genetic analysis of kidney function decline over time is missing. Here we conducted a GWAS meta-analysis among 63,558 participants of European descent, initially from 16 cohorts with serial kidney function measurements within the CKDGen Consortium, followed by independent replication among additional participants from 13 cohorts. In stage 1 GWAS meta-analysis, single-nucleotide polymorphisms (SNPs) at MEOX2, GALNT11, IL1RAP, NPPA, HPCAL1, and CDH23 showed the strongest associations for at least one trait, in addition to the known UMOD locus, which showed genome-wide significance with an annual change in eGFR. In stage 2 meta-analysis, the significant association at UMOD was replicated. Associations at GALNT11 with Rapid Decline (annual eGFR decline of 3 ml/min per 1.73 m(2) or more), and CDH23 with eGFR change among those with CKD showed significant suggestive evidence of replication. Combined stage 1 and 2 meta-analyses showed significance for UMOD, GALNT11, and CDH23. Morpholino knockdowns of galnt11 and cdh23 in zebrafish embryos each had signs of severe edema 72 h after gentamicin treatment compared with controls, but no gross morphological renal abnormalities before gentamicin administration. Thus, our results suggest a role in the deterioration of kidney function for the loci GALNT11 and CDH23, and show that the UMOD locus is significantly associated with kidney function decline.Kidney International advance online publication, 10 December 2014; doi:10.1038/ki.2014.361.
Resumo:
Coffee, a major dietary source of caffeine, is among the most widely consumed beverages in the world and has received considerable attention regarding health risks and benefits. We conducted a genome-wide (GW) meta-analysis of predominately regular-type coffee consumption (cups per day) among up to 91 462 coffee consumers of European ancestry with top single-nucleotide polymorphisms (SNPs) followed-up in ~30 062 and 7964 coffee consumers of European and African-American ancestry, respectively. Studies from both stages were combined in a trans-ethnic meta-analysis. Confirmed loci were examined for putative functional and biological relevance. Eight loci, including six novel loci, met GW significance (log10Bayes factor (BF)>5.64) with per-allele effect sizes of 0.03-0.14 cups per day. Six are located in or near genes potentially involved in pharmacokinetics (ABCG2, AHR, POR and CYP1A2) and pharmacodynamics (BDNF and SLC6A4) of caffeine. Two map to GCKR and MLXIPL genes related to metabolic traits but lacking known roles in coffee consumption. Enhancer and promoter histone marks populate the regions of many confirmed loci and several potential regulatory SNPs are highly correlated with the lead SNP of each. SNP alleles near GCKR, MLXIPL, BDNF and CYP1A2 that were associated with higher coffee consumption have previously been associated with smoking initiation, higher adiposity and fasting insulin and glucose but lower blood pressure and favorable lipid, inflammatory and liver enzyme profiles (P<5 × 10-8).Our genetic findings among European and African-American adults reinforce the role of caffeine in mediating habitual coffee consumption and may point to molecular mechanisms underlying inter-individual variability in pharmacological and health effects of coffee.
Resumo:
Background: Toll-like receptors (TLRs) are critical components for host pathogen recognition and variants in genes participating in this response influence susceptibility to infections. Recently, TLR1 gene polymorphisms have been found correlated with whole blood hyper-inflammatory responses to pathogen-associated molecules and associated with sepsis-associated multiorgan dysfunction and acute lung injury (ALI). We examined the association of common variants of TLR1 gene with sepsis-derived complications in an independent study and with serum levels for four inflammatory biomarker among septic patients. Methodology/Principal Findings: Seven tagging single nucleotide polymorphisms of the TLR1 gene were genotyped in samples from a prospective multicenter case-only study of patients with severe sepsis admitted into a network of intensive care units followed for disease severity. Interleukin (IL)-1 b, IL-6, IL-10, and C-reactive protein (CRP) serum levels were measured at study entry, at 48 h and at 7th day. Alleles -7202G and 248Ser, and the 248Ser-602Ile haplotype were associated with circulatory dysfunction among severe septic patients (0.001<=p <= 0.022), and with reduced IL-10 (0.012<= p <=0.047) and elevated CRP (0.011<= p <=0.036) serum levels during the first week of sepsis development. Additionally, the -7202GG genotype was found to be associated with hospital mortality (p =0.017) and ALI (p =0.050) in a combined analysis with European Americans, suggesting common risk effects among studies Conclusions/Significance: These results partially replicate and extend previous findings, supporting that variants of TLR1 gene are determinants of severe complications during sepsis.
Resumo:
Next-generation sequencing (NGS) technologies have become the standard for data generation in studies of population genomics, as the 1000 Genomes Project (1000G). However, these techniques are known to be problematic when applied to highly polymorphic genomic regions, such as the human leukocyte antigen (HLA) genes. Because accurate genotype calls and allele frequency estimations are crucial to population genomics analyses, it is important to assess the reliability of NGS data. Here, we evaluate the reliability of genotype calls and allele frequency estimates of the single-nucleotide polymorphisms (SNPs) reported by 1000G (phase I) at five HLA genes (HLA-A, -B, -C, -DRB1, and -DQB1). We take advantage of the availability of HLA Sanger sequencing of 930 of the 1092 1000G samples and use this as a gold standard to benchmark the 1000G data. We document that 18.6% of SNP genotype calls in HLA genes are incorrect and that allele frequencies are estimated with an error greater than ±0.1 at approximately 25% of the SNPs in HLA genes. We found a bias toward overestimation of reference allele frequency for the 1000G data, indicating mapping bias is an important cause of error in frequency estimation in this dataset. We provide a list of sites that have poor allele frequency estimates and discuss the outcomes of including those sites in different kinds of analyses. Because the HLA region is the most polymorphic in the human genome, our results provide insights into the challenges of using of NGS data at other genomic regions of high diversity.