21 resultados para terminological variant
Resumo:
Determination of copy number variants (CNVs) inferred in genome wide single nucleotide polymorphism arrays has shown increasing utility in genetic variant disease associations. Several CNV detection methods are available, but differences in CNV call thresholds and characteristics exist. We evaluated the relative performance of seven methods: circular binary segmentation, CNVFinder, cnvPartition, gain and loss of DNA, Nexus algorithms, PennCNV and QuantiSNP. Tested data included real and simulated Illumina HumHap 550 data from the Singapore cohort study of the risk factors for Myopia (SCORM) and simulated data from Affymetrix 6.0 and platform-independent distributions. The normalized singleton ratio (NSR) is proposed as a metric for parameter optimization before enacting full analysis. We used 10 SCORM samples for optimizing parameter settings for each method and then evaluated method performance at optimal parameters using 100 SCORM samples. The statistical power, false positive rates, and receiver operating characteristic (ROC) curve residuals were evaluated by simulation studies. Optimal parameters, as determined by NSR and ROC curve residuals, were consistent across datasets. QuantiSNP outperformed other methods based on ROC curve residuals over most datasets. Nexus Rank and SNPRank have low specificity and high power. Nexus Rank calls oversized CNVs. PennCNV detects one of the fewest numbers of CNVs.
Resumo:
BACKGROUND: Mutations in podocin (NPHS2) are the most common cause of childhood onset autosomal recessive steroid-resistant nephrotic syndrome (SRNS). The disease is characterized by early-onset proteinuria, resistance to immunosuppressive therapy and rapid progression to end-stage renal disease. Compound heterozygous changes involving the podocin variant R229Q combined with another pathogenic mutation have been associated with a mild phenotype with disease onset often in adulthood. METHODS: We screened 19 families with early-onset SRNS for mutations in NPHS2 and WT1 and identified four disease-causing mutations (three in NPHS2 and one in WT1) prior to planned whole-exome sequencing. RESULTS: We describe two families with three individuals presenting in childhood who are compound heterozygous for R229Q and one other pathogenic NPHS2 mutation, either L327F or A297V. One child presented at age 4 years (A297V plus R229Q) and the other two at age 13 (L327F plus R229Q), one with steadily deteriorating renal function. CONCLUSIONS: These cases highlight the phenotypic variability associated with the NPHS2 R229Q variant plus pathogenic mutation. Individuals may present with early aggressive disease.
Resumo:
We performed a whole-genome association study of human immunodeficiency virus type 1 (HIV-1) set point among a cohort of African Americans (n = 515), and an intronic single-nucleotide polymorphism (SNP) in the HLA-B gene showed one of the strongest associations. We use a subset of patients to demonstrate that this SNP reflects the effect of the HLA-B*5703 allele, which shows a genome-wide statistically significant association with viral load set point (P = 5.6 x 10(-10)). These analyses therefore confirm a member of the HLA-B*57 group of alleles as the most important common variant that influences viral load variation in African Americans, which is consistent with what has been observed for individuals of European ancestry, among whom the most important common variant is HLA-B*5701.
Resumo:
BACKGROUND: Several studies have noted that genetic variants of SCARB1, a lipoprotein receptor involved in reverse cholesterol transport, are associated with serum lipid levels in a sex-dependent fashion. However, the mechanism underlying this gene by sex interaction has not been explored. METHODS: We utilized both epidemiological and molecular methods to study how estrogen and gene variants interact to influence SCARB1 expression and lipid levels. Interaction between 35 SCARB1 haplotype-tagged polymorphisms and endogenous estradiol levels was assessed in 498 postmenopausal Caucasian women from the population-based Rancho Bernardo Study. We further examined associated variants with overall and SCARB1 splice variant (SR-BI and SR-BII) expression in 91 human liver tissues using quantitative real-time PCR. RESULTS: Several variants on a haplotype block spanning intron 11 to intron 12 of SCARB1 showed significant gene by estradiol interaction affecting serum lipid levels, the strongest for rs838895 with HDL-cholesterol (p=9.2x10(-4)) and triglycerides (p=1.3x10(-3)) and the triglyceride:HDL cholesterol ratio (p=2.7x10(-4)). These same variants were associated with expression of the SR-BI isoform in a sex-specific fashion, with the strongest association found among liver tissue from 52 young women<45 years old (p=0.002). CONCLUSIONS: Estrogen and SCARB1 genotype may act synergistically to regulate expression of SCARB1 isoforms and impact serum levels of HDL cholesterol and triglycerides. This work highlights the importance of considering sex-dependent effects of gene variants on serum lipid levels.
Resumo:
To extend the understanding of host genetic determinants of HIV-1 control, we performed a genome-wide association study in a cohort of 2,554 infected Caucasian subjects. The study was powered to detect common genetic variants explaining down to 1.3% of the variability in viral load at set point. We provide overwhelming confirmation of three associations previously reported in a genome-wide study and show further independent effects of both common and rare variants in the Major Histocompatibility Complex region (MHC). We also examined the polymorphisms reported in previous candidate gene studies and fail to support a role for any variant outside of the MHC or the chemokine receptor cluster on chromosome 3. In addition, we evaluated functional variants, copy-number polymorphisms, epistatic interactions, and biological pathways. This study thus represents a comprehensive assessment of common human genetic variation in HIV-1 control in Caucasians.
Resumo:
We used ultra-deep sequencing to obtain tens of thousands of HIV-1 sequences from regions targeted by CD8+ T lymphocytes from longitudinal samples from three acutely infected subjects, and modeled viral evolution during the critical first weeks of infection. Previous studies suggested that a single virus established productive infection, but these conclusions were tempered because of limited sampling; now, we have greatly increased our confidence in this observation through modeling the observed earliest sample diversity based on vastly more extensive sampling. Conventional sequencing of HIV-1 from acute/early infection has shown different patterns of escape at different epitopes; we investigated the earliest escapes in exquisite detail. Over 3-6 weeks, ultradeep sequencing revealed that the virus explored an extraordinary array of potential escape routes in the process of evading the earliest CD8 T-lymphocyte responses--using 454 sequencing, we identified over 50 variant forms of each targeted epitope during early immune escape, while only 2-7 variants were detected in the same samples via conventional sequencing. In contrast to the diversity seen within epitopes, non-epitope regions, including the Envelope V3 region, which was sequenced as a control in each subject, displayed very low levels of variation. In early infection, in the regions sequenced, the consensus forms did not have a fitness advantage large enough to trigger reversion to consensus amino acids in the absence of immune pressure. In one subject, a genetic bottleneck was observed, with extensive diversity at the second time point narrowing to two dominant escape forms by the third time point, all within two months of infection. Traces of immune escape were observed in the earliest samples, suggesting that immune pressure is present and effective earlier than previously reported; quantifying the loss rate of the founder virus suggests a direct role for CD8 T-lymphocyte responses in viral containment after peak viremia. Dramatic shifts in the frequencies of epitope variants during the first weeks of infection revealed a complex interplay between viral fitness and immune escape.
Resumo:
Complex diseases will have multiple functional sites, and it will be invaluable to understand the cross-locus interaction in terms of linkage disequilibrium (LD) between those sites (epistasis) in addition to the haplotype-LD effects. We investigated the statistical properties of a class of matrix-based statistics to assess this epistasis. These statistical methods include two LD contrast tests (Zaykin et al., 2006) and partial least squares regression (Wang et al., 2008). To estimate Type 1 error rates and power, we simulated multiple two-variant disease models using the SIMLA software package. SIMLA allows for the joint action of up to two disease genes in the simulated data with all possible multiplicative interaction effects between them. Our goal was to detect an interaction between multiple disease-causing variants by means of their linkage disequilibrium (LD) patterns with other markers. We measured the effects of marginal disease effect size, haplotype LD, disease prevalence and minor allele frequency have on cross-locus interaction (epistasis). In the setting of strong allele effects and strong interaction, the correlation between the two disease genes was weak (r=0.2). In a complex system with multiple correlations (both marginal and interaction), it was difficult to determine the source of a significant result. Despite these complications, the partial least squares and modified LD contrast methods maintained adequate power to detect the epistatic effects; however, for many of the analyses we often could not separate interaction from a strong marginal effect. While we did not exhaust the entire parameter space of possible models, we do provide guidance on the effects that population parameters have on cross-locus interaction.
Resumo:
Alzheimer's disease is a complex and progressive neurodegenerative disease leading to loss of memory, cognitive impairment, and ultimately death. To date, six large-scale genome-wide association studies have been conducted to identify SNPs that influence disease predisposition. These studies have confirmed the well-known APOE epsilon4 risk allele, identified a novel variant that influences disease risk within the APOE epsilon4 population, found a SNP that modifies the age of disease onset, as well as reported the first sex-linked susceptibility variant. Here we report a genome-wide scan of Alzheimer's disease in a set of 331 cases and 368 controls, extending analyses for the first time to include assessments of copy number variation. In this analysis, no new SNPs show genome-wide significance. We also screened for effects of copy number variation, and while nothing was significant, a duplication in CHRNA7 appears interesting enough to warrant further investigation.
Resumo:
PURPOSE: Evaluating genetic susceptibility may clarify effects of known environmental factors and also identify individuals at high risk. We evaluated the association of four insulin-related pathway gene polymorphisms in insulin-like growth factor-1 (IGF-I) (CA)( n ) repeat, insulin-like growth factor-2 (IGF-II) (rs680), insulin-like growth factor-binding protein-3 (IGFBP-3) (rs2854744), and adiponectin (APM1 rs1501299) with colon cancer risk, as well as relationships with circulating IGF-I, IGF-II, IGFBP-3, and C-peptide in a population-based study. METHODS: Participants were African Americans (231 cases and 306 controls) and Whites (297 cases, 530 controls). Consenting subjects provided blood specimens and lifestyle/diet information. Genotyping for all genes except IGF-I was performed by the 5'-exonuclease (Taqman) assay. The IGF-I (CA)(n) repeat was assayed by PCR and fragment analysis. Circulating proteins were measured by enzyme immunoassays. Odds ratios (ORs) and 95 % confidence intervals (CIs) were calculated by logistic regression. RESULTS: The IGF-I (CA)( 19 ) repeat was higher in White controls (50 %) than African American controls (31 %). Whites homozygous for the IGF-I (CA)(19) repeat had a nearly twofold increase in risk of colon cancer (OR = 1.77; 95 % CI = 1.15-2.73), but not African Americans (OR = 0.73, 95 % CI 0.50-1.51). We observed an inverse association between the IGF-II Apa1 A-variant and colon cancer risk (OR = 0.49, 95 % CI 0.28-0.88) in Whites only. Carrying the IGFBP-3 variant alleles was associated with lower IGFBP-3 protein levels, a difference most pronounced in Whites (p-trend <0.05). CONCLUSIONS: These results support an association between insulin pathway-related genes and elevated colon cancer risk in Whites but not in African Americans.
Resumo:
BACKGROUND: Genetic association studies are conducted to discover genetic loci that contribute to an inherited trait, identify the variants behind these associations and ascertain their functional role in determining the phenotype. To date, functional annotations of the genetic variants have rarely played more than an indirect role in assessing evidence for association. Here, we demonstrate how these data can be systematically integrated into an association study's analysis plan. RESULTS: We developed a Bayesian statistical model for the prior probability of phenotype-genotype association that incorporates data from past association studies and publicly available functional annotation data regarding the susceptibility variants under study. The model takes the form of a binary regression of association status on a set of annotation variables whose coefficients were estimated through an analysis of associated SNPs in the GWAS Catalog (GC). The functional predictors examined included measures that have been demonstrated to correlate with the association status of SNPs in the GC and some whose utility in this regard is speculative: summaries of the UCSC Human Genome Browser ENCODE super-track data, dbSNP function class, sequence conservation summaries, proximity to genomic variants in the Database of Genomic Variants and known regulatory elements in the Open Regulatory Annotation database, PolyPhen-2 probabilities and RegulomeDB categories. Because we expected that only a fraction of the annotations would contribute to predicting association, we employed a penalized likelihood method to reduce the impact of non-informative predictors and evaluated the model's ability to predict GC SNPs not used to construct the model. We show that the functional data alone are predictive of a SNP's presence in the GC. Further, using data from a genome-wide study of ovarian cancer, we demonstrate that their use as prior data when testing for association is practical at the genome-wide scale and improves power to detect associations. CONCLUSIONS: We show how diverse functional annotations can be efficiently combined to create 'functional signatures' that predict the a priori odds of a variant's association to a trait and how these signatures can be integrated into a standard genome-wide-scale association analysis, resulting in improved power to detect truly associated variants.
Association between DNA damage response and repair genes and risk of invasive serous ovarian cancer.
Resumo:
BACKGROUND: We analyzed the association between 53 genes related to DNA repair and p53-mediated damage response and serous ovarian cancer risk using case-control data from the North Carolina Ovarian Cancer Study (NCOCS), a population-based, case-control study. METHODS/PRINCIPAL FINDINGS: The analysis was restricted to 364 invasive serous ovarian cancer cases and 761 controls of white, non-Hispanic race. Statistical analysis was two staged: a screen using marginal Bayes factors (BFs) for 484 SNPs and a modeling stage in which we calculated multivariate adjusted posterior probabilities of association for 77 SNPs that passed the screen. These probabilities were conditional on subject age at diagnosis/interview, batch, a DNA quality metric and genotypes of other SNPs and allowed for uncertainty in the genetic parameterizations of the SNPs and number of associated SNPs. Six SNPs had Bayes factors greater than 10 in favor of an association with invasive serous ovarian cancer. These included rs5762746 (median OR(odds ratio)(per allele) = 0.66; 95% credible interval (CI) = 0.44-1.00) and rs6005835 (median OR(per allele) = 0.69; 95% CI = 0.53-0.91) in CHEK2, rs2078486 (median OR(per allele) = 1.65; 95% CI = 1.21-2.25) and rs12951053 (median OR(per allele) = 1.65; 95% CI = 1.20-2.26) in TP53, rs411697 (median OR (rare homozygote) = 0.53; 95% CI = 0.35 - 0.79) in BACH1 and rs10131 (median OR( rare homozygote) = not estimable) in LIG4. The six most highly associated SNPs are either predicted to be functionally significant or are in LD with such a variant. The variants in TP53 were confirmed to be associated in a large follow-up study. CONCLUSIONS/SIGNIFICANCE: Based on our findings, further follow-up of the DNA repair and response pathways in a larger dataset is warranted to confirm these results.
Resumo:
Antigenically variable RNA viruses are significant contributors to the burden of infectious disease worldwide. One reason for their ubiquity is their ability to escape herd immunity through rapid antigenic evolution and thereby to reinfect previously infected hosts. However, the ways in which these viruses evolve antigenically are highly diverse. Some have only limited diversity in the long-run, with every emergence of a new antigenic variant coupled with a replacement of the older variant. Other viruses rapidly accumulate antigenic diversity over time. Others still exhibit dynamics that can be considered evolutionary intermediates between these two extremes. Here, we present a theoretical framework that aims to understand these differences in evolutionary patterns by considering a virus's epidemiological dynamics in a given host population. Our framework, based on a dimensionless number, probabilistically anticipates patterns of viral antigenic diversification and thereby quantifies a virus's evolutionary potential. It is therefore similar in spirit to the basic reproduction number, the well-known dimensionless number which quantifies a pathogen's reproductive potential. We further outline how our theoretical framework can be applied to empirical viral systems, using influenza A/H3N2 as a case study. We end with predictions of our framework and work that remains to be done to further integrate viral evolutionary dynamics with disease ecology.
Resumo:
Early interventions are a preferred method for addressing behavioral problems in high-risk children, but often have only modest effects. Identifying sources of variation in intervention effects can suggest means to improve efficiency. One potential source of such variation is the genome. We conducted a genetic analysis of the Fast Track randomized control trial, a 10-year-long intervention to prevent high-risk kindergarteners from developing adult externalizing problems including substance abuse and antisocial behavior. We tested whether variants of the glucocorticoid receptor gene NR3C1 were associated with differences in response to the Fast Track intervention. We found that in European-American children, a variant of NR3C1 identified by the single-nucleotide polymorphism rs10482672 was associated with increased risk for externalizing psychopathology in control group children and decreased risk for externalizing psychopathology in intervention group children. Variation in NR3C1 measured in this study was not associated with differential intervention response in African-American children. We discuss implications for efforts to prevent externalizing problems in high-risk children and for public policy in the genomic era.
Resumo:
Antigenically evolving pathogens such as influenza viruses are difficult to control owing to their ability to evade host immunity by producing immune escape variants. Experimental studies have repeatedly demonstrated that viral immune escape variants emerge more often from immunized hosts than from naive hosts. This empirical relationship between host immune status and within-host immune escape is not fully understood theoretically, nor has its impact on antigenic evolution at the population level been evaluated. Here, we show that this relationship can be understood as a trade-off between the probability that a new antigenic variant is produced and the level of viraemia it reaches within a host. Scaling up this intra-host level trade-off to a simple population level model, we obtain a distribution for variant persistence times that is consistent with influenza A/H3N2 antigenic variant data. At the within-host level, our results show that target cell limitation, or a functional equivalent, provides a parsimonious explanation for how host immune status drives the generation of immune escape mutants. At the population level, our analysis also offers an alternative explanation for the observed tempo of antigenic evolution, namely that the production rate of immune escape variants is driven by the accumulation of herd immunity. Overall, our results suggest that disease control strategies should be further assessed by considering the impact that increased immunity--through vaccination--has on the production of new antigenic variants.
Resumo:
CD133 is one of the most common stem cell markers, and functional single nucleotide polymorphisms (SNPs) of CD133 may modulate its gene functions and thus cancer risk and patient survival. We hypothesized that potentially functional CD133 SNPs are associated with gastric cancer (GC) risk and survival. To test this hypothesis, we conducted a case-control study of 371 GC patients and 313 cancer-free controls frequency-matched by age, sex, and ethnicity. We genotyped four selected, potentially functional CD133 SNPs (rs2240688A>C, rs7686732C>G, rs10022537T>A, and rs3130C>T) and used logistic regression analysis for associations of these SNPs with GC risk and Cox hazards regression analysis for survival. We found that compared with the miRNA binding site rs2240688 AA genotype, AC + CC genotypes were associated with significantly increased GC risk (adjusted OR = 1.52, 95% CI = 1.09-2.13); for another miRNA binding site rs3130C>T SNP, the TT genotype was associated with significantly reduced GC risk (adjusted OR = 0.68, 95% CI = 0.48-0.97), compared with CC + CT genotypes. In all patients, the risk rs3130 TT variant genotype was significantly associated with overall survival (OS) (adjusted P(trend) = 0.016 and 0.007 under additive and recessive models, respectively). These findings suggest that these two CD133 miRNA binding site variants, rs2240688 and rs3130, may be potential biomarkers for genetic susceptibility to GC and possible predictors for survival in GC patients but require further validation by larger studies.