16 resultados para COPY-NUMBER VARIATION
em Duke University
Resumo:
Alzheimer's disease is a complex and progressive neurodegenerative disease leading to loss of memory, cognitive impairment, and ultimately death. To date, six large-scale genome-wide association studies have been conducted to identify SNPs that influence disease predisposition. These studies have confirmed the well-known APOE epsilon4 risk allele, identified a novel variant that influences disease risk within the APOE epsilon4 population, found a SNP that modifies the age of disease onset, as well as reported the first sex-linked susceptibility variant. Here we report a genome-wide scan of Alzheimer's disease in a set of 331 cases and 368 controls, extending analyses for the first time to include assessments of copy number variation. In this analysis, no new SNPs show genome-wide significance. We also screened for effects of copy number variation, and while nothing was significant, a duplication in CHRNA7 appears interesting enough to warrant further investigation.
Resumo:
Determination of copy number variants (CNVs) inferred in genome wide single nucleotide polymorphism arrays has shown increasing utility in genetic variant disease associations. Several CNV detection methods are available, but differences in CNV call thresholds and characteristics exist. We evaluated the relative performance of seven methods: circular binary segmentation, CNVFinder, cnvPartition, gain and loss of DNA, Nexus algorithms, PennCNV and QuantiSNP. Tested data included real and simulated Illumina HumHap 550 data from the Singapore cohort study of the risk factors for Myopia (SCORM) and simulated data from Affymetrix 6.0 and platform-independent distributions. The normalized singleton ratio (NSR) is proposed as a metric for parameter optimization before enacting full analysis. We used 10 SCORM samples for optimizing parameter settings for each method and then evaluated method performance at optimal parameters using 100 SCORM samples. The statistical power, false positive rates, and receiver operating characteristic (ROC) curve residuals were evaluated by simulation studies. Optimal parameters, as determined by NSR and ROC curve residuals, were consistent across datasets. QuantiSNP outperformed other methods based on ROC curve residuals over most datasets. Nexus Rank and SNPRank have low specificity and high power. Nexus Rank calls oversized CNVs. PennCNV detects one of the fewest numbers of CNVs.
Resumo:
Because of the role that DNA damage and depletion play in human disease, it is important to develop and improve tools to assess these endpoints. This unit describes PCR-based methods to measure nuclear and mitochondrial DNA damage and copy number. Long amplicon quantitative polymerase chain reaction (LA-QPCR) is used to detect DNA damage by measuring the number of polymerase-inhibiting lesions present based on the amount of PCR amplification; real-time PCR (RT-PCR) is used to calculate genome content. In this unit, we provide step-by-step instructions to perform these assays in Homo sapiens, Mus musculus, Rattus norvegicus, Caenorhabditis elegans, Drosophila melanogaster, Danio rerio, Oryzias latipes, Fundulus grandis, and Fundulus heteroclitus, and discuss the advantages and disadvantages of these assays.
Resumo:
To extend the understanding of host genetic determinants of HIV-1 control, we performed a genome-wide association study in a cohort of 2,554 infected Caucasian subjects. The study was powered to detect common genetic variants explaining down to 1.3% of the variability in viral load at set point. We provide overwhelming confirmation of three associations previously reported in a genome-wide study and show further independent effects of both common and rare variants in the Major Histocompatibility Complex region (MHC). We also examined the polymorphisms reported in previous candidate gene studies and fail to support a role for any variant outside of the MHC or the chemokine receptor cluster on chromosome 3. In addition, we evaluated functional variants, copy-number polymorphisms, epistatic interactions, and biological pathways. This study thus represents a comprehensive assessment of common human genetic variation in HIV-1 control in Caucasians.
Resumo:
Improvements in genomic technology, both in the increased speed and reduced cost of sequencing, have expanded the appreciation of the abundance of human genetic variation. However the sheer amount of variation, as well as the varying type and genomic content of variation, poses a challenge in understanding the clinical consequence of a single mutation. This work uses several methodologies to interpret the observed variation in the human genome, and presents novel strategies for the prediction of allele pathogenicity.
Using the zebrafish model system as an in vivo assay of allele function, we identified a novel driver of Bardet-Biedl Syndrome (BBS) in CEP76. A combination of targeted sequencing of 785 cilia-associated genes in a cohort of BBS patients and subsequent in vivo functional assays recapitulating the human phenotype gave strong evidence for the role of CEP76 mutations in the pathology of an affected family. This portion of the work demonstrated the necessity of functional testing in validating disease-associated mutations, and added to the catalogue of known BBS disease genes.
Further study into the role of copy-number variations (CNVs) in a cohort of BBS patients showed the significant contribution of CNVs to disease pathology. Using high-density array comparative genomic hybridization (aCGH) we were able to identify pathogenic CNVs as small as several hundred bp. Dissection of constituent gene and in vivo experiments investigating epistatic interactions between affected genes allowed for an appreciation of several paradigms by which CNVs can contribute to disease. This study revealed that the contribution of CNVs to disease in BBS patients is much higher than previously expected, and demonstrated the necessity of consideration of CNV contribution in future (and retrospective) investigations of human genetic disease.
Finally, we used a combination of comparative genomics and in vivo complementation assays to identify second-site compensatory modification of pathogenic alleles. These pathogenic alleles, which are found compensated in other species (termed compensated pathogenic deviations [CPDs]), represent a significant fraction (from 3 – 10%) of human disease-associated alleles. In silico pathogenicity prediction algorithms, a valuable method of allele prioritization, often misrepresent these alleles as benign, leading to omission of possibly informative variants in studies of human genetic disease. We created a mathematical model that was able to predict CPDs and putative compensatory sites, and functionally showed in vivo that second-site mutation can mitigate the pathogenicity of disease alleles. Additionally, we made publically available an in silico module for the prediction of CPDs and modifier sites.
These studies have advanced the ability to interpret the pathogenicity of multiple types of human variation, as well as made available tools for others to do so as well.
Resumo:
Tumor microenvironmental stresses, such as hypoxia and lactic acidosis, play important roles in tumor progression. Although gene signatures reflecting the influence of these stresses are powerful approaches to link expression with phenotypes, they do not fully reflect the complexity of human cancers. Here, we describe the use of latent factor models to further dissect the stress gene signatures in a breast cancer expression dataset. The genes in these latent factors are coordinately expressed in tumors and depict distinct, interacting components of the biological processes. The genes in several latent factors are highly enriched in chromosomal locations. When these factors are analyzed in independent datasets with gene expression and array CGH data, the expression values of these factors are highly correlated with copy number alterations (CNAs) of the corresponding BAC clones in both the cell lines and tumors. Therefore, variation in the expression of these pathway-associated factors is at least partially caused by variation in gene dosage and CNAs among breast cancers. We have also found the expression of two latent factors without any chromosomal enrichment is highly associated with 12q CNA, likely an instance of "trans"-variations in which CNA leads to the variations in gene expression outside of the CNA region. In addition, we have found that factor 26 (1q CNA) is negatively correlated with HIF-1alpha protein and hypoxia pathways in breast tumors and cell lines. This agrees with, and for the first time links, known good prognosis associated with both a low hypoxia signature and the presence of CNA in this region. Taken together, these results suggest the possibility that tumor segmental aneuploidy makes significant contributions to variation in the lactic acidosis/hypoxia gene signatures in human cancers and demonstrate that latent factor analysis is a powerful means to uncover such a linkage.
Resumo:
Loss of PTEN and activation of phosphoinositide 3-kinase are commonly observed in advanced prostate cancer. Inhibition of mammalian target of rapamycin (mTOR), a downstream target of phosphoinositide 3-kinase signaling, results in cell cycle arrest and apoptosis in multiple in vitro and in vivo models of prostate cancer. However, single-agent use of mTOR inhibition has limited clinical success, and the identification of molecular events mitigating tumor response to mTOR inhibition remains a critical question. Here, using genetically engineered human prostate epithelial cells (PrEC), we show that MYC, a frequent target of genetic gain in prostate cancers, abrogates sensitivity to rapamycin by decreasing rapamycin-induced cytostasis and autophagy. Analysis of MYC and the mTOR pathway in human prostate tumors and PrEC showed selective increased expression of eukaryotic initiation factor 4E-binding protein 1 (4EBP1) with gain in MYC copy number or forced MYC expression, respectively. We have also found that MYC binds to regulatory regions of the 4EBP1 gene. Suppression of 4EBP1 expression resulted in resensitization of MYC-expressing PrEC to rapamycin and increased autophagy. Taken together, our findings suggest that MYC expression abrogates sensitivity to rapamycin through increased expression of 4EBP1 and reduced autophagy.
Resumo:
The nuclear respiratory factor-1 (NRF1) gene is activated by lipopolysaccharide (LPS), which might reflect TLR4-mediated mitigation of cellular inflammatory damage via initiation of mitochondrial biogenesis. To test this hypothesis, we examined NRF1 promoter regulation by NFκB, and identified interspecies-conserved κB-responsive promoter and intronic elements in the NRF1 locus. In mice, activation of Nrf1 and its downstream target, Tfam, by Escherichia coli was contingent on NFκB, and in LPS-treated hepatocytes, NFκB served as an NRF1 enhancer element in conjunction with NFκB promoter binding. Unexpectedly, optimal NRF1 promoter activity after LPS also required binding by the energy-state-dependent transcription factor CREB. EMSA and ChIP assays confirmed p65 and CREB binding to the NRF1 promoter and p65 binding to intron 1. Functionality for both transcription factors was validated by gene-knockdown studies. LPS regulation of NRF1 led to mtDNA-encoded gene expression and expansion of mtDNA copy number. In cells expressing plasmid constructs containing the NRF-1 promoter and GFP, LPS-dependent reporter activity was abolished by cis-acting κB-element mutations, and nuclear accumulation of NFκB and CREB demonstrated dependence on mitochondrial H(2)O(2). These findings indicate that TLR4-dependent NFκB and CREB activation co-regulate the NRF1 promoter with NFκB intronic enhancement and redox-regulated nuclear translocation, leading to downstream target-gene expression, and identify NRF-1 as an early-phase component of the host antibacterial defenses.
Resumo:
Extensive departures from balanced gene dose in aneuploids are highly deleterious. However, we know very little about the relationship between gene copy number and expression in aneuploid cells. We determined copy number and transcript abundance (expression) genome-wide in Drosophila S2 cells by DNA-Seq and RNA-Seq. We found that S2 cells are aneuploid for >43 Mb of the genome, primarily in the range of one to five copies, and show a male genotype ( approximately two X chromosomes and four sets of autosomes, or 2X;4A). Both X chromosomes and autosomes showed expression dosage compensation. X chromosome expression was elevated in a fixed-fold manner regardless of actual gene dose. In engineering terms, the system "anticipates" the perturbation caused by X dose, rather than responding to an error caused by the perturbation. This feed-forward regulation resulted in precise dosage compensation only when X dose was half of the autosome dose. Insufficient compensation occurred at lower X chromosome dose and excessive expression occurred at higher doses. RNAi knockdown of the Male Specific Lethal complex abolished feed-forward regulation. Both autosome and X chromosome genes show Male Specific Lethal-independent compensation that fits a first order dose-response curve. Our data indicate that expression dosage compensation dampens the effect of altered DNA copy number genome-wide. For the X chromosome, compensation includes fixed and dose-dependent components.
Resumo:
We present the analysis of twenty human genomes to evaluate the prospects for identifying rare functional variants that contribute to a phenotype of interest. We sequenced at high coverage ten "case" genomes from individuals with severe hemophilia A and ten "control" genomes. We summarize the number of genetic variants emerging from a study of this magnitude, and provide a proof of concept for the identification of rare and highly-penetrant functional variants by confirming that the cause of hemophilia A is easily recognizable in this data set. We also show that the number of novel single nucleotide variants (SNVs) discovered per genome seems to stabilize at about 144,000 new variants per genome, after the first 15 individuals have been sequenced. Finally, we find that, on average, each genome carries 165 homozygous protein-truncating or stop loss variants in genes representing a diverse set of pathways.
Resumo:
Synthetic biology seeks to enable programmed control of cellular behavior though engineered biological systems. These systems typically consist of synthetic circuits that function inside, and interact with, complex host cells possessing pre-existing metabolic and regulatory networks. Nevertheless, while designing systems, a simple well-defined interface between the synthetic gene circuit and the host is frequently assumed. We describe the generation of robust but unexpected oscillations in the densities of bacterium Escherichia coli populations by simple synthetic suicide circuits containing quorum components and a lysis gene. Contrary to design expectations, oscillations required neither the quorum sensing genes (luxR and luxI) nor known regulatory elements in the P(luxI) promoter. Instead, oscillations were likely due to density-dependent plasmid amplification that established a population-level negative feedback. A mathematical model based on this mechanism captures the key characteristics of oscillations, and model predictions regarding perturbations to plasmid amplification were experimentally validated. Our results underscore the importance of plasmid copy number and potential impact of "hidden interactions" on the behavior of engineered gene circuits - a major challenge for standardizing biological parts. As synthetic biology grows as a discipline, increasing value may be derived from tools that enable the assessment of parts in their final context.
Resumo:
Mitochondria are responsible for producing the vast majority of cellular ATP, and are therefore critical to organismal health [1]. They contain thir own genomes (mtDNA) which encode 13 proteins that are all subunits of the mitochondrial respiratory chain (MRC) and are essential for oxidative phosphorylation [2]. mtDNA is present in multiple copies per cell, usually between 103 and 104 , though this number is reduced during certain developmental stages [3, 4]. The health of the mitochondrial genome is also important to the health of the organism, as mutations in mtDNA lead to human diseases that collectively affect approximately 1 in 4000 people [5, 6]. mtDNA is more susceptible than nuclear DNA (nucDNA) to damage by many environmental pollutants, for reasons including the absence of Nucleotide Excision Repair (NER) in the mitochondria [7]. NER is a highly functionally conserved DNA repair pathway that removes bulky, helix distorting lesions such as those caused by ultraviolet C (UVC) radiation and also many environmental toxicants, including benzo[a]pyrene (BaP) [8]. While these lesions cannot be repaired, they are slowly removed through a process that involves mitochondrial dynamics and autophagy [9, 10]. However, when present during development in C. elegans, this damage reduces mtDNA copy number and ATP levels [11]. We hypothesize that this damage, when present during development, will result in mitochondrial dysfunction and increase the potential for adverse outcomes later in life.
To test this hypothesis, 1st larval stage (L1) C. elegans are exposed to 3 doses of 7.5J/m2 ultraviolet C radiation 24 hours apart, leading to the accumulation of mtDNA damage [9, 11]. After exposure, many mitochondrial endpoints are assessed at multiple time points later in life. mtDNA and nucDNA damage levels and genome copy numbers are measured via QPCR and real-time PCR , respectively, every 2 day for 10 days. Steady state ATP levels are measured via luciferase expressing reporter strains and traditional ATP extraction methods. Oxygen consumption is measured using a Seahorse XFe24 extra cellular flux analyzer. Gene expression changes are measured via real time PCR and targeted metabolomics via LC-MS are used to investigate changes in organic acid, amino acid and acyl-carnitine levels. Lastly, nematode developmental delay is assessed as growth, and measured via imaging and COPAS biosort.
I have found that despite being removed, UVC induced mtDNA damage during development leads to persistent deficits in energy production later in life. mtDNA copy number is permanently reduced, as are ATP levels, though oxygen consumption is increased, indicating inefficient or uncoupled respiration. Metabolomic data and mutant sensitivity indicate a role for NADPH and oxidative stress in these results, and exposed nematodes are more sensitive to the mitochondrial poison rotenone later in life. These results fit with the developmental origin of health and disease hypothesis, and show the potential for environmental exposures to have lasting effects on mitochondrial function.
Lastly, we are currently working to investigate the potential for irreparable mtDNA lesions to drive mutagenesis in mtDNA. Mutations in mtDNA lead to a wide range of diseases, yet we currently do not understand the environmental component of what causes them. In vitro evidence suggests that UVC induced thymine dimers can be mutagenic [12]. We are using duplex sequencing of C. elegans mtDNA to determine mutation rates in nematodes exposed to our serial UVC protocol. Furthermore, by including mutant strains deficient in mitochondrial fission and mitophagy, we hope to determine if deficiencies in these processes will further increase mtDNA mutation rates, as they are implicated in human diseases.
Resumo:
The science of genetics is undergoing a paradigm shift. Recent discoveries, including the activity of retrotransposons, the extent of copy number variations, somatic and chromosomal mosaicism, and the nature of the epigenome as a regulator of DNA expressivity, are challenging a series of dogmas concerning the nature of the genome and the relationship between genotype and phenotype. DNA, once held to be the unchanging template of heredity, now appears subject to a good deal of environmental change; considered to be identical in all cells and tissues of the body, there is growing evidence that somatic mosaicism is the normal human condition; and treated as the sole biological agent of heritability, we now know that the epigenome, which regulates gene expressivity, can be inherited via the germline. These developments are particularly significant for behavior genetics for at least three reasons: First, these phenomena appear to be particularly prevalent in the human brain, and likely are involved in much of human behavior; second, they have important implications for the validity of heritability and gene association studies, the methodologies that largely define the discipline of behavior genetics; and third, they appear to play a critical role in development during the perinatal period, and in enabling phenotypic plasticity in offspring in particular. I examine one of the central claims to emerge from the use of heritability studies in the behavioral sciences, the principle of “minimal shared maternal effects,” in light of the growing awareness that the maternal perinatal environment is a critical venue for the exercise of adaptive phenotypic plasticity. This consideration has important implications for both developmental and evolutionary biology
Resumo:
Twelve months of aerosol size distributions from 3 to 560nm, measured using scanning mobility particle sizers are presented with an emphasis on average number, surface, and volume distributions, and seasonal and diurnal variation. The measurements were made at the main sampling site of the Pittsburgh Air Quality Study from July 2001 to June 2002. These are supplemented with 5 months of size distribution data from 0.5 to 2.5μm measured with a TSI aerosol particle sizer and 2 months of size distributions measured at an upwind rural sampling site. Measurements at the main site were made continuously under both low and ambient relative humidity. The average Pittsburgh number concentration (3-500nm) is 22,000cm-3 with an average mode size of 40nm. Strong diurnal patterns in number concentrations are evident as a direct effect of the sources of particles (atmospheric nucleation, traffic, and other combustion sources). New particle formation from homogeneous nucleation is significant on 30-50% of study days and over a wide area (at least a hundred kilometers). Rural number concentrations are a factor of 2-3 lower (on average) than the urban values. Average measured distributions are different from model literature urban and rural size distributions. © 2004 Elsevier Ltd. All rights reserved.
Resumo:
Antigenically variable RNA viruses are significant contributors to the burden of infectious disease worldwide. One reason for their ubiquity is their ability to escape herd immunity through rapid antigenic evolution and thereby to reinfect previously infected hosts. However, the ways in which these viruses evolve antigenically are highly diverse. Some have only limited diversity in the long-run, with every emergence of a new antigenic variant coupled with a replacement of the older variant. Other viruses rapidly accumulate antigenic diversity over time. Others still exhibit dynamics that can be considered evolutionary intermediates between these two extremes. Here, we present a theoretical framework that aims to understand these differences in evolutionary patterns by considering a virus's epidemiological dynamics in a given host population. Our framework, based on a dimensionless number, probabilistically anticipates patterns of viral antigenic diversification and thereby quantifies a virus's evolutionary potential. It is therefore similar in spirit to the basic reproduction number, the well-known dimensionless number which quantifies a pathogen's reproductive potential. We further outline how our theoretical framework can be applied to empirical viral systems, using influenza A/H3N2 as a case study. We end with predictions of our framework and work that remains to be done to further integrate viral evolutionary dynamics with disease ecology.