29 resultados para genome wide complex trait analysis
Resumo:
The complete and faithful duplication of the genome is essential to ensure normal cell division and organismal development. Eukaryotic DNA replication is initiated at multiple sites termed origins of replication that are activated at different time through S phase. The replication timing program is regulated by the S-phase checkpoint, which signals and repairs replicative stress. Eukaryotic DNA is packaged with histones into chromatin, thus DNA-templated processes including replication are modulated by the local chromatin environment such as post-translational modifications (PTMs) of histones.
One such epigenetic mark, methylation of lysine 20 on histone H4 (H4K20), has been linked to chromatin compaction, transcription, DNA repair and DNA replication. H4K20 can be mono-, di- and tri-methylated. Monomethylation of H4K20 (H4K20me1) is mediated by the cell cycle-regulated histone methyltransferase PR-Set7 and subsequent di-/tri- methylation is catalyzed by Suv4-20. Prior studies have shown that PR-Set7 depletion in mammalian cells results in defective S phase progression and the accumulation of DNA damage, which may be partially attributed to defects in origin selection and activation. Meanwhile, overexpression of mammalian PR-Set7 recruits components of pre-Replication Complex (pre-RC) onto chromatin and licenses replication origins for re-replication. However, these studies were limited to only a handful of mammalian origins, and it remains unclear how PR-Set7 impacts the replication program on a genomic scale. Finally, the methylation substrates of PR-Set7 include both histone (H4K20) and non-histone targets, therefore it is necessary to directly test the role of H4K20 methylation in PR-Set7 regulated phenotypes.
I employed genetic, cytological, and genomic approaches to better understand the role of H4K20 methylation in regulating DNA replication and genome stability in Drosophila melanogaster cells. Depletion of Drosophila PR-Set7 by RNAi in cultured Kc167 cells led to an ATR-dependent cell cycle arrest with near 4N DNA content and the accumulation of DNA damage, indicating a defect in completing S phase. The cells were arrested at the second S phase following PR-Set7 downregulation, suggesting that it was an epigenetic effect that coupled to the dilution of histone modification over multiple cell cycles. To directly test the role of H4K20 methylation in regulating genome integrity, I collaborated with the Duronio Lab and observed spontaneous DNA damage on the imaginal wing discs of third instar mutant larvae that had an alanine substitution on H4K20 (H4K20A) thus unable to be methylated, confirming that H4K20 is a bona fide target of PR-Set7 in maintaining genome integrity.
One possible source of DNA damage due to loss of PR-Set7 is reduced origin activity. I used BrdU-seq to profile the genome-wide origin activation pattern. However, I found that deregulation of H4K20 methylation states by manipulating the H4K20 methyltransferases PR-Set7 and Suv4-20 had no impact on origin activation throughout the genome. I then mapped the genomic distribution of DNA damage upon PR-Set7 depletion. Surprisingly, ChIP-seq of the DNA damage marker γ-H2A.v located the DNA damage to late replicating euchromatic regions of the Drosophila genome, and the strength of γ-H2A.v signal was uniformly distributed and spanned the entire late replication domain, implying stochastic replication fork collapse within late replicating regions. Together these data suggest that PR-Set7-mediated monomethylation of H4K20 is critical for maintaining the genomic integrity of late replicating domains, presumably via stabilization of late replicating forks.
In addition to investigating the function of H4K20me, I also used immunofluorescence to characterize the cell cycle regulated chromatin loading of Mcm2-7 complex, the DNA helicase that licenses replication origins, using H4K20me1 level as a proxy for cell cycle stages. In parallel with chromatin spindown data by Powell et al. (Powell et al. 2015), we showed a continuous loading of Mcm2-7 during G1 and a progressive removal from chromatin through S phase.
Resumo:
The autosomal recessive kidney disease nephronophthisis (NPHP) constitutes the most frequent genetic cause of terminal renal failure in the first 3 decades of life. Ten causative genes (NPHP1-NPHP9 and NPHP11), whose products localize to the primary cilia-centrosome complex, support the unifying concept that cystic kidney diseases are "ciliopathies". Using genome-wide homozygosity mapping, we report here what we believe to be a new locus (NPHP-like 1 [NPHPL1]) for an NPHP-like nephropathy. In 2 families with an NPHP-like phenotype, we detected homozygous frameshift and splice-site mutations, respectively, in the X-prolyl aminopeptidase 3 (XPNPEP3) gene. In contrast to all known NPHP proteins, XPNPEP3 localizes to mitochondria of renal cells. However, in vivo analyses also revealed a likely cilia-related function; suppression of zebrafish xpnpep3 phenocopied the developmental phenotypes of ciliopathy morphants, and this effect was rescued by human XPNPEP3 that was devoid of a mitochondrial localization signal. Consistent with a role for XPNPEP3 in ciliary function, several ciliary cystogenic proteins were found to be XPNPEP3 substrates, for which resistance to N-terminal proline cleavage resulted in attenuated protein function in vivo in zebrafish. Our data highlight an emerging link between mitochondria and ciliary dysfunction, and suggest that further understanding the enzymatic activity and substrates of XPNPEP3 will illuminate novel cystogenic pathways.
Resumo:
To extend the understanding of host genetic determinants of HIV-1 control, we performed a genome-wide association study in a cohort of 2,554 infected Caucasian subjects. The study was powered to detect common genetic variants explaining down to 1.3% of the variability in viral load at set point. We provide overwhelming confirmation of three associations previously reported in a genome-wide study and show further independent effects of both common and rare variants in the Major Histocompatibility Complex region (MHC). We also examined the polymorphisms reported in previous candidate gene studies and fail to support a role for any variant outside of the MHC or the chemokine receptor cluster on chromosome 3. In addition, we evaluated functional variants, copy-number polymorphisms, epistatic interactions, and biological pathways. This study thus represents a comprehensive assessment of common human genetic variation in HIV-1 control in Caucasians.
Resumo:
BRCA1 has been implicated in numerous DNA repair pathways that maintain genome integrity, however the function responsible for its tumor suppressor activity in breast cancer remains obscure. To identify the most highly conserved of the many BRCA1 functions, we screened the evolutionarily distant eukaryote Saccharomyces cerevisiae for mutants that suppressed the G1 checkpoint arrest and lethality induced following heterologous BRCA1 expression. A genome-wide screen in the diploid deletion collection combined with a screen of ionizing radiation sensitive gene deletions identified mutants that permit growth in the presence of BRCA1. These genes delineate a metabolic mRNA pathway that temporally links transcription elongation (SPT4, SPT5, CTK1, DEF1) to nucleopore-mediated mRNA export (ASM4, MLP1, MLP2, NUP2, NUP53, NUP120, NUP133, NUP170, NUP188, POM34) and cytoplasmic mRNA decay at P-bodies (CCR4, DHH1). Strikingly, BRCA1 interacted with the phosphorylated RNA polymerase II (RNAPII) carboxy terminal domain (P-CTD), phosphorylated in the pattern specified by the CTDK-I kinase, to induce DEF1-dependent cleavage and accumulation of a RNAPII fragment containing the P-CTD. Significantly, breast cancer associated BRCT domain defects in BRCA1 that suppressed P-CTD cleavage and lethality in yeast also suppressed the physical interaction of BRCA1 with human SPT5 in breast epithelial cells, thus confirming SPT5 as a relevant target of BRCA1 interaction. Furthermore, enhanced P-CTD cleavage was observed in both yeast and human breast cells following UV-irradiation indicating a conserved eukaryotic damage response. Moreover, P-CTD cleavage in breast epithelial cells was BRCA1-dependent since damage-induced P-CTD cleavage was only observed in the mutant BRCA1 cell line HCC1937 following ectopic expression of wild type BRCA1. Finally, BRCA1, SPT5 and hyperphosphorylated RPB1 form a complex that was rapidly degraded following MMS treatment in wild type but not BRCA1 mutant breast cells. These results extend the mechanistic links between BRCA1 and transcriptional consequences in response to DNA damage and suggest an important role for RNAPII P-CTD cleavage in BRCA1-mediated cancer suppression.
Resumo:
While genome-wide gene expression data are generated at an increasing rate, the repertoire of approaches for pattern discovery in these data is still limited. Identifying subtle patterns of interest in large amounts of data (tens of thousands of profiles) associated with a certain level of noise remains a challenge. A microarray time series was recently generated to study the transcriptional program of the mouse segmentation clock, a biological oscillator associated with the periodic formation of the segments of the body axis. A method related to Fourier analysis, the Lomb-Scargle periodogram, was used to detect periodic profiles in the dataset, leading to the identification of a novel set of cyclic genes associated with the segmentation clock. Here, we applied to the same microarray time series dataset four distinct mathematical methods to identify significant patterns in gene expression profiles. These methods are called: Phase consistency, Address reduction, Cyclohedron test and Stable persistence, and are based on different conceptual frameworks that are either hypothesis- or data-driven. Some of the methods, unlike Fourier transforms, are not dependent on the assumption of periodicity of the pattern of interest. Remarkably, these methods identified blindly the expression profiles of known cyclic genes as the most significant patterns in the dataset. Many candidate genes predicted by more than one approach appeared to be true positive cyclic genes and will be of particular interest for future research. In addition, these methods predicted novel candidate cyclic genes that were consistent with previous biological knowledge and experimental validation in mouse embryos. Our results demonstrate the utility of these novel pattern detection strategies, notably for detection of periodic profiles, and suggest that combining several distinct mathematical approaches to analyze microarray datasets is a valuable strategy for identifying genes that exhibit novel, interesting transcriptional patterns.
Resumo:
In the event of a terrorist-mediated attack in the United States using radiological or improvised nuclear weapons, it is expected that hundreds of thousands of people could be exposed to life-threatening levels of ionizing radiation. We have recently shown that genome-wide expression analysis of the peripheral blood (PB) can generate gene expression profiles that can predict radiation exposure and distinguish the dose level of exposure following total body irradiation (TBI). However, in the event a radiation-mass casualty scenario, many victims will have heterogeneous exposure due to partial shielding and it is unknown whether PB gene expression profiles would be useful in predicting the status of partially irradiated individuals. Here, we identified gene expression profiles in the PB that were characteristic of anterior hemibody-, posterior hemibody- and single limb-irradiation at 0.5 Gy, 2 Gy and 10 Gy in C57Bl6 mice. These PB signatures predicted the radiation status of partially irradiated mice with a high level of accuracy (range 79-100%) compared to non-irradiated mice. Interestingly, PB signatures of partial body irradiation were poorly predictive of radiation status by site of injury (range 16-43%), suggesting that the PB molecular response to partial body irradiation was anatomic site specific. Importantly, PB gene signatures generated from TBI-treated mice failed completely to predict the radiation status of partially irradiated animals or non-irradiated controls. These data demonstrate that partial body irradiation, even to a single limb, generates a characteristic PB signature of radiation injury and thus may necessitate the use of multiple signatures, both partial body and total body, to accurately assess the status of an individual exposed to radiation.
Resumo:
PURPOSE: The endoplasmic reticulum-associated degradation pathway is responsible for the translocation of misfolded proteins across the endoplasmic reticulum membrane into the cytosol for subsequent degradation by the proteasome. To define the phenotype associated with a novel inherited disorder of cytosolic endoplasmic reticulum-associated degradation pathway dysfunction, we studied a series of eight patients with deficiency of N-glycanase 1. METHODS: Whole-genome, whole-exome, or standard Sanger sequencing techniques were employed. Retrospective chart reviews were performed in order to obtain clinical data. RESULTS: All patients had global developmental delay, a movement disorder, and hypotonia. Other common findings included hypolacrima or alacrima (7/8), elevated liver transaminases (6/7), microcephaly (6/8), diminished reflexes (6/8), hepatocyte cytoplasmic storage material or vacuolization (5/6), and seizures (4/8). The nonsense mutation c.1201A>T (p.R401X) was the most common deleterious allele. CONCLUSION: NGLY1 deficiency is a novel autosomal recessive disorder of the endoplasmic reticulum-associated degradation pathway associated with neurological dysfunction, abnormal tear production, and liver disease. The majority of patients detected to date carry a specific nonsense mutation that appears to be associated with severe disease. The phenotypic spectrum is likely to enlarge as cases with a broader range of mutations are detected.
Resumo:
Using A/J mice, which are susceptible to Staphylococcus aureus, we sought to identify genetic determinants of susceptibility to S. aureus, and evaluate their function with regard to S. aureus infection. One QTL region on chromosome 11 containing 422 genes was found to be significantly associated with susceptibility to S. aureus infection. Of these 422 genes, whole genome transcription profiling identified five genes (Dcaf7, Dusp3, Fam134c, Psme3, and Slc4a1) that were significantly differentially expressed in a) S. aureus -infected susceptible (A/J) vs. resistant (C57BL/6J) mice and b) humans with S. aureus blood stream infection vs. healthy subjects. Three of these genes (Dcaf7, Dusp3, and Psme3) were down-regulated in susceptible vs. resistant mice at both pre- and post-infection time points by qPCR. siRNA-mediated knockdown of Dusp3 and Psme3 induced significant increases of cytokine production in S. aureus-challenged RAW264.7 macrophages and bone marrow derived macrophages (BMDMs) through enhancing NF-κB signaling activity. Similar increases in cytokine production and NF-κB activity were also seen in BMDMs from CSS11 (C57BL/6J background with chromosome 11 from A/J), but not C57BL/6J. These findings suggest that Dusp3 and Psme3 contribute to S. aureus infection susceptibility in A/J mice and play a role in human S. aureus infection.
Resumo:
Fluctuations in nutrient availability profoundly impact gene expression. Previous work revealed postrecruitment regulation of RNA polymerase II (Pol II) during starvation and recovery in Caenorhabditis elegans, suggesting that promoter-proximal pausing promotes rapid response to feeding. To test this hypothesis, we measured Pol II elongation genome wide by two complementary approaches and analyzed elongation in conjunction with Pol II binding and expression. We confirmed bona fide pausing during starvation and also discovered Pol II docking. Pausing occurs at active stress-response genes that become downregulated in response to feeding. In contrast, "docked" Pol II accumulates without initiating upstream of inactive growth genes that become rapidly upregulated upon feeding. Beyond differences in function and expression, these two sets of genes have different core promoter motifs, suggesting alternative transcriptional machinery. Our work suggests that growth and stress genes are both regulated postrecruitment during starvation but at initiation and elongation, respectively, coordinating gene expression with nutrient availability.
Resumo:
Nutrient availability profoundly influences gene expression. Many animal genes encode multiple transcript isoforms, yet the effect of nutrient availability on transcript isoform expression has not been studied in genome-wide fashion. When Caenorhabditis elegans larvae hatch without food, they arrest development in the first larval stage (L1 arrest). Starved larvae can survive L1 arrest for weeks, but growth and post-embryonic development are rapidly initiated in response to feeding. We used RNA-seq to characterize the transcriptome during L1 arrest and over time after feeding. Twenty-seven percent of detectable protein-coding genes were differentially expressed during recovery from L1 arrest, with the majority of changes initiating within the first hour, demonstrating widespread, acute effects of nutrient availability on gene expression. We used two independent approaches to track expression of individual exons and mRNA isoforms, and we connected changes in expression to functional consequences by mining a variety of databases. These two approaches identified an overlapping set of genes with alternative isoform expression, and they converged on common functional patterns. Genes affecting mRNA splicing and translation are regulated by alternative isoform expression, revealing post-transcriptional consequences of nutrient availability on gene regulation. We also found that phosphorylation sites are often alternatively expressed, revealing a common mode by which alternative isoform expression modifies protein function and signal transduction. Our results detail rich changes in C. elegans gene expression as larvae initiate growth and post-embryonic development, and they provide an excellent resource for ongoing investigation of transcriptional regulation and developmental physiology.
Resumo:
cERMIT is a computationally efficient motif discovery tool based on analyzing genome-wide quantitative regulatory evidence. Instead of pre-selecting promising candidate sequences, it utilizes information across all sequence regions to search for high-scoring motifs. We apply cERMIT on a range of direct binding and overexpression datasets; it substantially outperforms state-of-the-art approaches on curated ChIP-chip datasets, and easily scales to current mammalian ChIP-seq experiments with data on thousands of non-coding regions.
Resumo:
Determination of copy number variants (CNVs) inferred in genome wide single nucleotide polymorphism arrays has shown increasing utility in genetic variant disease associations. Several CNV detection methods are available, but differences in CNV call thresholds and characteristics exist. We evaluated the relative performance of seven methods: circular binary segmentation, CNVFinder, cnvPartition, gain and loss of DNA, Nexus algorithms, PennCNV and QuantiSNP. Tested data included real and simulated Illumina HumHap 550 data from the Singapore cohort study of the risk factors for Myopia (SCORM) and simulated data from Affymetrix 6.0 and platform-independent distributions. The normalized singleton ratio (NSR) is proposed as a metric for parameter optimization before enacting full analysis. We used 10 SCORM samples for optimizing parameter settings for each method and then evaluated method performance at optimal parameters using 100 SCORM samples. The statistical power, false positive rates, and receiver operating characteristic (ROC) curve residuals were evaluated by simulation studies. Optimal parameters, as determined by NSR and ROC curve residuals, were consistent across datasets. QuantiSNP outperformed other methods based on ROC curve residuals over most datasets. Nexus Rank and SNPRank have low specificity and high power. Nexus Rank calls oversized CNVs. PennCNV detects one of the fewest numbers of CNVs.
Resumo:
BACKGROUND: The Notch signaling pathway is constitutively activated in human cutaneous melanoma to promote growth and aggressive metastatic potential of primary melanoma cells. Therefore, genetic variants in Notch pathway genes may affect the prognosis of cutaneous melanoma patients. METHODS: We identified 6,256 SNPs in 48 Notch genes in 858 cutaneous melanoma patients included in a previously published cutaneous melanoma genome-wide association study dataset. Multivariate and stepwise Cox proportional hazards regression and false-positive report probability corrections were performed to evaluate associations between putative functional SNPs and cutaneous melanoma disease-specific survival. Receiver operating characteristic curve was constructed, and area under the curve was used to assess the classification performance of the model. RESULTS: Four putative functional SNPs of Notch pathway genes had independent and joint predictive roles in survival of cutaneous melanoma patients. The most significant variant was NCOR2 rs2342924 T>C (adjusted HR, 2.71; 95% confidence interval, 1.73-4.23; Ptrend = 9.62 × 10(-7)), followed by NCSTN rs1124379 G>A, NCOR2 rs10846684 G>A, and MAML2 rs7953425 G>A (Ptrend = 0.005, 0.005, and 0.013, respectively). The receiver operating characteristic analysis revealed that area under the curve was significantly increased after adding the combined unfavorable genotype score to the model containing the known clinicopathologic factors. CONCLUSIONS: Our results suggest that SNPs in Notch pathway genes may be predictors of cutaneous melanoma disease-specific survival. IMPACT: Our discovery offers a translational potential for using genetic variants in Notch pathway genes as a genotype score of biomarkers for developing an improved prognostic assessment and personalized management of cutaneous melanoma patients.