950 resultados para Copy number variations and polymorphisms
Resumo:
Pluripotent cells have the potential to differentiate into all somatic cell types. As the adult human body is unable to regenerate various tissues, pluripotent cells provide an attractive source for regenerative medicine. Human embryonic stem cells (hESCs) can be isolated from blastocyst stage embryos and cultured in the laboratory environment. However, their use in regenerative medicine is restricted due to problems with immunosuppression by the host and ethical legislation. Recently, a new source of pluripotent cells was established via the direct reprogramming of somatic cells. These human induced pluripotent stem cells (hiPSCs) enable the production of patient specific cell types. However, numerous challenges, such as efficient reprogramming, optimal culture, directed differentiation, genetic stability and tumor risk need to be solved before the launch of therapeutic applications. The main objective of this thesis was to understand the unique properties of human pluripotent stem cells. The specific aims were to identify novel factors involved in maintaining pluripotency, characterize the effects of low oxygen culture on hESCs, and determine the high resolution changes in hESCs and hiPSCs during culture and reprogramming. As a result, the previously uncharacterized protein L1TD1 was determined to be specific for pluripotent cells and essential for the maintenance of pluripotency. The low oxygen culture supported undifferentiated growth and affected expression of stem cell associated transcripts. High resolution screening of hESCs identified a number of culture induced copy number variations and loss of heterozygosity changes. Further, screening of hiPSCs revealed that reprogramming induces high resolution alterations. The results obtained in this thesis have important implications for stem cell and cancer biology and the therapeutic potential of pluripotent cells.
Resumo:
I disturbi dello spettro autistico (DSA) ed il ritardo mentale (RM) sono caratterizzati da un’eziologia genetica complessa ed eterogenea. Grazie ai recenti sviluppi nella ricerca genomica, è stato possibile dimostrare il ruolo di numerose copy number variants (CNVs) nella patogenesi di questi disturbi, anche se nella maggior parte dei casi l’eziologia rimane ancora sconosciuta. Questo lavoro riguarda l’identificazione e la caratterizzazione dei CNVs in famiglie con DSA e RM. E’ stata studiata una microdelezione in 7q31 che coinvolge i geni IMMP2L e DOCK4, trasmessa dalla madre con dislessia a due figli con autismo ed una figlia con dislessia. Nella stessa famiglia segrega una seconda microdelezione in 2q14 che inattiva il gene CNTNAP5 ed è trasmessa dal padre (con tratti autistici) ai due figli con autismo. Abbiamo quindi ipotizzato che i geni DOCK4 e CNTNAP5 potessero essere implicati, rispettivamente, nella suscettibilità a dislessia e DSA. Lo screening di numerosi individui affetti ha supportato la nostra ipotesi, con l’identificazione di una nuova microdelezione di DOCK4 che segrega con la dislessia, e 3 nuove varianti missenso in CNTNAP5 in individui con autismo. Dall’analisi genomica comparativa su array (aCGH) di individui con RM, è stata identificata una delezione nella regione 7q31.32, che coinvolge il gene CADPS2, in due fratelli con RM e tratti autistici, probabilmente ereditata dalla madre. Lo screening di mutazione di questo gene in individui con autismo o RM, ha portato all’identificazione di 3 varianti non sinonime, assenti nei controlli, ed ereditate per via materna. Poiché CADPS2 risiede in una regione genomica che contiene loci soggetti ad imprinting, abbiamo ipotizzato che il gene CADPS2 possa essere anch’esso caratterizzato da imprinting, con espressione monoallelica materna. Lo studio di espressione di CADPS2 in cellule del sangue ha avvalorato questa ipotesi, implicando perciò CADPS2 come un nuovo gene di suscettibilità per il RM e DSA.
Resumo:
Submicroscopic changes in chromosomal DNA copy number dosage are common and have been implicated in many heritable diseases and cancers. Recent high-throughput technologies have a resolution that permits the detection of segmental changes in DNA copy number that span thousands of basepairs across the genome. Genome-wide association studies (GWAS) may simultaneously screen for copy number-phenotype and SNP-phenotype associations as part of the analytic strategy. However, genome-wide array analyses are particularly susceptible to batch effects as the logistics of preparing DNA and processing thousands of arrays often involves multiple laboratories and technicians, or changes over calendar time to the reagents and laboratory equipment. Failure to adjust for batch effects can lead to incorrect inference and requires inefficient post-hoc quality control procedures that exclude regions that are associated with batch. Our work extends previous model-based approaches for copy number estimation by explicitly modeling batch effects and using shrinkage to improve locus-specific estimates of copy number uncertainty. Key features of this approach include the use of diallelic genotype calls from experimental data to estimate batch- and locus-specific parameters of background and signal without the requirement of training data. We illustrate these ideas using a study of bipolar disease and a study of chromosome 21 trisomy. The former has batch effects that dominate much of the observed variation in quantile-normalized intensities, while the latter illustrates the robustness of our approach to datasets where as many as 25% of the samples have altered copy number. Locus-specific estimates of copy number can be plotted on the copy-number scale to investigate mosaicism and guide the choice of appropriate downstream approaches for smoothing the copy number as a function of physical position. The software is open source and implemented in the R package CRLMM available at Bioconductor (http:www.bioconductor.org).
Resumo:
The recognition that colorectal cancer (CRC) is a heterogeneous disease in terms of clinical behaviour and response to therapy translates into an urgent need for robust molecular disease subclassifiers that can explain this heterogeneity beyond current parameters (MSI, KRAS, BRAF). Attempts to fill this gap are emerging. The Cancer Genome Atlas (TGCA) reported two main CRC groups, based on the incidence and spectrum of mutated genes, and another paper reported an EMT expression signature defined subgroup. We performed a prior free analysis of CRC heterogeneity on 1113 CRC gene expression profiles and confronted our findings to established molecular determinants and clinical, histopathological and survival data. Unsupervised clustering based on gene modules allowed us to distinguish at least five different gene expression CRC subtypes, which we call surface crypt-like, lower crypt-like, CIMP-H-like, mesenchymal and mixed. A gene set enrichment analysis combined with literature search of gene module members identified distinct biological motifs in different subtypes. The subtypes, which were not derived based on outcome, nonetheless showed differences in prognosis. Known gene copy number variations and mutations in key cancer-associated genes differed between subtypes, but the subtypes provided molecular information beyond that contained in these variables. Morphological features significantly differed between subtypes. The objective existence of the subtypes and their clinical and molecular characteristics were validated in an independent set of 720 CRC expression profiles. Our subtypes provide a novel perspective on the heterogeneity of CRC. The proposed subtypes should be further explored retrospectively on existing clinical trial datasets and, when sufficiently robust, be prospectively assessed for clinical relevance in terms of prognosis and treatment response predictive capacity. Original microarray data were uploaded to the ArrayExpress database (http://www.ebi.ac.uk/arrayexpress/) under Accession Nos E-MTAB-990 and E-MTAB-1026. © 2013 Swiss Institute of Bioinformatics. Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland.
Resumo:
Gene copy number polymorphism was studied in a population of the arbuscular mycorrhizal fungus Glomus intraradices by using a quantitative PCR approach on four different genomic regions. Variation in gene copy number was found for a pseudogene and for three ribosomal genes, providing conclusive evidence for a widespread occurrence of macromutational events in the population.
Resumo:
BACKGROUND The copy number variation (CNV) in beta-defensin genes (DEFB) on human chromosome 8p23 has been proposed to contribute to the phenotypic differences in inflammatory diseases. However, determination of exact DEFB CN is a major challenge in association studies. Quantitative real-time PCR (qPCR), paralog ratio tests (PRT) and multiplex ligation-dependent probe amplification (MLPA) have been extensively used to determine DEFB CN in different laboratories, but inter-method inconsistencies were observed frequently. In this study we asked which one is superior among the three methods for DEFB CN determination. RESULTS We developed a clustering approach for MLPA and PRT to statistically correlate data from a single experiment. Then we compared qPCR, a newly designed PRT and MLPA for DEFB CN determination in 285 DNA samples. We found MLPA had the best convergence and clustering results of the raw data and the highest call rate. In addition, the concordance rates between MLPA or PRT and qPCR (32.12% and 37.99%, respectively) were unacceptably low with underestimated CN by qPCR. Concordance rate between MLPA and PRT (90.52%) was high but PRT systematically underestimated CN by one in a subset of samples. In these samples a sequence variant which caused complete PCR dropout of the respective DEFB cluster copies was found in one primer binding site of one of the targeted paralogous pseudogenes. CONCLUSION MLPA is superior to PRT and even more to qPCR for DEFB CN determination. Although the applied PRT provides in most cases reliable results, such a test is particularly sensitive to low-frequency sequence variations preferably accumulating in loci like pseudogenes which are most likely not under selective pressure. In the light of the superior performance of multiplex assays, the drawbacks of such single PRTs could be overcome by combining more test markers.
Resumo:
We present a general method for rigorously identifying correlations between variations in large-scale molecular profiles and outcomes and apply it to chromosomal comparative genomic hybridization data from a set of 52 breast tumors. We identify two loci where copy number abnormalities are correlated with poor survival outcome (gain at 8q24 and loss at 9q13). We also identify a relationship between abnormalities at two loci and the mutational status of p53. Gain at 8q24 and loss at 5q15-5q21 are linked with mutant p53. The 9q and 5q losses suggest the possibility of gene products involved in breast cancer progression. The analytical techniques are general and also are applicable to the analysis of array-based expression data.
Resumo:
Tetralogy of Fallot (TOF), the most common severe congenital heart malformation, occurs sporadically, without other anomaly, and from unknown cause in 70% of cases. Through a genome-wide survey of 114 subjects with TOF and their unaffected parents, we identified 11 de novo copy number variants (CNVs) that were absent or extremely rare (<0.1%) in 2,265 controls. We then examined a second, independent TOF cohort (n = 398) for additional CNVs at these loci. We identified CNVs at chromosome 1q21.1 in 1% (5/512, P = 0.0002, OR = 22.3) of nonsyndromic sporadic TOF cases. We also identified recurrent CNVs at 3p25.1, 7p21.3 and 22q11.2. CNVs in a single subject with TOF occurred at six loci, two that encode known (NOTCH1, JAG1) disease-associated genes. Our findings predict that at least 10% (4.5-15.5%, 95% confidence interval) of sporadic nonsyndromic TOF cases result from de novo CNVs and suggest that mutations within these loci might be etiologic in other cases of TOF.
Resumo:
Plasmids are mobile genetic elements of bacteria that can impart important adaptive traits, such as increased virulence or antibiotic resistance. We report the existence of plasmids in Rickettsia (Rickettsiales; Rickettsiaceae) species, including Rickettsia akari, ""Candidatus Rickettsia amblyommii,"" R. bellii, R. rhipicephali, and REIS, the rickettsial endosymbiont of Ixodes scapularis. All of the rickettsiae were isolated from humans or North and South American ticks. R. parkeri isolates from both continents did not possess plasmids. We have now demonstrated plasmids in nearly all Rickettsia species that we have surveyed from three continents, which represent three of the four major proposed phylogenetic groups associated with blood-feeding arthropods. Gel-based evidence consistent with the existence of multiple plasmids in some species was confirmed by cloning plasmids with very different sequences from each of two ""Ca. Rickettsia amblyommii"" isolates. Phylogenetic analysis of rickettsial ParA plasmid partitioning proteins indicated multiple parA gene origins and plasmid incompatibility groups, consistent with possible multiple plasmid origins. Phylogenetic analysis of potentially host-adaptive rickettsial small heat shock proteins showed that hsp2 genes were plasmid specific and that hsp1 genes, found only on plasmids of ""Ca. Rickettsia amblyommii,"" R. felis, R. monacensis, and R. peacockii, were probably acquired independently of the hsp2 genes. Plasmid copy numbers in seven Rickettsia species ranged from 2.4 to 9.2 per chromosomal equivalent, as determined by real-time quantitative PCR. Plasmids may be of significance in rickettsial evolution and epidemiology by conferring genetic plasticity and host-adaptive traits via horizontal gene transfer that counteracts the reductive genome evolution typical of obligate intracellular bacteria.
Resumo:
A large fraction of genome variation between individuals is comprised of submicroscopic copy number variation of genomic DNA segments. We assessed the relative contribution of structural changes and gene dosage alterations on phenotypic outcomes with mouse models of Smith-Magenis and Potocki-Lupski syndromes. We phenotyped mice with 1n (Deletion/+), 2n (+/+), 3n (Duplication/+), and balanced 2n compound heterozygous (Deletion/Duplication) copies of the same region. Parallel to the observations made in humans, such variation in gene copy number was sufficient to generate phenotypic consequences: in a number of cases diametrically opposing phenotypes were associated with gain versus loss of gene content. Surprisingly, some neurobehavioral traits were not rescued by restoration of the normal gene copy number. Transcriptome profiling showed that a highly significant propensity of transcriptional changes map to the engineered interval in the five assessed tissues. A statistically significant overrepresentation of the genes mapping to the entire length of the engineered chromosome was also found in the top-ranked differentially expressed genes in the mice containing rearranged chromosomes, regardless of the nature of the rearrangement, an observation robust across different cell lineages of the central nervous system. Our data indicate that a structural change at a given position of the human genome may affect not only locus and adjacent gene expression but also "genome regulation." Furthermore, structural change can cause the same perturbation in particular pathways regardless of gene dosage. Thus, the presence of a genomic structural change, as well as gene dosage imbalance, contributes to the ultimate phenotype.
Resumo:
To develop a comprehensive overview of copy number aberrations (CNAs) in stage-II/III colorectal cancer (CRC), we characterized 302 tumors from the PETACC-3 clinical trial. Microsatellite-stable (MSS) samples (n = 269) had 66 minimal common CNA regions, with frequent gains on 20 q (72.5%), 7 (41.8%), 8 q (33.1%) and 13 q (51.0%) and losses on 18 (58.6%), 4 q (26%) and 21 q (21.6%). MSS tumors have significantly more CNAs than microsatellite-instable (MSI) tumors: within the MSI tumors a novel deletion of the tumor suppressor WWOX at 16 q23.1 was identified (p<0.01). Focal aberrations identified by the GISTIC method confirmed amplifications of oncogenes including EGFR, ERBB2, CCND1, MET, and MYC, and deletions of tumor suppressors including TP53, APC, and SMAD4, and gene expression was highly concordant with copy number aberration for these genes. Novel amplicons included putative oncogenes such as WNK1 and HNF4A, which also showed high concordance between copy number and expression. Survival analysis associated a specific patient segment featured by chromosome 20 q gains to an improved overall survival, which might be due to higher expression of genes such as EEF1B2 and PTK6. The CNA clustering also grouped tumors characterized by a poor prognosis BRAF-mutant-like signature derived from mRNA data from this cohort. We further revealed non-random correlation between CNAs among unlinked loci, including positive correlation between 20 q gain and 8 q gain, and 20 q gain and chromosome 18 loss, consistent with co-selection of these CNAs. These results reinforce the non-random nature of somatic CNAs in stage-II/III CRC and highlight loci and genes that may play an important role in driving the development and outcome of this disease.
Resumo:
Background: Prognostic and predictive markers are of great importance for future study designs and essential for the interpretation of clinical trials incorporating an EGFR-inhibitor. The current study prospectively assessed and validated KRAS, BRAF and PIK3CA mutations in rectal cancer patients screened for the trial SAKK41/07 of concomitant preoperative radio-chemotherapy with or without panitumumab.Methods: Macrodissection was performed on pretreatment formalin fixed paraffin embedded biopsy tissue sections to arrive at a minimum of 50% of tumor cells. DNA was extracted with the Maxwell 16 FFPE Tissue LEV DNA purification kit. After PCR amplification, mutations were identified by pyrosequencing. We prospectively analysed pretreatment biopsy material from 149 rectal cancer pts biopsies for KRAS (exon 2 codon 12 [2-12] and 13 [2-13], exon 3 codon 59 [3-59]) and 61 [3-61], exon 4 codon 117 [4-117] and 146 [4-146]). Sixty-eight pts (KRASwt exon 2, 3 only) were further analysed for BRAF (exon 15 codon 600) and PIK3CA (exon 9 codon 542, 545 and 546, exon 20 codon 1043 [20-1043] and 1047 [20-1047]) mutations, and EGFR copy number by qPCR. For the calculation of the EGFR copy number, we used KRAS copy number as internal reference standard. The calculation was done on the basis of the two standard curves relative quantification method.Results: In 149 screened pts with rectal cancer, the prevalence of KRAS mutations was 36%. Among the 68 pts enrolled in SAKK 41/07 based on initially presumed KRASwt status (exon 2/codons 12+13), 18 pts (26%) had a total of 23 mutations in the RAS/PIK3CA-pathways upon validation analysis. Twelve pts had a KRAS mutation, 7 pts had a PIK3CA mutation, 3 pts had a NRAS mutation, 1 patient a BRAF mutation. Surprisingly, five of these pts had double- mutations, including 4 pts with KRAS plus PIK3CA mutations, and 1 pt with NRAS plus PIK3CA mutations. The median normalized EGFR copy number was 1. Neither mutations of KRAS, BRAF, and PIK3CA, nor EGFR copy number were statistically associated with the primary study endpoint pCR (pathological complete regression).Conclusions: The prevalence of KRAS mutations in rectal and in colon cancer appears to be similar. BRAF mutations are rare; PIK3CA mutations are more common (10%). EGFR copy number is not increased in rectal cancer. A considerable number or KRAS exon 2 wt tumors harbored KRAS exon 3+4 mutations. Further study is needed to determine if KRAS testing should include exons 2-4.
Resumo:
Cancer genomes frequently contain somatic copy number alterations (SCNA) that can significantly perturb the expression level of affected genes and thus disrupt pathways controlling normal growth. In melanoma, many studies have focussed on the copy number and gene expression levels of the BRAF, PTEN and MITF genes, but little has been done to identify new genes using these parameters at the genome-wide scale. Using karyotyping, SNP and CGH arrays, and RNA-seq, we have identified SCNA affecting gene expression ('SCNA-genes') in seven human metastatic melanoma cell lines. We showed that the combination of these techniques is useful to identify candidate genes potentially involved in tumorigenesis. Since few of these alterations were recurrent across our samples, we used a protein network-guided approach to determine whether any pathways were enriched in SCNA-genes in one or more samples. From this unbiased genome-wide analysis, we identified 28 significantly enriched pathway modules. Comparison with two large, independent melanoma SCNA datasets showed less than 10% overlap at the individual gene level, but network-guided analysis revealed 66% shared pathways, including all but three of the pathways identified in our data. Frequently altered pathways included WNT, cadherin signalling, angiogenesis and melanogenesis. Additionally, our results emphasize the potential of the EPHA3 and FRS2 gene products, involved in angiogenesis and migration, as possible therapeutic targets in melanoma. Our study demonstrates the utility of network-guided approaches, for both large and small datasets, to identify pathways recurrently perturbed in cancer.
Resumo:
We report 24 unrelated individuals with deletions and 17 additional cases with duplications at 10q11.21q21.1 identified by chromosomal microarray analysis. The rearrangements range in size from 0.3 to 12 Mb. Nineteen of the deletions and eight duplications are flanked by large, directly oriented segmental duplications of >98% sequence identity, suggesting that nonallelic homologous recombination (NAHR) caused these genomic rearrangements. Nine individuals with deletions and five with duplications have additional copy number changes. Detailed clinical evaluation of 20 patients with deletions revealed variable clinical features, with developmental delay (DD) and/or intellectual disability (ID) as the only features common to a majority of individuals. We suggest that some of the other features present in more than one patient with deletion, including hypotonia, sleep apnea, chronic constipation, gastroesophageal and vesicoureteral refluxes, epilepsy, ataxia, dysphagia, nystagmus, and ptosis may result from deletion of the CHAT gene, encoding choline acetyltransferase, and the SLC18A3 gene, mapping in the first intron of CHAT and encoding vesicular acetylcholine transporter. The phenotypic diversity and presence of the deletion in apparently normal carrier parents suggest that subjects carrying 10q11.21q11.23 deletions may exhibit variable phenotypic expressivity and incomplete penetrance influenced by additional genetic and nongenetic modifiers.