949 resultados para copy number variation
Resumo:
Arbuscular mycorrhizal fungi (AMF) are ancient asexually reproducing organisms that form symbioses with the majority of plant species, improving plant nutrition and promoting plant diversity. Little is known about the evolution or organization of the genomes of any eukaryotic symbiont or ancient asexual organism. Direct evidence shows that one AMF species is heterokaryotic; that is, containing populations of genetically different nuclei. It has been suggested, however, that the genetic variation passed from generation to generation in AMF is simply due to multiple chromosome sets (that is, high ploidy). Here we show that previously documented genetic variation in Pol-like sequences, which are passed from generation to generation, cannot be due to either high ploidy or repeated gene duplications. Our results provide the clearest evidence so far for substantial genetic differences among nuclei in AMF. We also show that even AMF with a very large nuclear DNA content are haploid. An underlying principle of evolutionary theory is that an individual passes on one or half of its genome to each of its progeny. The coexistence of a population of many genomes in AMF and their transfer to subsequent generations, therefore, has far-reaching consequences for understanding genome evolution.
Resumo:
BACKGROUND Hirschsprung disease (HSCR) is a congenital malformation of the hindgut produced by a disruption in neural crest cell migration during embryonic development. HSCR has a complex genetic etiology and mutations in several genes, mainly the RET proto-oncogene, have been related to the disease. There is a clear predominance of missense/nonsense mutations in these genes whereas copy number variations (CNVs) have been seldom described, probably due to the limitations of conventional techniques usually employed for mutational analysis. METHODS In this study we have aimed to analyze the presence of CNVs in some HSCR genes (RET, EDN3, GDNF and ZFHX1B) using the Multiple Ligation-dependent Probe Amplification (MLPA) approach. RESULTS Two alterations in the MLPA profiles of RET and EDN3 were detected, but a detailed inspection showed that the decrease in the corresponding dosages were due to point mutations affecting the hybridization probes regions. CONCLUSION Our results indicate that CNVs of the gene coding regions analyzed here are not a common molecular cause of Hirschsprung disease. However, further studies are required to determine the presence of CNVs affecting non-coding regulatory regions, as well as other candidate genes.
Resumo:
Gene copy number polymorphism was studied in a population of the arbuscular mycorrhizal fungus Glomus intraradices by using a quantitative PCR approach on four different genomic regions. Variation in gene copy number was found for a pseudogene and for three ribosomal genes, providing conclusive evidence for a widespread occurrence of macromutational events in the population.
Resumo:
AbstractAlthough the genomes from any two human individuals are more than 99.99% identical at the sequence level, some structural variation can be observed. Differences between genomes include single nucleotide polymorphism (SNP), inversion and copy number changes (gain or loss of DNA). The latter can range from submicroscopic events (CNVs, at least 1kb in size) to complete chromosomal aneuploidies. Small copy number variations have often no (lethal) consequences to the cell, but a few were associated to disease susceptibility and phenotypic variations. Larger re-arrangements (i.e. complete chromosome gain) are frequently associated with more severe consequences on health such as genomic disorders and cancer. High-throughput technologies like DNA microarrays enable the detection of CNVs in a genome-wide fashion. Since the initial catalogue of CNVs in the human genome in 2006, there has been tremendous interest in CNVs both in the context of population and medical genetics. Understanding CNV patterns within and between human populations is essential to elucidate their possible contribution to disease. But genome analysis is a challenging task; the technology evolves rapidly creating needs for novel, efficient and robust analytical tools which need to be compared with existing ones. Also, while the link between CNV and disease has been established, the relative CNV contribution is not fully understood and the predisposition to disease from CNVs of the general population has not been yet investigated.During my PhD thesis, I worked on several aspects related to CNVs. As l will report in chapter 3, ! was interested in computational methods to detect CNVs from the general population. I had access to the CoLaus dataset, a population-based study with more than 6,000 participants from the Lausanne area. All these individuals were analysed on SNP arrays and extensive clinical information were available. My work explored existing CNV detection methods and I developed a variety of metrics to compare their performance. Since these methods were not producing entirely satisfactory results, I implemented my own method which outperformed two existing methods. I also devised strategies to combine CNVs from different individuals into CNV regions.I was also interested in the clinical impact of CNVs in common disease (chapter 4). Through an international collaboration led by the Centre Hospitalier Universitaire Vaudois (CHUV) and the Imperial College London I was involved as a main data analyst in the investigation of a rare deletion at chromosome 16p11 detected in obese patients. Specifically, we compared 8,456 obese patients and 11,856 individuals from the general population and we found that the deletion was accounting for 0.7% of the morbid obesity cases and was absent in healthy non- obese controls. This highlights the importance of rare variants with strong impact and provides new insights in the design of clinical studies to identify the missing heritability in common disease.Furthermore, I was interested in the detection of somatic copy number alterations (SCNA) and their consequences in cancer (chapter 5). This project was a collaboration initiated by the Ludwig Institute for Cancer Research and involved other groups from the Swiss Institute of Bioinformatics, the CHUV and Universities of Lausanne and Geneva. The focus of my work was to identify genes with altered expression levels within somatic copy number alterations (SCNA) in seven metastatic melanoma ceil lines, using CGH and SNP arrays, RNA-seq, and karyotyping. Very few SCNA genes were shared by even two melanoma samples making it difficult to draw any conclusions at the individual gene level. To overcome this limitation, I used a network-guided analysis to determine whether any pathways, defined by amplified or deleted genes, were common among the samples. Six of the melanoma samples were potentially altered in four pathways and five samples harboured copy-number and expression changes in components of six pathways. In total, this approach identified 28 pathways. Validation with two external, large melanoma datasets confirmed all but three of the detected pathways and demonstrated the utility of network-guided approaches for both large and small datasets analysis.RésuméBien que le génome de deux individus soit similaire à plus de 99.99%, des différences de structure peuvent être observées. Ces différences incluent les polymorphismes simples de nucléotides, les inversions et les changements en nombre de copies (gain ou perte d'ADN). Ces derniers varient de petits événements dits sous-microscopiques (moins de 1kb en taille), appelés CNVs (copy number variants) jusqu'à des événements plus large pouvant affecter des chromosomes entiers. Les petites variations sont généralement sans conséquence pour la cellule, toutefois certaines ont été impliquées dans la prédisposition à certaines maladies, et à des variations phénotypiques dans la population générale. Les réarrangements plus grands (par exemple, une copie additionnelle d'un chromosome appelée communément trisomie) ont des répercutions plus grave pour la santé, comme par exemple dans certains syndromes génomiques et dans le cancer. Les technologies à haut-débit telle les puces à ADN permettent la détection de CNVs à l'échelle du génome humain. La cartographie en 2006 des CNV du génome humain, a suscité un fort intérêt en génétique des populations et en génétique médicale. La détection de différences au sein et entre plusieurs populations est un élément clef pour élucider la contribution possible des CNVs dans les maladies. Toutefois l'analyse du génome reste une tâche difficile, la technologie évolue très rapidement créant de nouveaux besoins pour le développement d'outils, l'amélioration des précédents, et la comparaison des différentes méthodes. De plus, si le lien entre CNV et maladie a été établit, leur contribution précise n'est pas encore comprise. De même que les études sur la prédisposition aux maladies par des CNVs détectés dans la population générale n'ont pas encore été réalisées.Pendant mon doctorat, je me suis concentré sur trois axes principaux ayant attrait aux CNV. Dans le chapitre 3, je détaille mes travaux sur les méthodes d'analyses des puces à ADN. J'ai eu accès aux données du projet CoLaus, une étude de la population de Lausanne. Dans cette étude, le génome de plus de 6000 individus a été analysé avec des puces SNP et de nombreuses informations cliniques ont été récoltées. Pendant mes travaux, j'ai utilisé et comparé plusieurs méthodes de détection des CNVs. Les résultats n'étant pas complètement satisfaisant, j'ai implémenté ma propre méthode qui donne de meilleures performances que deux des trois autres méthodes utilisées. Je me suis aussi intéressé aux stratégies pour combiner les CNVs de différents individus en régions.Je me suis aussi intéressé à l'impact clinique des CNVs dans le cas des maladies génétiques communes (chapitre 4). Ce projet fut possible grâce à une étroite collaboration avec le Centre Hospitalier Universitaire Vaudois (CHUV) et l'Impérial College à Londres. Dans ce projet, j'ai été l'un des analystes principaux et j'ai travaillé sur l'impact clinique d'une délétion rare du chromosome 16p11 présente chez des patients atteints d'obésité. Dans cette collaboration multidisciplinaire, nous avons comparés 8'456 patients atteint d'obésité et 11 '856 individus de la population générale. Nous avons trouvés que la délétion était impliquée dans 0.7% des cas d'obésité morbide et était absente chez les contrôles sains (non-atteint d'obésité). Notre étude illustre l'importance des CNVs rares qui peuvent avoir un impact clinique très important. De plus, ceci permet d'envisager une alternative aux études d'associations pour améliorer notre compréhension de l'étiologie des maladies génétiques communes.Egalement, j'ai travaillé sur la détection d'altérations somatiques en nombres de copies (SCNA) et de leurs conséquences pour le cancer (chapitre 5). Ce projet fut une collaboration initiée par l'Institut Ludwig de Recherche contre le Cancer et impliquant l'Institut Suisse de Bioinformatique, le CHUV et les Universités de Lausanne et Genève. Je me suis concentré sur l'identification de gènes affectés par des SCNAs et avec une sur- ou sous-expression dans des lignées cellulaires dérivées de mélanomes métastatiques. Les données utilisées ont été générées par des puces ADN (CGH et SNP) et du séquençage à haut débit du transcriptome. Mes recherches ont montrées que peu de gènes sont récurrents entre les mélanomes, ce qui rend difficile l'interprétation des résultats. Pour contourner ces limitations, j'ai utilisé une analyse de réseaux pour définir si des réseaux de signalisations enrichis en gènes amplifiés ou perdus, étaient communs aux différents échantillons. En fait, parmi les 28 réseaux détectés, quatre réseaux sont potentiellement dérégulés chez six mélanomes, et six réseaux supplémentaires sont affectés chez cinq mélanomes. La validation de ces résultats avec deux larges jeux de données publiques, a confirmée tous ces réseaux sauf trois. Ceci démontre l'utilité de cette approche pour l'analyse de petits et de larges jeux de données.Résumé grand publicL'avènement de la biologie moléculaire, en particulier ces dix dernières années, a révolutionné la recherche en génétique médicale. Grâce à la disponibilité du génome humain de référence dès 2001, de nouvelles technologies telles que les puces à ADN sont apparues et ont permis d'étudier le génome dans son ensemble avec une résolution dite sous-microscopique jusque-là impossible par les techniques traditionnelles de cytogénétique. Un des exemples les plus importants est l'étude des variations structurales du génome, en particulier l'étude du nombre de copies des gènes. Il était établi dès 1959 avec l'identification de la trisomie 21 par le professeur Jérôme Lejeune que le gain d'un chromosome supplémentaire était à l'origine de syndrome génétique avec des répercussions graves pour la santé du patient. Ces observations ont également été réalisées en oncologie sur les cellules cancéreuses qui accumulent fréquemment des aberrations en nombre de copies (telles que la perte ou le gain d'un ou plusieurs chromosomes). Dès 2004, plusieurs groupes de recherches ont répertorié des changements en nombre de copies dans des individus provenant de la population générale (c'est-à-dire sans symptômes cliniques visibles). En 2006, le Dr. Richard Redon a établi la première carte de variation en nombre de copies dans la population générale. Ces découvertes ont démontrées que les variations dans le génome était fréquentes et que la plupart d'entre elles étaient bénignes, c'est-à-dire sans conséquence clinique pour la santé de l'individu. Ceci a suscité un très grand intérêt pour comprendre les variations naturelles entre individus mais aussi pour mieux appréhender la prédisposition génétique à certaines maladies.Lors de ma thèse, j'ai développé de nouveaux outils informatiques pour l'analyse de puces à ADN dans le but de cartographier ces variations à l'échelle génomique. J'ai utilisé ces outils pour établir les variations dans la population suisse et je me suis consacré par la suite à l'étude de facteurs pouvant expliquer la prédisposition aux maladies telles que l'obésité. Cette étude en collaboration avec le Centre Hospitalier Universitaire Vaudois a permis l'identification d'une délétion sur le chromosome 16 expliquant 0.7% des cas d'obésité morbide. Cette étude a plusieurs répercussions. Tout d'abord elle permet d'effectuer le diagnostique chez les enfants à naître afin de déterminer leur prédisposition à l'obésité. Ensuite ce locus implique une vingtaine de gènes. Ceci permet de formuler de nouvelles hypothèses de travail et d'orienter la recherche afin d'améliorer notre compréhension de la maladie et l'espoir de découvrir un nouveau traitement Enfin notre étude fournit une alternative aux études d'association génétique qui n'ont eu jusqu'à présent qu'un succès mitigé.Dans la dernière partie de ma thèse, je me suis intéressé à l'analyse des aberrations en nombre de copies dans le cancer. Mon choix s'est porté sur l'étude de mélanomes, impliqués dans le cancer de la peau. Le mélanome est une tumeur très agressive, elle est responsable de 80% des décès des cancers de la peau et est souvent résistante aux traitements utilisés en oncologie (chimiothérapie, radiothérapie). Dans le cadre d'une collaboration entre l'Institut Ludwig de Recherche contre le Cancer, l'Institut Suisse de Bioinformatique, le CHUV et les universités de Lausanne et Genève, nous avons séquencés l'exome (les gènes) et le transcriptome (l'expression des gènes) de sept mélanomes métastatiques, effectués des analyses du nombre de copies par des puces à ADN et des caryotypes. Mes travaux ont permis le développement de nouvelles méthodes d'analyses adaptées au cancer, d'établir la liste des réseaux de signalisation cellulaire affectés de façon récurrente chez le mélanome et d'identifier deux cibles thérapeutiques potentielles jusqu'alors ignorées dans les cancers de la peau.
Resumo:
The P transposable element copy numbers and the KP/full-sized P element ratios were determined in eight Brazilian strains of Drosophila melanogaster. Strains from tropical regions showed lower overall P element copy numbers than did strains from temperate regions. Variable numbers of full-sized and defective elements were detected, but the full-sized P and KP elements were the predominant classes of elements in all strains. The full-sized P and KP element ratios were calculated and compared with latitude. The northernmost and southernmost Brazilian strains showed fewer full-sized elements than KP elements per genome, and the strains from less extreme latitudes had many more full-sized P than KP elements. However, no clinal variation was observed. Strains from different localities, previously classified as having P cytotype, displayed a higher or a lower proportion of KP elements than of full-sized P elements, as well as an equal number of the two element types, showing that the same phenotype may be produced by different underlying genomic components of the P-M system.
Resumo:
Sandhoff disease (SD) is a lysosomal disorder caused by mutations in the HEXB gene. To date, 43 mutations of HEXB have been described, including 3 large deletions. Here, we have characterized 14 unrelated SD patients and developed a Multiplex Ligation-dependent Probe Amplification (MLPA) assay to investigate the presence of large HEXB deletions. Overall, we identified 16 alleles, 9 of which were novel, including 4 sequence variation leading to aminoacid changes [c.626C>T (p.T209I), c.634C>A (p.H212N), c.926G>T (p.C309F), c.1451G>A (p.G484E)] 3 intronic mutations (c.1082+5G>A, c.1242+1G>A, c.1169+5G>A), 1 nonsense mutation c.146C>A (p.S49X) and 1 small in-frame deletion c.1260_1265delAGTTGA (p.V421_E422del). Using the new MLPA assay, 2 previously described deletions were identified. In vitro expression studies showed that proteins bearing aminoacid changes p.T209I and p.G484E presented a very low or absent activity, while proteins bearing the p.H212N and p.C309F changes retained a significant residual activity. The detrimental effect of the 3 novel intronic mutations on the HEXB mRNA processing was demonstrated using a minigene assay. Unprecedentedly, minigene studies revealed the presence of a novel alternative spliced HEXB mRNA variant also present in normal cells. In conclusion, we provided new insights into the molecular basis of SD and validated an MLPA assay for detecting large HEXB deletions.
Resumo:
Specific language impairment (SLI) is a complex neurodevelopmental disorder defined as an unexpected failure to develop normal language abilities for no obvious reason. Copy number variants (CNVs) are an important source of variation in the susceptibility to neuropsychiatric disorders. Therefore, a CNV study within SLI families was performed to investigate the role of structural variants in SLI. Among the identified CNVs, we focused on CNVs on chromosome 15q11-q13, recurrently observed in neuropsychiatric conditions, and a homozygous exonic microdeletion in ZNF277. Since this microdeletion falls within the AUTS1 locus, a region linked to autism spectrum disorders (ASD), we investigated a potential role of ZNF277 in SLI and ASD. Frequency data and expression analysis of the ZNF277 microdeletion suggested that this variant may contribute to the risk of language impairments in a complex manner, that is independent of the autism risk previously described in this region. Moreover, we identified an affected individual with a dihydropyrimidine dehydrogenase (DPD) deficiency, caused by compound heterozygosity of two deleterious variants in the gene DPYD. Since DPYD represents a good candidate gene for both SLI and ASD, we investigated its involvement in the susceptibility to these two disorders, focusing on the splicing variant rs3918290, the most common mutation in the DPD deficiency. We observed a higher frequency of rs3918290 in SLI cases (1.2%), compared to controls (~0.6%), while no difference was observed in a large ASD cohort. DPYD mutation screening in 4 SLI and 7 ASD families carrying the splicing variant identified six known missense changes and a novel variant in the promoter region. These data suggest that the combined effect of the mutations identified in affected individuals may lead to an altered DPD activity and that rare variants in DPYD might contribute to a minority of cases, in conjunction with other genetic or non-genetic factors.
Resumo:
Autism spectrum disorder (ASD) and Intellectual Disability (ID) are complex neuropsychiatric disorders characterized by extensive clinical and genetic heterogeneity and with overlapping risk factors. The aim of my project was to further investigate the role of Copy Numbers Variants (CNVs), identified through genome-wide studies performed by the Autism Geome Project (AGP) and the CHERISH consortium in large cohorts of ASD and ID cases, respectively. Specifically, I focused on four rare genic CNVs, selected on the basis of their impact on interesting ASD/ID candidate genes: a) a compound heterozygous deletion involving CTNNA3, predicted to cause the lack of functional protein; b) a 15q13.3 duplication containing CHRNA7; c) a 2q31.1 microdeletion encompassing KLHL23, SSB and METTL5; d) Lastly, I investigated the putative imprinting regulation of the CADPS2 gene, disrupted by a maternal deletion in two siblings with ASD and ID. This study provides further evidence for the role of CTNNA3, CHRNA7, KLHL23 and CADPS2 as ASD and/or ID susceptibility genes, and highlights that rare genetic variation contributes to disease risk in different ways: some rare mutations, such as those impacting CTNNA3, act in a recessive mode of inheritance, while other CNVs, such as those occurring in the 15q13.3 region, are implicated in multiple developmental and/or neurological disorders possibly interacting with other susceptibility variants elsewhere in the genome. On the other hand, the discovery of a tissue-specific monoallelic expression for the CADPS2 gene, implicates the involvement of epigenetic regulatory mechanisms as risk factors conferring susceptibility to ASD/ID.
Resumo:
Submicroscopic changes in chromosomal DNA copy number dosage are common and have been implicated in many heritable diseases and cancers. Recent high-throughput technologies have a resolution that permits the detection of segmental changes in DNA copy number that span thousands of basepairs across the genome. Genome-wide association studies (GWAS) may simultaneously screen for copy number-phenotype and SNP-phenotype associations as part of the analytic strategy. However, genome-wide array analyses are particularly susceptible to batch effects as the logistics of preparing DNA and processing thousands of arrays often involves multiple laboratories and technicians, or changes over calendar time to the reagents and laboratory equipment. Failure to adjust for batch effects can lead to incorrect inference and requires inefficient post-hoc quality control procedures that exclude regions that are associated with batch. Our work extends previous model-based approaches for copy number estimation by explicitly modeling batch effects and using shrinkage to improve locus-specific estimates of copy number uncertainty. Key features of this approach include the use of diallelic genotype calls from experimental data to estimate batch- and locus-specific parameters of background and signal without the requirement of training data. We illustrate these ideas using a study of bipolar disease and a study of chromosome 21 trisomy. The former has batch effects that dominate much of the observed variation in quantile-normalized intensities, while the latter illustrates the robustness of our approach to datasets where as many as 25% of the samples have altered copy number. Locus-specific estimates of copy number can be plotted on the copy-number scale to investigate mosaicism and guide the choice of appropriate downstream approaches for smoothing the copy number as a function of physical position. The software is open source and implemented in the R package CRLMM available at Bioconductor (http:www.bioconductor.org).
Resumo:
The reliable quantification of gene copy number variations is a precondition for future investigations regarding their functional relevance. To date, there is no generally accepted gold standard method for copy number quantification, and methods in current use have given inconsistent results in selected cohorts. In this study, we compare two methods for copy number quantification. beta-defensin gene copy numbers were determined in parallel in 80 genomic DNA samples by real-time PCR and multiplex ligation-dependent probe amplification (MLPA). The pyrosequencing-based paralog ratio test (PPRT) was used as a standard of comparison in 79 out of 80 samples. Realtime PCR and MPLA results confirmed concordant DEFB4, DEFB103A, and DEFB104A copy numbers within samples. These two methods showed identical results in 32 out of 80 samples; 29 of these 32 samples comprised four or fewer copies. The coefficient of variation of MLPA is lower compared with PCR. In addition, the consistency between MLPA and PPRT is higher than either PCR/MLPA or PCR/PPRT consistency. In summary, these results suggest that MLPA is superior to real-time PCR in beta-defensin copy number quantification.
Resumo:
Metastasizing pleomorphic adenoma (MPA) is a rare tumour, and its mechanism of metastasis still is unknown. To date, there has been no study on MPA genomics. We analysed primary and secondary MPAs with array comparative genomic hybridization to identify somatic copy number alterations and affected genes. Tumour DNA samples from primary (parotid salivary gland) and secondary (scalp skin) MPAs were subjected to array comparative genomic hybridization investigation, and the data were analysed with NEXUS COPY NUMBER DISCOVERY. The primary MPA showed copy number losses affecting 3p22.2p14.3 and 19p13.3p123, and a complex pattern of four different deletions at chromosome 6. The 3p deletion encompassed several genes: CTNNB1, SETD2, BAP1, and PBRM1, among others. The secondary MPA showed a genomic profile similar to that of the primary MPA, with acquisition of additional copy number changes affecting 9p24.3p13.1 (loss), 19q11q13.43 (gain), and 22q11.1q13.33 (gain). Our findings indicated a clonal origin of the secondary MPA, as both tumours shared a common profile of genomic copy number alterations. Furthermore, we were able to detect in the primary tumour a specific pattern of copy number alterations that could explain the metastasizing characteristic, whereas the secondary MPA showed a more unbalanced genome.
Resumo:
Tetralogy of Fallot (TOF), the most common severe congenital heart malformation, occurs sporadically, without other anomaly, and from unknown cause in 70% of cases. Through a genome-wide survey of 114 subjects with TOF and their unaffected parents, we identified 11 de novo copy number variants (CNVs) that were absent or extremely rare (<0.1%) in 2,265 controls. We then examined a second, independent TOF cohort (n = 398) for additional CNVs at these loci. We identified CNVs at chromosome 1q21.1 in 1% (5/512, P = 0.0002, OR = 22.3) of nonsyndromic sporadic TOF cases. We also identified recurrent CNVs at 3p25.1, 7p21.3 and 22q11.2. CNVs in a single subject with TOF occurred at six loci, two that encode known (NOTCH1, JAG1) disease-associated genes. Our findings predict that at least 10% (4.5-15.5%, 95% confidence interval) of sporadic nonsyndromic TOF cases result from de novo CNVs and suggest that mutations within these loci might be etiologic in other cases of TOF.
Resumo:
Plasmids are mobile genetic elements of bacteria that can impart important adaptive traits, such as increased virulence or antibiotic resistance. We report the existence of plasmids in Rickettsia (Rickettsiales; Rickettsiaceae) species, including Rickettsia akari, ""Candidatus Rickettsia amblyommii,"" R. bellii, R. rhipicephali, and REIS, the rickettsial endosymbiont of Ixodes scapularis. All of the rickettsiae were isolated from humans or North and South American ticks. R. parkeri isolates from both continents did not possess plasmids. We have now demonstrated plasmids in nearly all Rickettsia species that we have surveyed from three continents, which represent three of the four major proposed phylogenetic groups associated with blood-feeding arthropods. Gel-based evidence consistent with the existence of multiple plasmids in some species was confirmed by cloning plasmids with very different sequences from each of two ""Ca. Rickettsia amblyommii"" isolates. Phylogenetic analysis of rickettsial ParA plasmid partitioning proteins indicated multiple parA gene origins and plasmid incompatibility groups, consistent with possible multiple plasmid origins. Phylogenetic analysis of potentially host-adaptive rickettsial small heat shock proteins showed that hsp2 genes were plasmid specific and that hsp1 genes, found only on plasmids of ""Ca. Rickettsia amblyommii,"" R. felis, R. monacensis, and R. peacockii, were probably acquired independently of the hsp2 genes. Plasmid copy numbers in seven Rickettsia species ranged from 2.4 to 9.2 per chromosomal equivalent, as determined by real-time quantitative PCR. Plasmids may be of significance in rickettsial evolution and epidemiology by conferring genetic plasticity and host-adaptive traits via horizontal gene transfer that counteracts the reductive genome evolution typical of obligate intracellular bacteria.
Resumo:
To develop a comprehensive overview of copy number aberrations (CNAs) in stage-II/III colorectal cancer (CRC), we characterized 302 tumors from the PETACC-3 clinical trial. Microsatellite-stable (MSS) samples (n = 269) had 66 minimal common CNA regions, with frequent gains on 20 q (72.5%), 7 (41.8%), 8 q (33.1%) and 13 q (51.0%) and losses on 18 (58.6%), 4 q (26%) and 21 q (21.6%). MSS tumors have significantly more CNAs than microsatellite-instable (MSI) tumors: within the MSI tumors a novel deletion of the tumor suppressor WWOX at 16 q23.1 was identified (p<0.01). Focal aberrations identified by the GISTIC method confirmed amplifications of oncogenes including EGFR, ERBB2, CCND1, MET, and MYC, and deletions of tumor suppressors including TP53, APC, and SMAD4, and gene expression was highly concordant with copy number aberration for these genes. Novel amplicons included putative oncogenes such as WNK1 and HNF4A, which also showed high concordance between copy number and expression. Survival analysis associated a specific patient segment featured by chromosome 20 q gains to an improved overall survival, which might be due to higher expression of genes such as EEF1B2 and PTK6. The CNA clustering also grouped tumors characterized by a poor prognosis BRAF-mutant-like signature derived from mRNA data from this cohort. We further revealed non-random correlation between CNAs among unlinked loci, including positive correlation between 20 q gain and 8 q gain, and 20 q gain and chromosome 18 loss, consistent with co-selection of these CNAs. These results reinforce the non-random nature of somatic CNAs in stage-II/III CRC and highlight loci and genes that may play an important role in driving the development and outcome of this disease.