139 resultados para Whole exome sequencing
Resumo:
Gastric cancer is a major cause of global cancer mortality. We surveyed the spectrum of somatic alterations in gastric cancer by sequencing the exomes of 15 gastric adenocarcinomas and their matched normal DNAs. Frequently mutated genes in the adenocarcinomas included TP53 (11/15 tumors), PIK3CA (3/15) and ARID1A (3/15). Cell adhesion was the most enriched biological pathway among the frequently mutated genes. A prevalence screening confirmed mutations in FAT4, a cadherin family gene, in 5% of gastric cancers (6/110) and FAT4 genomic deletions in 4% (3/83) of gastric tumors. Frequent mutations in chromatin remodeling genes (ARID1A, MLL3 and MLL) also occurred in 47% of the gastric cancers. We detected ARID1A mutations in 8% of tumors (9/110), which were associated with concurrent PIK3CA mutations and microsatellite instability. In functional assays, we observed both FAT4 and ARID1A to exert tumor-suppressor activity. Somatic inactivation of FAT4 and ARID1A may thus be key tumorigenic events in a subset of gastric cancers.
Resumo:
BACKGROUND: Prostate cancer (PCa) is a clinically and pathologically heterogeneous disease. The rapid development of sequencing technology has the potential to deliver new biomarkers with emphasis on aggressive disease and to revolutionise personalised cancer treatment. However, a prostate harbouring cancer commonly contains multiple separate tumour foci, with the potential to aggravate tumour sampling. The level of intraprostatic tumour heterogeneity remains to be determined.
OBJECTIVE: To determine the level of intraprostatic tumour heterogeneity through genome-wide, high-resolution profiling of multiple tumour samples from the same individual.
DESIGN, SETTINGS, AND PARTICIPANTS: Multiple tumour samples were obtained from four individuals following radical prostatectomy. One individual (SWE-1) contained >70% cancer cells in all tumour samples, whereas the other three (SWE-2 to SWE-4) required the use of laser capture microdissection for tumour cell enrichment. Subsequently, DNA was extracted from all tissue samples, and exome sequencing was performed. All tumour foci of SWE-1 were also profiled using a high-resolution array for the identification of copy number alterations (CNA).
OUTCOME MEASUREMENTS AND STATISTICAL ANALYSIS: Shared somatic high-frequency single nucleotide variants (SNV) and CNAs were used to infer the level of intraprostatic tumour heterogeneity.
RESULTS AND LIMITATIONS: No high-frequency mutations, common for the three tumour samples of SWE-1, were identified. Ten randomly chosen positions were validated with Sanger sequencing in all foci, which verified the exome data. The high level of intraprostatic heterogeneity was consistent in all individuals. In total, three out of four individuals harboured tumours without an apparent common somatic denominator. Although we cannot exclude the presence of common structural rearrangements, a high-density array was used for the detection of deletions and amplifications in SWE-1, which agreed with the exome data.
CONCLUSIONS: We present evidence for the presence of somatically independent tumours within the same prostate. This finding will have implications for personalised cancer treatment and biomarker discovery.
Resumo:
Whole genome sequencing (WGS) technology holds great promise as a tool for the forensic epidemiology of bacterial pathogens. It is likely to be particularly useful for studying the transmission dynamics of an observed epidemic involving a largely unsampled 'reservoir' host, as for bovine tuberculosis (bTB) in British and Irish cattle and badgers. BTB is caused by Mycobacterium bovis, a member of the M. tuberculosis complex that also includes the aetiological agent for human TB. In this study, we identified a spatio-temporally linked group of 26 cattle and 4 badgers infected with the same Variable Number Tandem Repeat (VNTR) type of M. bovis. Single-nucleotide polymorphisms (SNPs) between sequences identified differences that were consistent with bacterial lineages being persistent on or near farms for several years, despite multiple clear whole herd tests in the interim. Comparing WGS data to mathematical models showed good correlations between genetic divergence and spatial distance, but poor correspondence to the network of cattle movements or within-herd contacts. Badger isolates showed between zero and four SNP differences from the nearest cattle isolate, providing evidence for recent transmissions between the two hosts. This is the first direct genetic evidence of M. bovis persistence on farms over multiple outbreaks with a continued, ongoing interaction with local badgers. However, despite unprecedented resolution, directionality of transmission cannot be inferred at this stage. Despite the often notoriously long timescales between time of infection and time of sampling for TB, our results suggest that WGS data alone can provide insights into TB epidemiology even where detailed contact data are not available, and that more extensive sampling and analysis will allow for quantification of the extent and direction of transmission between cattle and badgers. © 2012 Biek et al.
Resumo:
Mycobacterium bovis is the causal agent of bovine tuberculosis, one of the most important diseases currently facing the UK cattle industry. Here, we use high-density whole genome sequencing (WGS) in a defined sub-population of M. bovis in 145 cattle across 66 herd breakdowns to gain insights into local spread and persistence. We show that despite low divergence among isolates, WGS can in principle expose contributions of under-sampled host populations to M. bovis transmission. However, we demonstrate that in our data such a signal is due to molecular type switching, which had been previously undocumented for M. bovis. Isolates from farms with a known history of direct cattle movement between them did not show a statistical signal of higher genetic similarity. Despite an overall signal of genetic isolation by distance, genetic distances also showed no apparent relationship with spatial distance among affected farms over distances <5 km. Using simulations, we find that even over the brief evolutionary timescale covered by our data, Bayesian phylogeographic approaches are feasible. Applying such approaches showed that M. bovis dispersal in this system is heterogeneous but slow overall, averaging 2 km/year. These results confirm that widespread application of WGS to M. bovis will bring novel and important insights into the dynamics of M. bovis spread and persistence, but that the current questions most pertinent to control will be best addressed using approaches that more directly integrate WGS with additional epidemiological data.
Resumo:
Neurodegenerative diseases affecting the macula constitute a major cause of incurable vision loss and exhibit considerable clinical and genetic heterogeneity, from early-onset monogenic disease to multifactorial late-onset age-related macular degeneration (AMD). As part of our continued efforts to define genetic causes of macular degeneration, we performed whole exome sequencing in four individuals of a two-generation family with autosomal dominant maculopathy and identified a rare variant p.Glu1144Lys in Fibrillin 2 (FBN2), a glycoprotein of the elastin-rich extracellular matrix (ECM). Sanger sequencing validated the segregation of this variant in the complete pedigree, including two additional affected and one unaffected individual. Sequencing of 192 maculopathy patients revealed additional rare variants, predicted to disrupt FBN2 function. We then undertook additional studies to explore the relationship of FBN2 to macular disease. We show that FBN2 localizes to Bruch's membrane and its expression appears to be reduced in aging and AMD eyes, prompting us to examine its relationship with AMD. We detect suggestive association of a common FBN2 non-synonymous variant, rs154001 (p.Val965Ile) with AMD in 10,337 cases and 11,174 controls (OR=1.10; p-value=3.79×10(-5)). Thus, it appears that rare and common variants in a single gene - FBN2 - can contribute to Mendelian and complex forms of macular degeneration. Our studies provide genetic evidence for a key role of elastin microfibers and Bruch's membrane in maintaining blood-retina homeostasis and establish the importance of studying orphan diseases for understanding more common clinical phenotypes.
Resumo:
Diabetes is the leading cause of end stage renal disease. Despite evidence for a substantial heritability of diabetic kidney disease, efforts to identify genetic susceptibility variants have had limited success. We extended previous efforts in three dimensions, examining a more comprehensive set of genetic variants in larger numbers of subjects with type 1 diabetes characterized for a wider range of cross-sectional diabetic kidney disease phenotypes. In 2,843 subjects, we estimated that the heritability of diabetic kidney disease was 35% ( p=6x10-3 ). Genome-wide association analysis and replication in 12,540 individuals identified no single variants reaching stringent levels of significance and, despite excellent power, provided little independent confirmation of previously published associated variants. Whole exome sequencing in 997 subjects failed to identify any large-effect coding alleles of lower frequency influencing the risk of diabetic kidney disease. However, sets of alleles increasing body mass index ( p=2.2×10-5) and the risk of type 2 diabetes (p=6.1x10-4 ) were associated with the risk of diabetic kidney disease. We also found genome-wide genetic correlation between diabetic kidney disease and failure at smoking cessation ( p=1.1×10-4 ). Pathway analysis implicated ascorbate and aldarate metabolism ( p=9×10-6), and pentose and glucuronate interconversions ( p=3×10-6) in pathogenesis of diabetic kidney disease. These data provide further evidence for the role of genetic factors influencing diabetic kidney disease in those with type 1 diabetes and highlight some key pathways that may be responsible. Altogether these results reveal important biology behind the major cause of kidney disease.
Resumo:
Hairy cell leukemia (HCL) is marked by near 100% mutational frequency of BRAFV600E mutations. Recurrent cooperating genetic events that may contribute to HCL pathogenesis or affect the clinical course of HCL are currently not described. Therefore, we performed whole exome sequencing to explore the mutational landscape of purine analog refractory HCL. In addition to the disease-defining BRAFV600E mutations, we identified mutations in EZH2, ARID1A, and recurrent inactivating mutations of the cell cycle inhibitor CDKN1B (p27). Targeted deep sequencing of CDKN1B in a larger cohort of HCL patients identify deleterious CDKN1B mutations in 16% of patients with HCL (n = 13 of 81). In 11 of 13 patients the CDKN1B mutation was clonal, implying an early role of CDKN1B mutations in the pathogenesis of HCL. CDKN1B mutations were not found to impact clinical characteristics or outcome in this cohort. These data identify HCL as having the highest frequency of CDKN1B mutations among cancers and identify CDNK1B as the second most common mutated gene in HCL. Moreover, given the known function of CDNK1B, these data suggest a novel role for alterations in regulation of cell cycle and senescence in HCL with CDKN1B mutations.
Resumo:
We have used whole exome sequencing to compare a group of presentation t(4;14) with t(11;14) cases of myeloma to define the mutational landscape. Each case was characterized by a median of 24.5 exonic nonsynonymous single-nucleotide variations, and there was a consistently higher number of mutations in the t(4;14) group, but this number did not reach statistical significance. We show that the transition and transversion rates in the 2 subgroups are similar, suggesting that there was no specific mechanism leading to mutation differentiating the 2 groups. Only 3% of mutations were seen in both groups, and recurrently mutated genes include NRAS, KRAS, BRAF, and DIS3 as well as DNAH5, a member of the axonemal dynein family. The pattern of mutation in each group was distinct, with the t(4;14) group being characterized by deregulation of chromatin organization, actin filament, and microfilament movement. Recurrent RAS pathway mutations identified subclonal heterogeneity at a mutational level in both groups, with mutations being present as either dominant or minor subclones. The presence of subclonal diversity was confirmed at a single-cell level using other tumor-acquired mutations. These results are consistent with a distinct molecular pathogenesis underlying each subgroup and have important impacts on targeted treatment strategies. The Medical Research Council Myeloma IX trial is registered under ISRCTN68454111.
Resumo:
Genome-wide association studies (GWAS) have identified several risk variants for late-onset Alzheimer's disease (LOAD)1, 2. These common variants have replicable but small effects on LOAD risk and generally do not have obvious functional effects. Low-frequency coding variants, not detected by GWAS, are predicted to include functional variants with larger effects on risk. To identify low-frequency coding variants with large effects on LOAD risk, we carried out whole-exome sequencing (WES) in 14 large LOAD families and follow-up analyses of the candidate variants in several large LOAD case–control data sets. A rare variant in PLD3 (phospholipase D3; Val232Met) segregated with disease status in two independent families and doubled risk for Alzheimer’s disease in seven independent case–control series with a total of more than 11,000 cases and controls of European descent. Gene-based burden analyses in 4,387 cases and controls of European descent and 302 African American cases and controls, with complete sequence data for PLD3, reveal that several variants in this gene increase risk for Alzheimer’s disease in both populations. PLD3 is highly expressed in brain regions that are vulnerable to Alzheimer’s disease pathology, including hippocampus and cortex, and is expressed at significantly lower levels in neurons from Alzheimer’s disease brains compared to control brains. Overexpression of PLD3 leads to a significant decrease in intracellular amyloid-β precursor protein (APP) and extracellular Aβ42 and Aβ40 (the 42- and 40-residue isoforms of the amyloid-β peptide), and knockdown of PLD3 leads to a significant increase in extracellular Aβ42 and Aβ40. Together, our genetic and functional data indicate that carriers of PLD3 coding variants have a twofold increased risk for LOAD and that PLD3 influences APP processing. This study provides an example of how densely affected families may help to identify rare variants with large effects on risk for disease or other complex traits.
Resumo:
To assess factors influencing the success of whole-genome sequencing for mainstream clinical diagnosis, we sequenced 217 individuals from 156 independent cases or families across a broad spectrum of disorders in whom previous screening had identified no pathogenic variants. We quantified the number of candidate variants identified using different strategies for variant calling, filtering, annotation and prioritization. We found that jointly calling variants across samples, filtering against both local and external databases, deploying multiple annotation tools and using familial transmission above biological plausibility contributed to accuracy. Overall, we identified disease-causing variants in 21% of cases, with the proportion increasing to 34% (23/68) for mendelian disorders and 57% (8/14) in family trios. We also discovered 32 potentially clinically actionable variants in 18 genes unrelated to the referral disorder, although only 4 were ultimately considered reportable. Our results demonstrate the value of genome sequencing for routine clinical diagnosis but also highlight many outstanding challenges.
Resumo:
Defects in primary cilium biogenesis underlie the ciliopathies, a growing group of genetic disorders. We describe a whole-genome siRNA-based reverse genetics screen for defects in biogenesis and/or maintenance of the primary cilium, obtaining a global resource. We identify 112 candidate ciliogenesis and ciliopathy genes, including 44 components of the ubiquitin-proteasome system, 12 G-protein-coupled receptors, and 3 pre-mRNA processing factors (PRPF6, PRPF8 and PRPF31) mutated in autosomal dominant retinitis pigmentosa. The PRPFs localize to the connecting cilium, and PRPF8- and PRPF31-mutated cells have ciliary defects. Combining the screen with exome sequencing data identified recessive mutations in PIBF1, also known as CEP90, and C21orf2, also known as LRRC76, as causes of the ciliopathies Joubert and Jeune syndromes. Biochemical approaches place C21orf2 within key ciliopathy-associated protein modules, offering an explanation for the skeletal and retinal involvement observed in individuals with C21orf2 variants. Our global, unbiased approaches provide insights into ciliogenesis complexity and identify roles for unanticipated pathways in human genetic disease.
Resumo:
The order Lagomorpha comprises about 90 living species, divided in 2 families: the pikas (Family Ochotonidae), and the rabbits, hares, and jackrabbits (Family Leporidae). Lagomorphs are important economically and scientifically as major human food resources, valued game species, pests of agricultural significance, model laboratory animals, and key elements in food webs. A quarter of the lagomorph species are listed as threatened. They are native to all continents except Antarctica, and occur up to 5000 m above sea level, from the equator to the Arctic, spanning a wide range of environmental conditions. The order has notable taxonomic problems presenting significant difficulties for defining a species due to broad phenotypic variation, overlap of morphological characteristics, and relatively recent speciation events. At present, only the genomes of 2 species, the European rabbit (Oryctolagus cuniculus) and American pika (Ochotona princeps) have been sequenced and assembled. Starting from a paucity of genome information, the main scientific aim of the Lagomorph Genomics Consortium (LaGomiCs), born from a cooperative initiative of the European COST Action “A Collaborative European Network on Rabbit Genome Biology—RGB-Net” and the World Lagomorph Society (WLS), is to provide an international framework for the sequencing of the genome of all extant and selected extinct lagomorphs. Sequencing the genomes of an entire order will provide a large amount of information to address biological problems not only related to lagomorphs but also to all mammals. We present current and planned sequencing programs and outline the final objective of LaGomiCs possible through broad international collaboration.