928 resultados para Whole exome sequencing


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Malaria caused by Plasmodium vivax is an experimentally neglected severe disease with a substantial burden on human health. Because of technical limitations, little is known about the biology of this important human pathogen. Whole genome analysis methods on patient-derived material are thus likely to have a substantial impact on our understanding of P. vivax pathogenesis and epidemiology. For example, it will allow study of the evolution and population biology of the parasite, allow parasite transmission patterns to be characterized, and may facilitate the identification of new drug resistance genes. Because parasitemias are typically low and the parasite cannot be readily cultured, on-site leukocyte depletion of blood samples is typically needed to remove human DNA that may be 1000X more abundant than parasite DNA. These features have precluded the analysis of archived blood samples and require the presence of laboratories in close proximity to the collection of field samples for optimal pre-cryopreservation sample preparation. Results: Here we show that in-solution hybridization capture can be used to extract P. vivax DNA from human contaminating DNA in the laboratory without the need for on-site leukocyte filtration. Using a whole genome capture method, we were able to enrich P. vivax DNA from bulk genomic DNA from less than 0.5% to a median of 55% (range 20%-80%). This level of enrichment allows for efficient analysis of the samples by whole genome sequencing and does not introduce any gross biases into the data. With this method, we obtained greater than 5X coverage across 93% of the P. vivax genome for four P. vivax strains from Iquitos, Peru, which is similar to our results using leukocyte filtration (greater than 5X coverage across 96% of the genome). Conclusion: The whole genome capture technique will enable more efficient whole genome analysis of P. vivax from a larger geographic region and from valuable archived sample collections.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Saccharomyces cerevisiae strains widely used for industrial fuel-ethanol production have been developed by selection, but their underlying beneficial genetic polymorphisms remain unknown. Here, we report the draft whole-genome sequence of the S. cerevisiae strain CAT-1, which is a dominant fuel-ethanol fermentative strain from the sugarcane industry in Brazil. Our results indicate that strain CAT-1 is a highly heterozygous diploid yeast strain, and the similar to 12-Mb genome of CAT-1, when compared with the reference S228c genome, contains similar to 36,000 homozygous and similar to 30,000 heterozygous single nucleotide polymorphisms, exhibiting an uneven distribution among chromosomes due to large genomic regions of loss of heterozygosity (LOH). In total, 58 % of the 6,652 predicted protein-coding genes of the CAT-1 genome constitute different alleles when compared with the genes present in the reference S288c genome. The CAT-1 genome contains a reduced number of transposable elements, as well as several gene deletions and duplications, especially at telomeric regions, some correlated with several of the physiological characteristics of this industrial fuel-ethanol strain. Phylogenetic analyses revealed that some genes were likely associated with traits important for bioethanol production. Identifying and characterizing the allelic variations controlling traits relevant to industrial fermentation should provide the basis for a forward genetics approach for developing better fermenting yeast strains.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND  Whole genome sequencing (WGS) is increasingly used in molecular-epidemiological investigations of bacterial pathogens, despite cost- and time-intensive analyses. We combined strain-specific single nucleotide polymorphism (SNP)-typing and targeted WGS to investigate a tuberculosis cluster spanning 21 years in Bern, Switzerland. METHODS  Based on genome sequences of three historical outbreak Mycobacterium tuberculosis isolates, we developed a strain-specific SNP-typing assay to identify further cases. We screened 1,642 patient isolates, and performed WGS on all identified cluster isolates. We extracted SNPs to construct genomic networks. Clinical and social data were retrospectively collected. RESULTS  We identified 68 patients associated with the outbreak strain. Most were diagnosed in 1991-1995, but cases were observed until 2011. Two thirds belonged to the homeless and substance abuser milieu. Targeted WGS revealed 133 variable SNP positions among outbreak isolates. Genomic network analyses suggested a single origin of the outbreak, with subsequent division into three sub-clusters. Isolates from patients with confirmed epidemiological links differed by 0-11 SNPs. CONCLUSIONS  Strain-specific SNP-genotyping allowed rapid and inexpensive identification of M. tuberculosis outbreak isolates in a population-based strain collection. Subsequent targeted WGS provided detailed insights into transmission dynamics. This combined approach could be applied to track bacterial pathogens in real-time and at high resolution.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Over 250 Mendelian traits and disorders, caused by rare alleles have been mapped in the canine genome. Although each disease is rare in the dog as a species, they are collectively common and have major impact on canine health. With SNP-based genotyping arrays, genome-wide association studies (GWAS) have proven to be a powerful method to map the genomic region of interest when 10-20 cases and 10-20 controls are available. However, to identify the genetic variant in associated regions, fine-mapping and targeted re-sequencing is required. Here we present a new approach using whole-genome sequencing (WGS) of a family trio without prior GWAS. As a proof-of-concept, we chose an autosomal recessive disease known as hereditary footpad hyperkeratosis (HFH) in Kromfohrl änder dogs. To our knowledge, this is the first time this family trio WGS-approach, has successfully been used to identify a genetic variant that perfectly segregates with a canine disorder. The sequencing of three Kromfohrl änder dogs from a family trio (an affected offspring and both its healthy parents) resulted in an average genome coverage of 9.2X per individual. After applying stringent filtering criteria for candidate causative coding variants, 527 single nucleotide variants (SNVs) and 15 indels were found to be homozygous in the affected offspring and heterozygous in the parents. Using the computer software packages ANNOVAR and SIFT to functionally annotate coding sequence differences and to predict their functional effect, resulted in seven candidate variants located in six different genes. Of these, only FAM83G:c155G>C (p.R52P) was found to be concordant in eight additional cases and 16 healthy Kromfohrl änder dogs.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The advent of next-generation sequencing, now nearing a decade in age, has enabled, among other capabilities, measurement of genome-wide sequence features at unprecedented scale and resolution.

In this dissertation, I describe work to understand the genetic underpinnings of non-Hodgkin’s lymphoma through exploration of the epigenetics of its cell of origin, initial characterization and interpretation of driver mutations, and finally, a larger-scale, population-level study that incorporates mutation interpretation with clinical outcome.

In the first research chapter, I describe genomic characteristics of lymphomas through the lens of their cells of origin. Just as many other cancers, such as breast cancer or lung cancer, are categorized based on their cell of origin, lymphoma subtypes can be examined through the context of their normal B Cells of origin, Naïve, Germinal Center, and post-Germinal Center. By applying integrative analysis of the epigenetics of normal B Cells of origin through chromatin-immunoprecipitation sequencing, we find that differences in normal B Cell subtypes are reflected in the mutational landscapes of the cancers that arise from them, namely Mantle Cell, Burkitt, and Diffuse Large B-Cell Lymphoma.

In the next research chapter, I describe our first endeavor into understanding the genetic heterogeneity of Diffuse Large B Cell Lymphoma, the most common form of non-Hodgkin’s lymphoma, which affects 100,000 patients in the world. Through whole-genome sequencing of 1 case as well as whole-exome sequencing of 94 cases, we characterize the most recurrent genetic features of DLBCL and lay the groundwork for a larger study.

In the last research chapter, I describe work to characterize and interpret the whole exomes of 1001 cases of DLBCL in the largest single-cancer study to date. This highly-powered study enabled sub-gene, gene-level, and gene-network level understanding of driver mutations within DLBCL. Moreover, matched genomic and clinical data enabled the connection of these driver mutations to clinical features such as treatment response or overall survival. As sequencing costs continue to drop, whole-exome sequencing will become a routine clinical assay, and another diagnostic dimension in addition to existing methods such as histology. However, to unlock the full utility of sequencing data, we must be able to interpret it. This study undertakes a first step in developing the understanding necessary to uncover the genomic signals of DLBCL hidden within its exomes. However, beyond the scope of this one disease, the experimental and analytical methods can be readily applied to other cancer sequencing studies.

Thus, this dissertation leverages next-generation sequencing analysis to understand the genetic underpinnings of lymphoma, both by examining its normal cells of origin as well as through a large-scale study to sensitively identify recurrently mutated genes and their relationship to clinical outcome.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Metastatic melanoma, a cancer historically refractory to chemotherapeutic strategies, has a poor prognosis and accounts for the majority of skin cancer related mortality. Although the recent approval of two new drugs combating this disease, Ipilimumab and Vemurafenib (PLX4032), has demonstrated for the first time in decades an improvement in overall survival; the clinical efficacy of these drugs has been marred by severe adverse immune reactions and acquired drug resistance in patients, respectively. Thus, understanding the etiology of metastatic melanoma will contribute to the improvement of current therapeutic strategies while leading to the development of novel drug approaches. In order to identify recurrently mutated genes of therapeutic relevance in metastatic melanoma, a panel of stage III local lymph node melanomas were extensively characterised using high-throughput genomic technologies. This led to the identification of mutations in TFG in 5% of melanomas from a candidate gene sequencing approach using SNP array analysis, 24% of melanomas with mutations in MAP3K5 or MAP3K9 though unbiased whole-exome sequencing strategies, and inactivating mutations in NF1 in BRAF/NRAS wild type tumours though pathway analysis. Lastly, this thesis describes the development of a melanoma specific mutation panel that can rapidly identify clinically relevant mutation profiles that could guide effective treatment strategies through a personalised therapeutic approach. These findings are discussed in respect to a number of important issues raised by this study including the current limitation of next-generation sequencing technology, the difficulty in identifying ‘driver’ mutations critical to the development of melanoma due to high carcinogenic exposure by UV radiation, and the ultimate application of mutation screening in a personalised therapeutic setting. In summary, a number novel genes involved in metastatic melanoma have been identified that may have relevance for current therapeutic strategies in treating this disease.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Focal segmental glomerulosclerosis (FSGS) is the consequence of a disease process that attacks the kidney's filtering system, causing serious scarring. More than half of FSGS patients develop chronic kidney failure within 10 years, ultimately requiring dialysis or renal transplantation. There are currently several genes known to cause the hereditary forms of FSGS (ACTN4, TRPC6, CD2AP, INF2, MYO1E and NPHS2). This study involves a large, unique, multigenerational Australian pedigree in which FSGS co-segregates with progressive heart block with apparent X-linked recessive inheritance. Through a classical combined approach of linkage and haplotype analysis, we identified a 21.19 cM interval implicated on the X chromosome. We then used a whole exome sequencing approach to identify two mutated genes, NXF5 and ALG13, which are located within this linkage interval. The two mutations NXF5-R113W and ALG13-T141L segregated perfectly with the disease phenotype in the pedigree and were not found in a large healthy control cohort. Analysis using bioinformatics tools predicted the R113W mutation in the NXF5 gene to be deleterious and cellular studies support a role in the stability and localization of the protein suggesting a causative role of this mutation in these co-morbid disorders. Further studies are now required to determine the functional consequence of these novel mutations to development of FSGS and heart block in this pedigree and to determine whether these mutations have implications for more common forms of these diseases in the general population.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The successful completion of the Human Genome Project (HGP) was an unprecedented scientific advance that has become an invaluable resource in the search for genes that cause monogenic and common (polygenic) diseases. Prior to the HGP, linkage analysis had successfully mapped many disease genes for monogenic disorders; however, the limitations of this approach were particularly evident for identifying causative genes in rare genetic disorders affecting lifespan and/or reproductive fitness, such as skeletal dysplasias. In this review, we illustrate the challenges of mapping disease genes in such conditions through the ultra-rare disorder fibrodysplasia ossificans progressiva (FOP) and we discuss the advances that are being made through current massively parallel (“next generation”) sequencing (MPS) technologies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

ABSTRACT Idiopathic developmental disorders (DDs) affect ~1% of the population worldwide. This being a considerable amount, efforts are being made to elucidate the disease mechanisms. One or several genetic factors cause 30-40% of DDs, and only 10% are caused by environmental factors. The remaining 50% of DD patients go undiagnosed, mostly due to a lack of diagnostic techniques. The cause in most undiagnosed cases is though to be a genetic factor or a combination of genetic and environmental factors. Despite the surge of new technologies entering the market, their implementation into diagnostic laboratories is hampered by costs, lack of information about the expected diagnostic yield, and the wide range of selection. This study evaluates new microarray methods in diagnosing idiopathic DDs, providing information about their added diagnostic value. Study I analysed 150 patients by array comparative genomic hybridization (array CGH, 44K and 244K), with a subsequent 18% diagnostic yield. These results are supported by other studies, indicating an enourmous added diagnostic value of array CGH, compared with conventional cytogenetic analysis. Nevertheless, 80% of the patients remained undiagnosed in Study I. In an effort to diagnose more patients, in Study IV the resolution was increased from 8.9 Kb of the 244K CGH array to 0.7 Kb, by using a single-nucleotide polymorphism (SNP) array. However, no additional pathogenic changes were detected in the 35 patients assessed, and thus, for diagnostic purposes, an array platform with ca 9 Kb resolution appears adequate. The recent vast increase in reports of detected aberrations and associated phenotypes has enabled characterization of several new syndromes first based on a common aberration and thereafter by delineation of common clinical characteristics. In Study II, a familial deletion at 9q22.2q22.32 with variable penetrance was described. Despite several reports of aberrations in the adjacent area at 9q associated with Gorlin syndrome, the patients in this family had a unique phenotype and did not present with the syndrome. In Study III, a familial duplication of chromosome 6p22.2 was described. The duplication caused increased expression of an important enzyme of the γ-aminobutyric acid (GABA) degradation pathway, causing oxidative stress of the brain, and thus, very likely, the mild mental retardation of these patients. These two case studies attempted to pinpoint candidate genes and to resolve the pathogenic mechanism causing the clinical characteristics of the patients. Presenting rare genetic and clinical findings to the international science and medical community enables interpretation of similar findings in other patients. The added value of molecular karyotyping in patients with idiopathic DD is evident. As a first line of testing, arrays with a median resolution of at least 9 Kb should be considered and further characterization of detected aberrations undertaken when possible. Diagnostic whole-exome sequencing may be the best option for patients who remain undiagnosed after high-resolution array analysis.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Primary microcephaly is an autosomal recessive disorder characterized by smaller than normal brain size and mental retardation. It is genetically heterogeneous with seven loci: MCPH1-MCPH7. We have previously reported genetic analysis of 35 families, including the identification of the MCPH7 gene STIL. Of the 35 families, three families showed linkage to the MCPH2 locus. Recent whole-exome sequencing studies have shown that the WDR62 gene, located in the MCPH2 candidate region, is mutated in patients with severe brain malformations. We therefore sequenced the WDR62 gene in our MCPH2 families and identified two novel homozygous protein truncating mutations in two families. Affected individuals in the two families had pachygyria, microlissencephaly, band heterotopias, gyral thickening, and dysplastic cortex. Using immunofluorescence study, we showed that, as with other MCPH proteins, WDR62 localizes to centrosomes in A549, HepG2, and HaCaT cells. In addition, WDR62 was also localized to nucleoli. Bioinformatics analysis predicted two overlapping nuclear localization signals and multiple WD-40 repeats in WDR62. Two other groups have also recently identified WDR62 mutations in MCPH2 families. Our results therefore add further evidence that WDR62 is the MCPH2 gene. The present findings will be helpful in genetic diagnosis of patients linked to the MCPH2 locus.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Lymphomas comprise a diverse group of malignancies derived from immune cells. High throughput sequencing has recently emerged as a powerful and versatile method for analysis of the cancer genome and transcriptome. As these data continue to emerge, the crucial work lies in sorting through the wealth of information to hone in on the critical aspects that will give us a better understanding of biology and new insight for how to treat disease. Finding the important signals within these large data sets is one of the major challenges of next generation sequencing.

In this dissertation, I have developed several complementary strategies to describe the genetic underpinnings of lymphomas. I begin with developing a better method for RNA sequencing that enables strand-specific total RNA sequencing and alternative splicing profiling in the same analysis. I then combine this RNA sequencing technique with whole exome sequencing to better understand the global landscape of aberrations in these diseases. Finally, I use traditional cell and molecular biology techniques to define the consequences of major genetic alterations in lymphoma.

Through this analysis, I find recurrent silencing mutations in the G alpha binding protein GNA13 and associated focal adhesion proteins. I aim to describe how loss-of-function mutations in GNA13 can be oncogenic in the context of germinal center B cell biology. Using in vitro techniques including liquid chromatography-mass spectrometry and knockdown and overexpression of genes in B cell lymphoma cell lines, I determine protein binding partners and downstream effectors of GNA13. I also develop a transgenic mouse model to study the role of GNA13 in the germinal center in vivo to determine effects of GNA13 deletion on germinal center structure and cell migration.

Thus, I have developed complementary approaches that span the spectrum from discovery to context-dependent gene models that afford a better understanding of the biological function of aberrant events and ultimately result in a better understanding of disease.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Mutations in podocin (NPHS2) are the most common cause of childhood onset autosomal recessive steroid-resistant nephrotic syndrome (SRNS). The disease is characterized by early-onset proteinuria, resistance to immunosuppressive therapy and rapid progression to end-stage renal disease. Compound heterozygous changes involving the podocin variant R229Q combined with another pathogenic mutation have been associated with a mild phenotype with disease onset often in adulthood. METHODS: We screened 19 families with early-onset SRNS for mutations in NPHS2 and WT1 and identified four disease-causing mutations (three in NPHS2 and one in WT1) prior to planned whole-exome sequencing. RESULTS: We describe two families with three individuals presenting in childhood who are compound heterozygous for R229Q and one other pathogenic NPHS2 mutation, either L327F or A297V. One child presented at age 4 years (A297V plus R229Q) and the other two at age 13 (L327F plus R229Q), one with steadily deteriorating renal function. CONCLUSIONS: These cases highlight the phenotypic variability associated with the NPHS2 R229Q variant plus pathogenic mutation. Individuals may present with early aggressive disease.