839 resultados para next generation sequencing
Resumo:
In stark contrast to its horticultural origins, modern genetics is an extremely technology-driven field. Almost all the major advances in the field over the past 20 years have followed technological developments that have permitted change in study designs. The development of PCR in the 1980s led to RFLP mapping of monogenic diseases. The development of fluorescent-tagged genotyping methods led to linkage mapping approaches for common diseases that dominated the 1990s. The development of microarray SNP genotyping has led to the genome-wide association study era of the new millennium. And now the development of next-generation sequencing technologies is about to open up a new era of gene-mapping, enabling many potential new study designs. This review aims to present the strengths and weaknesses of the current approaches, and present some new ideas about gene-mapping approaches that are likely to advance our knowledge of the genes involved in heritable bone traits such as bone mineral density (BMD) and fracture.
Resumo:
Scientists have injected endotoxin into animals to investigate and understand various pathologies and novel therapies for several decades. Recent observations have shown that there is selective susceptibility to Escherichia coli lipopolysaccharide (LPS) endotoxin in sheep, despite having similar breed characteristics. The reason behind this difference is unknown, and has prompted studies aiming to explain the variation by proteogenomic characterisation of circulating acute phase biomarkers. It is hypothesised that genetic trait, biochemical, immunological and inflammation marker patterns contribute in defining and predicting mammalian response to LPS. This review discusses the effects of endotoxin and host responses, genetic basis of innate defences, activation of the acute phase response (APR) following experimental LPS challenge, and the current approaches employed in detecting novel biomarkers including acute phase proteins (APP) and micro-ribonucleic acids (miRNAs) in serum or plasma. miRNAs are novel targets for elucidating molecular mechanisms of disease because of their differential expression during pathological, and in healthy states. Changes in miRNA profiles during a disease challenge may be reflected in plasma. Studies show that gel-based two-dimensional electrophoresis (2-DE) coupled with either matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry (MALDI-TOF MS) or liquid chromatography-mass spectrometry (LC-MS/MS) are currently the most used methods for proteome characterisation. Further evidence suggests that proteomic investigations are preferentially shifting from 2-DE to non-gel based LC-MS/MS coupled with data extraction by sequential window acquisition of all theoretical fragment-ion spectra (SWATH) approaches that are able to identify a wider range of proteins. Enzyme-linked immunosorbent assay (ELISA), quantitative real-time polymerase chain reaction (qRT-PCR), and most recently proteomic methods have been used to quantify low abundance proteins such as cytokines. qRT-PCR and next generation sequencing (NGS) are used for the characterisation of miRNA. Proteogenomic approaches for detecting APP and novel miRNA profiling are essential in understanding the selective resistance to endotoxin in sheep. The results of these methods could help in understanding similar pathology in humans. It might also be helpful in the development of physiological and diagnostic screening assays for determining experimental inclusion and endpoints, and in clinical trials in future
Resumo:
The successful completion of the Human Genome Project (HGP) was an unprecedented scientific advance that has become an invaluable resource in the search for genes that cause monogenic and common (polygenic) diseases. Prior to the HGP, linkage analysis had successfully mapped many disease genes for monogenic disorders; however, the limitations of this approach were particularly evident for identifying causative genes in rare genetic disorders affecting lifespan and/or reproductive fitness, such as skeletal dysplasias. In this review, we illustrate the challenges of mapping disease genes in such conditions through the ultra-rare disorder fibrodysplasia ossificans progressiva (FOP) and we discuss the advances that are being made through current massively parallel (“next generation”) sequencing (MPS) technologies.
Resumo:
Single nucleotide polymorphisms (SNPs) have been classically used for dissecting various human complex disorders using candidate gene studies. During the last decade, large scale SNP analysis i.e. genome-wide association studies (GWAS) have provided an agnostic approach to identify possible genetic loci associated with heterogeneous disease such as cancer susceptibility, prognosis of survival or drug response. Further, the advent of new technologies, including microarray based genotyping as well as high throughput next generation sequencing has opened new avenues for SNPs to be used in clinical practice. It is speculated that the utility of SNPs to understand the mechanisms, biology of variable drug response and ultimately treatment individualization based on the individual’s genome composition will be indispensable in the near future. In the current review, we discuss the advantages and disadvantages of the clinical utility of genetic variants in disease risk-prediction, prognosis, clinical outcome and pharmacogenomics. The lessons and challenges for the utility of SNP based biomarkers are also discussed, including the need for additional functional validation studies.
Resumo:
In humans, congenital spinal defects occur with an incidence of 0.5-1 per 1000 live births. One of the most severe syndromes with such defects is spondylocostal dysostosis (SCD). Over the past decade, the genetic basis of several forms of autosomal recessive SCD cases has been solved with the identification of four causative genes (DLL3, MESP2, LFNG and HES7). Autosomal dominant forms of SCD have also been reported, but to date no genetic etiology has been described for these. Here, we have used exome capture and next-generation sequencing to identify a stoploss mutation in TBX6 that segregates with disease in two generations of one family. We show that this mutation has a deleterious effect on the transcriptional activation activity of the TBX6 protein, likely due to haploinsufficiency. In mouse, Tbx6 is essential for the patterning of the vertebral precursor tissues, somites; thus, mutation of TBX6 is likely to be causative of SCD in this family. This is the first identification of the genetic cause of an autosomal dominant form of SCD, and also demonstrates the potential of exome sequencing to identify genetic causes of dominant diseases even in small families with few affected individuals.
Resumo:
Improved sequencing technologies offer unprecedented opportunities for investigating the role of rare genetic variation in common disease. However, there are considerable challenges with respect to study design, data analysis and replication. Using pooled next-generation sequencing of 507 genes implicated in the repair of DNA in 1,150 samples, an analytical strategy focused on protein-truncating variants (PTVs) and a large-scale sequencing case-control replication experiment in 13,642 individuals, here we show that rare PTVs in the p53-inducible protein phosphatase PPM1D are associated with predisposition to breast cancer and ovarian cancer. PPM1D PTV mutations were present in 25 out of 7,781 cases versus 1 out of 5,861 controls (P = 1.12 × 10-5), including 18 mutations in 6,912 individuals with breast cancer (P = 2.42 × 10-4) and 12 mutations in 1,121 individuals with ovarian cancer (P = 3.10 × 10-9). Notably, all of the identified PPM1D PTVs were mosaic in lymphocyte DNA and clustered within a 370-base-pair region in the final exon of the gene, carboxy-terminal to the phosphatase catalytic domain. Functional studies demonstrate that the mutations result in enhanced suppression of p53 in response to ionizing radiation exposure, suggesting that the mutant alleles encode hyperactive PPM1D isoforms. Thus, although the mutations cause premature protein truncation, they do not result in the simple loss-of-function effect typically associated with this class of variant, but instead probably have a gain-of-function effect. Our results have implications for the detection and management of breast and ovarian cancer risk. More generally, these data provide new insights into the role of rare and of mosaic genetic variants in common conditions, and the use of sequencing in their identification.
Resumo:
Rapid advances in sequencing technologies (Next Generation Sequencing or NGS) have led to a vast increase in the quantity of bioinformatics data available, with this increasing scale presenting enormous challenges to researchers seeking to identify complex interactions. This paper is concerned with the domain of transcriptional regulation, and the use of visualisation to identify relationships between specific regulatory proteins (the transcription factors or TFs) and their associated target genes (TGs). We present preliminary work from an ongoing study which aims to determine the effectiveness of different visual representations and large scale displays in supporting discovery. Following an iterative process of implementation and evaluation, representations were tested by potential users in the bioinformatics domain to determine their efficacy, and to understand better the range of ad hoc practices among bioinformatics literate users. Results from two rounds of small scale user studies are considered with initial findings suggesting that bioinformaticians require richly detailed views of TF data, features to compare TF layouts between organisms quickly, and ways to keep track of interesting data points.
Resumo:
The silver gemfish Rexea solandri is an important economic resource but vulnerable to overfishing in Australian waters. The complete mitochondrial genome sequence is described from 1.6 million reads obtained via next generation sequencing. The total length of the mitogenome is 16,350 bp comprising 2 rRNA, 13 protein-coding genes, 22 tRNA and 2 non-coding regions. The mitogenome sequence was validated against sequences of PCR fragments and BLAST queries of Genbank. Gene order was equivalent to that found in marine fishes.
Resumo:
Background Next-generation sequencing technology is an important tool for the rapid, genome-wide identification of genetic variations. However, it is difficult to resolve the ‘signal’ of variations of interest and the ‘noise’ of stochastic sequencing and bioinformatic errors in the large datasets that are generated. We report a simple approach to identify regional linkage to a trait that requires only two pools of DNA to be sequenced from progeny of a defined genetic cross (i.e. bulk segregant analysis) at low coverage (<10×) and without parentage assignment of individual SNPs. The analysis relies on regional averaging of pooled SNP frequencies to rapidly scan polymorphisms across the genome for differential regional homozygosity, which is then displayed graphically. Results Progeny from defined genetic crosses of Tribolium castaneum (F4 and F19) segregating for the phosphine resistance trait were exposed to phosphine to select for the resistance trait while the remainders were left unexposed. Next generation sequencing was then carried out on the genomic DNA from each pool of selected and unselected insects from each generation. The reads were mapped against the annotated T. castaneum genome from NCBI (v3.0) and analysed for SNP variations. Since it is difficult to accurately call individual SNP frequencies when the depth of sequence coverage is low, variant frequencies were averaged across larger regions. Results from regional SNP frequency averaging identified two loci, tc_rph1 on chromosome 8 and tc_rph2 on chromosome 9, which together are responsible for high level resistance. Identification of the two loci was possible with only 5-7× average coverage of the genome per dataset. These loci were subsequently confirmed by direct SNP marker analysis and fine-scale mapping. Individually, homozygosity of tc_rph1 or tc_rph2 results in only weak resistance to phosphine (estimated at up to 1.5-2.5× and 3-5× respectively), whereas in combination they interact synergistically to provide a high-level resistance >200×. The tc_rph2 resistance allele resulted in a significant fitness cost relative to the wild type allele in unselected beetles over eighteen generations. Conclusion We have validated the technique of linkage mapping by low-coverage sequencing of progeny from a simple genetic cross. The approach relied on regional averaging of SNP frequencies and was used to successfully identify candidate gene loci for phosphine resistance in T. castaneum. This is a relatively simple and rapid approach to identifying genomic regions associated with traits in defined genetic crosses that does not require any specialised statistical analysis.
Resumo:
Increasing salinity levels in freshwater and coastal environments caused by sea level rise linked to climate change is now recognized to be a major factor that can impact fish growth negatively, especially for freshwater teleost species. Striped catfish (Pangasianodon hypophthalmus) is an important freshwater teleost that is now widely farmed across the Mekong River Delta in Vietnam. Understanding the basis for tolerance and adaptation to raised environmental salinity conditions can assist the regional culture industry to mitigate predicted impacts of climate change across this region. Attempt of next generation sequencing using the ion proton platform results in more than 174 million raw reads from three tissue libraries (gill, kidney and intestine). Reads were filtered and de novo assembled using a variety of assemblers and then clustered together to generate a combined reference transcriptome. Downstream analysis resulted in a final reference transcriptome that contained 60,585 transcripts with an N50 of 683 bp. This resource was further annotated using a variety of bioinformatics databases, followed by differential gene expression analysis that resulted in 3062 transcripts that were differentially expressed in catfish samples raised under two experimental conditions (0 and 15 ppt). A number of transcripts with a potential role in salinity tolerance were then classified into six different functional gene categories based on their gene ontology assignments. These included; energy metabolism, ion transportation, detoxification, signal transduction, structural organization and detoxification. Finally, we combined the data on functional salinity tolerance genes into a hypothetical schematic model that attempted to describe potential relationships and interactions among target genes to explain the molecular pathways that control adaptive salinity responses in P. hypophthalmus. Our results indicate that P. hypophthalmus exhibit predictable plastic regulatory responses to elevated salinity by means of characteristic gene expression patterns, providing numerous candidate genes for future investigations.
Resumo:
Two complete mitochondrial genomes of the black marlin Istiompax indica were assembled from approximately 3.5 and 2.5 million reads produced by Ion Torrent next generation sequencing. The complete genomes were 16,531 bp and 16,532 bp in length consisting of 2 rRNA, 13 protein-coding genes, 22tRNA and 2 coding regions. They demonstrated a similar A + T base (52.6%) to other teleosts. Intraspecific sequence variation was 99.5% for three I. indica mitogenomes and 99.7% for X. gladius. A lower value (85%) was found for the I. platypterus mitogenomes from genbank and accredited to inadvertent inclusion of gene regions from a con-familial species in one record, highlighting the need for cautious downstream use of genbank data. © 2014 Informa UK Ltd.
Resumo:
Interferon-induced transmembrane protein 5 or bone-restricted i ifitm-like gene (Bril) was first identified as a bone gene in 2008, although no in vivo role was identified at that time. A role in human bone has now been demonstrated with a number of recent studies identifying a single point mutation in Bril as the causative mutation in osteogenesis imperfecta type V (OI type V). Such a discovery suggests a key role for Bril in skeletal regulation, and the completely novel nature of the gene raises the possibility of a new regulatory pathway in bone. Furthermore, the phenotype of OI type V has unique and quite divergent features compared with other forms of OI involving defects in collagen biology. Currently it appears that the underlying genetic defect in OI type V may be unrelated to collagen regulation, which also raises interesting questions about the classification of this form of OI. This review will discuss current knowledge of OI type V, the function of Bril, and the implications of this recent discovery.
Resumo:
Introduction: Osteoporosis is the commonest metabolic bone disease worldwide. The clinical hallmark of osteoporosis is low trauma fracture, with the most devastating being hip fracture, resulting in significant effects on both morbidity and mortality. Sources of data: Data for this review have been gathered from the published literature and from a range of web resources. Areas of agreement: Genome-wide association studies in the field of osteoporosis have led to the identification of a number of loci associated with both bone mineral density and fracture risk and further increased our understanding of disease. Areas of controversy: The early strategies for mapping osteoporosis disease genes reported only isolated associations, with replication in independent cohorts proving difficult. Neither candidate gene or linkage studies showed association at genome-wide level of significance. Growing points: The advent of massive parallel sequencing technologies has proved extremely successful in mapping monogenic diseases and thus leading to the utilization of this new technology in complex disease genetics. Areas timely for developing research: The identification of novel genes and pathways will potentially lead to the identification of novel therapeutic options for patients with osteoporosis. © 2014 The Author.
Resumo:
The article by Meric-Bernstam et al1 that was recently published in Journal of Clinical Oncology raises important questions about the clinical application of large-scale genomic testing. We congratulate the authors for this ambitious study, which successfully profiled 2,000 consecutive patients with advanced cancer. The next-generation sequencing (NGS) platform was used for 1,749 of 2,000 patients (87.5%). Of 789 patients with potentially actionable mutations, 83 (11%, or 4% of screened population) were enrolled in a genomically matched clinical study. As the editorial2 accompanying the article by Meric-Bernstam et al1 pointed out, the 4% figure, albeit disappointing, may be an underestimate because cancers such as lung adenocarcinoma and melanoma, for which ≥ 50% of patients have actionable mutations, were under-represented. ...
Resumo:
The progressive myoclonic epilepsies (PMEs) are a clinically and etiologically heterogeneous group of symptomatic epilepsies characterized by myoclonus, tonic-clonic seizures, psychomotor regression and ataxia. Different disorders have been classified as PMEs. Of these, the group of neuronal ceroid lipofuscinoses (NCLs) comprise an entity that has onset in childhood, being the most common cause of neurodegeneration in children. The primary aim of this thesis was to dissect the molecular genetic background of patients with childhood onset PME by studying candidate genes and attempting to identify novel PME-associated genes. Another specific aim was to study the primary protein properties of the most recently identified member of the NCL-causing proteins, MFSD8. To dissect the genetic background of a cohort of Turkish patients with childhood onset PME, a screen of the NCL-associated genes PPT1, TPP1, CLN3, CLN5, CLN6, MFSD8, CLN8 and CTSD was performed. Altogether 49 novel mutations were identified, which together with 56 mutations found by collaborators raised the total number of known NCL mutations to 364. Fourteen of the novel mutations affect the recently identified MFSD8 gene, which had originally been identified in a subset of mainly Turkish patients as the underlying cause of CLN7 disease. To investigate the distribution of MFSD8 defects, a total of 211 patients of different ethnic origins were evaluated for mutations in the gene. Altogether 45 patients from nine different countries were provided with a CLN7 molecular diagnosis, denoting the wide geographical occurrence of MFSD8 defects. The mutations are private with only one having been established by a founder-effect in the Roma population from the former Czechoslovakia. All mutations identified except one are associated with the typical clinical picture of variant late-infantile NCL. To address the trafficking properties of MFSD8, lysosomal targeting of the protein was confirmed in both neuronal and non-neuronal cells. The major determinant for this lysosomal sorting was identified to be an N-terminal dileucine based signal (9-EQEPLL-14), recognized by heterotetrameric AP-1 adaptor proteins, suggesting that MFSD8 takes the direct trafficking pathway en route to the lysosomes. Expression studies revealed the neurons as the primary cell-type and the hippocampus and cerebellar granular cell layer as the predominant regions in which MFSD8 is expressed. To identify novel genes associated with childhood onset PME, a single nucleotide polymorphism (SNP) genomewide scan was performed in three small families and 18 sporadic patients followed by homozygosity mapping to determine the candidate loci. One of the families and a sporadic patient were positive for mutations in PLA2G6, a gene that had previously been shown to cause infantile neuroaxonal dystrophy. Application of next-generation sequencing of candidate regions in the remaining two families led to identification of a homozygous missense mutation in USP19 for the first and TXNDC6 for the second family. Analysis of the 18 sporadic cases mapped the best candidate interval in a 1.5 Mb region on chromosome 7q21. Screening of the positional candidate KCTD7 revealed six mutations in seven unrelated families. All patients with mutations in KCTD7 were reported to have early onset PME, rapid disease progression leading to dementia and no pathologic hallmarks. The identification of KCTD7 mutations in nine patients and the clinical delineation of their phenotype establish KCTD7 as a gene for early onset PME. The findings presented in this thesis denote MFSD8 and KCTD7 as genes commonly associated with childhood onset symptomatic epilepsy. The disease-associated role of TXNDC6 awaits verification through identification of additional mutations in patients with similar phenotypes. Completion of the genetic spectrum underlying childhood onset PMEs and understanding of the gene products functions will comprise important steps towards understanding the underlying pathogenetic mechanisms, and will possibly shed light on the general processes of neurodegeneration and nervous system regulation, facilitating the diagnosis, classification and possibly treatment of the affected cases.