30 resultados para whole exome sequencing
em Consorci de Serveis Universitaris de Catalunya (CSUC), Spain
Resumo:
Colorectal cancer (CRC) is the third most common cancer and the fourth leading cause of cancer death worldwide. About 85% of the cases of CRC are known to have chromosomal instability, an allelic imbalance at several chromosomal loci, and chromosome amplification and translocation. The aim of this study is to determine the recurrent copy number variant (CNV) regions present in stage II of CRC through whole exome sequencing, a rapidly developing targeted next-generation sequencing (NGS) technology that provides an accurate alternative approach for accessing genomic variations. 42 normal-tumor paired samples were sequenced by Illumina Genome Analyzer. Data was analyzed with Varscan2 and segmentation was performed with R package R-GADA. Summary of the segments across all samples was performed and the result was overlapped with DEG data of the same samples from a previous study in the group1. Major and more recurrent segments of CNV were: gain of chromosome 7pq(13%), 13q(31%) and 20q(75%) and loss of 8p(25%), 17p(23%), and 18pq(27%). This results are coincident with the known literature of CNV in CRC or other cancers, but our methodology should be validated by array comparative genomic hybridisation (aCGH) profiling, which is currently the gold standard for genetic diagnosis of CNV.
Resumo:
There is great scientific and popular interest in understanding the genetic history of populations in the Americas. We wish to understand when different regions of the continent were inhabited, where settlers came from, and how current inhabitants relate genetically to earlier populations. Recent studies unraveled parts of the genetic history of the continent using genotyping arrays and uniparental markers. The 1000 Genomes Project provides a unique opportunity for improving our understanding of population genetic history by providing over a hundred sequenced low coverage genomes and exomes from Colombian (CLM), Mexican-American (MXL), and Puerto Rican (PUR) populations. Here, we explore the genomic contributions of African, European, and especially Native American ancestry to these populations. Estimated Native American ancestry is 48% in MXL, 25% in CLM, and 13% in PUR. Native American ancestry in PUR is most closely related to populations surrounding the Orinoco River basin, confirming the Southern American ancestry of the Taíno people of the Caribbean. We present new methods to estimate the allele frequencies in the Native American fraction of the populations, and model their distribution using a demographic model for three ancestral Native American populations. These ancestral populations likely split in close succession: the most likely scenario, based on a peopling of the Americas 16 thousand years ago (kya), supports that the MXL Ancestors split 12.2kya, with a subsequent split of the ancestors to CLM and PUR 11.7kya. The model also features effective populations of 62,000 in Mexico, 8,700 in Colombia, and 1,900 in Puerto Rico. Modeling Identity-by-descent (IBD) and ancestry tract length, we show that post-contact populations also differ markedly in their effective sizes and migration patterns, with Puerto Rico showing the smallest effective size and the earlier migration from Europe. Finally, we compare IBD and ancestry assignments to find evidence for relatedness among European founders to the three populations.
Resumo:
Several studies over the last few years have shown that newly arising (de novo) mutations contribute to the genetics of schizophrenia (SZ), autism (ASD) and other developmental disorders. The strongest evidence comes from studies of de novo Copy Number Variation (CNV), where the rate of new mutations is shown to be increased in cases when compared to controls [23, 24]. Research on de novo point mutations and small insertion-deletions (indels) has been more limited, but with the development of next-generation sequencing (NGS) technology, such studies are beginning to provide preliminary evidence that de novo single-nucleotide mutations (SNVs) might also increase risk of SZ and ASD [25, 26] Advanced paternal age is a major source of new mutations in human beings [27] and could thus be associated with increased risk for developing SZ, ASD or other developmental disorders. Indeed, advanced paternal age is found to be a risk factor for developing SZ and ASD in the offspring [28, 29] and new mutations related to advanced paternal age have been implicated as a cause of sporadic cases in several autosomal dominant diseases, some neurodevelopmental diseases, including SZ and ASD, and social functioning. New single-base substitutions occur at higher rates at males compared to females and this difference increases with paternal age. This is due to the fact that sperm cells go through a much higher number of cell divisions (~840 by the age of 50), which increases the risk for DNA copy errors in the male germ line [30] . By contrast, the female eggs (oocytes) undergo only 24 cell divisions and all but the last occur during foetal life. The aim of my project is to determine the parent-of-origin of de novo SNVs, using large samples of parent-offspring trios affected with schizophrenia (SZ). From whole exome sequencing of 618 Bulgarian proband-offspring trios affected, nearly 1000 de novo (SNVs or small indels) have been identified and from these, the parent-of-origin of at least 60% of the mutations (N=600) can be established. This project is contained in a main one that consists on the determination of the parental origin of different types of de novo mutations (SNVs, small indels and large CNVs).
Resumo:
We investigated two siblings with granulomatous histiocytosis prominent in the nasal area, mimicking rhinoscleroma and Rosai-Dorfman syndrome. Genome-wide linkage analysis and whole-exome sequencing identified a homozygous frameshift deletion in SLC29A3, which encodes human equilibrative nucleoside transporter-3 (hENT3). Germline mutations in SLC29A3 have been reported in rare patients with a wide range of overlapping clinical features and inherited disorders including H syndrome, pigmented hypertrichosis with insulin-dependent diabetes, and Faisalabad histiocytosis. With the exception of insulin-dependent diabetes and mild finger and toe contractures in one sibling, the two patients with nasal granulomatous histiocytosis studied here displayed none of the many SLC29A3-associated phenotypes. This mild clinical phenotype probably results from a remarkable genetic mechanism. The SLC29A3 frameshift deletion prevents the expression of the normally coding transcripts. It instead leads to the translation, expression, and function of an otherwise noncoding, out-of-frame mRNA splice variant lacking exon 3 that is eliminated by nonsense-mediated mRNA decay (NMD) in healthy individuals. The mutated isoform differs from the wild-type hENT3 by the modification of 20 residues in exon 2 and the removal of another 28 amino acids in exon 3, which include the second transmembrane domain. As a result, this new isoform displays some functional activity. This mechanism probably accounts for the narrow and mild clinical phenotype of the patients. This study highlights the"rescue" role played by a normally noncoding mRNA splice variant of SLC29A3, uncovering a new mechanism by which frameshift mutations can be hypomorphic.
Resumo:
We summarize the progress in whole-genome sequencing and analyses of primate genomes. These emerging genome datasets have broadened our understanding of primate genome evolution revealing unexpected and complex patterns of evolutionary change. This includes the characterization of genome structural variation, episodic changes in the repeat landscape, differences in gene expression, new models regarding speciation, and the ephemeral nature of the recombination landscape. The functional characterization of genomic differences important in primate speciation and adaptation remains a significant challenge. Limited access to biological materials, the lack of detailed phenotypic data and the endangered status of many critical primate species have significantly attenuated research into the genetic basis of primate evolution. Next-generation sequencing technologies promise to greatly expand the number of available primate genome sequences; however, such draft genome sequences will likely miss critical genetic differences within complex genomic regions unless dedicated efforts are put forward to understand the full spectrum of genetic variation.
Resumo:
BACKGROUND: The only known albino gorilla, named Snowflake, was a male wild born individual from Equatorial Guinea who lived at the Barcelona Zoo for almost 40 years. He was diagnosed with non-syndromic oculocutaneous albinism, i.e. white hair, light eyes, pink skin, photophobia and reduced visual acuity. Despite previous efforts to explain the genetic cause, this is still unknown. Here, we study the genetic cause of his albinism and making use of whole genome sequencing data we find a higher inbreeding coefficient compared to other gorillas.RESULTS: We successfully identified the causal genetic variant for Snowflake's albinism, a non-synonymous single nucleotide variant located in a transmembrane region of SLC45A2. This transporter is known to be involved in oculocutaneous albinism type 4 (OCA4) in humans. We provide experimental evidence that shows that this amino acid replacement alters the membrane spanning capability of this transmembrane region. Finally, we provide a comprehensive study of genome-wide patterns of autozygogosity revealing that Snowflake's parents were related, being this the first report of inbreeding in a wild born Western lowland gorilla.CONCLUSIONS: In this study we demonstrate how the use of whole genome sequencing can be extended to link genotype and phenotype in non-model organisms and it can be a powerful tool in conservation genetics (e.g., inbreeding and genetic diversity) with the expected decrease in sequencing cost.
Resumo:
Next-generation sequencing techniques such as exome sequencing can successfully detect all genetic variants in a human exome and it has been useful together with the implementation of variant filters to identify causing-disease mutations. Two filters aremainly used for the mutations identification: low allele frequency and the computational annotation of the genetic variant. Bioinformatic tools to predict the effect of a givenvariant may have errors due to the existing bias in databases and sometimes show a limited coincidence among them. Advances in functional and comparative genomics are needed in order to properly annotate these variants.The goal of this study is to: first, functionally annotate Common Variable Immunodeficiency disease (CVID) variants with the available bioinformatic methods in order to assess the reliability of these strategies. Sencondly, as the development of new methods to reduce the number of candidate genetic variants is an active and necessary field of research, we are exploring the utility of gene function information at organism level as a filter for rare disease genes identification. Recently, it has been proposed that only 10-15% of human genes are essential and therefore we would expect that severe rare diseases are mostly caused by mutations on them. Our goal is to determine whether or not these rare and severe diseases are caused by deleterious mutations in these essential genes. If this hypothesis were true, taking into account essential genes as a filter would be an interesting parameter to identify causingdisease mutations.
Resumo:
In this note we study uncertainty sequencing situations, i.e., 1-machine sequencing situations in which no initial order is specified. We associate cooperative games with these sequencing situations, study their core, and provide links with the classic sequencing games introduced by Curiel et al. (1989). Moreover, we propose and characterize two simple cost allocation rules for uncertainty sequencing situations with equal processing times.
Resumo:
Recoveries after recent earthquakes in the U.S. and Japan have shown that large welfare gains can be achieved by reshaping current emergency plans as incentive-compatible contracts. We apply tools from the mechanisms design literature to show ways to integrate economic incentives into the management of natural disasters and discuss issues related to the application to seismic event recovery. The focus is on restoring lifeline services such as the water, gas, transportation, and electric power networks. We put forward decisional procedures that an uninformed planner could employ to set repair priorities and help to coordinate lifeline firms in the post-earthquake reconstruction.
Resumo:
In this paper we study a class of cooperative sequencing games that arise from one-machine sequencing situations in which chain precedence relations are imposed on the jobs. It is shown that these sequencing games are convex.
Resumo:
With the advent of High performance computing, it is now possible to achieve orders of magnitude performance and computation e ciency gains over conventional computer architectures. This thesis explores the potential of using high performance computing to accelerate whole genome alignment. A parallel technique is applied to an algorithm for whole genome alignment, this technique is explained and some experiments were carried out to test it. This technique is based in a fair usage of the available resource to execute genome alignment and how this can be used in HPC clusters. This work is a rst approximation to whole genome alignment and it shows the advantages of parallelism and some of the drawbacks that our technique has. This work describes the resource limitations of current WGA applications when dealing with large quantities of sequences. It proposes a parallel heuristic to distribute the load and to assure that alignment quality is mantained.
Resumo:
En aquest treball, es proposa un nou mètode per estimar en temps real la qualitat del producte final en processos per lot. Aquest mètode permet reduir el temps necessari per obtenir els resultats de qualitat de les anàlisi de laboratori. S'utiliza un model de anàlisi de componentes principals (PCA) construït amb dades històriques en condicions normals de funcionament per discernir si un lot finalizat és normal o no. Es calcula una signatura de falla pels lots anormals i es passa a través d'un model de classificació per la seva estimació. L'estudi proposa un mètode per utilitzar la informació de les gràfiques de contribució basat en les signatures de falla, on els indicadors representen el comportament de les variables al llarg del procés en les diferentes etapes. Un conjunt de dades compost per la signatura de falla dels lots anormals històrics es construeix per cercar els patrons i entrenar els models de classifcació per estimar els resultas dels lots futurs. La metodologia proposada s'ha aplicat a un reactor seqüencial per lots (SBR). Diversos algoritmes de classificació es proven per demostrar les possibilitats de la metodologia proposada.
Resumo:
This work shows the use of adaptation techniques involved in an e-learning system that considers students' learning styles and students' knowledge states. The mentioned e-learning system is built on a multiagent framework designed to examine opportunities to improve the teaching and to motivate the students to learn what they want in a user-friendly and assisted environment
Resumo:
The role of root systems in drought tolerance is a subject of very limited information compared with above-ground responses. Adjustments to the ability of roots to supply water relative to shoot transpiration demand is proposed as a major means for woody perennial plants to tolerate drought, and is often expressed as changes in the ratios of leaf to root area (AL:AR). Seasonal root proliferation in a directed manner could increase the water supply function of roots independent of total root area (AR) and represents a mechanism whereby water supply to demand could be increased. To address this issue, seasonal root proliferation, stomatal conductance (gs) and whole root system hydraulic conductance (kr) were investigated for a drought-tolerant grape root system (Vitis berlandieri×V. rupestris cv. 1103P) and a non-drought-tolerant root system (Vitis riparia×V. rupestris cv. 101-14Mgt), upon which had been grafted the same drought-sensitive clone of Vitis vinifera cv. Merlot. Leaf water potentials (ψL) for Merlot grafted onto the 1103P root system (–0.91±0.02 MPa) were +0.15 MPa higher than Merlot on 101-14Mgt (–1.06±0.03 MPa) during spring, but dropped by approximately –0.4 MPa from spring to autumn, and were significantly lower by –0.15 MPa (–1.43±0.02 MPa) than for Merlot on 101-14Mgt (at –1.28±0.02 MPa). Surprisingly, gs of Merlot on the drought-tolerant root system (1103P) was less down-regulated and canopies maintained evaporative fluxes ranging from 35–20 mmol vine−1 s−1 during the diurnal peak from spring to autumn, respectively, three times greater than those measured for Merlot on the drought-sensitive rootstock 101-14Mgt. The drought-tolerant root system grew more roots at depth during the warm summer dry period, and the whole root system conductance (kr) increased from 0.004 to 0.009 kg MPa−1 s−1 during that same time period. The changes in kr could not be explained by xylem anatomy or conductivity changes of individual root segments. Thus, the manner in which drought tolerance was conveyed to the drought-sensitive clone appeared to arise from deep root proliferation during the hottest and driest part of the season, rather than through changes in xylem structure, xylem density or stomatal regulation. This information can be useful to growers on a site-specific basis in selecting rootstocks for grape clonal material (scions) grafted to them.