27 resultados para Single Nucleotide Polymorphism
Resumo:
Background: Single Nucleotide Polymorphisms, among other type of sequence variants, constitute key elements in genetic epidemiology and pharmacogenomics. While sequence data about genetic variation is found at databases such as dbSNP, clues about the functional and phenotypic consequences of the variations are generally found in biomedical literature. The identification of the relevant documents and the extraction of the information from them are hampered by the large size of literature databases and the lack of widely accepted standard notation for biomedical entities. Thus, automatic systems for the identification of citations of allelic variants of genes in biomedical texts are required. Results: Our group has previously reported the development of OSIRIS, a system aimed at the retrieval of literature about allelic variants of genes http://ibi.imim.es/osirisform.html. Here we describe the development of a new version of OSIRIS (OSIRISv1.2, http://ibi.imim.es/OSIRISv1.2.html webcite) which incorporates a new entity recognition module and is built on top of a local mirror of the MEDLINE collection and HgenetInfoDB: a database that collects data on human gene sequence variations. The new entity recognition module is based on a pattern-based search algorithm for the identification of variation terms in the texts and their mapping to dbSNP identifiers. The performance of OSIRISv1.2 was evaluated on a manually annotated corpus, resulting in 99% precision, 82% recall, and an F-score of 0.89. As an example, the application of the system for collecting literature citations for the allelic variants of genes related to the diseases intracranial aneurysm and breast cancer is presented. Conclusion: OSIRISv1.2 can be used to link literature references to dbSNP database entries with high accuracy, and therefore is suitable for collecting current knowledge on gene sequence variations and supporting the functional annotation of variation databases. The application of OSIRISv1.2 in combination with controlled vocabularies like MeSH provides a way to identify associations of biomedical interest, such as those that relate SNPs with diseases.
Resumo:
BACKGROUND: The only known albino gorilla, named Snowflake, was a male wild born individual from Equatorial Guinea who lived at the Barcelona Zoo for almost 40 years. He was diagnosed with non-syndromic oculocutaneous albinism, i.e. white hair, light eyes, pink skin, photophobia and reduced visual acuity. Despite previous efforts to explain the genetic cause, this is still unknown. Here, we study the genetic cause of his albinism and making use of whole genome sequencing data we find a higher inbreeding coefficient compared to other gorillas.RESULTS: We successfully identified the causal genetic variant for Snowflake's albinism, a non-synonymous single nucleotide variant located in a transmembrane region of SLC45A2. This transporter is known to be involved in oculocutaneous albinism type 4 (OCA4) in humans. We provide experimental evidence that shows that this amino acid replacement alters the membrane spanning capability of this transmembrane region. Finally, we provide a comprehensive study of genome-wide patterns of autozygogosity revealing that Snowflake's parents were related, being this the first report of inbreeding in a wild born Western lowland gorilla.CONCLUSIONS: In this study we demonstrate how the use of whole genome sequencing can be extended to link genotype and phenotype in non-model organisms and it can be a powerful tool in conservation genetics (e.g., inbreeding and genetic diversity) with the expected decrease in sequencing cost.
Resumo:
High-throughput prioritization of cancer-causing mutations (drivers) is a key challenge of cancer genome projects, due to the number of somatic variants detected in tumors. One important step in this task is to assess the functional impact of tumor somatic mutations. A number of computational methods have been employed for that purpose, although most were originally developed to distinguish disease-related nonsynonymous single nucleotide variants (nsSNVs) from polymorphisms. Our new method, transformed Functional Impact score for Cancer (transFIC), improves the assessment of the functional impact of tumor nsSNVs by taking into account the baseline tolerance of genes to functional variants.
Resumo:
Several studies over the last few years have shown that newly arising (de novo) mutations contribute to the genetics of schizophrenia (SZ), autism (ASD) and other developmental disorders. The strongest evidence comes from studies of de novo Copy Number Variation (CNV), where the rate of new mutations is shown to be increased in cases when compared to controls [23, 24]. Research on de novo point mutations and small insertion-deletions (indels) has been more limited, but with the development of next-generation sequencing (NGS) technology, such studies are beginning to provide preliminary evidence that de novo single-nucleotide mutations (SNVs) might also increase risk of SZ and ASD [25, 26] Advanced paternal age is a major source of new mutations in human beings [27] and could thus be associated with increased risk for developing SZ, ASD or other developmental disorders. Indeed, advanced paternal age is found to be a risk factor for developing SZ and ASD in the offspring [28, 29] and new mutations related to advanced paternal age have been implicated as a cause of sporadic cases in several autosomal dominant diseases, some neurodevelopmental diseases, including SZ and ASD, and social functioning. New single-base substitutions occur at higher rates at males compared to females and this difference increases with paternal age. This is due to the fact that sperm cells go through a much higher number of cell divisions (~840 by the age of 50), which increases the risk for DNA copy errors in the male germ line [30] . By contrast, the female eggs (oocytes) undergo only 24 cell divisions and all but the last occur during foetal life. The aim of my project is to determine the parent-of-origin of de novo SNVs, using large samples of parent-offspring trios affected with schizophrenia (SZ). From whole exome sequencing of 618 Bulgarian proband-offspring trios affected, nearly 1000 de novo (SNVs or small indels) have been identified and from these, the parent-of-origin of at least 60% of the mutations (N=600) can be established. This project is contained in a main one that consists on the determination of the parental origin of different types of de novo mutations (SNVs, small indels and large CNVs).
Resumo:
Background: In recent years, microRNA (miRNA) pathways have emerged as a crucial system for the regulation of tumorogenesis. miR-SNPs are a novel class of single nucleotide polymorphisms that can affect miRNA pathways. Design and Methods: We analyzed eight miR-SNPs by allelic discrimination in 141 patients with Hodgkin lymphoma and correlated the results with treatment-related toxicity, response, disease-free survival (DFS) and overall survival (OS). Results: The KRT81 (rs3660) GG genotype was associated with an increased risk of neurological toxicity (P=0.016), while patients with XPO5 (rs11077) AA or CC genotypes had a higher rate of bleomycin-associated pulmonary toxicity (P=0.048). Both miR-SNPs emerged as independent factors in the multivariate analysis. The XPO5 AA and CC genotypes were also associated with a lower response rate (P=0.036). XPO5 (P=0.039) and TRBP (rs784567) (P=0.022) genotypes emerged as prognostic markers for DFS, and XPO5 was also associated with OS (P=0.033). In the multivariate analysis, only XPO5 emerged as an independent prognostic factor for DFS (HR: 2.622; 95%CI 1.039-6.620; P=0.041). Given the influence of XPO5 and TRBP as individual markers, we then investigated the combined effect of these miR-SNPs. Patients with both the XPO5 AA/CC and TRBP TT/TC genotypes had the shortest DFS (P=0.008) and OS (P=0.008). Conclusion: miR-SNPs can add useful prognostic information on treatment-related toxicity and clinical outcome in Hodgkin lymphoma and can be used to identify patients likely to be chemoresistant or to relapse.
Resumo:
BACKGROUND: Genetic factors play a role in chronic obstructive pulmonary disease (COPD) but are poorly understood. A number of candidate genes have been proposed on the basis of the pathogenesis of COPD. These include the matrix metalloproteinase (MMP) genes which play a role in tissue remodelling and fit in with the protease--antiprotease imbalance theory for the cause of COPD. Previous genetic studies of MMPs in COPD have had inadequate coverage of the genes, and have reported conflicting associations of both single nucleotide polymorphisms (SNPs) and SNP haplotypes, plausibly due to under-powered studies. METHODS: To address these issues we genotyped 26 SNPs, providing comprehensive coverage of reported SNP variation, in MMPs- 1, 9 and 12 from 977 COPD patients and 876 non-diseased smokers of European descent and evaluated their association with disease singly and in haplotype combinations. We used logistic regression to adjust for age, gender, centre and smoking history. RESULTS: Haplotypes of two SNPs in MMP-12 (rs652438 and rs2276109), showed an association with severe/very severe disease, corresponding to GOLD Stages III and IV. CONCLUSIONS: Those with the common A-A haplotype for these two SNPs were at greater risk of developing severe/very severe disease (p = 0.0039) while possession of the minor G variants at either SNP locus had a protective effect (adjusted odds ratio of 0.76; 95% CI 0.61 - 0.94). The A-A haplotype was also associated with significantly lower predicted FEV1 (42.62% versus 44.79%; p = 0.0129). This implicates haplotypes of MMP-12 as modifiers of disease severity.
Resumo:
La industria de la producción de camarón es una de las industrias acuícolas que se encuentra en más crecimiento en la actualidad. Los estudios para encontrar marcadores genéticos son muy efectivos para la mejora de sus propiedades y de gran interés para los productores de camarón. En este trabajo se utilizaron seis individuos de una población de Litopenaeus vannamei, donde se encontraron cuatro polimorfismos de nucleótido único (SNPs) en el gen 5HT1R (5-hidroxitriptamina receptor1) y un SNP en el gen STAT (transductor de señal y activador de la transcripción). Sin embargo, el polimorfismo en el gen STAT resultó ser homocigoto en una población diferente utilizada para análisis de asociación. Los presentes análisis revelaron que el alelo C, en dos polimorfismos SNP (C109T y C395G) del gen 5HT1R, tiende a estar asociado con el aumento del peso corporal. Consideramos que hay necesidad de hacer nuevos estudios utilizando una muestra más amplia y diversa de la población en cuestión.
Resumo:
There is growing public concern about reducing saturated fat intake. Stearoyl-CoA desaturase (SCD) is the lipogenic enzyme responsible for the biosynthesis of oleic acid (18:1) by desaturating stearic acid (18:0). Here we describe a total of 18 mutations in the promoter and 3′ non-coding region of the pig SCD gene and provide evidence that allele T at AY487830:g.2228T>C in the promoter region enhances fat desaturation (the ratio 18:1/18:0 in muscle increases from 3.78 to 4.43 in opposite homozygotes) without affecting fat content (18:0+18:1, intramuscular fat content, and backfat thickness). No mutations that could affect the functionality of the protein were found in the coding region. First, we proved in a purebred Duroc line that the C-T-A haplotype of the 3 single nucleotide polymorphisms (SNPs) (g.2108C>T; g.2228T>C; g.2281A>G) of the promoter region was additively associated to enhanced 18:1/18:0 both in muscle and subcutaneous fat, but not in liver. We show that this association was consistent over a 10-year period of overlapping generations and, in line with these results, that the C-T-A haplotype displayed greater SCD mRNA expression in muscle. The effect of this haplotype was validated both internally, by comparing opposite homozygote siblings, and externally, by using experimental Duroc-based crossbreds. Second, the g.2281A>G and the g.2108C>T SNPs were excluded as causative mutations using new and previously published data, restricting the causality to g.2228T>C SNP, the last source of genetic variation within the haplotype. This mutation is positioned in the core sequence of several putative transcription factor binding sites, so that there are several plausible mechanisms by which allele T enhances 18:1/18:0 and, consequently, the proportion of monounsaturated to saturated fat.
Resumo:
Background: Toll-like receptors (TLRs) are critical components for host pathogen recognition and variants in genes participating in this response influence susceptibility to infections. Recently, TLR1 gene polymorphisms have been found correlated with whole blood hyper-inflammatory responses to pathogen-associated molecules and associated with sepsis-associated multiorgan dysfunction and acute lung injury (ALI). We examined the association of common variants of TLR1 gene with sepsis-derived complications in an independent study and with serum levels for four inflammatory biomarker among septic patients. Methodology/Principal Findings: Seven tagging single nucleotide polymorphisms of the TLR1 gene were genotyped in samples from a prospective multicenter case-only study of patients with severe sepsis admitted into a network of intensive care units followed for disease severity. Interleukin (IL)-1 b, IL-6, IL-10, and C-reactive protein (CRP) serum levels were measured at study entry, at 48 h and at 7th day. Alleles -7202G and 248Ser, and the 248Ser-602Ile haplotype were associated with circulatory dysfunction among severe septic patients (0.001<=p <= 0.022), and with reduced IL-10 (0.012<= p <=0.047) and elevated CRP (0.011<= p <=0.036) serum levels during the first week of sepsis development. Additionally, the -7202GG genotype was found to be associated with hospital mortality (p =0.017) and ALI (p =0.050) in a combined analysis with European Americans, suggesting common risk effects among studies Conclusions/Significance: These results partially replicate and extend previous findings, supporting that variants of TLR1 gene are determinants of severe complications during sepsis.
Resumo:
Introduction. Genetic epidemiology is focused on the study of the genetic causes that determine health and diseases in populations. To achieve this goal a common strategy is to explore differences in genetic variability between diseased and nondiseased individuals. Usual markers of genetic variability are single nucleotide polymorphisms (SNPs) which are changes in just one base in the genome. The usual statistical approach in genetic epidemiology study is a marginal analysis, where each SNP is analyzed separately for association with the phenotype. Motivation. It has been observed, that for common diseases the single-SNP analysis is not very powerful for detecting genetic causing variants. In this work, we consider Gene Set Analysis (GSA) as an alternative to standard marginal association approaches. GSA aims to assess the overall association of a set of genetic variants with a phenotype and has the potential to detect subtle effects of variants in a gene or a pathway that might be missed when assessed individually. Objective. We present a new optimized implementation of a pair of gene set analysis methodologies for analyze the individual evidence of SNPs in biological pathways. We perform a simulation study for exploring the power of the proposed methodologies in a set of scenarios with different number of causal SNPs under different effect sizes. In addition, we compare the results with the usual single-SNP analysis method. Moreover, we show the advantage of using the proposed gene set approaches in the context of an Alzheimer disease case-control study where we explore the Reelin signal pathway.
Influence of M. tuberculosis lineage variability within a clinical trial for pulmonary tuberculosis.
Resumo:
Recent studies suggest that M. tuberculosis lineage and host genetics interact to impact how active tuberculosis presents clinically. We determined the phylogenetic lineages of M. tuberculosis isolates from participants enrolled in the Tuberculosis Trials Consortium Study 28, conducted in Brazil, Canada, South Africa, Spain, Uganda and the United States, and secondarily explored the relationship between lineage, clinical presentation and response to treatment. Large sequence polymorphisms and single nucleotide polymorphisms were analyzed to determine lineage and sublineage of isolates. Of 306 isolates genotyped, 246 (80.4%) belonged to the Euro-American lineage, with sublineage 724 predominating at African sites (99/192, 51.5%), and the Euro-American strains other than 724 predominating at non-African sites (89/114, 78.1%). Uneven distribution of lineages across regions limited our ability to discern significant associations, nonetheless, in univariate analyses, Euro-American sublineage 724 was associated with more severe disease at baseline, and along with the East Asian lineage was associated with lower bacteriologic conversion after 8 weeks of treatment. Disease presentation and response to drug treatment varied by lineage, but these associations were no longer statistically significant after adjustment for other variables associated with week-8 culture status.
Resumo:
Nontypable Haemophilus influenzae (NTHi) has emerged as an important opportunistic pathogen causing infection in adults suffering obstructive lung diseases. Existing evidence associates chronic infection by NTHi to the progression of the chronic respiratory disease, but specific features of NTHi associated with persistence have not been comprehensively addressed. To provide clues about adaptive strategies adopted by NTHi during persistent infection, we compared sequential persistent isolates with newly acquired isolates in sputa from six patients with chronic obstructive lung disease. Pulse field gel electrophoresis (PFGE) identified three patients with consecutive persistent strains and three with new strains. Phenotypic characterisation included infection of respiratory epithelial cells, bacterial self-aggregation, biofilm formation and resistance to antimicrobial peptides (AMP). Persistent isolates differed from new strains in showing low epithelial adhesion and inability to form biofilms when grown under continuous-flow culture conditions in microfermenters. Self-aggregation clustered the strains by patient, not by persistence. Increasing resistance to AMPs was observed for each series of persistent isolates; this was not associated with lipooligosaccharide decoration with phosphorylcholine or with lipid A acylation. Variation was further analyzed for the series of three persistent isolates recovered from patient 1. These isolates displayed comparable growth rate, natural transformation frequency and murine pulmonary infection. Genome sequencing of these three isolates revealed sequential acquisition of single-nucleotide variants in the AMP permease sapC, the heme acquisition systems hgpB, hgpC, hup and hxuC, the 3-deoxy-D-manno-octulosonic acid kinase kdkA, the long-chain fatty acid transporter ompP1, and the phosphoribosylamine glycine ligase purD. Collectively, we frame a range of pathogenic traits and a repertoire of genetic variants in the context of persistent infection by NTHi.