921 resultados para SINGLE-NUCLEOTIDE POLYMORPHISMS
Resumo:
The growing accessibility to genomic resources using next-generation sequencing (NGS) technologies has revolutionized the application of molecular genetic tools to ecology and evolutionary studies in non-model organisms. Here we present the case study of the European hake (Merluccius merluccius), one of the most important demersal resources of European fisheries. Two sequencing platforms, the Roche 454 FLX (454) and the Illumina Genome Analyzer (GAII), were used for Single Nucleotide Polymorphisms (SNPs) discovery in the hake muscle transcriptome. De novo transcriptome assembly into unique contigs, annotation, and in silico SNP detection were carried out in parallel for 454 and GAII sequence data. High-throughput genotyping using the Illumina GoldenGate assay was performed for validating 1,536 putative SNPs. Validation results were analysed to compare the performances of 454 and GAII methods and to evaluate the role of several variables (e.g. sequencing depth, intron-exon structure, sequence quality and annotation). Despite well-known differences in sequence length and throughput, the two approaches showed similar assay conversion rates (approximately 43%) and percentages of polymorphic loci (67.5% and 63.3% for GAII and 454, respectively). Both NGS platforms therefore demonstrated to be suitable for large scale identification of SNPs in transcribed regions of non-model species, although the lack of a reference genome profoundly affects the genotyping success rate. The overall efficiency, however, can be improved using strict quality and filtering criteria for SNP selection (sequence quality, intron-exon structure, target region score).
Resumo:
Recent improvements in the speed, cost and accuracy of next generation sequencing are revolutionizing the discovery of single nucleotide polymorphisms (SNPs). SNPs are increasingly being used as an addition to the molecular ecology toolkit in nonmodel organisms, but their efficient use remains challenging. Here, we discuss common issues when employing SNP markers, including the high numbers of markers typically employed, the effects of ascertainment bias and the inclusion of nonneutral loci in a marker panel. We provide a critique of considerations specifically associated with the application and population genetic analysis of SNPs in nonmodel taxa, focusing specifically on some of the most commonly applied methods.
Resumo:
Regulations on the exploitation of populations of commercially important fish species and the ensuing consumer interest in sustainable products have increased the need to accurately identify the population of origin of fish and fish products. Although genomics-based tools have proven highly useful, there are relatively few examples in marine fish displaying accurate origin assignment. We synthesize data for 156 single-nucleotide polymorphisms typed in 1039 herring, Clupea harengus L., spanning the Northeast Atlantic to develop a tool that allows assignment of individual herring to their regional origin. We show the method's suitability to address specific biological questions, as well as management applications. We analyse temporally replicated collections from two areas, the Skagerrak (n = 81, 84, 66) and the western Baltic (n = 52, 52). Both areas harbour heavily fished mixed-origin stocks, complicating management issues. We report novel genetic evidence that herring from the Baltic Sea contribute to catches in the North Sea, and find support that western Baltic feeding aggregations mainly constitute herring from the western Baltic with contributions from the Eastern Baltic. Our study describes a general approach and outlines a database allowing individual assignment and traceability of herring across a large part of its East Atlantic distribution.
Resumo:
Introduction: The chromosome 9p21 locus has been identified as a marker of coronary artery disease. In this locus studies have focused on variations in the ANRIL gene that has also been identified as a strong candidate for association with aggressive periodontitis (AgP).
Objective: To investigate possible associations between gene variants of ANRIL and AgP in European and African populations.
Methods: European AgP cases (n= 213) and age-matched periodontally healthy controls (n= 81) were recruited from centres in the United Kingdom (Belfast, Glasgow, Newcastle and London). African AgP cases (n= 95) and controls (n= 105) were recruited in Khartoum, Sudan. Five single nucleotide polymorphisms (SNPs) in ANRIL were genotyped using Sequenom and analysed using Haploview with permutation testing to correct for multiple candidates. Odds ratios (OR) and 95% confidence intervals (95%CI) were calculated.
Results: In the European subjects there was a significant association between rs518394 (p=0.0013; OR = 1.81, 95%CI 1.26-2.61) and rs1333049 (p=0.0028; OR = 1.75, 95%CI 1.21-2.52) and AgP. These associations remained significant after permutation testing. In addition there was an association between rs 1360590 (p=0.035) and AgP in females. In the African subjects there was a significant association between only one SNP rs1537415 and AgP (p=0.036; OR = 1.59, 95%CI 1.04-2.43), however, this was not significant following permutation testing. There were no significant associations with rs3217992 in either population.
Conclusions: SNP variants in the ANRIL locus were shown to be significantly associated with AgP in a European population and for the first time in an African population confirming this as the best replicated locus for aggressive periodontitis.
Resumo:
Seafloor massive sulfide (SMS) mining will likely occur at hydrothermal systems in the near future. Alongside their mineral wealth, SMS deposits also have considerable biological value. Active SMS deposits host endemic hydrothermal vent communities, whilst inactive deposits support communities of deep water corals and other suspension feeders. Mining activities are expected to remove all large organisms and suitable habitat in the immediate area, making vent endemic organisms particularly at risk from habitat loss and localised extinction. As part of environmental management strategies designed to mitigate the effects of mining, areas of seabed need to be protected to preserve biodiversity that is lost at the mine site and to preserve communities that support connectivity among populations of vent animals in the surrounding region. These "set-aside" areas need to be biologically similar to the mine site and be suitably connected, mostly by transport of larvae, to neighbouring sites to ensure exchange of genetic material among remaining populations. Establishing suitable set-asides can be a formidable task for environmental managers, however the application of genetic approaches can aid set-aside identification, suitability assessment and monitoring. There are many genetic tools available, including analysis of mitochondrial DNA (mtDNA) sequences (e.g. COI or other suitable mtDNA genes) and appropriate nuclear DNA markers (e.g. microsatellites, single nucleotide polymorphisms), environmental DNA (eDNA) techniques and microbial metagenomics. When used in concert with traditional biological survey techniques, these tools can help to identify species, assess the genetic connectivity among populations and assess the diversity of communities. How these techniques can be applied to set-aside decision making is discussed and recommendations are made for the genetic characteristics of set-aside sites. A checklist for environmental regulators forms a guide to aid decision making on the suitability of set-aside design and assessment using genetic tools. This non-technical primer document represents the views of participants in the VentBase 2014 workshop.
Resumo:
Herring, Clupea harengus, is one of the ecologically and commercially most important species in European northern seas, where two distinct ecotypes have been described based on spawning time; spring and autumn. To date, it is unknown if these spring and autumn spawning herring constitute genetically distinct units. We assessed levels of genetic divergence between spring and autumn spawning herring in the Baltic Sea using two types of DNA markers, microsatellites and Single Nucleotide Polymorphisms, and compared the results with data for autumn spawning North Sea herring. Temporally replicated analyses reveal clear genetic differences between ecotypes and hence support reproductive isolation. Loci showing non-neutral behaviour, so-called outlier loci, show convergence between autumn spawning herring from demographically disjoint populations, potentially reflecting selective processes associated with autumn spawning ecotypes. The abundance and
exploitation of the two ecotypes have varied strongly over space and time in the Baltic Sea, where autumn spawners have faced strong depression for decades. The results therefore have practical implications by highlighting the need for specific management of these co-occurring ecotypes to meet requirements for sustainable exploitation and ensure optimal livelihood for coastal communities.
Resumo:
The genetic code is not universal. Alterations to its standard form have been discovered in both prokaryotes and eukaryotes and demolished the dogma of an immutable code. For instance, several Candida species translate the standard leucine CUG codon as serine. In the case of the human pathogen Candida albicans, a serine tRNA (tRNACAGSer) incorporates in vivo 97% of serine and 3% of leucine in proteins at CUG sites. Such ambiguity is flexible and the level of leucine incorporation increases significantly in response to environmental stress. To elucidate the function of such ambiguity and clarify whether the identity of the CUG codon could be reverted from serine back to leucine, we have developed a forced evolution strategy to increase leucine incorporation at CUGs and a fluorescent reporter system to monitor such incorporation in vivo. Leucine misincorporation increased from 3% up to nearly 100%, reverting CUG identity from serine back to leucine. Growth assays showed that increasing leucine incorporation produced impressive arrays of phenotypes of high adaptive potential. In particular, strains with high levels of leucine misincorporation exhibited novel phenotypes and high level of tolerance to antifungals. Whole genome re-sequencing revealed that increasing levels of leucine incorporation were associated with accumulation of single nucleotide polymorphisms (SNPs) and loss of heterozygozity (LOH) in the higher misincorporating strains. SNPs accumulated preferentially in genes involved in cell adhesion, filamentous growth and biofilm formation, indicating that C. albicans uses its natural CUG ambiguity to increase genetic diversity in pathogenesis and drug resistance related processes. The overall data provided evidence for unantecipated flexibility of the C. albicans genetic code and highlighted new roles of codon ambiguity on the evolution of genetic and phenotypic diversity.
Resumo:
Tese de mestrado. Biologia (Biologia Humana e Ambiente). Universidade de Lisboa, Faculdade de Ciências, 2014
Resumo:
BACKGROUND: Data for multiple common susceptibility alleles for breast cancer may be combined to identify women at different levels of breast cancer risk. Such stratification could guide preventive and screening strategies. However, empirical evidence for genetic risk stratification is lacking. METHODS: We investigated the value of using 77 breast cancer-associated single nucleotide polymorphisms (SNPs) for risk stratification, in a study of 33 673 breast cancer cases and 33 381 control women of European origin. We tested all possible pair-wise multiplicative interactions and constructed a 77-SNP polygenic risk score (PRS) for breast cancer overall and by estrogen receptor (ER) status. Absolute risks of breast cancer by PRS were derived from relative risk estimates and UK incidence and mortality rates. RESULTS: There was no strong evidence for departure from a multiplicative model for any SNP pair. Women in the highest 1% of the PRS had a three-fold increased risk of developing breast cancer compared with women in the middle quintile (odds ratio [OR] = 3.36, 95% confidence interval [CI] = 2.95 to 3.83). The ORs for ER-positive and ER-negative disease were 3.73 (95% CI = 3.24 to 4.30) and 2.80 (95% CI = 2.26 to 3.46), respectively. Lifetime risk of breast cancer for women in the lowest and highest quintiles of the PRS were 5.2% and 16.6% for a woman without family history, and 8.6% and 24.4% for a woman with a first-degree family history of breast cancer. CONCLUSIONS: The PRS stratifies breast cancer risk in women both with and without a family history of breast cancer. The observed level of risk discrimination could inform targeted screening and prevention strategies. Further discrimination may be achievable through combining the PRS with lifestyle/environmental factors, although these were not considered in this report.
Resumo:
Mycobacterium avium Complex (MAC) comprises microorganisms that affect a wide range of animals including humans. The most relevant are Mycobacterium avium subspecies hominissuis (Mah) with a high impact on public health affecting mainly immunocompromised individuals and Mycobacterium avium subspecies paratuberculosis (Map) causing paratuberculosis in animals with a high economic impact worldwide. In this work, we characterized 28 human and 67 porcine Mah isolates and evaluated the relationship among them by Multiple-Locus Variable number tandem repeat Analysis (MLVA). We concluded that Mah population presented a high genetic diversity and no correlations were inferred based on geographical origin, host or biological sample. For the first time in Portugal Map strains, from asymptomatic bovine faecal samples were isolated highlighting the need of more reliable and rapid diagnostic methods for Map direct detection. Therefore, we developed an IS900 nested real time PCR with high sensitivity and specificity associated with optimized DNA extraction methodologies for faecal and milk samples. We detected 83% of 155 faecal samples from goats, cattle and sheep, and 26% of 98 milk samples from cattle, positive for Map IS900 nested real time PCR. A novel SNPs (single nucleotide polymorphisms) assay to Map characterization based on a Whole Genome Sequencing analysis was developed to elucidate the genetic relationship between strains. Based on sequential detection of 14 SNPs and on a decision tree we were able to differentiate 14 phylogenetic groups with a higher discriminatory power compared to other typing methods. A pigmented Map strain was isolated and characterized evidencing for the first time to our knowledge the existence of pigmented Type C strains. With this work, we intended to improve the ante mortem direct molecular detection of Map, to conscientiously aware for the existence of Map animal infections widespread in Portugal and to contribute to the improvement of Map and Mah epidemiological studies.
Resumo:
Restriction site-associated DNA sequencing (RADseq) provides researchers with the ability to record genetic polymorphism across thousands of loci for nonmodel organisms, potentially revolutionizing the field of molecular ecology. However, as with other genotyping methods, RADseq is prone to a number of sources of error that may have consequential effects for population genetic inferences, and these have received only limited attention in terms of the estimation and reporting of genotyping error rates. Here we use individual sample replicates, under the expectation of identical genotypes, to quantify genotyping error in the absence of a reference genome. We then use sample replicates to (i) optimize de novo assembly parameters within the program Stacks, by minimizing error and maximizing the retrieval of informative loci; and (ii) quantify error rates for loci, alleles and single-nucleotide polymorphisms. As an empirical example, we use a double-digest RAD data set of a nonmodel plant species, Berberis alpina, collected from high-altitude mountains in Mexico.
Resumo:
Background: Therapy of chronic hepatitis C (CHC) with pegIFNa/ribavirin achieves sustained virologic response (SVR) in ~55%. Pre-activation of the endogenous interferon system in the liver is associated non-response (NR). Recently, genome-wide association studies described associations of allelic variants near the IL28B (IFNλ3) gene with treatment response and with spontaneous clearance of the virus. We investigated if the IL28B genotype determines the constitutive expression of IFN stimulated genes (ISGs) in the liver of patients with CHC. Methods: We genotyped 93 patients with CHC for 3 IL28B single nucleotide polymorphisms (SNPs, rs12979860, rs8099917, rs12980275), extracted RNA from their liver biopsies and quantified the expression of IL28B and of 8 previously identified classifier genes which discriminate between SVR and NR (IFI44L, RSAD2, ISG15, IFI22, LAMP3, OAS3, LGALS3BP and HTATIP2). Decision tree ensembles in the form of a random forest classifier were used to calculate the relative predictive power of these different variables in a multivariate analysis. Results: The minor IL28B allele (bad risk for treatment response) was significantly associated with increased expression of ISGs, and, unexpectedly, with decreased expression of IL28B. Stratification of the patients into SVR and NR revealed that ISG expression was conditionally independent from the IL28B genotype, i.e. there was an increased expression of ISGs in NR compared to SVR irrespective of the IL28B genotype. The random forest feature score (RFFS) identified IFI27 (RFFS = 2.93), RSAD2 (1.88) and HTATIP2 (1.50) expression and the HCV genotype (1.62) as the strongest predictors of treatment response. ROC curves of the IL28B SNPs showed an AUC of 0.66 with an error rate (ERR) of 0.38. A classifier with the 3 best classifying genes showed an excellent test performance with an AUC of 0.94 and ERR of 0.15. The addition of IL28B genotype information did not improve the predictive power of the 3-gene classifier. Conclusions: IL28B genotype and hepatic ISG expression are conditionally independent predictors of treatment response in CHC. There is no direct link between altered IFNλ3 expression and pre-activation of the endogenous system in the liver. Hepatic ISG expression is by far the better predictor for treatment response than IL28B genotype.
Resumo:
AIMS/HYPOTHESIS: Several susceptibility genes for type 2 diabetes have been discovered recently. Individually, these genes increase the disease risk only minimally. The goals of the present study were to determine, at the population level, the risk of diabetes in individuals who carry risk alleles within several susceptibility genes for the disease and the added value of this genetic information over the clinical predictors. METHODS: We constructed an additive genetic score using the most replicated single-nucleotide polymorphisms (SNPs) within 15 type 2 diabetes-susceptibility genes, weighting each SNP with its reported effect. We tested this score in the extensively phenotyped population-based cross-sectional CoLaus Study in Lausanne, Switzerland (n = 5,360), involving 356 diabetic individuals. RESULTS: The clinical predictors of prevalent diabetes were age, BMI, family history of diabetes, WHR, and triacylglycerol/HDL-cholesterol ratio. After adjustment for these variables, the risk of diabetes was 2.7 (95% CI 1.8-4.0, p = 0.000006) for individuals with a genetic score within the top quintile, compared with the bottom quintile. Adding the genetic score to the clinical covariates improved the area under the receiver operating characteristic curve slightly (from 0.86 to 0.87), yet significantly (p = 0.002). BMI was similar in these two extreme quintiles. CONCLUSIONS/INTERPRETATION: In this population, a simple weighted 15 SNP-based genetic score provides additional information over clinical predictors of prevalent diabetes. At this stage, however, the clinical benefit of this genetic information is limited.
Resumo:
Metabolic traits are molecular phenotypes that can drive clinical phenotypes and may predict disease progression. Here, we report results from a metabolome- and genome-wide association study on (1)H-NMR urine metabolic profiles. The study was conducted within an untargeted approach, employing a novel method for compound identification. From our discovery cohort of 835 Caucasian individuals who participated in the CoLaus study, we identified 139 suggestively significant (P<5×10(-8)) and independent associations between single nucleotide polymorphisms (SNP) and metabolome features. Fifty-six of these associations replicated in the TasteSensomics cohort, comprising 601 individuals from São Paulo of vastly diverse ethnic background. They correspond to eleven gene-metabolite associations, six of which had been previously identified in the urine metabolome and three in the serum metabolome. Our key novel findings are the associations of two SNPs with NMR spectral signatures pointing to fucose (rs492602, P = 6.9×10(-44)) and lysine (rs8101881, P = 1.2×10(-33)), respectively. Fine-mapping of the first locus pinpointed the FUT2 gene, which encodes a fucosyltransferase enzyme and has previously been associated with Crohn's disease. This implicates fucose as a potential prognostic disease marker, for which there is already published evidence from a mouse model. The second SNP lies within the SLC7A9 gene, rare mutations of which have been linked to severe kidney damage. The replication of previous associations and our new discoveries demonstrate the potential of untargeted metabolomics GWAS to robustly identify molecular disease markers.
Resumo:
BACKGROUND & AIMS: Hepatitis C virus (HCV) induces chronic infection in 50% to 80% of infected persons; approximately 50% of these do not respond to therapy. We performed a genome-wide association study to screen for host genetic determinants of HCV persistence and response to therapy. METHODS: The analysis included 1362 individuals: 1015 with chronic hepatitis C and 347 who spontaneously cleared the virus (448 were coinfected with human immunodeficiency virus [HIV]). Responses to pegylated interferon alfa and ribavirin were assessed in 465 individuals. Associations between more than 500,000 single nucleotide polymorphisms (SNPs) and outcomes were assessed by multivariate logistic regression. RESULTS: Chronic hepatitis C was associated with SNPs in the IL28B locus, which encodes the antiviral cytokine interferon lambda. The rs8099917 minor allele was associated with progression to chronic HCV infection (odds ratio [OR], 2.31; 95% confidence interval [CI], 1.74-3.06; P = 6.07 x 10(-9)). The association was observed in HCV mono-infected (OR, 2.49; 95% CI, 1.64-3.79; P = 1.96 x 10(-5)) and HCV/HIV coinfected individuals (OR, 2.16; 95% CI, 1.47-3.18; P = 8.24 x 10(-5)). rs8099917 was also associated with failure to respond to therapy (OR, 5.19; 95% CI, 2.90-9.30; P = 3.11 x 10(-8)), with the strongest effects in patients with HCV genotype 1 or 4. This risk allele was identified in 24% of individuals with spontaneous HCV clearance, 32% of chronically infected patients who responded to therapy, and 58% who did not respond (P = 3.2 x 10(-10)). Resequencing of IL28B identified distinct haplotypes that were associated with the clinical phenotype. CONCLUSIONS: The association of the IL28B locus with natural and treatment-associated control of HCV indicates the importance of innate immunity and interferon lambda in the pathogenesis of HCV infection.