937 resultados para Single-nucleotide polymorphism
Resumo:
Both polygenicity (many small genetic effects) and confounding biases, such as cryptic relatedness and population stratification, can yield an inflated distribution of test statistics in genome-wide association studies (GWAS). However, current methods cannot distinguish between inflation from a true polygenic signal and bias. We have developed an approach, LD Score regression, that quantifies the contribution of each by examining the relationship between test statistics and linkage disequilibrium (LD). The LD Score regression intercept can be used to estimate a more powerful and accurate correction factor than genomic control. We find strong evidence that polygenicity accounts for the majority of the inflation in test statistics in many GWAS of large sample size.
Resumo:
Shallow population structure is generally reported for most marine fish and explained as a consequence of high dispersal, connectivity and large population size. Targeted gene analyses and more recently genome-wide studies have challenged such view, suggesting that adaptive divergence might occur even when neutral markers provide genetic homogeneity across populations. Here, 381 SNPs located in transcribed regions were used to assess large- and fine-scale population structure in the European hake (Merluccius merluccius), a widely distributed demersal species of high priority for the European fishery. Analysis of 850 individuals from 19 locations across the entire distribution range showed evidence for several outlier loci, with significantly higher resolving power. While 299 putatively neutral SNPs confirmed the genetic break between basins (F(CT) = 0.016) and weak differentiation within basins, outlier loci revealed a dramatic divergence between Atlantic and Mediterranean populations (F(CT) range 0.275-0.705) and fine-scale significant population structure. Outlier loci separated North Sea and Northern Portugal populations from all other Atlantic samples and revealed a strong differentiation among Western, Central and Eastern Mediterranean geographical samples. Significant correlation of allele frequencies at outlier loci with seawater surface temperature and salinity supported the hypothesis that populations might be adapted to local conditions. Such evidence highlights the importance of integrating information from neutral and adaptive evolutionary patterns towards a better assessment of genetic diversity. Accordingly, the generated outlier SNP data could be used for tackling illegal practices in hake fishing and commercialization as well as to develop explicit spatial models for defining management units and stock boundaries.
Resumo:
The introduction of Next Generation Sequencing (NGS) has revolutionised population genetics, providing studies of non-model species with unprecedented genomic coverage, allowing evolutionary biologists to address questions previously far beyond the reach of available resources. Furthermore, the simple mutation model of Single Nucleotide Polymorphisms (SNPs) permits cost-effective high-throughput genotyping in thousands of individuals simultaneously. Genomic resources are scarce for the Atlantic herring (Clupea harengus), a small pelagic species that sustains high revenue fisheries. This paper details the development of 578 SNPs using a combined NGS and high-throughput genotyping approach. Eight individuals covering the species distribution in the eastern Atlantic were bar-coded and multiplexed into a single cDNA library and sequenced using the 454 GS FLX platform. SNP discovery was performed by de novo sequence clustering and contig assembly, followed by the mapping of reads against consensus contig sequences. Selection of candidate SNPs for genotyping was conducted using an in silico approach. SNP validation and genotyping were performed simultaneously using an Illumina 1,536 GoldenGate assay. Although the conversion rate of candidate SNPs in the genotyping assay cannot be predicted in advance, this approach has the potential to maximise cost and time efficiencies by avoiding expensive and time-consuming laboratory stages of SNP validation. Additionally, the in silico approach leads to lower ascertainment bias in the resulting SNP panel as marker selection is based only on the ability to design primers and the predicted presence of intron-exon boundaries. Consequently SNPs with a wider spectrum of minor allele frequencies (MAFs) will be genotyped in the final panel. The genomic resources presented here represent a valuable multi-purpose resource for developing informative marker panels for population discrimination, microarray development and for population genomic studies in the wild.
Resumo:
High gene flow is considered the norm for most marine organisms and is expected to limit their ability to adapt to local environments. Few studies have directly compared the patterns of differentiation at neutral and selected gene loci in marine organisms. We analysed a transcriptome-derived panel of 281 SNPs in Atlantic herring (Clupea harengus), a highly migratory small pelagic fish, for elucidating neutral and selected genetic variation among populations and to identify candidate genes for environmental adaptation. We analysed 607 individuals from 18 spawning locations in the northeast Atlantic, including two temperature clines (5-12 °C) and two salinity clines (5-35‰). By combining genome scan and landscape genetic analyses, four genetically distinct groups of herring were identified: Baltic Sea, Baltic-North Sea transition area, North Sea/British Isles and North Atlantic; notably, samples exhibited divergent clustering patterns for neutral and selected loci. We found statistically strong evidence for divergent selection at 16 outlier loci on a global scale, and significant correlations with temperature and salinity at nine loci. On regional scales, we identified two outlier loci with parallel patterns across temperature clines and five loci associated with temperature in the North Sea/North Atlantic. Likewise, we found seven replicated outliers, of which five were significantly associated with low salinity across both salinity clines. Our results reveal a complex pattern of varying spatial genetic variation among outlier loci, likely reflecting adaptations to local environments. In addition to disclosing the fine scale of local adaptation in a highly vagile species, our data emphasize the need to preserve functionally important biodiversity.
Resumo:
The growing accessibility to genomic resources using next-generation sequencing (NGS) technologies has revolutionized the application of molecular genetic tools to ecology and evolutionary studies in non-model organisms. Here we present the case study of the European hake (Merluccius merluccius), one of the most important demersal resources of European fisheries. Two sequencing platforms, the Roche 454 FLX (454) and the Illumina Genome Analyzer (GAII), were used for Single Nucleotide Polymorphisms (SNPs) discovery in the hake muscle transcriptome. De novo transcriptome assembly into unique contigs, annotation, and in silico SNP detection were carried out in parallel for 454 and GAII sequence data. High-throughput genotyping using the Illumina GoldenGate assay was performed for validating 1,536 putative SNPs. Validation results were analysed to compare the performances of 454 and GAII methods and to evaluate the role of several variables (e.g. sequencing depth, intron-exon structure, sequence quality and annotation). Despite well-known differences in sequence length and throughput, the two approaches showed similar assay conversion rates (approximately 43%) and percentages of polymorphic loci (67.5% and 63.3% for GAII and 454, respectively). Both NGS platforms therefore demonstrated to be suitable for large scale identification of SNPs in transcribed regions of non-model species, although the lack of a reference genome profoundly affects the genotyping success rate. The overall efficiency, however, can be improved using strict quality and filtering criteria for SNP selection (sequence quality, intron-exon structure, target region score).
Resumo:
Recent improvements in the speed, cost and accuracy of next generation sequencing are revolutionizing the discovery of single nucleotide polymorphisms (SNPs). SNPs are increasingly being used as an addition to the molecular ecology toolkit in nonmodel organisms, but their efficient use remains challenging. Here, we discuss common issues when employing SNP markers, including the high numbers of markers typically employed, the effects of ascertainment bias and the inclusion of nonneutral loci in a marker panel. We provide a critique of considerations specifically associated with the application and population genetic analysis of SNPs in nonmodel taxa, focusing specifically on some of the most commonly applied methods.
Resumo:
Schizophrenia is a heritable brain illness with unknown pathogenic mechanisms. Schizophrenia's strongest genetic association at a population level involves variation in the major histocompatibility complex (MHC) locus, but the genes and molecular mechanisms accounting for this have been challenging to identify. Here we show that this association arises in part from many structurally diverse alleles of the complement component 4 (C4) genes. We found that these alleles generated widely varying levels of C4A and C4B expression in the brain, with each common C4 allele associating with schizophrenia in proportion to its tendency to generate greater expression of C4A. Human C4 protein localized to neuronal synapses, dendrites, axons, and cell bodies. In mice, C4 mediated synapse elimination during postnatal development. These results implicate excessive complement activity in the development of schizophrenia and may help explain the reduced numbers of synapses in the brains of individuals with schizophrenia.
Resumo:
Restriction site-associated DNA sequencing (RADseq) provides researchers with the ability to record genetic polymorphism across thousands of loci for nonmodel organisms, potentially revolutionizing the field of molecular ecology. However, as with other genotyping methods, RADseq is prone to a number of sources of error that may have consequential effects for population genetic inferences, and these have received only limited attention in terms of the estimation and reporting of genotyping error rates. Here we use individual sample replicates, under the expectation of identical genotypes, to quantify genotyping error in the absence of a reference genome. We then use sample replicates to (i) optimize de novo assembly parameters within the program Stacks, by minimizing error and maximizing the retrieval of informative loci; and (ii) quantify error rates for loci, alleles and single-nucleotide polymorphisms. As an empirical example, we use a double-digest RAD data set of a nonmodel plant species, Berberis alpina, collected from high-altitude mountains in Mexico.
Resumo:
The first extensive catalog of structural human variation was recently released. It showed that large stretches of genomic DNA that vary considerably in copy number were extremely abundant. Thus it is conceivable that they play a major role in functional variation. Consistently, genomic insertions and deletions were shown to contribute to phenotypic differences by modifying not only the expression levels of genes within the aneuploid segments but also of normal copy-number neighboring genes. In this report, we review the possible mechanisms behind this latter effect.
Resumo:
BACKGROUND & AIMS: Hepatitis C virus (HCV) induces chronic infection in 50% to 80% of infected persons; approximately 50% of these do not respond to therapy. We performed a genome-wide association study to screen for host genetic determinants of HCV persistence and response to therapy. METHODS: The analysis included 1362 individuals: 1015 with chronic hepatitis C and 347 who spontaneously cleared the virus (448 were coinfected with human immunodeficiency virus [HIV]). Responses to pegylated interferon alfa and ribavirin were assessed in 465 individuals. Associations between more than 500,000 single nucleotide polymorphisms (SNPs) and outcomes were assessed by multivariate logistic regression. RESULTS: Chronic hepatitis C was associated with SNPs in the IL28B locus, which encodes the antiviral cytokine interferon lambda. The rs8099917 minor allele was associated with progression to chronic HCV infection (odds ratio [OR], 2.31; 95% confidence interval [CI], 1.74-3.06; P = 6.07 x 10(-9)). The association was observed in HCV mono-infected (OR, 2.49; 95% CI, 1.64-3.79; P = 1.96 x 10(-5)) and HCV/HIV coinfected individuals (OR, 2.16; 95% CI, 1.47-3.18; P = 8.24 x 10(-5)). rs8099917 was also associated with failure to respond to therapy (OR, 5.19; 95% CI, 2.90-9.30; P = 3.11 x 10(-8)), with the strongest effects in patients with HCV genotype 1 or 4. This risk allele was identified in 24% of individuals with spontaneous HCV clearance, 32% of chronically infected patients who responded to therapy, and 58% who did not respond (P = 3.2 x 10(-10)). Resequencing of IL28B identified distinct haplotypes that were associated with the clinical phenotype. CONCLUSIONS: The association of the IL28B locus with natural and treatment-associated control of HCV indicates the importance of innate immunity and interferon lambda in the pathogenesis of HCV infection.
Resumo:
β-Arrestin2 (ARRB2) is a component of the G-protein-coupled receptor complex and is involved in μ-opioid and dopamine D(2) receptor signaling, two central processes in methadone signal transduction. We analyzed 238 patients in methadone maintenance treatment (MMT) and identified a haplotype block (rs34230287, rs3786047, rs1045280 and rs2036657) spanning almost the entire ARRB2 locus. Although none of these single nucleotide polymorphisms (SNPs) leads to a change in amino-acid sequence, we found that for all the SNPs analyzed, with exception of rs34230287, homozygosity for the variant allele confers a nonresponding phenotype (n=73; rs1045280C and rs2036657G: OR=3.1, 95% CI=1.5-6.3, P=0.004; rs3786047A: OR=2.5, 95% CI=1.2-5.1, P=0.02) also illustrated by a 12-fold shorter period of negative urine screening (P=0.01). The ARRB2 genotype may thus contribute to the interindividual variability in the response to MMT and help to predict response to treatment.
Resumo:
Complete achromatopsia is a rare autosomal recessive disease associated with CNGA3, CNGB3, GNAT2 and PDE6C mutations. This retinal disorder is characterized by complete loss of color discrimination due to the absence or alteration of the cones function. The purpose of the present study was the clinical and the genetic characterization of achromatopsia in a large consanguineous Tunisian family. Ophthalmic evaluation included a full clinical examination, color vision testing and electroretinography. Linkage analysis using microsatellite markers flanking CNGA3, CNGB3, GNAT2 and PDE6C genes was performed. Mutations were screened by direct sequencing. A total of 12 individuals were diagnosed with congenital complete achromatopsia. They are members of six nuclear consanguineous families belonging to the same large consanguineous family. Linkage analysis revealed linkage to GNAT2. Mutational screening of GNAT2 revealed three intronic variations c.119-69G>C, c.161+66A>T and c.875-31G>C that co-segregated with a novel mutation p.R313X. An identical GNAT2 haplotype segregating with this mutation was identified, indicating a founder mutation. All patients were homozygous for the p.R313X mutation. This is the first report of the clinical and genetic investigation of complete achromatopsia in North Africa and the largest family with recessive achromatopsia involving GNAT2; thus, providing a unique opportunity for genotype-phenotype correlation for this extremely rare condition.
Resumo:
BACKGROUND: The FTO gene harbors the strongest known susceptibility locus for obesity. While many individual studies have suggested that physical activity (PA) may attenuate the effect of FTO on obesity risk, other studies have not been able to confirm this interaction. To confirm or refute unambiguously whether PA attenuates the association of FTO with obesity risk, we meta-analyzed data from 45 studies of adults (n = 218,166) and nine studies of children and adolescents (n = 19,268). METHODS AND FINDINGS: All studies identified to have data on the FTO rs9939609 variant (or any proxy [r(2)>0.8]) and PA were invited to participate, regardless of ethnicity or age of the participants. PA was standardized by categorizing it into a dichotomous variable (physically inactive versus active) in each study. Overall, 25% of adults and 13% of children were categorized as inactive. Interaction analyses were performed within each study by including the FTO×PA interaction term in an additive model, adjusting for age and sex. Subsequently, random effects meta-analysis was used to pool the interaction terms. In adults, the minor (A-) allele of rs9939609 increased the odds of obesity by 1.23-fold/allele (95% CI 1.20-1.26), but PA attenuated this effect (p(interaction) = 0.001). More specifically, the minor allele of rs9939609 increased the odds of obesity less in the physically active group (odds ratio = 1.22/allele, 95% CI 1.19-1.25) than in the inactive group (odds ratio = 1.30/allele, 95% CI 1.24-1.36). No such interaction was found in children and adolescents. CONCLUSIONS: The association of the FTO risk allele with the odds of obesity is attenuated by 27% in physically active adults, highlighting the importance of PA in particular in those genetically predisposed to obesity.
Resumo:
Hypertension is a common, modifiable and heritable cardiovascular risk factor. Some rare monogenic forms of hypertension have been described, but the majority of patients suffer from "essential" hypertension, for whom the underlying pathophysiological mechanism is not clear. Essential hypertension is a complex trait, involving multiple genes and environmental factors. Recently, progress in the identification of common genetic variants associated with blood pressure and hypertension has been made thanks to large-scale international collaborative projects involving geneticists, epidemiologists, statisticians and clinicians. In this article, we review some basic genetic concepts and the main research methods used to study the genetics of hypertension, as well as selected recent findings in this field.