54 resultados para ALLELE FREQUENCIES
em Université de Lausanne, Switzerland
Resumo:
Next-generation sequencing (NGS) technologies have become the standard for data generation in studies of population genomics, as the 1000 Genomes Project (1000G). However, these techniques are known to be problematic when applied to highly polymorphic genomic regions, such as the human leukocyte antigen (HLA) genes. Because accurate genotype calls and allele frequency estimations are crucial to population genomics analyses, it is important to assess the reliability of NGS data. Here, we evaluate the reliability of genotype calls and allele frequency estimates of the single-nucleotide polymorphisms (SNPs) reported by 1000G (phase I) at five HLA genes (HLA-A, -B, -C, -DRB1, and -DQB1). We take advantage of the availability of HLA Sanger sequencing of 930 of the 1092 1000G samples and use this as a gold standard to benchmark the 1000G data. We document that 18.6% of SNP genotype calls in HLA genes are incorrect and that allele frequencies are estimated with an error greater than ±0.1 at approximately 25% of the SNPs in HLA genes. We found a bias toward overestimation of reference allele frequency for the 1000G data, indicating mapping bias is an important cause of error in frequency estimation in this dataset. We provide a list of sites that have poor allele frequency estimates and discuss the outcomes of including those sites in different kinds of analyses. Because the HLA region is the most polymorphic in the human genome, our results provide insights into the challenges of using of NGS data at other genomic regions of high diversity.
Resumo:
Population genetic differentiation characterizes the repartition of alleles among populations. It is commonly thought that genetic differentiation measures, such as GST and D, should be near zero when allele frequencies are close to their expected value in panmictic populations, and close to one when they are close to their expected value in isolated populations. To analyse those properties, we first derive analytically a reference function f of known parameters that describes how important features of genetic differentiation (e.g. gene diversity, proportion of private alleles, frequency of the most common allele) are close to their expected panmictic and isolation value. We find that the behaviour of function f differs according to three distinct mutation regimes defined by the scaled mutation rate and the number of populations. Then, we compare GST and D to f, and demonstrate that their signal of differentiation strongly depends on the mutation regime. In particular, we show that D captures well the variations of genetic diversity when mutation is weak, otherwise it overestimates it when panmixia is not met. GST detects population differentiation when mutation is intermediate but has a low sensitivity to the variations of genetic diversity when mutation is weak. When mutation is strong the domain of sensitivity of both measures are altered. Finally, we also point out the importance of the number of populations on genetic differentiation measures, and provide recommendations for the use of GST and D.
Resumo:
Major depressive disorder (MDD) is a highly prevalent disorder with substantial heritability. Heritability has been shown to be substantial and higher in the variant of MDD characterized by recurrent episodes of depression. Genetic studies have thus far failed to identify clear and consistent evidence of genetic risk factors for MDD. We conducted a genome-wide association study (GWAS) in two independent datasets. The first GWAS was performed on 1022 recurrent MDD patients and 1000 controls genotyped on the Illumina 550 platform. The second was conducted on 492 recurrent MDD patients and 1052 controls selected from a population-based collection, genotyped on the Affymetrix 5.0 platform. Neither GWAS identified any SNP that achieved GWAS significance. We obtained imputed genotypes at the Illumina loci for the individuals genotyped on the Affymetrix platform, and performed a meta-analysis of the two GWASs for this common set of approximately half a million SNPs. The meta-analysis did not yield genome-wide significant results either. The results from our study suggest that SNPs with substantial odds ratio are unlikely to exist for MDD, at least in our datasets and among the relatively common SNPs genotyped or tagged by the half-million-loci arrays. Meta-analysis of larger datasets is warranted to identify SNPs with smaller effects or with rarer allele frequencies that contribute to the risk of MDD.
Resumo:
The male-to-female sex ratio at birth is constant across world populations with an average of 1.06 (106 male to 100 female live births) for populations of European descent. The sex ratio is considered to be affected by numerous biological and environmental factors and to have a heritable component. The aim of this study was to investigate the presence of common allele modest effects at autosomal and chromosome X variants that could explain the observed sex ratio at birth. We conducted a large-scale genome-wide association scan (GWAS) meta-analysis across 51 studies, comprising overall 114 863 individuals (61 094 women and 53 769 men) of European ancestry and 2 623 828 common (minor allele frequency >0.05) single-nucleotide polymorphisms (SNPs). Allele frequencies were compared between men and women for directly-typed and imputed variants within each study. Forward-time simulations for unlinked, neutral, autosomal, common loci were performed under the demographic model for European populations with a fixed sex ratio and a random mating scheme to assess the probability of detecting significant allele frequency differences. We do not detect any genome-wide significant (P < 5 × 10(-8)) common SNP differences between men and women in this well-powered meta-analysis. The simulated data provided results entirely consistent with these findings. This large-scale investigation across ~115 000 individuals shows no detectable contribution from common genetic variants to the observed skew in the sex ratio. The absence of sex-specific differences is useful in guiding genetic association study design, for example when using mixed controls for sex-biased traits.
Resumo:
Chronic exposure to food of low quality may exert conflicting selection pressures on foraging behaviour. On the one hand, more active search behaviour may allow the animal to find patches with slightly better, or more, food; on the other hand, such active foraging is energetically costly, and thus may be opposed by selection for energetic efficiency. Here, we test these alternative hypotheses in Drosophila larvae. We show that populations which experimentally evolved improved tolerance to larval chronic malnutrition have shorter foraging path length than unselected control populations. A behavioural polymorphism in foraging path length (the rover-sitter polymorphism) exists in nature and is attributed to the foraging locus (for). We show that a sitter strain (for(s2)) survives better on the poor food than the rover strain (for(R)), confirming that the sitter foraging strategy is advantageous under malnutrition. Larvae of the selected and control populations did not differ in global for expression. However, a quantitative complementation test suggests that the for locus may have contributed to the adaptation to poor food in one of the selected populations, either through a change in for allele frequencies, or by interacting epistatically with alleles at other loci. Irrespective of its genetic basis, our results provide two independent lines of evidence that sitter-like foraging behaviour is favoured under chronic larval malnutrition.
Resumo:
Arbuscular mycorrhizal fungi (AMF) are highly successful plant symbionts. They reproduce clonally producing multinucleate spores. It has been suggested that some AMF harbor genetically different nuclei. However, recent advances in sequencing the Glomus irregulare genome have indicated very low within-fungus polymorphism. We tested the null hypothesis that, with no genetic differences among nuclei, no significant genetic or phenotypic variation would occur among clonal single spore lines generated from one initial AMF spore. Furthermore, no additional variation would be expected in the following generations of single spore lines. Genetic diversity contained in one initial spore repeatedly gave rise to genetically different variants of the fungus with novel phenotypes. The genetic changes represented quantitative changes in allele frequencies, most probably as a result of changes in the frequency of genetic variation partitioned on different nuclei. The genetic and phenotypic variation is remarkable, given that it arose repeatedly from one clonal individual. Our results highlight the dynamic nature of AMF genetics. Even though within-fungus genetic variation is low, some is probably partitioned among nuclei and potentially causes changes in the phenotype. Our results are important for understanding AMF genetics, as well as for researchers and biotechnologists hoping to use AMF genetic diversity for the improvement of AMF inoculum.
Resumo:
We improved, evaluated, and used Sanger sequencing for quantification of single nucleotide polymorphism (SNP) variants in transcripts and gDNA samples. This improved assay resulted in highly reproducible relative allele frequencies (e.g., for a heterozygous gDNA 50.0+/-1.4%, and for a missense mutation-bearing transcript 46.9+/-3.7%) with a lower detection limit of 3-9%. It provided excellent accuracy and linear correlation between expected and observed relative allele frequencies. This sequencing assay, which can also be used for the quantification of copy number variations (CNVs), methylations, mosaicisms, and DNA pools, enabled us to analyze transcripts of the FBN1 gene in fibroblasts and blood samples of patients with suspected Marfan syndrome not only qualitatively but also quantitatively. We report a total of 18 novel and 19 known FBN1 sequence variants leading to a premature termination codon (PTC), 26 of which we analyzed by quantitative sequencing both at gDNA and cDNA levels. The relative amounts of PTC-containing FBN1 transcripts in fresh and PAXgene-stabilized blood samples were significantly higher (33.0+/-3.9% to 80.0+/-7.2%) than those detected in affected fibroblasts with inhibition of nonsense-mediated mRNA decay (NMD) (11.0+/-2.1% to 25.0+/-1.8%), whereas in fibroblasts without NMD inhibition no mutant alleles could be detected. These results provide evidence for incomplete NMD in leukocytes and have particular importance for RNA-based analyses not only in FBN1 but also in other genes.
Resumo:
Sib matings increase homozygosity and, hence, the frequency of detrimental phenotypes caused by recessive deleterious alleles. However, many species have evolved adaptations that prevent the genetic costs associated with inbreeding. We discovered that the highly invasive longhorn crazy ant, Paratrechina longicornis, has evolved an unusual mode of reproduction whereby sib mating does not result in inbreeding. A population genetic study of P. longicornis revealed dramatic differences in allele frequencies between queens, males and workers. Mother-offspring analyses demonstrated that these allele frequency differences resulted from the fact that the three castes were all produced through different means. Workers developed through normal sexual reproduction between queens and males. However, queens were produced clonally and, thus, were genetically identical to their mothers. In contrast, males never inherited maternal alleles and were genetically identical to their fathers. The outcome of this system is that genetic inbreeding is impossible because queen and male genomes remain completely separate. Moreover, the sexually produced worker offspring retain the same genotype, combining alleles from both the maternal and paternal lineage over generations. Thus, queens may mate with their brothers in the parental nest, yet their offspring are no more homozygous than if the queen mated with a male randomly chosen from the population. The complete segregation of the male and female gene pools allows the queens to circumvent the costs associated with inbreeding and therefore may act as an important pre-adaptation for the crazy ant's tremendous invasive success.
Resumo:
Allele frequencies and forensically relevant population statistics of 16 STR loci, including the new European Standard Set (ESS) loci, were estimated from 668 unrelated individuals of Caucasian appearance living in different parts of Switzerland. The samples were amplified with a combination of the following three kits: AmpFlSTR® NGM SElect?, PowerPlex® ESI17 and PowerPlex® ESX 17. All loci were highly polymorphic and no significant departure from Hardy-Weinberg equilibrium and linkage equilibrium was detected after correction for sampling.
Resumo:
OBJECTIVE: Inflammatory bowel diseases (IBDs), Crohn's disease, and ulcerative colitis (UC), are multifactorial disorders, characterized by chronic inflammation of the intestine. A number of genetic components have been proposed to contribute to IBD pathogenesis. In this case-control study, we investigated the association between two common vitamin D-binding protein (DBP) genetic variants and IBD susceptibility. These two single nucleotide polymorphisms (SNPs) in exon 11 of the DBP gene, at codons 416 (GAT>GAG; Asp>Glu) and 420 (ACG>AAG; Thr>Lys), have been previously suggested to play roles in the etiology of other autoimmune diseases. METHODS: Using TaqMan SNP technology, we have genotyped 884 individuals (636 IBD cases and 248 non-IBD controls) for the two DBP variants. RESULTS: On statistical analysis, we observed that the DBP 420 variant Lys is less frequent in IBD cases than in non-IBD controls (allele frequencies, P=0.034; homozygous carrier genotype frequencies, P=0.006). This inverse association between the DBP 420 Lys and the disease remained significant, when non-IBD participants were compared with UC (homozygous carrier genotype frequencies, P=0.022) or Crohn's disease (homozygous carrier genotype frequencies, P=0.016) patients separately. Although the DBP position 416 alone was not found to be significantly associated with IBD, the haplotype DBP_2, consisting of 416 Asp and 420 Lys, was more frequent in the non-IBD population, particularly notably when compared with the UC group (Odds ratio, 4.390). CONCLUSION: Our study adds DBP to the list of potential genes that contribute to the complex genetic etiology of IBD, and further emphasizes the association between vitamin D homeostasis and intestinal inflammation.
Resumo:
BACKGROUND: Three non-synonymous single nucleotide polymorphisms (Q223R, K109R and K656N) of the leptin receptor gene (LEPR) have been tested for association with obesity-related outcomes in multiple studies, showing inconclusive results. We performed a systematic review and meta-analysis on the association of the three LEPR variants with BMI. In addition, we analysed 15 SNPs within the LEPR gene in the CoLaus study, assessing the interaction of the variants with sex. METHODOLOGY/PRINCIPAL FINDINGS: We searched electronic databases, including population-based studies that investigated the association between LEPR variants Q223R, K109R and K656N and obesity- related phenotypes in healthy, unrelated subjects. We furthermore performed meta-analyses of the genotype and allele frequencies in case-control studies. Results were stratified by SNP and by potential effect modifiers. CoLaus data were analysed by logistic and linear regressions and tested for interaction with sex. The meta-analysis of published data did not show an overall association between any of the tested LEPR variants and overweight. However, the choice of a BMI cut-off value to distinguish cases from controls was crucial to explain heterogeneity in Q223R. Differences in allele frequencies across ethnic groups are compatible with natural selection of derived alleles in Q223R and K109R and of the ancient allele in K656N in Asians. In CoLaus, the rs10128072, rs3790438 and rs3790437 variants showed interaction with sex for their association with overweight, waist circumference and fat mass in linear regressions. CONCLUSIONS: Our systematic review and analysis of primary data from the CoLaus study did not show an overall association between LEPR SNPs and overweight. Most studies were underpowered to detect small effect sizes. A potential effect modification by sex, population stratification, as well as the role of natural selection should be addressed in future genetic association studies.
Resumo:
OBJECTIVE: To report the study of a multigenerational Swiss family with dopa-responsive dystonia (DRD). METHODS: Clinical investigation was made of available family members, including historical and chart reviews. Subject examinations were video recorded. Genetic analysis included a genome-wide linkage study with microsatellite markers (STR), GTP cyclohydrolase I (GCH1) gene sequencing, and dosage analysis. RESULTS: We evaluated 32 individuals, of whom 6 were clinically diagnosed with DRD, with childhood-onset progressive foot dystonia, later generalizing, followed by parkinsonism in the two older patients. The response to levodopa was very good. Two additional patients had late onset dopa-responsive parkinsonism. Three other subjects had DRD symptoms on historical grounds. We found suggestive linkage to the previously reported DYT14 locus, which excluded GCH1. However, further study with more stringent criteria for disease status attribution showed linkage to a larger region, which included GCH1. No mutation was found in GCH1 by gene sequencing but dosage methods identified a novel heterozygous deletion of exons 3 to 6 of GCH1. The mutation was found in seven subjects. One of the patients with dystonia represented a phenocopy. CONCLUSIONS: This study rules out the previously reported DYT14 locus as a cause of disease, as a novel multiexonic deletion was identified in GCH1. This work highlights the necessity of an accurate clinical diagnosis in linkage studies as well as the need for appropriate allele frequencies, penetrance, and phenocopy estimates. Comprehensive sequencing and dosage analysis of known genes is recommended prior to genome-wide linkage analysis.
Resumo:
Identifying adaptive genetic variation is a challenging task, in particular in non-model species for which genomic information is still limited or absent. Here, we studied distribution patterns of amplified fragment length polymorphisms (AFLPs) in response to environmental variation, in 13 alpine plant species consistently sampled across the entire European Alps. Multiple linear regressions were performed between AFLP allele frequencies per site as dependent variables and two categories of independent variables, namely Moran's eigenvector map MEM variables (to account for spatial and unaccounted environmental variation, and historical demographic processes) and environmental variables. These associations allowed the identification of 153 loci of ecological relevance. Univariate regressions between allele frequency and each environmental factor further showed that loci of ecological relevance were mainly correlated with MEM variables. We found that precipitation and temperature were the best environmental predictors, whereas topographic factors were rarely involved in environmental associations. Climatic factors, subject to rapid variation as a result of the current global warming, are known to strongly influence the fate of alpine plants. Our study shows, for the first time for a large number of species, that the same environmental variables are drivers of plant adaptation at the scale of a whole biome, here the European Alps.
Resumo:
Occasional XY recombination is a proposed explanation for the sex-chromosome homomorphy in European tree frogs. Numerous laboratory crosses, however, failed to detect any event of male recombination, and a detailed survey of NW-European Hyla arborea populations identified male-specific alleles at sex-linked loci, pointing to the absence of XY recombination in their recent history. Here, we address this paradox in a phylogeographic framework by genotyping sex-linked microsatellite markers in populations and sibships from the entire species range. Contrasting with postglacial populations of NW Europe, which display complete absence of XY recombination and strong sex-chromosome differentiation, refugial populations of the southern Balkans and Adriatic coast show limited XY recombination and large overlaps in allele frequencies. Geographically and historically intermediate populations of the Pannonian Basin show intermediate patterns of XY differentiation. Even in populations where X and Y occasionally recombine, the genetic diversity of Y haplotypes is reduced below the levels expected from the fourfold drop in copy numbers. This study is the first in which X and Y haplotypes could be phased over the distribution range in a species with homomorphic sex chromosomes; it shows that XY-recombination patterns may differ strikingly between conspecific populations, and that recombination arrest may evolve rapidly (<5000 generations).
Resumo:
Major depressive disorder (MDD) is a highly prevalent disorder with substantial heritability. Heritability has been shown to be substantial and higher in the variant of MDD characterized by recurrent episodes of depression. Genetic studies have thus far failed to identify clear and consistent evidence of genetic risk factors for MDD. We conducted a genome-wide association study (GWAS) in two independent datasets. The first GWAS was performed on 1022 recurrent MDD patients and 1000 controls genotyped on the Illumina 550 platform. The second was conducted on 492 recurrent MDD patients and 1052 controls selected from a population-based collection, genotyped on the Affymetrix 5.0 platform. Neither GWAS identified any SNP that achieved GWAS significance. We obtained imputed genotypes at the Illumina loci for the individuals genotyped on the Affymetrix platform, and performed a meta-analysis of the two GWASs for this common set of approximately half a million SNPs. The meta-analysis did not yield genome-wide significant results either. The results from our study suggest that SNPs with substantial odds ratio are unlikely to exist for MDD, at least in our datasets and among the relatively common SNPs genotyped or tagged by the half-million-loci arrays. Meta-analysis of larger datasets is warranted to identify SNPs with smaller effects or with rarer allele frequencies that contribute to the risk of MDD.