8 resultados para Allele frequency data
em Université de Lausanne, Switzerland
Resumo:
Next-generation sequencing (NGS) technologies have become the standard for data generation in studies of population genomics, as the 1000 Genomes Project (1000G). However, these techniques are known to be problematic when applied to highly polymorphic genomic regions, such as the human leukocyte antigen (HLA) genes. Because accurate genotype calls and allele frequency estimations are crucial to population genomics analyses, it is important to assess the reliability of NGS data. Here, we evaluate the reliability of genotype calls and allele frequency estimates of the single-nucleotide polymorphisms (SNPs) reported by 1000G (phase I) at five HLA genes (HLA-A, -B, -C, -DRB1, and -DQB1). We take advantage of the availability of HLA Sanger sequencing of 930 of the 1092 1000G samples and use this as a gold standard to benchmark the 1000G data. We document that 18.6% of SNP genotype calls in HLA genes are incorrect and that allele frequencies are estimated with an error greater than ±0.1 at approximately 25% of the SNPs in HLA genes. We found a bias toward overestimation of reference allele frequency for the 1000G data, indicating mapping bias is an important cause of error in frequency estimation in this dataset. We provide a list of sites that have poor allele frequency estimates and discuss the outcomes of including those sites in different kinds of analyses. Because the HLA region is the most polymorphic in the human genome, our results provide insights into the challenges of using of NGS data at other genomic regions of high diversity.
Resumo:
BACKGROUND: LDL cholesterol has a causal role in the development of cardiovascular disease. Improved understanding of the biological mechanisms that underlie the metabolism and regulation of LDL cholesterol might help to identify novel therapeutic targets. We therefore did a genome-wide association study of LDL-cholesterol concentrations. METHODS: We used genome-wide association data from up to 11,685 participants with measures of circulating LDL-cholesterol concentrations across five studies, including data for 293 461 autosomal single nucleotide polymorphisms (SNPs) with a minor allele frequency of 5% or more that passed our quality control criteria. We also used data from a second genome-wide array in up to 4337 participants from three of these five studies, with data for 290,140 SNPs. We did replication studies in two independent populations consisting of up to 4979 participants. Statistical approaches, including meta-analysis and linkage disequilibrium plots, were used to refine association signals; we analysed pooled data from all seven populations to determine the effect of each SNP on variations in circulating LDL-cholesterol concentrations. FINDINGS: In our initial scan, we found two SNPs (rs599839 [p=1.7x10(-15)] and rs4970834 [p=3.0x10(-11)]) that showed genome-wide statistical association with LDL cholesterol at chromosomal locus 1p13.3. The second genome screen found a third statistically associated SNP at the same locus (rs646776 [p=4.3x10(-9)]). Meta-analysis of data from all studies showed an association of SNPs rs599839 (combined p=1.2x10(-33)) and rs646776 (p=4.8x10(-20)) with LDL-cholesterol concentrations. SNPs rs599839 and rs646776 both explained around 1% of the variation in circulating LDL-cholesterol concentrations and were associated with about 15% of an SD change in LDL cholesterol per allele, assuming an SD of 1 mmol/L. INTERPRETATION: We found evidence for a novel locus for LDL cholesterol on chromosome 1p13.3. These results potentially provide insight into the biological mechanisms that underlie the regulation of LDL cholesterol and might help in the discovery of novel therapeutic targets for cardiovascular disease.
Resumo:
To identify previously unknown genetic loci associated with fasting glucose concentrations, we examined the leading association signals in ten genome-wide association scans involving a total of 36,610 individuals of European descent. Variants in the gene encoding melatonin receptor 1B (MTNR1B) were consistently associated with fasting glucose across all ten studies. The strongest signal was observed at rs10830963, where each G allele (frequency 0.30 in HapMap CEU) was associated with an increase of 0.07 (95% CI = 0.06-0.08) mmol/l in fasting glucose levels (P = 3.2 x 10(-50)) and reduced beta-cell function as measured by homeostasis model assessment (HOMA-B, P = 1.1 x 10(-15)). The same allele was associated with an increased risk of type 2 diabetes (odds ratio = 1.09 (1.05-1.12), per G allele P = 3.3 x 10(-7)) in a meta-analysis of 13 case-control studies totaling 18,236 cases and 64,453 controls. Our analyses also confirm previous associations of fasting glucose with variants at the G6PC2 (rs560887, P = 1.1 x 10(-57)) and GCK (rs4607517, P = 1.0 x 10(-25)) loci.
Resumo:
Identifying adaptive genetic variation is a challenging task, in particular in non-model species for which genomic information is still limited or absent. Here, we studied distribution patterns of amplified fragment length polymorphisms (AFLPs) in response to environmental variation, in 13 alpine plant species consistently sampled across the entire European Alps. Multiple linear regressions were performed between AFLP allele frequencies per site as dependent variables and two categories of independent variables, namely Moran's eigenvector map MEM variables (to account for spatial and unaccounted environmental variation, and historical demographic processes) and environmental variables. These associations allowed the identification of 153 loci of ecological relevance. Univariate regressions between allele frequency and each environmental factor further showed that loci of ecological relevance were mainly correlated with MEM variables. We found that precipitation and temperature were the best environmental predictors, whereas topographic factors were rarely involved in environmental associations. Climatic factors, subject to rapid variation as a result of the current global warming, are known to strongly influence the fate of alpine plants. Our study shows, for the first time for a large number of species, that the same environmental variables are drivers of plant adaptation at the scale of a whole biome, here the European Alps.
Resumo:
Seizures associated with fever are a common pediatric problem, affecting about 2-7 % of children between 3 months and 5 years of age. Differentiation of febrile seizures from acute symptomatic seizures secondary to central nervous system infections or seizures associated with fever in children with epilepsy is essential to provide appropriate treatment and follow-up care. Here, we tested the hypothesis that children who exhibit simple febrile seizures during early childhood, but do not develop epileptic seizures later in life, might preferentially carry the ApoE2 allele of the gene coding for the apolipoprotein E. We did not find any differences in the distribution of ApoE alleles or genotypes between individuals who exhibited simple febrile seizures (n = 93) and age-matched, typically developing subjects (n = 80). We found that the observed allele and genotype frequencies did not deviate from Hardy-Weinberg equilibrium, which suggests that the frequencies of ApoE alleles and genotypes are stable in the Swiss population from which our samples were derived. Across both groups of subjects (n = 173), we found an ApoE2 allele frequency of 0.064, an ApoE3 frequency of 0.829 and an ApoE4 frequency of 0.107. Our findings are consistent with previous reports of the distribution of ApoE polymorphism for European subjects free of any neurological disorders, and show that the different alleles of the gene coding for the apolipoprotein E are not associated with the occurrence of simple febrile seizures.
Resumo:
Waveform tomographic imaging of crosshole georadar data is a powerful method to investigate the shallow subsurface because of its ability to provide images of pertinent petrophysical parameters with extremely high spatial resolution. All current crosshole georadar waveform inversion strategies are based on the assumption of frequency-independent electromagnetic constitutive parameters. However, in reality, these parameters are known to be frequency-dependent and complex and thus recorded georadar data may show significant dispersive behavior. In this paper, we evaluate synthetically the reconstruction limits of a recently published crosshole georadar waveform inversion scheme in the presence of varying degrees of dielectric dispersion. Our results indicate that, when combined with a source wavelet estimation procedure that provides a means of partially accounting for the frequency-dependent effects through an "effective" wavelet, the inversion algorithm performs remarkably well in weakly to moderately dispersive environments and has the ability to provide adequate tomographic reconstructions.
Resumo:
It is widely accepted that antibody responses against the human parasitic pathogen Plasmodium falciparum protect the host from the rigors of severe malaria and death. However, there is a continuing need for the development of in vitro correlate assays of immune protection. To this end, the capacity of human monoclonal and polyclonal antibodies in eliciting phagocytosis and parasite growth inhibition via Fcγ receptor-dependent mechanisms was explored. In examining the extent to which sequence diversity in merozoite surface protein 2 (MSP2) results in the evasion of antibody responses, an unexpectedly high level of heterologous function was measured for allele-specific human antibodies. The dependence on Fcγ receptors for opsonic phagocytosis and monocyte-mediated antibody-dependent parasite inhibition was demonstrated by the mutation of the Fc domain of monoclonal antibodies against both MSP2 and a novel vaccine candidate, peptide 27 from the gene PFF0165c. The described flow cytometry-based functional assays are expected to be useful for assessing immunity in naturally infected and vaccinated individuals and for prioritizing among blood-stage antigens for inclusion in blood-stage vaccines.
Resumo:
Assessment of eating habits in young children from multicultural backgrounds has seldom been conducted. Our objectives were to study the reproducibility and the results of a food frequency questionnaire (FFQ) developed to assess changes in eating habits of preschool children with a high migrant population, in the context of a multidisciplinary multilevel lifestyle intervention. Three kindergarten classes (53% from migrant backgrounds) in French-speaking Switzerland were randomly selected and included 16 girls and 28 boys (mean age +/- SD, 5.4 +/- 0.7 years). The FFQ was filled out twice within a 4-week interval by the parents. Spearman rank correlations between the first and the second FFQ for the 39 items of the food questions were as follows: low (r < 0.50) for 8 (7 P < .05 and 1 nonsignificant), moderate (0.50 <or= r < 0.70) for 22 (all P < .01), and high (r >or= 0.70) for 9 (all P < .01). In addition, 28 of 39 intraclass correlation coefficients were high (>0.50, all P < .01). Eighty-six percent of the children ate breakfast at home daily, but only 67% had lunch at home. The percentages of children eating at least once a week in front of the TV were as follows: 50% for breakfast, 33% for lunch, 38% for dinner, and 48% for snacks. Forty percent of children asked their parents to buy food previously seen in advertisements and ate fast food between once a week and once a month. Children generally consumed foods with a high-energy content. The FFQ yielded good test-retest reproducibility for most items of the food questions and gave relevant findings about the eating habits of preschool children in areas with a high migrant population.