996 resultados para POPULATION ADMIXTURE
Resumo:
Introduction: Interethnic admixture is a source of cryptic population structure that may lead to spurious genotype-phenotype associations in pharmacogenomic studies. We studied the impact of population stratification on the distribution of ABCB1 polymorphisms (1236C > T, 2677G > T/A and 3435C > T) among Brazilians, a highly admixed population with Amerindian, European and African ancestral roots. Methods: Individual DNA from 320 healthy adults was genotyped with a panel of ancestry informative markers, and the proportions of African component of ancestry (ACA) were estimated. ABCB1 genotypes were determined by the single base extension/termination method. We describe the association between ABCB1 polymorphisms and ACA by fitting a linear proportional odds logistic regression model to the data. Results: The distribution of the ABCB1 2677G > T/A and 3435C > T, but not the 1236C > T, SNPs displayed a significant trend for decreasing frequency of the T alleles and TT genotypes from White to Intermediate to Black individuals. The same trend was observed in the frequency of the T/nonG/T haplotype at the 1236, 2677 and 3435 loci. When the population sample was proportioned in quartiles, according to the individual ACA estimates, the frequency of the T allele and TT genotype at each locus declined progressively from the lowest (< 0.25 ACA) to the highest (> 0.75 ACA) quartile. Linear proportional odds logistic regression analysis confirmed that the odds of having the T allele at each locus decreases in a continuous manner with the increase of the ACA, throughout the ACA range (0.13-0.94) observed in the overall population sample. A significant association was also detected between the individual ACA estimates and the presence of the T/nonG/T haplotype in the overall population. Conclusion: Self-identification according to the racial/color categories proposed by the Brazilian Census is insufficient to properly control for population stratification in pharmacogenomic studies of ABCB1.
Resumo:
Inferring population admixture from genetic data and quantifying it is a difficult but crucial task in evolutionary and conservation biology. Unfortunately state-of-the-art probabilistic approaches are computationally demanding. Effectively exploiting the computational power of modern multiprocessor systems can thus have a positive impact to Monte Carlo-based simulation of admixture modeling. A novel parallel approach is briefly described and promising results on its message passing interface (MPI)-based C implementation are reported.
Resumo:
Background: Before the arrival of Europeans to Cuba, the island was inhabited by two Native American groups, the Tainos and the Ciboneys. Most of the present archaeological, linguistic and ancient DNA evidence indicates a South American origin for these populations. In colonial times, Cuban Native American people were replaced by European settlers and slaves from Africa. It is still unknown however, to what extent their genetic pool intermingled with and was 'diluted' by the arrival of newcomers. In order to investigate the demographic processes that gave rise to the current Cuban population, we analyzed the hypervariable region I (HVS-I) and five single nucleotide polymorphisms (SNPs) in the mitochondrial DNA (mtDNA) coding region in 245 individuals, and 40 Y-chromosome SNPs in 132 male individuals. Results: The Native American contribution to present-day Cubans accounted for 33% of the maternal lineages, whereas Africa and Eurasia contributed 45% and 22% of the lineages, respectively. This Native American substrate in Cuba cannot be traced back to a single origin within the American continent, as previously suggested by ancient DNA analyses. Strikingly, no Native American lineages were found for the Y-chromosome, for which the Eurasian and African contributions were around 80% and 20%, respectively. Conclusion: While the ancestral Native American substrate is still appreciable in the maternal lineages, the extensive process of population admixture in Cuba has left no trace of the paternal Native American lineages, mirroring the strong sexual bias in the admixture processes taking place during colonial times.
Resumo:
The impact of biogeographical ancestry, self-reported 'race/color' and geographical origin on the frequency distribution of 10 CYP2C functional polymorphisms (CYP2C8*2, *3, *4, CYP2C9*2, *3, *5, *11, CYP2C19*2, *3 and *17) and their haplotypes was assessed in a representative cohort of the Brazilian population (n = 1034). TaqMan assays were used for allele discrimination at each CYP2C locus investigated. Individual proportions of European, African and Amerindian biogeographical ancestry were estimated using a panel of insertion-deletion polymorphisms. Multinomial log-linear models were applied to infer the statistical association between the CYP2C alleles and haplotypes (response variables), and biogeographical ancestry, self-reported Color and geographical origin (explanatory variables). The results showed that CYP2C19*3, CYP2C9*5 and CYP2C9*11 were rare alleles (<1%), the frequency of other variants ranged from 3.4% (CYP2C8*4) to 17.3% (CYP2C19*17). Two distinct haplotype blocks were identified: block 1 consists of three single nucleotide polymorphisms (SNPs) (CYP2C19*17, CYP2C19*2 and CYP2C9*2) and block 2 of six SNPs (CYP2C9*11, CYP2C9*3, CYP2C9*5, CYP2C8*2, CYP2C8*4 and CYP2C8*3). Diplotype analysis generated 41 haplotypes, of which eight had frequencies greater than 1% and together accounted for 96.4% of the overall genetic diversity. The distribution of CYP2C8 and CYP2C9 (but not CYP2C19) alleles, and of CYP2C haplotypes was significantly associated with self-reported Color and with the individual proportions of European and African genetic ancestry, irrespective of Color self-identification. The individual odds of having alleles CYP2C8*2, CYP2C8*3, CYP2C9*2 and CYP2C9*3, and haplotypes including these alleles, varied continuously as the proportion of European ancestry increased. Collectively, these data strongly suggest that the intrinsic heterogeneity of the Brazilian population must be acknowledged in the design and interpretation of pharmacogenomic studies of the CYP2C cluster in order to avoid spurious conclusions based on improper matching of study cohorts. This conclusion extends to other polymorphic pharmacogenes among Brazilians, and most likely to other admixed populations of the Americas. The Pharmacogenomics Journal (2012) 12, 267-276; doi: 10.1038/tpj.2010.89; published online 21 December 2010
Resumo:
This dissertation has three separate parts: the first part deals with the general pedigree association testing incorporating continuous covariates; the second part deals with the association tests under population stratification using the conditional likelihood tests; the third part deals with the genome-wide association studies based on the real rheumatoid arthritis (RA) disease data sets from Genetic Analysis Workshop 16 (GAW16) problem 1. Many statistical tests are developed to test the linkage and association using either case-control status or phenotype covariates for family data structure, separately. Those univariate analyses might not use all the information coming from the family members in practical studies. On the other hand, the human complex disease do not have a clear inheritance pattern, there might exist the gene interactions or act independently. In part I, the new proposed approach MPDT is focused on how to use both the case control information as well as the phenotype covariates. This approach can be applied to detect multiple marker effects. Based on the two existing popular statistics in family studies for case-control and quantitative traits respectively, the new approach could be used in the simple family structure data set as well as general pedigree structure. The combined statistics are calculated using the two statistics; A permutation procedure is applied for assessing the p-value with adjustment from the Bonferroni for the multiple markers. We use simulation studies to evaluate the type I error rates and the powers of the proposed approach. Our results show that the combined test using both case-control information and phenotype covariates not only has the correct type I error rates but also is more powerful than the other existing methods. For multiple marker interactions, our proposed method is also very powerful. Selective genotyping is an economical strategy in detecting and mapping quantitative trait loci in the genetic dissection of complex disease. When the samples arise from different ethnic groups or an admixture population, all the existing selective genotyping methods may result in spurious association due to different ancestry distributions. The problem can be more serious when the sample size is large, a general requirement to obtain sufficient power to detect modest genetic effects for most complex traits. In part II, I describe a useful strategy in selective genotyping while population stratification is present. Our procedure used a principal component based approach to eliminate any effect of population stratification. The paper evaluates the performance of our procedure using both simulated data from an early study data sets and also the HapMap data sets in a variety of population admixture models generated from empirical data. There are one binary trait and two continuous traits in the rheumatoid arthritis dataset of Problem 1 in the Genetic Analysis Workshop 16 (GAW16): RA status, AntiCCP and IgM. To allow multiple traits, we suggest a set of SNP-level F statistics by the concept of multiple-correlation to measure the genetic association between multiple trait values and SNP-specific genotypic scores and obtain their null distributions. Hereby, we perform 6 genome-wide association analyses using the novel one- and two-stage approaches which are based on single, double and triple traits. Incorporating all these 6 analyses, we successfully validate the SNPs which have been identified to be responsible for rheumatoid arthritis in the literature and detect more disease susceptibility SNPs for follow-up studies in the future. Except for chromosome 13 and 18, each of the others is found to harbour susceptible genetic regions for rheumatoid arthritis or related diseases, i.e., lupus erythematosus. This topic is discussed in part III.
Resumo:
Aims: The relationship between variants in SLCO1B1 and SLCO2B1 genes and lipid-lowering response to atorvastatin was investigated. Material and Methods: One-hundred-thirty-six unrelated individuals with hypercholesterolemia were selected and treated with atorvastatin (10 mg/day/4 weeks). They were genotyped with a panel of ancestry informative markers for individual African component of ancestry (ACA) estimation by SNaPshot (R) and SLCO1B1 (c.388A>G, c.463C>A and c.521T>C) and SLCO2B1 (-71T>C) gene polymorphisms were identified by TaqMan (R) Real-time PCR. Results: Subjects carrying SLCO1B1 c.388GG genotype exhibited significantly high low-density lipoprotein (LDL) cholesterol reduction relative to c.388AA+c.388AG carriers (41 vs. 37%, p = 0.034). Haplotype analysis revealed that homozygous of SLCO1B1*15 (c.521C and c.388G) variant had similar response to statin relative to heterozygous and non-carriers. A multivariate logistic regression analysis confirmed that c.388GG genotype was associated with higher LDL cholesterol reduction in the study population (OR: 3.2, CI95%: 1.3-8.0, p < 0.05). Conclusion: SLCO1B1 c.388A>G polymorphism causes significant increase in atorvastatin response and may be an important marker for predicting efficacy of lipid-lowering therapy.
Resumo:
Aims: The heterogeneity of the Brazilian population renders the extrapolation of pharmacogenomic data derived from well-defined ethnic groups inappropriate. We investigated the influence of self-reported `race/color`, geographical origin and genetic ancestry on the distribution of four VKORC1 SNPs and haplotypes in Brazilians. Comparative data were obtained from two major ancestral roots of Brazilians: Portuguese and Africans from former Portuguese colonies. Materials & methods: A total of 1037 healthy adults Brazilians, recruited at four different geographical regions and self identified as white, brown or black (race/color categories), 89 Portuguese and 216 Africans from Angola and Mozambique were genotyped for the VKORC1 3673G>A (rs9923231), 5808T>G (rs2884737), 6853G>C (rs8050894) and 9041G>A (rs7294) polymorphisms using TaqMan (R) (Applied Biosystems, CA, USA) assays. VKORC1 haplotypes were statistically inferred using the haplo.stats software. We inferred the statistical association between the distribution of the VKORC1 polymorphisms among Brazilians and self-reported color, geographical region and genetic ancestry by fitting multinomial log linear models via neural networks. Individual proportions of European and African ancestry were used to assess the impact of genetic admixture on the frequency distribution of VKORC1 polymorphisms among Brazilians, and for the comparison of Brazilians with Portuguese and Africans. Results: The frequency distribution of the 3673G>A and 5808T>G polymorphisms, and VKORC1 haplotypes among Brazilians varies across geographical regions, within self-reported color categories and according to the individual proportions of European and African genetic ancestry. Notably, the frequency of the warfarin sensitive VKORC1 3673A allele and the distribution of VKORC1 haplotypes varied continuously as the individual proportion of European ancestry increased in the entire cohort, independently of race/color categorization and geographical origin. Brazilians with more than 80% African ancestry differ significantly from Angolans and Mozambicans in frequency of the 3673G>A, 5808T>G and 6853G>C polymorphisms and haplotype distribution, whereas no such differences are observed between Brazilians with more than 90% European ancestry and Portuguese individuals. Conclusion: The diversity of the Brazilian population, evident in the distribution of VKORC1 polymorphisms, must be taken into account in the design of pharmacogenetic clinical trials and dealt with as a continuous variable. Warfarin dosing algorithms that include `race` terms defined for other populations are clearly not applicable to the heterogeneous and extensively admixed Brazilian population.
Resumo:
Analyses of mitochondrial DNA (mtDNA) control region polymorphism and of variation at 10 nuclear microsatellite loci were used to investigate the mechanisms and genetic consequences of postglacial expansion of Myotis myotis in Europe. Initial sampling consisted of 480 bats genotyped in 24 nursery colonies arranged along a transect of approximately 3000 km. The phylogeographical survey based on mtDNA sequences revealed the existence of major genetic subdivisions across this area, with several suture zones between haplogroups. Such zones of secondary contact were found in the Alps and Rhodopes, whereas other potential barriers to gene flow, like the Pyrenees, did not coincide with genetic discontinuities. Areas of population admixture increased locally the genetic diversity of colonies, which confounded the northward decrease in nucleotide diversity predicted using classical models of postglacial range expansion. However, when analyses were restricted to a subset of 15 nurseries originating from a single presumed glacial refugium, mtDNA polymorphism did indeed support a northwards decrease in diversity. Populations were also highly structured (PhiST = 0.384). Conversely, the same subset of colonies showed no significant latitudinal decrease in microsatellite diversity and much less population structure (FST = 0.010), but pairwise genetic differentiation at these nuclear markers was strongly correlated with increasing geographical distance. Together, this evidence suggests that alleles carried via male bats have maintained enough nuclear gene flow to counteract the effects of recurrent bottlenecks generally associated with recolonization processes. As females are highly philopatric, we argue that the maternally transmitted mtDNA marker better reflects the situation of past, historical gene flow, whereas current levels of gene flow are better reflected by microsatellite markers.
Resumo:
The advent of simple and affordable tools for molecular identification of novel insect invaders and assessment of population diversity has changed the face of invasion biology in recent years. The widespread application of these tools has brought with it an emerging understanding that patterns in biogeography, introduction history and subsequent movement and spread of many invasive alien insects are far more complex than previously thought. We reviewed the literature and found that for a number of invasive insects, there is strong and growing evidence that multiple introductions, complex global movement, and population admixture in the invaded range are commonplace. Additionally, historical paradigms related to species and strain identities and origins of common invaders are in many cases being challenged. This has major consequences for our understanding of basic biology and ecology of invasive insects and impacts quarantine, management and biocontrol programs. In addition, we found that founder effects rarely limit fitness in invasive insects and may benefit populations (by purging harmful alleles or increasing additive genetic variance). Also, while phenotypic plasticity appears important post-establishment, genetic diversity in invasive insects is often higher than expected and increases over time via multiple introductions. Further, connectivity among disjunct regions of global invasive ranges is generally far higher than expected and is often asymmetric, with some populations contributing disproportionately to global spread. We argue that the role of connectivity in driving the ecology and evolution of introduced species with multiple invasive ranges has been historically underestimated and that such species are often best understood in a global context.
Resumo:
A dosing algorithm including genetic (VKORC1 and CYP2C9 genotypes) and nongenetic factors (age, weight, therapeutic indication, and cotreatment with amiodarone or simvastatin) explained 51% of the variance in stable weekly warfarin doses in 390 patients attending an anticoagulant clinic in a Brazilian public hospital. The VKORC1 3673G>A genotype was the most important predictor of warfarin dose, with a partial R(2) value of 23.9%. Replacing the VKORC1 3673G>A genotype with VKORC1 diplotype did not increase the algorithm`s predictive power. We suggest that three other single-nucleotide polymorphisms (SNPs) (5808T>G, 6853G>C, and 9041G>A) that are in strong linkage disequilibrium (LD) with 3673G>A would be equally good predictors of the warfarin dose requirement. The algorithm`s predictive power was similar across the self-identified ""race/color"" subsets. ""Race/color"" was not associated with stable warfarin dose in the multiple regression model, although the required warfarin dose was significantly lower (P = 0.006) in white (29 +/- 13 mg/week, n = 196) than in black patients (35 +/- 15 mg/week, n = 76).
Resumo:
We genotyped 15 microsatellite loci in order to evaluate the effects of habitat fragmentation, caused by flooding of the Tucuruí reservoir, on the genetic structure of Alouatta belzebul in eastern Amazonia. The analysis included two populations sampled in 1984, representing both margins of the Tocantins river, and three populations sampled 18 years later. Minimal differences in the diversity levels between present-day (Ho = 0.62-0.69 and AR = 6.07-7.21) and pre-flooding (Ho = 0.60-0.62 and AR = 6.27-6.77) populations indicated there was no significant loss of genetic variability, possibly because of successful management strategies applied during the flooding. The changes observed were limited to shifts in the composition of alleles, which presumably reflect the admixture of subpopulations during flooding. Given this, there were significant differences in the Rst values (p = 0.05) in all but one between-site comparison. Both present-day and original populations showed a deficit of heterozygotes, which suggests that this may be typical of the species, at least at a local level, perhaps because of specific ecological characteristics. The relatively large number of private alleles recorded in all populations may be a consequence of the Wahlund effect resulting from population admixture or a process of expansion rather than the loss of rare alleles through genetic drift. Additionally, the levels of genetic variability observed in this study were higher than those reported for other species of Neotropical primates, suggesting good fitness levels in these A. belzebul populations. Regular genetic monitoring of remnant populations, especially on islands, should nevertheless be an integral component of long-term management strategies.
Resumo:
We propose robust and e±cient tests and estimators for gene-environment/gene-drug interactions in family-based association studies. The methodology is designed for studies in which haplotypes, quantitative pheno- types and complex exposure/treatment variables are analyzed. Using causal inference methodology, we derive family-based association tests and estimators for the genetic main effects and the interactions. The tests and estimators are robust against population admixture and strati¯cation without requiring adjustment for confounding variables. We illustrate the practical relevance of our approach by an application to a COPD study. The data analysis suggests a gene-environment interaction between a SNP in the Serpine gene and smok- ing status/pack years of smoking that reduces the FEV1 volume by about 0.02 liter per pack year of smoking. Simulation studies show that the pro- posed methodology is su±ciently powered for realistic sample sizes and that it provides valid tests and effect size estimators in the presence of admixture and stratification.
Resumo:
Linkage disequilibrium methods can be used to find genes influencing quantitative trait variation in humans. Linkage disequilibrium methods can require smaller sample sizes than linkage equilibrium methods, such as the variance component approach to find loci with a specific effect size. The increase in power is at the expense of requiring more markers to be typed to scan the entire genome. This thesis compares different linkage disequilibrium methods to determine which factors influence the power to detect disequilibrium. The costs of disequilibrium and equilibrium tests were compared to determine whether the savings in phenotyping costs when using disequilibrium methods outweigh the additional genotyping costs.^ Nine linkage disequilibrium tests were examined by simulation. Five tests involve selecting isolated unrelated individuals while four involved the selection of parent child trios (TDT). All nine tests were found to be able to identify disequilibrium with the correct significance level in Hardy-Weinberg populations. Increasing linked genetic variance and trait allele frequency were found to increase the power to detect disequilibrium, while increasing the number of generations and distance between marker and trait loci decreased the power to detect disequilibrium. Discordant sampling was used for several of the tests. It was found that the more stringent the sampling, the greater the power to detect disequilibrium in a sample of given size. The power to detect disequilibrium was not affected by the presence of polygenic effects.^ When the trait locus had more than two trait alleles, the power of the tests maximized to less than one. For the simulation methods used here, when there were more than two-trait alleles there was a probability equal to 1-heterozygosity of the marker locus that both trait alleles were in disequilibrium with the same marker allele, resulting in the marker being uninformative for disequilibrium.^ The five tests using isolated unrelated individuals were found to have excess error rates when there was disequilibrium due to population admixture. Increased error rates also resulted from increased unlinked major gene effects, discordant trait allele frequency, and increased disequilibrium. Polygenic effects did not affect the error rates. The TDT, Transmission Disequilibrium Test, based tests were not liable to any increase in error rates.^ For all sample ascertainment costs, for recent mutations ($<$100 generations) linkage disequilibrium tests were less expensive than the variance component test to carry out. Candidate gene scans saved even more money. The use of recently admixed populations also decreased the cost of performing a linkage disequilibrium test. ^
Resumo:
Currently there is no general method to study the impact of population admixture within families on the assumptions of random mating and consequently, Hardy-Weinberg equilibrium (HWE) and linkage equilibrium (LE) and on the inference obtained from traditional linkage analysis. ^ First, through simulation, the effect of admixture of two populations on the log of the odds (LOD) score was assessed, using Prostate Cancer as the typical disease model. Comparisons between simulated mixed and homogeneous families were performed. LOD scores under both models of admixture (within families and within a data set of homogeneous families) were closest to the homogeneous family scores of the population having the highest mixing proportion. Random sampling of families or ascertainment of families with disease affection status did not affect this observation, nor did the mode of inheritance (dominant/recessive) or sample size. ^ Second, after establishing the effect of admixture on the LOD score and inference for linkage, the presence of induced disequilibria by population admixture within families was studied and an adjustment procedure was developed. The adjustment did not force all disequilibria to disappear but because the families were adjusted for the population admixture, those replicates where the disequilibria exist are no longer affected by the disequilibria in terms of maximization for linkage. Furthermore, the adjustment was able to exclude uninformative families or families that had such a high departure from HWE and/or LE that their LOD scores were not reliable. ^ Together these observations imply that the presence of families of mixed population ancestry impacts linkage analysis in terms of the LOD score and the estimate of the recombination fraction. ^
Resumo:
Objectives. In this study, we aimed to identify ancestry informative haplotypes and make interethnic admixture estimates using X-chromosome markers. Methods. A significant sample (461 individuals) of European, African, and Native American populations was analyzed, and four linkage groups were identified. The data obtained were used to describe the ancestral contribution of populations from four different geographical regions of Brazil (745 individuals). Results. The global interethnic admixture estimates of the four mixed populations under investigation were calculated applying all the 24 insertion/deletion (INDEL) markers. In the North region, a larger Native Americans ancestry was observed (42%). The Northeast and Southeast regions had smaller Native American contribution (27% in both of them). In the South region, there was a large European contribution (46%). Conclusions. The estimates obtained are compatible with expectations for a colonization model with biased admixture between European men (one X chromosome) and Native American and African women (two X chromosomes), so the 24 X-INDEL panel described here can be a useful to make admixture interethnic estimates in Brazilian populations. Am. J. Hum. Biol. 22:849-852,2010. (C) 2010 Wiley-Liss, Inc.