11 resultados para Population genetics
em DigitalCommons@The Texas Medical Center
Resumo:
Variable number of tandem repeats (VNTR) are genetic loci at which short sequence motifs are found repeated different numbers of times among chromosomes. To explore the potential utility of VNTR loci in evolutionary studies, I have conducted a series of studies to address the following questions: (1) What are the population genetic properties of these loci? (2) What are the mutational mechanisms of repeat number change at these loci? (3) Can DNA profiles be used to measure the relatedness between a pair of individuals? (4) Can DNA fingerprint be used to measure the relatedness between populations in evolutionary studies? (5) Can microsatellite and short tandem repeat (STR) loci which mutate stepwisely be used in evolutionary analyses?^ A large number of VNTR loci typed in many populations were studied by means of statistical methods developed recently. The results of this work indicate that there is no significant departure from Hardy-Weinberg expectation (HWE) at VNTR loci in most of the human populations examined, and the departure from HWE in some VNTR loci are not solely caused by the presence of population sub-structure.^ A statistical procedure is developed to investigate the mutational mechanisms of VNTR loci by studying the allele frequency distributions of these loci. Comparisons of frequency distribution data on several hundreds VNTR loci with the predictions of two mutation models demonstrated that there are differences among VNTR loci grouped by repeat unit sizes.^ By extending the ITO method, I derived the distribution of the number of shared bands between individuals with any kinship relationship. A maximum likelihood estimation procedure is proposed to estimate the relatedness between individuals from the observed number of shared bands between them.^ It was believed that classical measures of genetic distance are not applicable to analysis of DNA fingerprints which reveal many minisatellite loci simultaneously in the genome, because the information regarding underlying alleles and loci is not available. I proposed a new measure of genetic distance based on band sharing between individuals that is applicable to DNA fingerprint data.^ To address the concern that microsatellite and STR loci may not be useful for evolutionary studies because of the convergent nature of their mutation mechanisms, by a theoretical study as well as by computer simulation, I conclude that the possible bias caused by the convergent mutations can be corrected, and a novel measure of genetic distance that makes the correction is suggested. In summary, I conclude that hypervariable VNTR loci are useful in evolutionary studies of closely related populations or species, especially in the study of human evolution and the history of geographic dispersal of Homo sapiens. (Abstract shortened by UMI.) ^
Resumo:
Enterococcus faecium has emerged as an important nosocomial pathogen worldwide, and this trend has been associated with the dissemination of a genetic lineage designated clonal cluster 17 (CC17). Enterococcal isolates were collected prospectively (2006 to 2008) from 32 hospitals in Colombia, Ecuador, Perú, and Venezuela and subjected to antimicrobial susceptibility testing. Genotyping was performed with all vancomycin-resistant E. faecium (VREfm) isolates by pulsed-field gel electrophoresis (PFGE) and multilocus sequence typing. All VREfm isolates were evaluated for the presence of 16 putative virulence genes (14 fms genes, the esp gene of E. faecium [espEfm], and the hyl gene of E. faecium [hylEfm]) and plasmids carrying the fms20-fms21 (pilA), hylEfm, and vanA genes. Of 723 enterococcal isolates recovered, E. faecalis was the most common (78%). Vancomycin resistance was detected in 6% of the isolates (74% of which were E. faecium). Eleven distinct PFGE types were found among the VREfm isolates, with most belonging to sequence types 412 and 18. The ebpAEfm-ebpBEfm-ebpCEfm (pilB) and fms11-fms19-fms16 clusters were detected in all VREfm isolates from the region, whereas espEfm and hylEfm were detected in 69% and 23% of the isolates, respectively. The fms20-fms21 (pilA) cluster, which encodes a putative pilus-like protein, was found on plasmids from almost all VREfm isolates and was sometimes found to coexist with hylEfm and the vanA gene cluster. The population genetics of VREfm in South America appear to resemble those of such strains in the United States in the early years of the CC17 epidemic. The overwhelming presence of plasmids encoding putative virulence factors and vanA genes suggests that E. faecium from the CC17 genogroup may disseminate in the region in the coming years.
Resumo:
Colorectal cancer is the forth most common diagnosed cancer in the United States. Every year about a hundred forty-seven thousand people will be diagnosed with colorectal cancer and fifty-six thousand people lose their lives due to this disease. Most of the hereditary nonpolyposis colorectal cancer (HNPCC) and 12% of the sporadic colorectal cancer show microsatellite instability. Colorectal cancer is a multistep progressive disease. It starts from a mutation in a normal colorectal cell and grows into a clone of cells that further accumulates mutations and finally develops into a malignant tumor. In terms of molecular evolution, the process of colorectal tumor progression represents the acquisition of sequential mutations. ^ Clinical studies use biomarkers such as microsatellite or single nucleotide polymorphisms (SNPs) to study mutation frequencies in colorectal cancer. Microsatellite data obtained from single genome equivalent PCR or small pool PCR can be used to infer tumor progression. Since tumor progression is similar to population evolution, we used an approach known as coalescent, which is well established in population genetics, to analyze this type of data. Coalescent theory has been known to infer the sample's evolutionary path through the analysis of microsatellite data. ^ The simulation results indicate that the constant population size pattern and the rapid tumor growth pattern have different genetic polymorphic patterns. The simulation results were compared with experimental data collected from HNPCC patients. The preliminary result shows the mutation rate in 6 HNPCC patients range from 0.001 to 0.01. The patients' polymorphic patterns are similar to the constant population size pattern which implies the tumor progression is through multilineage persistence instead of clonal sequential evolution. The results should be further verified using a larger dataset. ^
Resumo:
Coalescent theory represents the most significant progress in theoretical population genetics in the past three decades. The coalescent theory states that all genes or alleles in a given population are ultimately inherited from a single ancestor shared by all members of the population, known as the most recent common ancestor. It is now widely recognized as a cornerstone for rigorous statistical analyses of molecular data from population [1]. The scientists have developed a large number of coalescent models and methods[2,3,4,5,6], which are not only applied in coalescent analysis and process, but also in today’s population genetics and genome studies, even public health. The thesis aims at completing a statistical framework based on computers for coalescent analysis. This framework provides a large number of coalescent models and statistic methods to assist students and researchers in coalescent analysis, whose results are presented in various formats as texts, graphics and printed pages. In particular, it also supports to create new coalescent models and statistical methods. ^
Resumo:
Native peoples of the New World, including Amerindians and admixed Latin Americans such as Mexican-Americans, are highly susceptible to diseases of the gallbladder. These include cholesterol cholelithiasis (gallstones) and its complications, as well as cancer of the gallbladder. Although there is clearly some necessary dietary or other environmental risk factor involved, the pattern of disease prevalence is geographically associated with the distribution of genes of aboriginal Amerindian origin, and levels of risk generally correspond to the degree of Amerindian admixture. This pattern differs from that generally associated with Westernization, which suggests a gene-environment interaction, and that within an admixed population there is a subset whose risk is underestimated when admixture is ignored. The risk that an individual of a susceptible New World genotype will undergo a cholecystectomy by age 85 can approach 40% in Mexican-American females, and their risk of gallbladder cancer can reach several percent. These are heretofore unrecognized levels of risk, especially of the latter, because previous studies have not accounted for admixture or for the loss of at-risk individuals due to cholecystectomy. A genetic susceptibility may, thus, be as "carcinogenic" in New World peoples as any known major environmental exposure; yet, while the risk has a genetic basis, its expression as gallbladder cancer is so delayed as to lead only very rarely to multiply-affected families. Estimates in this paper are derived in part from two studies of Mexican-Americans in Starr County and Laredo, Texas.
Resumo:
The interpretation of data on genetic variation with regard to the relative roles of different evolutionary factors that produce and maintain genetic variation depends critically on our assumptions concerning effective population size and the level of migration between neighboring populations. In humans, recent population growth and movements of specific ethnic groups across wide geographic areas mean that any theory based on assumptions of constant population size and absence of substructure is generally untenable. We examine the effects of population subdivision on the pattern of protein genetic variation in a total sample drawn from an artificial agglomerate of 12 tribal populations of Central and South America, analyzing the pooled sample as though it were a single population. Several striking findings emerge. (1) Mean heterozygosity is not sensitive to agglomeration, but the number of different alleles (allele count) is inflated, relative to neutral mutation/drift/equilibrium expectation. (2) The inflation is most serious for rare alleles, especially those which originally occurred as tribally restricted "private" polymorphisms. (3) The degree of inflation is an increasing function of both the number of populations encompassed by the sample and of the genetic divergence among them. (4) Treating an agglomerated population as though it were a panmictic unit of long standing can lead to serious biases in estimates of mutation rates, selection pressures, and effective population sizes. Current DNA studies indicate the presence of numerous genetic variants in human populations. The findings and conclusions of this paper are all fully applicable to the study of genetic variation at the DNA level as well.
Resumo:
BACKGROUND: Meningomyelocele (MM) is a common human birth defect. MM is a disorder of neural development caused by contributions from genes and environmental factors that result in the NTD and lead to a spectrum of physical and neurocognitive phenotypes. METHODS: A multidisciplinary approach has been taken to develop a comprehensive understanding of MM through collaborative efforts from investigators specializing in genetics, development, brain imaging, and neurocognitive outcome. Patients have been recruited from five different sites: Houston and the Texas-Mexico border area; Toronto, Canada; Los Angeles, California; and Lexington, Kentucky. Genetic risk factors for MM have been assessed by genotyping and association testing using the transmission disequilibrium test. RESULTS: A total of 509 affected child/parent trios and 309 affected child/parent duos have been enrolled to date for genetic association studies. Subsets of the patients have also been enrolled for studies assessing development, brain imaging, and neurocognitive outcomes. The study recruited two major ethnic groups, with 45.9% Hispanics of Mexican descent and 36.2% North American Caucasians of European descent. The remaining patients are African-American, South and Central American, Native American, and Asian. Studies of this group of patients have already discovered distinct corpus callosum morphology and neurocognitive deficits that associate with MM. We have identified maternal MTHFR 667T allele as a risk factor for MM. In addition, we also found that several genes for glucose transport and metabolism are potential risk factors for MM. CONCLUSIONS: The enrolled patient population provides a valuable resource for elucidating the disease characteristics and mechanisms for MM development.
Resumo:
Human pigmentation is a complex trait with the observed variation caused by the varied production of eumelanin (brown/black melanins) and phaeomelanin (red/yellow melanins) by the melanocytes. The melanocortin 1 receptor (MC1R), a G protein-coupled receptor expressed in the melanocytes, is a regulator eu- and phaeomelanin synthesis, and MC1R mutations causing skin and coat color changes are known in many mammals. To understand the role of MC1R in human pigmentation variation, I have sequenced the MC1R gene in 121 individuals sampled from world populations. In addition, I have sequenced the MC1R gene in common and pygmy chimpanzees, gorilla, orangutan, and baboon to study the evolution of MC1R and to infer the ancestral human MC1R sequence. The ancestral MC1R sequence is observed in all 25 African individuals studied, but at lower frequencies in the other populations examined, especially in East and Southeast Asians. The Arg163Gln variant is absent in the Africans studied, almost absent in Europeans, and at a low frequency in Indians, but is at an exceptionally high frequency (70%) in East and Southeast Asians. To further evaluate the role of MC1R variants in human pigmentation variation, I have combined these molecular evolution and population studies with functional assays on MC1R variants and primate MC1Rs. ^
Resumo:
The genetic etiology of stroke likely reflects the influence of multiple loci with small effects, each modulating different pathophysiological processes. This research project utilized three analytical strategies to address the paucity of information related to the identification and characterization of genetic variation associated with stroke in the general population. ^ First, the general contribution of familial factors to stroke susceptibility was evaluated in a population-based sample of unrelated individuals. Increased risk of subclinical cerebral infarction was observed among individuals with a positive parental history of stroke. This association did not appear to be mediated by established stroke risk factors, specifically blood pressure levels or hypertension status. ^ The need to identify specific gene variation associated with stroke in the general population was addressed by evaluating seven candidate gene polymorphisms in a population-based sample of unrelated individuals. Three polymorphisms were significantly associated with increased subclinical cerebral infarction or incident clinical ischemic stroke risk. These relationships include the G-protein β3 subunit 825C/T polymorphism and clinical stroke in Whites, the lipoprotein lipase S/X447 polymorphism and subclinical and clinical stroke in men, and the angiotensin I-converting enzyme Ins/Del polymorphism and subclinical stroke in White men. These associations did not appear to be obfuscated by the stroke risk factors adjusted for in the analysis models specifically blood pressure levels or anti-hypertensive medication use. ^ The final research strategy considered, on a genome-wide scale, the idea that genetic variation may contribute to the occurrence of hypertension or stroke through a common etiologic pathway. Genomic regions were identified for which significant evidence of heterogeneity was observed among hypertensive sibpairs stratified by family history of stroke information. Regions identified on chromosome 15 in African Americans, and chromosome 13 in Whites and African Americans, suggest the presence of genes influencing hypertension and stroke susceptibility. ^ Insight into the role of genetics in stroke is useful for the potential early identification of individuals at increased risk for stroke and improved understanding of the etiology of the disease. The ultimate goal of these endeavors is to guide the development of therapeutic intervention and informed prevention to provide a lasting and positive impact on public health. ^
Resumo:
Normal humans have one red and at least one green visual pigment genes. These genes are tightly linked as tandem repeats on the X chromosome and each of them has six exons. There is only one X-linked visual pigment gene in New World monkeys (NWMs) but the locus has three polymorphic alleles encoding red, yellow and green visual pigments, respectively. The spectral properties of the squirrel monkey and the marmoset (both NWMs) have been studied and partial sequences of the three alleles are available. To study the evolutionary history of these X-linked opsin genes in humans and NWMs, coding and intron sequences of the three squirrel monkey alleles and the three marmoset alleles were amplified by PCR followed by subcloning and sequencing. Introns 2 and 4 of the human red and green pigment genes were also sequenced. The results obtained are as follows: (1) The sequences of introns 2 and 4 of the human red and green opsin genes are significantly more similar between the two genes than are coding sequences, contrary to the usual situation where coding regions are better conserved in evolution than are introns. The high similarities in the two introns are probably due to recent gene conversion events during evolution of the human lineage. (2) Phylogenetic analysis of both intron and exon sequences indicates that the phylogenetic tree of the available primate opsin genes is the same as the species tree. The two human genes were derived from a gene duplication event after the divergence of the human and NWM lineages. The three alleles in each of the two NWM species diverged after the split of the two NWMs but have persisted in the population for at least 5 million years. (3) Allelic gene conversion might have occurred between the three squirrel monkey alleles. (4) A model of additive effect of hydroxyl-bearing amino acids on spectral tuning is proposed by treating some unknown variables as groups. Under the assumption that some residues have no effect, it is found that at least five amino acid residues, at positions 178 (3 nm), 180 (5 nm), 230 ($-$4 nm), 277 (9 nm) and 285 (13 nm), have linear spectral tuning effects. (5) Adaptive evolution of the opsin genes to different spectral peaks was observed at four residues that are important for spectral tuning. ^
Resumo:
The purpose of this research is to develop a new statistical method to determine the minimum set of rows (R) in a R x C contingency table of discrete data that explains the dependence of observations. The statistical power of the method will be empirically determined by computer simulation to judge its efficiency over the presently existing methods. The method will be applied to data on DNA fragment length variation at six VNTR loci in over 72 populations from five major racial groups of human (total sample size is over 15,000 individuals; each sample having at least 50 individuals). DNA fragment lengths grouped in bins will form the basis of studying inter-population DNA variation within the racial groups are significant, will provide a rigorous re-binning procedure for forensic computation of DNA profile frequencies that takes into account intra-racial DNA variation among populations. ^