16 resultados para Human population genetics
em DigitalCommons@The Texas Medical Center
Resumo:
Variable number of tandem repeats (VNTR) are genetic loci at which short sequence motifs are found repeated different numbers of times among chromosomes. To explore the potential utility of VNTR loci in evolutionary studies, I have conducted a series of studies to address the following questions: (1) What are the population genetic properties of these loci? (2) What are the mutational mechanisms of repeat number change at these loci? (3) Can DNA profiles be used to measure the relatedness between a pair of individuals? (4) Can DNA fingerprint be used to measure the relatedness between populations in evolutionary studies? (5) Can microsatellite and short tandem repeat (STR) loci which mutate stepwisely be used in evolutionary analyses?^ A large number of VNTR loci typed in many populations were studied by means of statistical methods developed recently. The results of this work indicate that there is no significant departure from Hardy-Weinberg expectation (HWE) at VNTR loci in most of the human populations examined, and the departure from HWE in some VNTR loci are not solely caused by the presence of population sub-structure.^ A statistical procedure is developed to investigate the mutational mechanisms of VNTR loci by studying the allele frequency distributions of these loci. Comparisons of frequency distribution data on several hundreds VNTR loci with the predictions of two mutation models demonstrated that there are differences among VNTR loci grouped by repeat unit sizes.^ By extending the ITO method, I derived the distribution of the number of shared bands between individuals with any kinship relationship. A maximum likelihood estimation procedure is proposed to estimate the relatedness between individuals from the observed number of shared bands between them.^ It was believed that classical measures of genetic distance are not applicable to analysis of DNA fingerprints which reveal many minisatellite loci simultaneously in the genome, because the information regarding underlying alleles and loci is not available. I proposed a new measure of genetic distance based on band sharing between individuals that is applicable to DNA fingerprint data.^ To address the concern that microsatellite and STR loci may not be useful for evolutionary studies because of the convergent nature of their mutation mechanisms, by a theoretical study as well as by computer simulation, I conclude that the possible bias caused by the convergent mutations can be corrected, and a novel measure of genetic distance that makes the correction is suggested. In summary, I conclude that hypervariable VNTR loci are useful in evolutionary studies of closely related populations or species, especially in the study of human evolution and the history of geographic dispersal of Homo sapiens. (Abstract shortened by UMI.) ^
Resumo:
Naturally occurring genetic variants confer susceptibility to disease in the human population, including in testicular germ cell tumor development. Disease susceptibility loci for testicular germ cell tumors have been identified by genetic mapping in humans and mice. However, the identity of many of the susceptibility genes remains unclear. My study utilized a chromosome substitution strain, the 129.MOLF-Chr 19 (or M19 strain), to identify candidate testicular germ cell tumor susceptibility genes. Males of this strain have a high incidence of germ cell tumors in the testes. By forward genetic approaches, five susceptibility loci were fine-mapped and the genetic interactions were dissected. In addition, I identified three protein-coding genes and one micro-RNA as testicular tumor susceptibility genes by genomic screening. Using reverse genetic approaches, I verified one of the candidates, Splicing factor 1, as a modifier of testicular tumor. Deficiency of SF1 significantly reduces the incidence of testicular tumors in mice. This study highlights the advantage of the 129.MOLF-Chr 19 consomic strain in disease gene identification and validation. It also sets the stage to elucidate the molecular mechanisms of tumorigenesis in the testis. ^
Resumo:
Enterococcus faecium has emerged as an important nosocomial pathogen worldwide, and this trend has been associated with the dissemination of a genetic lineage designated clonal cluster 17 (CC17). Enterococcal isolates were collected prospectively (2006 to 2008) from 32 hospitals in Colombia, Ecuador, Perú, and Venezuela and subjected to antimicrobial susceptibility testing. Genotyping was performed with all vancomycin-resistant E. faecium (VREfm) isolates by pulsed-field gel electrophoresis (PFGE) and multilocus sequence typing. All VREfm isolates were evaluated for the presence of 16 putative virulence genes (14 fms genes, the esp gene of E. faecium [espEfm], and the hyl gene of E. faecium [hylEfm]) and plasmids carrying the fms20-fms21 (pilA), hylEfm, and vanA genes. Of 723 enterococcal isolates recovered, E. faecalis was the most common (78%). Vancomycin resistance was detected in 6% of the isolates (74% of which were E. faecium). Eleven distinct PFGE types were found among the VREfm isolates, with most belonging to sequence types 412 and 18. The ebpAEfm-ebpBEfm-ebpCEfm (pilB) and fms11-fms19-fms16 clusters were detected in all VREfm isolates from the region, whereas espEfm and hylEfm were detected in 69% and 23% of the isolates, respectively. The fms20-fms21 (pilA) cluster, which encodes a putative pilus-like protein, was found on plasmids from almost all VREfm isolates and was sometimes found to coexist with hylEfm and the vanA gene cluster. The population genetics of VREfm in South America appear to resemble those of such strains in the United States in the early years of the CC17 epidemic. The overwhelming presence of plasmids encoding putative virulence factors and vanA genes suggests that E. faecium from the CC17 genogroup may disseminate in the region in the coming years.
Resumo:
There is evidence that ultraviolet radiation (UVR) is increasing over certain locations on the Earth's surface. Of primary concern is the annual pattern of ozone depletion over Antarctica and the Southern Ocean. Reduction of ozone concentration selectively limits absorption of solar UV-B (290–320 nm), resulting in higher irradiance at the Earth's surface. The effects of ozone depletion on the human population and natural ecosystems, particularly the marine environment, are a matter of considerable concern. Indeed, marine plankton may serve as sensitive indicators of ozone depletion and UV-B fluctuations. Direct biological effects of UVR result from absorption of UV-B by DNA. Once absorbed, energy is dissipated by a variety of pathways, including covalent chemical reactions leading to the formation of photoproducts. The major types of photoproduct formed are cyclobutyl pyrimidine dimer (CPD) and pyrimidine(6-4)pyrimidone dimer [(6-4)PD]. Marine plankton repair these photoproducts using light-dependent photoenzymatic repair or nucleotide excision repair. The studies here show that fluctuations in CPD concentrations in the marine environment at Palmer Station, Antarctica correlate well with ozone concentration and UV-B irradiance at the Earth's surface. A comparison of photoproduct levels in marine plankton and DNA dosimeters show that bacterioplankton display higher resistance to solar UVR than phytoplankton in an ozone depleted environment. DNA damage in marine microorganisms was investigated during two separate latitudinal transects which covered a total range of 140°. We observed the same pattern of change in DNA damage levels in dosimeters and marine plankton as measured using two distinct quantitative techniques. Results from the transects show that differences in photosensitivity exist in marine plankton collected under varying UVR environments. Laboratory studies of Antarctic bacterial isolates confirm that marine bacterioplankton possess differences in survival, DNA damage induction, and repair following exposure to UVR. Results from DNA damage measurements during ozone season, along a latitudinal gradient, and in marine bacterial isolates suggest that changes in environmental UVR correlate with changes in UV-B induced DNA damage in marine microorganisms. Differences in the ability to tolerate UVR stress under different environmental conditions may determine the composition of the microbial communities inhabiting those environments. ^
Resumo:
Colorectal cancer is the forth most common diagnosed cancer in the United States. Every year about a hundred forty-seven thousand people will be diagnosed with colorectal cancer and fifty-six thousand people lose their lives due to this disease. Most of the hereditary nonpolyposis colorectal cancer (HNPCC) and 12% of the sporadic colorectal cancer show microsatellite instability. Colorectal cancer is a multistep progressive disease. It starts from a mutation in a normal colorectal cell and grows into a clone of cells that further accumulates mutations and finally develops into a malignant tumor. In terms of molecular evolution, the process of colorectal tumor progression represents the acquisition of sequential mutations. ^ Clinical studies use biomarkers such as microsatellite or single nucleotide polymorphisms (SNPs) to study mutation frequencies in colorectal cancer. Microsatellite data obtained from single genome equivalent PCR or small pool PCR can be used to infer tumor progression. Since tumor progression is similar to population evolution, we used an approach known as coalescent, which is well established in population genetics, to analyze this type of data. Coalescent theory has been known to infer the sample's evolutionary path through the analysis of microsatellite data. ^ The simulation results indicate that the constant population size pattern and the rapid tumor growth pattern have different genetic polymorphic patterns. The simulation results were compared with experimental data collected from HNPCC patients. The preliminary result shows the mutation rate in 6 HNPCC patients range from 0.001 to 0.01. The patients' polymorphic patterns are similar to the constant population size pattern which implies the tumor progression is through multilineage persistence instead of clonal sequential evolution. The results should be further verified using a larger dataset. ^
Resumo:
Coalescent theory represents the most significant progress in theoretical population genetics in the past three decades. The coalescent theory states that all genes or alleles in a given population are ultimately inherited from a single ancestor shared by all members of the population, known as the most recent common ancestor. It is now widely recognized as a cornerstone for rigorous statistical analyses of molecular data from population [1]. The scientists have developed a large number of coalescent models and methods[2,3,4,5,6], which are not only applied in coalescent analysis and process, but also in today’s population genetics and genome studies, even public health. The thesis aims at completing a statistical framework based on computers for coalescent analysis. This framework provides a large number of coalescent models and statistic methods to assist students and researchers in coalescent analysis, whose results are presented in various formats as texts, graphics and printed pages. In particular, it also supports to create new coalescent models and statistical methods. ^
Resumo:
Human pigmentation is a complex trait with the observed variation caused by the varied production of eumelanin (brown/black melanins) and phaeomelanin (red/yellow melanins) by the melanocytes. The melanocortin 1 receptor (MC1R), a G protein-coupled receptor expressed in the melanocytes, is a regulator eu- and phaeomelanin synthesis, and MC1R mutations causing skin and coat color changes are known in many mammals. To understand the role of MC1R in human pigmentation variation, I have sequenced the MC1R gene in 121 individuals sampled from world populations. In addition, I have sequenced the MC1R gene in common and pygmy chimpanzees, gorilla, orangutan, and baboon to study the evolution of MC1R and to infer the ancestral human MC1R sequence. The ancestral MC1R sequence is observed in all 25 African individuals studied, but at lower frequencies in the other populations examined, especially in East and Southeast Asians. The Arg163Gln variant is absent in the Africans studied, almost absent in Europeans, and at a low frequency in Indians, but is at an exceptionally high frequency (70%) in East and Southeast Asians. To further evaluate the role of MC1R variants in human pigmentation variation, I have combined these molecular evolution and population studies with functional assays on MC1R variants and primate MC1Rs. ^
Resumo:
The interpretation of data on genetic variation with regard to the relative roles of different evolutionary factors that produce and maintain genetic variation depends critically on our assumptions concerning effective population size and the level of migration between neighboring populations. In humans, recent population growth and movements of specific ethnic groups across wide geographic areas mean that any theory based on assumptions of constant population size and absence of substructure is generally untenable. We examine the effects of population subdivision on the pattern of protein genetic variation in a total sample drawn from an artificial agglomerate of 12 tribal populations of Central and South America, analyzing the pooled sample as though it were a single population. Several striking findings emerge. (1) Mean heterozygosity is not sensitive to agglomeration, but the number of different alleles (allele count) is inflated, relative to neutral mutation/drift/equilibrium expectation. (2) The inflation is most serious for rare alleles, especially those which originally occurred as tribally restricted "private" polymorphisms. (3) The degree of inflation is an increasing function of both the number of populations encompassed by the sample and of the genetic divergence among them. (4) Treating an agglomerated population as though it were a panmictic unit of long standing can lead to serious biases in estimates of mutation rates, selection pressures, and effective population sizes. Current DNA studies indicate the presence of numerous genetic variants in human populations. The findings and conclusions of this paper are all fully applicable to the study of genetic variation at the DNA level as well.
A pure population of lung alveolar epithelial type II cells derived from human embryonic stem cells.
Resumo:
Alveolar epithelial type II (ATII) cells are small, cuboidal cells that constitute approximately 60% of the pulmonary alveolar epithelium. These cells are crucial for repair of the injured alveolus by differentiating into alveolar epithelial type I cells. ATII cells derived from human ES (hES) cells are a promising source of cells that could be used therapeutically to treat distal lung diseases. We have developed a reliable transfection and culture procedure, which facilitates, via genetic selection, the differentiation of hES cells into an essentially pure (>99%) population of ATII cells (hES-ATII). Purity, as well as biological features and morphological characteristics of normal ATII cells, was demonstrated for the hES-ATII cells, including lamellar body formation, expression of surfactant proteins A, B, and C, alpha-1-antitrypsin, and the cystic fibrosis transmembrane conductance receptor, as well as the synthesis and secretion of complement proteins C3 and C5. Collectively, these data document the successful generation of a pure population of ATII cells derived from hES cells, providing a practical source of ATII cells to explore in disease models their potential in the regeneration and repair of the injured alveolus and in the therapeutic treatment of genetic diseases affecting the lung.
Resumo:
BACKGROUND: Meningomyelocele (MM) is a common human birth defect. MM is a disorder of neural development caused by contributions from genes and environmental factors that result in the NTD and lead to a spectrum of physical and neurocognitive phenotypes. METHODS: A multidisciplinary approach has been taken to develop a comprehensive understanding of MM through collaborative efforts from investigators specializing in genetics, development, brain imaging, and neurocognitive outcome. Patients have been recruited from five different sites: Houston and the Texas-Mexico border area; Toronto, Canada; Los Angeles, California; and Lexington, Kentucky. Genetic risk factors for MM have been assessed by genotyping and association testing using the transmission disequilibrium test. RESULTS: A total of 509 affected child/parent trios and 309 affected child/parent duos have been enrolled to date for genetic association studies. Subsets of the patients have also been enrolled for studies assessing development, brain imaging, and neurocognitive outcomes. The study recruited two major ethnic groups, with 45.9% Hispanics of Mexican descent and 36.2% North American Caucasians of European descent. The remaining patients are African-American, South and Central American, Native American, and Asian. Studies of this group of patients have already discovered distinct corpus callosum morphology and neurocognitive deficits that associate with MM. We have identified maternal MTHFR 667T allele as a risk factor for MM. In addition, we also found that several genes for glucose transport and metabolism are potential risk factors for MM. CONCLUSIONS: The enrolled patient population provides a valuable resource for elucidating the disease characteristics and mechanisms for MM development.
Resumo:
CYP4F (Cytochrome P4504F) enzymes metabolize endogenous molecules including leukotrienes, prostaglandins and arachidonic acid. The involvement of these endogenous compounds in inflammation has led to the hypothesis that changes in the inflamed tissue environment may affect the expression of CYP4Fs during the pro-inflammatory state, which in turn may modulate inflammatory conditions during the anti-inflammatory state. We demonstrated that inflamed tissues have different levels of CYP4F isoform expression profiles in a number of human samples when compared to the average population. The CYP4F isoform expression levels change with the degree of inflammation present in tissue. Further investigation in cell culture studies revealed that inflammatory cytokines, in particular TNF-α, play a role in regulating the expression of the CYP4F family. One of the isoforms, CYP4F11, had different characteristics than that of the other five CYP4F family members. CYP4F11 metabolizes xenobiotics while the other isoforms metabolize endogenous compounds with higher affinity. CYP4F11 also was expressed at high quantities in the brain, and was up-regulated by TNF-α, while the other isoforms were not expressed at high quantities in the brain and were down-regulated by TNF-α. We identified the AP-1 protein of the JNK pathway as the signaling protein that causes significant increase in CYP4F11 expression. Since TNF-α stimulation causes a simultaneous activation of both JNK pathway and NF-κB signaling, we investigated further the role that NF-κB plays on expression of the CYP4F11 gene. We concluded that although there is a significant increase in CYP4F11 expression in the presence of TNF-α, the activation of NF-κB signaling inhibits CYP4F11 expression in a time dependent manner. The expression of CYP4F11 is only significantly increased after 24 hours of treatment with TNF-α; at shorter time points NF-κB signaling overpowers the JNK pathway activation. We believe that these findings may in the future lead to improved drug design for modulating inflammation.
Resumo:
DNA sequence variation is currently a major source of data for studying human origins, evolution, and demographic history, and for detecting linkage association of complex diseases. In this dissertation, I investigated DNA variation in worldwide populations from two ∼10 kb autosomal regions on 22q11.2 (noncoding) and 1q24 (introns). A total of 75 variant sites were found among 128 human sequences in the 22q11.2 region, yielding an estimate of 0.088% for nucleotide diversity (π), and a total of 52 variant sites were found among 122 human sequences in the 1q24 region with an estimated π value of 0.057%. The data from these two regions and a 10 kb noncoding region on Xq13.3 all show a strong excess of low-frequency variants in comparison to that expected from an equilibrium population, indicating a relatively recent population expansion. The effective population sizes estimated from the three regions were 11,000, 12,700, and 8,600, respectively, which are close to the commonly used value of 10,000. In each of the two autosomal regions, the age of the most recent common ancestor (MRCA) was estimated to be older than 1 million years among all the sequences and ∼600,000 years among non-African sequences, providing first evidence from autosomal noncoding or intronic regions for a genetic history of humans much more ancient than the emergence of modern humans. The ancient genetic history of humans indicates no severe bottleneck during the evolution of humans in the last half million years; otherwise, much of the ancient genetic history would have been lost during a severe bottleneck. This study strongly suggests that both the “out of Africa” and the multiregional models are too simple for explaining the evolution of modern humans. A compilation of genome-wide data revealed that nucleotide diversity is highest in autosomal regions, intermediate in X-linked regions, and lowest in Y-linked regions. The data suggest the existence of background selection or selective sweep on Y-linked loci. In general, the nucleotide diversity in humans is low compared to that in chimpanzee and Drosophila populations. ^
Resumo:
Linkage disequilibrium (LD) is defined as the nonrandom association of alleles at two or more loci in a population and may be a useful tool in a diverse array of applications including disease gene mapping, elucidating the demographic history of populations, and testing hypotheses of human evolution. However, the successful application of LD-based approaches to pertinent genetic questions is hampered by a lack of understanding about the forces that mediate the genome-wide distribution of LD within and between human populations. Delineating the genomic patterns of LD is a complex task that will require interdisciplinary research that transcends traditional scientific boundaries. The research presented in this dissertation is predicated upon the need for interdisciplinary studies and both theoretical and experimental projects were pursued. In the theoretical studies, I have investigated the effect of genotyping errors and SNP identification strategies on estimates of LD. The primary importance of these two chapters is that they provide important insights and guidance for the design of future empirical LD studies. Furthermore, I analyzed the allele frequency distribution of 26,530 single nucleotide polymorphisms (SNPs) in three populations and generated the first-generation natural selection map of the human genome, which will be an important resource for explaining and understanding genomic patterns of LD. Finally, in the experimental study, I describe a novel and simple, low-cost, and high-throughput SNP genotyping method. The theoretical analyses and experimental tools developed in this dissertation will facilitate a more complete understanding of patterns of LD in human populations. ^
Resumo:
The development of dentition is a fascinating process that involves a complex series of epithelial-mesenchymel signaling interactions. That such a precise process frequently goes awry is not surprising. Indeed, tooth agenesis is one of the most commonly inherited disorders in humans that affects up to twenty percent of the population and imposes significant functional, emotional and financial burdens on patients. Mutations in the paired box domain containing transcription factor PAX9 result in autosomal dominant tooth agenesis that primarily involves posterior dentition. Despite these advances, little is known about how PAX9 mediates key signaling actions in tooth development and how aberrations in PAX9 functions lead to tooth agenesis. As an initial step towards providing evidence for the pathogenic role of mutant PAX9 proteins, I performed a series of molecular genetic analyses aimed at resolving the structural and functional defects produced by a number of PAX9 mutations causing non-syndromic posterior tooth agenesis. It is likely that the pathogenic mechanism underlying tooth agenesis for the first two mutations studied (219InsG and IIe87Phe) is haploinsufficiency. For the six paired domain missense mutations studied, the lack of functional defects observed for three of the mutant proteins suggests that these mutations altered PAX9 function through alternate mechanisms. Next, I explored further the nature of the partnership between Pax9 and the Msx1 homeoprotein and their role in the expression of a downstream effector molecule, Bmp4. When viewed in the context of events occurring in dental mesenchyme, the results of these studies indicate that the Pax9-Msx1 protein interaction involves the localized up-regulation of Bmp4 activity that is mediated by synergistic interactions between the two transcription factors. Importantly, these assays corroborate in vivo data from mouse genetic studies and support reports of Pax9-dependent expression of Bmp4 in dental mesenchyme. Taken together, these results suggest that PAX9 mutations cause an early developmental defect due to an inability to maintain the inductive potential of dental mesenchyme through involvement in a pathway involving Msx1 and Bmp4. ^