21 resultados para Norovirus, Molecular evolution, Genetic variation, Immunocompromised host, Gastroenteritis outbreaks, Next generation sequencing

em DigitalCommons@The Texas Medical Center


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Human pigmentation is a complex trait with the observed variation caused by the varied production of eumelanin (brown/black melanins) and phaeomelanin (red/yellow melanins) by the melanocytes. The melanocortin 1 receptor (MC1R), a G protein-coupled receptor expressed in the melanocytes, is a regulator eu- and phaeomelanin synthesis, and MC1R mutations causing skin and coat color changes are known in many mammals. To understand the role of MC1R in human pigmentation variation, I have sequenced the MC1R gene in 121 individuals sampled from world populations. In addition, I have sequenced the MC1R gene in common and pygmy chimpanzees, gorilla, orangutan, and baboon to study the evolution of MC1R and to infer the ancestral human MC1R sequence. The ancestral MC1R sequence is observed in all 25 African individuals studied, but at lower frequencies in the other populations examined, especially in East and Southeast Asians. The Arg163Gln variant is absent in the Africans studied, almost absent in Europeans, and at a low frequency in Indians, but is at an exceptionally high frequency (70%) in East and Southeast Asians. To further evaluate the role of MC1R variants in human pigmentation variation, I have combined these molecular evolution and population studies with functional assays on MC1R variants and primate MC1Rs. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The interpretation of data on genetic variation with regard to the relative roles of different evolutionary factors that produce and maintain genetic variation depends critically on our assumptions concerning effective population size and the level of migration between neighboring populations. In humans, recent population growth and movements of specific ethnic groups across wide geographic areas mean that any theory based on assumptions of constant population size and absence of substructure is generally untenable. We examine the effects of population subdivision on the pattern of protein genetic variation in a total sample drawn from an artificial agglomerate of 12 tribal populations of Central and South America, analyzing the pooled sample as though it were a single population. Several striking findings emerge. (1) Mean heterozygosity is not sensitive to agglomeration, but the number of different alleles (allele count) is inflated, relative to neutral mutation/drift/equilibrium expectation. (2) The inflation is most serious for rare alleles, especially those which originally occurred as tribally restricted "private" polymorphisms. (3) The degree of inflation is an increasing function of both the number of populations encompassed by the sample and of the genetic divergence among them. (4) Treating an agglomerated population as though it were a panmictic unit of long standing can lead to serious biases in estimates of mutation rates, selection pressures, and effective population sizes. Current DNA studies indicate the presence of numerous genetic variants in human populations. The findings and conclusions of this paper are all fully applicable to the study of genetic variation at the DNA level as well.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Individuals with Lynch syndrome are predisposed to cancer due to an inherited DNA mismatch repair gene mutation. However, there is significant variability observed in disease expression likely due to the influence of other environmental, lifestyle, or genetic factors. Polymorphisms in genes encoding xenobiotic-metabolizing enzymes may modify cancer risk by influencing the metabolism and clearance of potential carcinogens from the body. In this retrospective analysis, we examined key candidate gene polymorphisms in CYP1A1, EPHX1, GSTT1, GSTM1, and GSTP1 as modifiers of age at onset of colorectal cancer among 257 individuals with Lynch syndrome. We found that subjects heterozygous for CYP1A1 I462V (c.1384A>G) developed colorectal cancer 4 years earlier than those with the homozygous wild-type genotype (median ages, 39 and 43 years, respectively; log-rank test P = 0.018). Furthermore, being heterozygous for the CYP1A1 polymorphisms, I462V and Msp1 (g.6235T>C), was associated with an increased risk for developing colorectal cancer [adjusted hazard ratio for AG relative to AA, 1.78; 95% confidence interval, 1.16-2.74; P = 0.008; hazard ratio for TC relative to TT, 1.53; 95% confidence interval, 1.06-2.22; P = 0.02]. Because homozygous variants for both CYP1A1 polymorphisms were rare, risk estimates were imprecise. None of the other gene polymorphisms examined were associated with an earlier onset age for colorectal cancer. Our results suggest that the I462V and Msp1 polymorphisms in CYP1A1 may be an additional susceptibility factor for disease expression in Lynch syndrome because they modify the age of colorectal cancer onset by up to 4 years.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The myocyte enhancer factor (MEF)-2 family of transcription factors has been implicated in the regulation of muscle transcription in vertebrates, but the precise position of these regulators within the genetic hierarchy leading to myogenesis is unclear. The MEF2 proteins bind to a conserved A/T-rich DNA sequence present in numerous muscle-specific genes, and they are expressed in the cells of the developing somites and in the embryonic heart at the onset of muscle formation in mammals. The MEF2 genes belong to the MADS box family of transcription factors, which control specific programs of gene expression in species ranging from yeast to humans. Each MEF2 family member contains two highly conserved protein motifs, the MADS domain and the MEF2-specific domain, which together provide the MEF2 factors with their unique DNA binding and dimerization properties. In an effort to further define the function of the MEF2 proteins, and to evaluate the degree of conservation shared among these factors and the phylogenetic pathways that they regulate, we sought to identify MEF2 family members in other species. In Drosophila, a homolog of the vertebrate MEF2 genes was identified and termed D-mef2. The D-MEF2 protein binds to the consensus MEF2 element and can activate transcription through tandem copies of that site. During Drosophila embryogenesis, D-MEF2 is specific to the mesoderm germ layer of the developing embryo and becomes expressed in all muscle cell types within the embryo. The role of D-mef2 in Drosophila embryogenesis was examined by generating a loss-of-function mutation in the D-mef2 gene. In embryos homozygous for this mutant allele, somatic, cardiac, and visceral muscles fail to differentiate, but precursors of these myogenic lineages are normally specified and positioned. These results demonstrate that different muscle cell types share a common myogenic differentiation program controlled by MEF2 and suggest that this program has been conserved from Drosophila to mammals. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Primate immunodeficiency viruses, or lentiviruses (HIV-1, HIV-2, and SIV), and hepatitis delta virus (HDV) are RNA viruses characterized by rapid evolution. Infection by primate immunodeficiency viruses usually results in the development of acquired immunodeficiency syndrome (AIDS) in humans and AIDS-like illnesses in Asian macaques. Similarly, hepatitis delta virus infection causes hepatitis and liver cancer in humans. These viruses are heterogeneous within an infected patient and among individuals. Substitution rates in the virus genomes are high and vary in different lineages and among sites. Methods of phylogenetic analysis were applied to study the evolution of primate lentiviruses and the hepatitis delta virus. The following results have been obtained: (1) The substitution rate varies among sites of primate lentivirus genes according to the two parameter gamma distribution, with the shape parameter $\alpha$ being close to 1. (2) Primate immunodeficiency viruses fall into species-specific lineages. Therefore, viral transmissions across primate species are not as frequent as suggested by previous authors. (3) Primate lentiviruses have acquired or lost their pathogenicity several times in the course of evolution. (4) Evidence was provided for multiple infections of a North American patient by distinct HIV-1 strains of the B subtype. (5) Computer simulations indicate that the probability of committing an error in testing HIV transmission depends on the number of virus sequences and their length, the divergence times among sequences, and the model of nucleotide substitution. (6) For future investigations of HIV-1 transmissions, using longer virus sequences and avoiding the use of distant outgroups is recommended. (7) Hepatitis delta virus strains are usually related according to the geographic region of isolation. (8) Evolution of HDV is characterized by the rate of synonymous substitution being lower than the nonsynonymous substitution rate and the rate of evolution of the noncoding region. (9) There is a strong preference for G and C nucleotides at the third codon positions of the HDV coding region. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

(1) A mathematical theory for computing the probabilities of various nucleotide configurations is developed, and the probability of obtaining the correct phylogenetic tree (model tree) from sequence data is evaluated for six phylogenetic tree-making methods (UPGMA, distance Wagner method, transformed distance method, Fitch-Margoliash's method, maximum parsimony method, and compatibility method). The number of nucleotides (m*) necessary to obtain the correct tree with a probability of 95% is estimated with special reference to the human, chimpanzee, and gorilla divergence. m* is at least 4,200, but the availability of outgroup species greatly reduces m* for all methods except UPGMA. m* increases if transitions occur more frequently than transversions as in the case of mitochondrial DNA. (2) A new tree-making method called the neighbor-joining method is proposed. This method is applicable either for distance data or character state data. Computer simulation has shown that the neighbor-joining method is generally better than UPGMA, Farris' method, Li's method, and modified Farris method on recovering the true topology when distance data are used. A related method, the simultaneous partitioning method, is also discussed. (3) The maximum likelihood (ML) method for phylogeny reconstruction under the assumption of both constant and varying evolutionary rates is studied, and a new algorithm for obtaining the ML tree is presented. This method gives a tree similar to that obtained by UPGMA when constant evolutionary rate is assumed, whereas it gives a tree similar to that obtained by the maximum parsimony tree and the neighbor-joining method when varying evolutionary rate is assumed. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Epithelial-mesenchymal tissue interactions regulate the development of derivatives of the caudal pharyngeal arches (PAs) to govern the ultimate morphogenesis of the aortic arch and outflow tract (OFT) of the heart. Disruption of these signaling pathways is thought to contribute to the pathology of a significant proportion of congenital cardiovascular defects in humans. In this study, I tested whether Fibroblast Growth Factor 15 (Fgf15), a secreted signaling molecule expressed within the PAs, is an extracellular mediator of tissue interactions during PA and OFT development. Analyses of Fgf15−/− mouse embryonic hearts revealed abnormalities primarily localized to the OFT, correlating with aberrant cardiac neural crest cell behavior. The T-box-containing transcription factor Tbx1 has been implicated in the cardiovascular defects associated with the human 22q11 Deletion Syndromes, and regulates the expression of other Fgf family members within the mouse PAs. However, expression and genetic interaction studies incorporating mice deficient for Tbx1, its upstream regulator, Sonic Hedgehog (Shh), or its putative downstream effector, Fgf8, indicated that Fgf15 functions during OFT development in a manner independent of these factors. Rather, analyses of compound mutant mice indicated that Fgf15 and Fgf9, an additional Fgf family member expressed within the PAs, genetically interact, providing insight into the factors acting in conjunction with Fgf15 during OFT development. Finally, in an effort to further characterize this Fgf15-mediated developmental pathway, promoter deletion analyses were employed to isolate a 415bp sequence 7.1Kb 5′ to the Fgf15 transcription start site both necessary and sufficient to drive reporter gene expression within the epithelium of the PAs. Sequence comparisons among multiple mammalian species facilitated the identification of evolutionarily conserved potential trans-acting factor binding sites within this fragment. Subsequent studies will investigate the molecular pathway(s) through which Fgf15 functions via identification of factors that bind to this element to govern Fgf15 gene expression. Furthermore, targeted deletion of this element will establish the developmental requirement for pharyngeal epithelium-derived Fgf15 signaling function. Taken as a whole, these data demonstrate that Fgf15 is a component of a novel, Tbx1-independent molecular pathway, functioning within the PAs in a manner cooperative with Fgf9, required for proper development of the cardiac OFT. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Apolipoprotein E (ApoE) plays a major role in the metabolism of high density and low density lipoproteins (HDL and LDL). Its common protein isoforms (E2, E3, E4) are risk factors for coronary artery disease (CAD) and explain between 16 to 23% of the inter-individual variation in plasma apoE levels. Linkage analysis has been completed for plasma apoE levels in the GENOA study (Genetic Epidemiology Network of Atherosclerosis). After stratification of the population by lipoprotein levels and body mass index (BMI) to create more homogeneity with regard to biological context for apoE levels, Hispanic families showed significant linkage on chromosome 17q for two strata (LOD=2.93 at 104 cM for a low cholesterol group, LOD=3.04 at 111 cM for a low cholesterol, high HDLC group). Replication of 17q linkage was observed for apoB and apoE levels in the unstratified Hispanic and African-American populations, and for apoE levels in African-American families. Replication of this 17q linkage in different populations and strata provides strong support for the presence of gene(s) in this region with significant roles in the determination of inter-individual variation in plasma apoE levels. Through a positional and functional candidate gene approach, ten genes were identified in the 17q linked region, and 62 polymorphisms in these genes were genotyped in the GENOA families. Association analysis was performed with FBAT, GEE, and variance-component based tests followed by conditional linkage analysis. Association studies with partial coverage of TagSNPs in the gene coding for apolipoprotein H (APOH) were performed, and significant results were found for 2 SNPs (APOH_20951 and APOH_05407) in the Hispanic low cholesterol strata accounting for 3.49% of the inter-individual variation in plasma apoE levels. Among the other candidate genes, we identified a haplotype block in the ACE1 gene that contains two major haplotypes associated with apoE levels as well as total cholesterol, apoB and LDLC levels in the unstratified Hispanic population. Identifying genes responsible for the remaining 60% of inter-individual variation in plasma apoE level, will yield new insights into the understanding of genetic interactions involved in the lipid metabolism, and a more precise understanding of the risk factors leading to CAD. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

To identify more mutations that can affect the early development of Myxococcus xanthus, the synthetic transposon TnT41 was designed and constructed. By virtue of its special features, it can greatly facilitate the processes of mutation screening/selection, mapping, cloning and DNA sequencing. In addition, it allows for the systematic discovery of genes in regulatory hierarchies using their target promoters. In this study, the minimal regulatory region of the early developmentally regulated gene 4521 was used as a reporter in the TnT41 mutagenesis. Both positive (P) mutations and negative (N) mutations were isolated based on their effects on 4521 expression.^ Four of these mutations, i.e. N1, N2, P52 and P54 were analyzed in detail. Mutations N1 and N2 are insertion mutations in a gene designated sasB. The sasB gene is also identified in this study by genetic and molecular analysis of five UV-generated 4521 suppressor mutations. The sasB gene encodes a protein without meaningful homology in the databases. The sasB gene negatively regulates 4521 expression possibly through the SasS-SasR two component system. A wild-type sasB gene is required for normal M. xanthus fruiting body formation and sporulation.^ Cloning and sequencing analysis of the P52 mutation led to the identification of an operon that encodes the M. xanthus high-affinity branched-chain amino acid transporter system. This liv operon consists of five genes designated livK, livH, livM, livC, and livF, respectively. The Liv proteins are highly similar to their counterparts from other bacteria in both amino acid sequences, functional motifs and predicted secondary structures. This system is required for development since liv null mutations cause abnormality in fruiting body formation and a 100-fold decrease in sporulation efficiency.^ Mutation P54 is a TnT41 insertion in the sscM gene of the ssc chemotaxis system, which has been independently identified by Dr. Shi's lab. The sscM gene encodes a MCP (methyl-accepting chemotaxis protein) homologue. The SscM protein is predicted to contain two transmembrane domains, a signaling domain and at least one putative methylation site. Null mutations of this gene abolish the aggregation of starving cells at a very early stage, though the sporulation levels of the mutant can reach 10% that of wild-type cells. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Next-generation DNA sequencing platforms can effectively detect the entire spectrum of genomic variation and is emerging to be a major tool for systematic exploration of the universe of variants and interactions in the entire genome. However, the data produced by next-generation sequencing technologies will suffer from three basic problems: sequence errors, assembly errors, and missing data. Current statistical methods for genetic analysis are well suited for detecting the association of common variants, but are less suitable to rare variants. This raises great challenge for sequence-based genetic studies of complex diseases.^ This research dissertation utilized genome continuum model as a general principle, and stochastic calculus and functional data analysis as tools for developing novel and powerful statistical methods for next generation of association studies of both qualitative and quantitative traits in the context of sequencing data, which finally lead to shifting the paradigm of association analysis from the current locus-by-locus analysis to collectively analyzing genome regions.^ In this project, the functional principal component (FPC) methods coupled with high-dimensional data reduction techniques will be used to develop novel and powerful methods for testing the associations of the entire spectrum of genetic variation within a segment of genome or a gene regardless of whether the variants are common or rare.^ The classical quantitative genetics suffer from high type I error rates and low power for rare variants. To overcome these limitations for resequencing data, this project used functional linear models with scalar response to develop statistics for identifying quantitative trait loci (QTLs) for both common and rare variants. To illustrate their applications, the functional linear models were applied to five quantitative traits in Framingham heart studies. ^ This project proposed a novel concept of gene-gene co-association in which a gene or a genomic region is taken as a unit of association analysis and used stochastic calculus to develop a unified framework for testing the association of multiple genes or genomic regions for both common and rare alleles. The proposed methods were applied to gene-gene co-association analysis of psoriasis in two independent GWAS datasets which led to discovery of networks significantly associated with psoriasis.^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

It is well accepted that tumorigenesis is a multi-step procedure involving aberrant functioning of genes regulating cell proliferation, differentiation, apoptosis, genome stability, angiogenesis and motility. To obtain a full understanding of tumorigenesis, it is necessary to collect information on all aspects of cell activity. Recent advances in high throughput technologies allow biologists to generate massive amounts of data, more than might have been imagined decades ago. These advances have made it possible to launch comprehensive projects such as (TCGA) and (ICGC) which systematically characterize the molecular fingerprints of cancer cells using gene expression, methylation, copy number, microRNA and SNP microarrays as well as next generation sequencing assays interrogating somatic mutation, insertion, deletion, translocation and structural rearrangements. Given the massive amount of data, a major challenge is to integrate information from multiple sources and formulate testable hypotheses. This thesis focuses on developing methodologies for integrative analyses of genomic assays profiled on the same set of samples. We have developed several novel methods for integrative biomarker identification and cancer classification. We introduce a regression-based approach to identify biomarkers predictive to therapy response or survival by integrating multiple assays including gene expression, methylation and copy number data through penalized regression. To identify key cancer-specific genes accounting for multiple mechanisms of regulation, we have developed the integIRTy software that provides robust and reliable inferences about gene alteration by automatically adjusting for sample heterogeneity as well as technical artifacts using Item Response Theory. To cope with the increasing need for accurate cancer diagnosis and individualized therapy, we have developed a robust and powerful algorithm called SIBER to systematically identify bimodally expressed genes using next generation RNAseq data. We have shown that prediction models built from these bimodal genes have the same accuracy as models built from all genes. Further, prediction models with dichotomized gene expression measurements based on their bimodal shapes still perform well. The effectiveness of outcome prediction using discretized signals paves the road for more accurate and interpretable cancer classification by integrating signals from multiple sources.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The genomic era brought by recent advances in the next-generation sequencing technology makes the genome-wide scans of natural selection a reality. Currently, almost all the statistical tests and analytical methods for identifying genes under selection was performed on the individual gene basis. Although these methods have the power of identifying gene subject to strong selection, they have limited power in discovering genes targeted by moderate or weak selection forces, which are crucial for understanding the molecular mechanisms of complex phenotypes and diseases. Recent availability and rapid completeness of many gene network and protein-protein interaction databases accompanying the genomic era open the avenues of exploring the possibility of enhancing the power of discovering genes under natural selection. The aim of the thesis is to explore and develop normal mixture model based methods for leveraging gene network information to enhance the power of natural selection target gene discovery. The results show that the developed statistical method, which combines the posterior log odds of the standard normal mixture model and the Guilt-By-Association score of the gene network in a naïve Bayes framework, has the power to discover moderate/weak selection gene which bridges the genes under strong selection and it helps our understanding the biology under complex diseases and related natural selection phenotypes.^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

I have undertaken measurements of the genetic (or inherited) and nongenetic (or noninherited) components of the variability of metastasis formation and tumor diameter doubling time in more than 100 metastatic lines from each of three murine tumors (sarcoma SANH, sarcoma SA4020, and hepatocarcinoma HCA-I) syngeneic to C3Hf/Kam mice. These lines were isolated twice from lung metastases and analysed immediately thereafter to obtain the variance to spontaneous lung metastasis and tumor diameter doubling time. Additional studies utilized cells obtained from within 4 passages of isolation. Under the assumption that no genetic differences in metastasis formation or diameter doubling time existed among the cells of a given line, the variance within a line would estimate nongenetic variation. The variability derived from differences between lines would represent genetic origin. The estimates of the genetic contribution to the variation of metastasis and tumor diameter doubling time were significantly greater than zero, but only in the metastatic lines of tumor SANH was genetic variation the major source of metastatic variability (contributing 53% of the variability). In the tumor cell lines of SA4020 and HCA-I, however, the contribution of nongenetic factors predominated over genetic factors in the variability of the number of metastasis and tumor diameter doubling time. A number of other parameters examined, such as DNA content, karyotype, and selection and variance analysis with passage in vivo, indicated that genetic differences existed within the cell lines and that these differences were probably created by genetic instability. The mean metastatic propensity of the lines may have increased somewhat during their isolation and isotransplantation, but the variance was only slightly affected, if at all. Analysis of the DNA profiles of the metastatic lines of SA4020 and HCA-I revealed differences between these lines and their primary parent tumors, but not among the SANH lines and their parent tumor. Furthermore, there was a direct correlation between the extent of genetic influence on metastasis formation and the ability of the tumor cells to develop resistance to cisplatinum. Thus although nongenetic factors might predominate in contributing to metastasis formation, it is probably genetic variation and genetic instability that cause the progression of tumor cells to a more metastatic phenotype and leads to the emergence of drug resistance. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Two molecular epidemiological studies were conducted to examine associations between genetic variation and risk of squamous cell carcinoma of the head and neck (SCCHN). In the first study, we hypothesized that genetic variation in p53 response elements (REs) may play roles in the etiology of SCCHN. We selected and genotyped five polymorphic p53 REs as well as a most frequently studied p53 codon 72 (Arg72Pro, rs1042522) polymorphism in 1,100 non-Hispanic White SCCHN patients and 1,122 age-and sex-matched cancer-free controls recruited at The University of Texas M. D. Anderson Cancer Center. In multivariate logistic regression analysis with adjustment for age, sex, smoking and drinking status, marital status and education level, we observed that the EOMES rs3806624 CC genotype had a significant effect of protection against SCCHN risk (adjusted odds ratio= 0.79, 95% confidence interval =0.64–0.98), compared with the -838TT+CT genotypes. Moreover, a significantly increased risk associated with the combined genotypes of p53 codon 72CC and EOMES -838TT+CT was observed, especially in the subgroup of non-oropharyneal cancer patients. The values of false-positive report probability were also calculated for significant findings. In the second study, we assessed the association between SCCHN risk and four potential regulatory single nucleotide polymorphisms (SNPs) of DEC1 (deleted in esophageal cancer 1) gene, a candidate tumor suppressor gene for esophageal cancer. After adjustment for age, sex, and smoking and drinking status, the variant -606CC (i.e., -249CC) homozygotes had a significantly reduced SCCHN risk (adjusted odds ratio = 0.71, 95% confidence interval = 0.52–0.99), compared with the -606TT homozygotes. Stratification analyses showed that a reduced risk associated with the -606CC genotype was more pronounced in subgroups of non-smokers, non-drinkers, younger subjects (defined as ≤ 57 years), carriers of TP53 Arg/Arg (rs1042522) genotype, patients with oropharyngeal cancer or late-stage SCCHN. Further in silico analysis revealed that the -249 T-to-C change led to a gain of a transcription factor binding site. Additional functional analysis showed that the -249T-to-C change significantly enhanced transcriptional activity of the DEC1 promoter and the DNA-protein binding activity. We conclude that the DEC1 promoter -249 T>C (rs2012775) polymorphism is functional, modulating susceptibility to SCCHN among non-Hispanic Whites. Additional large-scale, preferably population-based studies are needed to validate our findings.^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Isolated clubfoot, a common birth defect occurring in more than 135,000 livebirths worldwide each year, is associated with significant health care and financial burdens. Clubfoot is defined by forefoot adduction, hindfoot varus, midfoot cavus and hindfoot equinus. Isolated clubfoot, which is the focus of these studies, is distinct from syndromic clubfoot because there are no other associated malformations. Population, family, twin and segregation analysis studies provide evidence that genetic and environmental factors play an etiologic role in isolated clubfoot. The studies described in this thesis were performed to define the role of genetic variation in isolated clubfoot. Interrogation of a deletion region associated with syndromic clubfoot, suggested that CASP8 and CASP10, two apoptotic genes, play a role in isolated clubfoot. To explore the role of apoptotic genes in clubfoot, SNPs spanning genes involved in the apoptotic pathway in the six chromosomal deletion regions, and limb patterning genes, HOXD and HOXA, were interrogated. SNPs in mitochondrial mediated apoptotic genes and several SNPs in HOXA and HOXD genes were modestly associated with clubfoot with the most significant SNP, rs3801776, located in the basal promoter of HOXA9. Several significant associations were found with SNPs in NFAT2 and TNIP2. Significant gene interactions were detected between SNPs in HOX and apoptotic genes. These findings suggest a model for clubfoot in which variation in one gene is not sufficient to cause the malformation but requires variation several genes to perturb protein expression sufficiently to alter muscle and foot development. These results significantly impact our knowledge base by delineating underlying mechanisms causing clubfoot.