371 resultados para Genetic Analyses
Resumo:
The present study examined polymorphisms of genes that might be involved in the onset of essential hypertension (HT). These included the (i) growth hormone gene (GH1), whose locus has recently been linked to elevated blood pressure (BP) in the stroke-prone SHR, although recent sib-pair analysis of a polymorphism near the human chorionic somatomammotropin gene (a member of the GH cluster) was unable to show linkage with HT; (ii) renal kallikrein gene (KLK1); and (iii) atrial natriuretic factor gene (ANF), where a primary defect in production or activity of kallikrein or ANF could cause NaCl retention and vasoconstriction. Association analyses were conducted to compare restriction fragment length polymorphisms (RFLPs) of each gene in 85 HT and 95 normotensive (NT) Caucasian subjects whose parents had a similar BP status at age ≥50 years. The frequency of the minor allele of (i) a RsaI RFLP in the promoter of GH1, amplified from leukocyte DNA by the polymerase chain reaction, was 0.15 in the HT group and 0.14 in the NT group (χ1=0.34, P=0.55); (ii) a TaqI RFLP for KLK1 was 0.035 in the HT group and 0.015 in the NT group (χ2=1.5, P=0.21); and (iii) a XhoI RFLP for ANF was 0.50 in HTs and 0.46 in NTs (χ2=0.20, P=0.65). Studies of HT pedigrees found one family in which the ANF locus and HT were not linked, owing to an obligate recombinant. The present data thus provide no evidence for involvement of the growth hormone, renal kallikrein, nor ANF gene in the causation of essential hypertension.
Resumo:
INTRODUCTION Although the high heritability of BMD variation has long been established, few genes have been conclusively shown to affect the variation of BMD in the general population. Extreme truncate selection has been proposed as a more powerful alternative to unselected cohort designs in quantitative trait association studies. We sought to test these theoretical predictions in studies of the bone densitometry measures BMD, BMC, and femoral neck area, by investigating their association with members of the Wnt pathway, some of which have previously been shown to be associated with BMD in much larger cohorts, in a moderate-sized extreme truncate selected cohort (absolute value BMD Z-scores = 1.5-4.0; n = 344). MATERIALS AND METHODS Ninety-six tag-single nucleotide polymorphism (SNPs) lying in 13 Wnt signaling pathway genes were selected to tag common genetic variation (minor allele frequency [MAF] > 5% with an r(2) > 0.8) within 5 kb of all exons of 13 Wnt signaling pathway genes. The genes studied included LRP1, LRP5, LRP6, Wnt3a, Wnt7b, Wnt10b, SFRP1, SFRP2, DKK1, DKK2, FZD7, WISP3, and SOST. Three hundred forty-four cases with either high or low BMD were genotyped by Illumina Goldengate microarray SNP genotyping methods. Association was tested either by Cochrane-Armitage test for dichotomous variables or by linear regression for quantitative traits. RESULTS Strong association was shown with LRP5, polymorphisms of which have previously been shown to influence total hip BMD (minimum p = 0.0006). In addition, polymorphisms of the Wnt antagonist, SFRP1, were significantly associated with BMD and BMC (minimum p = 0.00042). Previously reported associations of LRP1, LRP6, and SOST with BMD were confirmed. Two other Wnt pathway genes, Wnt3a and DKK2, also showed nominal association with BMD. CONCLUSIONS This study shows that polymorphisms of multiple members of the Wnt pathway are associated with BMD variation. Furthermore, this study shows in a practical trial that study designs involving extreme truncate selection and moderate sample sizes can robustly identify genes of relevant effect sizes involved in BMD variation in the general population. This has implications for the design of future genome-wide studies of quantitative bone phenotypes relevant to osteoporosis.
Resumo:
Combining datasets across independent studies can boost statistical power by increasing the numbers of observations and can achieve more accurate estimates of effect sizes. This is especially important for genetic studies where a large number of observations are required to obtain sufficient power to detect and replicate genetic effects. There is a need to develop and evaluate methods for joint-analytical analyses of rich datasets collected in imaging genetics studies. The ENIGMA-DTI consortium is developing and evaluating approaches for obtaining pooled estimates of heritability through meta-and mega-genetic analytical approaches, to estimate the general additive genetic contributions to the intersubject variance in fractional anisotropy (FA) measured from diffusion tensor imaging (DTI). We used the ENIGMA-DTI data harmonization protocol for uniform processing of DTI data from multiple sites. We evaluated this protocol in five family-based cohorts providing data from a total of 2248 children and adults (ages: 9-85) collected with various imaging protocols. We used the imaging genetics analysis tool, SOLAR-Eclipse, to combine twin and family data from Dutch, Australian and Mexican-American cohorts into one large "mega-family". We showed that heritability estimates may vary from one cohort to another. We used two meta-analytical (the sample-size and standard-error weighted) approaches and a mega-genetic analysis to calculate heritability estimates across-population. We performed leave-one-out analysis of the joint estimates of heritability, removing a different cohort each time to understand the estimate variability. Overall, meta- and mega-genetic analyses of heritability produced robust estimates of heritability.
Resumo:
Latent class and genetic analyses were used to identify subgroups of migraine sufferers in a community sample of 6,265 Australian twins (55% female) aged 25-36 who had completed an interview based on International Headache Society (IHS) criteria. Consistent with prevalence rates from other population-based studies, 703 (20%) female and 250 (9%) male twins satisfied the IHS criteria for migraine without aura (MO), and of these, 432 (13%) female and 166 (6%) male twins satisfied the criteria for migraine with aura (MA) as indicated by visual symptoms. Latent class analysis (LCA) of IHS symptoms identified three major symptomatic classes, representing 1) a mild form of recurrent nonmigrainous headache, 2) a moderately severe form of migraine, typically without visual aura symptoms (although 40% of individuals in this class were positive for aura), and 3) a severe form of migraine typically with visual aura symptoms (although 24% of individuals were negative for aura). Using the LCA classification, many more individuals were considered affected to some degree than when using IHS criteria (35% vs. 13%). Furthermore, genetic model fitting indicated a greater genetic contribution to migraine using the LCA classification (heritability, h(2)=0.40; 95% CI, 0.29-0.46) compared with the IHS classification (h(2)=0.36; 95% CI, 0.22-0.42). Exploratory latent class modeling, fitting up to 10 classes, did not identify classes corresponding to either the IHS MO or MA classification. Our data indicate the existence of a continuum of severity, with MA more severe but not etiologically distinct from MO. In searching for predisposing genes, we should therefore expect to find some genes that may underlie all major recurrent headache subtypes, with modifying genetic or environmental factors that may lead to differential expression of the liability for migraine.
Resumo:
Polygenic profiling has been proposed for elite endurance performance, using an additive model determining the proportion of optimal alleles in endurance athletes. To investigate this model’s utility for elite triathletes, we genotyped seven polymorphisms previously associated with an endurance polygenic profile (ACE Ins/Del, ACTN3 Arg577Ter, AMPD1 Gln12Ter, CKMM 1170bp/985+185bp, HFE His63Asp, GDF8 Lys153Arg and PPARGC1A Gly482Ser) in a cohort of 196 elite athletes who participated in the 2008 Kona Ironman championship triathlon. Mean performance time (PT) was not significantly different in individual marker analysis. Age, sex, and continent of origin had a significant influence on PT and were adjusted for. Only the AMPD1 endurance-optimal Gln allele was found to be significantly associated with an improvement in PT (model p=5.79 x 10-17, AMPD1 genotype p=0.01). Individual genotypes were combined into a total genotype score (TGS); TGS distribution ranged from 28.6 to 92.9, concordant with prior studies in endurance athletes (mean±SD: 60.75±12.95). TGS distribution was shifted toward higher TGS in the top 10% of athletes, though the mean TGS was not significantly different (p=0.164) and not significantly associated with PT even when adjusted for age, sex, and origin. Receiver operating characteristic curve analysis determined that TGS alone could not significantly predict athlete finishing time with discriminating sensitivity and specificity for three outcomes (less than median PT, less than mean PT, or in the top 10%), though models with the age, sex, continent of origin, and either TGS or AMPD1 genotype could. These results suggest three things: that more sophisticated genetic models may be necessary to accurately predict athlete finishing time in endurance events; that non-genetic factors such as training are hugely influential and should be included in genetic analyses to prevent confounding; and that large collaborations may be necessary to obtain sufficient sample sizes for powerful and complex analyses of endurance performance.
Resumo:
Definition of disease phenotype is a necessary preliminary to research into genetic causes of a complex disease. Clinical diagnosis of migraine is currently based on diagnostic criteria developed by the International Headache Society. Previously, we examined the natural clustering of these diagnostic symptoms using latent class analysis (LCA) and found that a four-class model was preferred. However, the classes can be ordered such that all symptoms progressively intensify, suggesting that a single continuous variable representing disease severity may provide a better model. Here, we compare two models: item response theory and LCA, each constructed within a Bayesian context. A deviance information criterion is used to assess model fit. We phenotyped our population sample using these models, estimated heritability and conducted genome-wide linkage analysis using Merlin-qtl. LCA with four classes was again preferred. After transformation, phenotypic trait values derived from both models are highly correlated (correlation = 0.99) and consequently results from subsequent genetic analyses were similar. Heritability was estimated at 0.37, while multipoint linkage analysis produced genome-wide significant linkage to chromosome 7q31-q33 and suggestive linkage to chromosomes 1 and 2. We argue that such continuous measures are a powerful tool for identifying genes contributing to migraine susceptibility.
Resumo:
Starting from the study at the beginning of the East German "Heterosisfeldversuch", where PANICKE et al. (1975) considered the possibilities of a targeted use of inbreeding and heterotic effects, we show and discuss results of inbreeding studies in the USA dairy cattle breeding. Several research groups worldwide presented effective tools for managing inbreeding in dairy cattle. Their efforts underline the need of inbreeding studies. Contemplating inbreeding is necessary for any breeding decision to avoid inbreeding depression and for improved genetic analyses, e.g. in QTL- estimation. A novel methodology (HERNANDEZ-SANCHEZ et al., 2004a and b) is suggested for estimating inbreeding at the three levels of population, individual and locus.
Resumo:
Common variants in the hepatocyte nuclear factor 1 homeobox B (HNF1B) gene are associated with the risk of Type II diabetes and multiple cancers. Evidence to date indicates that cancer risk may be mediated via genetic or epigenetic effects on HNF1B gene expression. We previously found single-nucleotide polymorphisms (SNPs) at the HNF1B locus to be associated with endometrial cancer, and now report extensive fine-mapping and in silico and laboratory analyses of this locus. Analysis of 1184 genotyped and imputed SNPs in 6608 Caucasian cases and 37 925 controls, and 895 Asian cases and 1968 controls, revealed the best signal of association for SNP rs11263763 (P = 8.4 × 10−14, odds ratio = 0.86, 95% confidence interval = 0.82–0.89), located within HNF1B intron 1. Haplotype analysis and conditional analyses provide no evidence of further independent endometrial cancer risk variants at this locus. SNP rs11263763 genotype was associated with HNF1B mRNA expression but not with HNF1B methylation in endometrial tumor samples from The Cancer Genome Atlas. Genetic analyses prioritized rs11263763 and four other SNPs in high-to-moderate linkage disequilibrium as the most likely causal SNPs. Three of these SNPs map to the extended HNF1B promoter based on chromatin marks extending from the minimal promoter region. Reporter assays demonstrated that this extended region reduces activity in combination with the minimal HNF1B promoter, and that the minor alleles of rs11263763 or rs8064454 are associated with decreased HNF1B promoter activity. Our findings provide evidence for a single signal associated with endometrial cancer risk at the HNF1B locus, and that risk is likely mediated via altered HNF1B gene expression.
Resumo:
Context: Identifying susceptibility genes for schizophrenia may be complicated by phenotypic heterogeneity, with some evidence suggesting that phenotypic heterogeneity reflects genetic heterogeneity. Objective: To evaluate the heritability and conduct genetic linkage analyses of empirically derived, clinically homogeneous schizophrenia subtypes. Design: Latent class and linkage analysis. Setting: Taiwanese field research centers. Participants: The latent class analysis included 1236 Han Chinese individuals with DSM-IV schizophrenia. These individuals were members of a large affected-sibling-pair sample of schizophrenia (606 ascertained families), original linkage analyses of which detected a maximum logarithm of odds (LOD) of 1.8 (z = 2.88) on chromosome 10q22.3. Main Outcome Measures: Multipoint exponential LOD scores by latent class assignment and parametric heterogeneity LOD scores. Results: Latent class analyses identified 4 classes, with 2 demonstrating familial aggregation. The first (LC2) described a group with severe negative symptoms, disorganization, and pronounced functional impairment, resembling “deficit schizophrenia.” The second (LC3) described a group with minimal functional impairment, mild or absent negative symptoms, and low disorganization. Using the negative/deficit subtype, we detected genome-wide significant linkage to 1q23-25 (LOD = 3.78, empiric genome-wide P = .01). This region was not detected using the DSM-IV schizophrenia diagnosis, but has been strongly implicated in schizophrenia pathogenesis by previous linkage and association studies.Variants in the 1q region may specifically increase risk for a negative/deficit schizophrenia subtype. Alternatively, these results may reflect increased familiality/heritability of the negative class, the presence of multiple 1q schizophrenia risk genes, or a pleiotropic 1q risk locus or loci, with stronger genotype-phenotype correlation with negative/deficit symptoms. Using the second familial latent class, we identified nominally significant linkage to the original 10q peak region. Conclusion: Genetic analyses of heritable, homogeneous phenotypes may improve the power of linkage and association studies of schizophrenia and thus have relevance to the design and analysis of genome-wide association studies.
Resumo:
Anatomical brain networks change throughout life and with diseases. Genetic analysis of these networks may help identify processes giving rise to heritable brain disorders, but we do not yet know which network measures are promising for genetic analyses. Many factors affect the downstream results, such as the tractography algorithm used to define structural connectivity. We tested nine different tractography algorithms and four normalization methods to compute brain networks for 853 young healthy adults (twins and their siblings). We fitted genetic structural equation models to all nine network measures, after a normalization step to increase network consistency across tractography algorithms. Probabilistic tractography algorithms with global optimization (such as Probtrackx and Hough) yielded higher heritability statistics than 'greedy' algorithms (such as FACT) which process small neighborhoods at each step. Some global network measures (probtrackx-derived GLOB and ST) showed significant genetic effects, making them attractive targets for genome-wide association studies.
Resumo:
Several studies have demonstrated an association between polycystic ovary syndrome (PCOS) and the dinucleotide repeat microsatellite marker D19S884, which is located in intron 55 of the fibrillin-3 (FBN3) gene. Fibrillins, including FBN1 and 2, interact with latent transforming growth factor (TGF)-β-binding proteins (LTBP) and thereby control the bioactivity of TGFβs. TGFβs stimulate fibroblast replication and collagen production. The PCOS ovarian phenotype includes increased stromal collagen and expansion of the ovarian cortex, features feasibly influenced by abnormal fibrillin expression. To examine a possible role of fibrillins in PCOS, particularly FBN3, we undertook tagging and functional single nucleotide polymorphism (SNP) analysis (32 SNPs including 10 that generate non-synonymous amino acid changes) using DNA from 173 PCOS patients and 194 controls. No SNP showed a significant association with PCOS and alleles of most SNPs showed almost identical population frequencies between PCOS and control subjects. No significant differences were observed for microsatellite D19S884. In human PCO stroma/cortex (n = 4) and non-PCO ovarian stroma (n = 9), follicles (n = 3) and corpora lutea (n = 3) and in human ovarian cancer cell lines (KGN, SKOV-3, OVCAR-3, OVCAR-5), FBN1 mRNA levels were approximately 100 times greater than FBN2 and 200–1000-fold greater than FBN3. Expression of LTBP-1 mRNA was 3-fold greater than LTBP-2. We conclude that FBN3 appears to have little involvement in PCOS but cannot rule out that other markers in the region of chromosome 19p13.2 are associated with PCOS or that FBN3 expression occurs in other organs and that this may be influencing the PCOS phenotype.
Resumo:
The Enhancing NeuroImaging Genetics through Meta-Analysis (ENIGMA) Consortium is a collaborative network of researchers working together on a range of large-scale studies that integrate data from 70 institutions worldwide. Organized into Working Groups that tackle questions in neuroscience, genetics, and medicine, ENIGMA studies have analyzed neuroimaging data from over 12,826 subjects. In addition, data from 12,171 individuals were provided by the CHARGE consortium for replication of findings, in a total of 24,997 subjects. By meta-analyzing results from many sites, ENIGMA has detected factors that affect the brain that no individual site could detect on its own, and that require larger numbers of subjects than any individual neuroimaging study has currently collected. ENIGMA's first project was a genome-wide association study identifying common variants in the genome associated with hippocampal volume or intracranial volume. Continuing work is exploring genetic associations with subcortical volumes (ENIGMA2) and white matter microstructure (ENIGMA-DTI). Working groups also focus on understanding how schizophrenia, bipolar illness, major depression and attention deficit/hyperactivity disorder (ADHD) affect the brain. We review the current progress of the ENIGMA Consortium, along with challenges and unexpected discoveries made on the way.
Resumo:
This work is concerned with the genetic basis of normal human pigmentation variation. Specifically, the role of polymorphisms within the solute carrier family 45 member 2 (SLC45A2 or membrane associated transporter protein; MATP) gene were investigated with respect to variation in hair, skin and eye colour ― both between and within populations. SLC45A2 is an important regulator of melanin production and mutations in the gene underly the most recently identified form of oculocutaneous albinism. There is evidence to suggest that non-synonymous polymorphisms in SLC45A2 are associated with normal pigmentation variation between populations. Therefore, the underlying hypothesis of this thesis is that polymorphisms in SLC45A2 will alter the function or regulation of the protein, thereby altering the important role it plays in melanogenesis and providing a mechanism for normal pigmentation variation. In order to investigate the role that SLC45A2 polymorphisms play in human pigmentation variation, a DNA database was established which collected pigmentation phenotypic information and blood samples of more than 700 individuals. This database was used as the foundation for two association studies outlined in this thesis, the first of which involved genotyping two previously-described non-synonymous polymorphisms, p.Glu272Lys and p.Phe374Leu, in four different population groups. For both polymorphisms, allele frequencies were significantly different between population groups and the 272Lys and 374Leu alleles were strongly associated with black hair, brown eyes and olive skin colour in Caucasians. This was the first report to show that SLC45A2 polymorphisms were associated with normal human intra-population pigmentation variation. The second association study involved genotyping several SLC45A2 promoter polymorphisms to determine if they also played a role in pigmentation variation. Firstly, the transcription start site (TSS), and hence putative proximal promoter region, was identified using 5' RNA ligase mediated rapid amplification of cDNA ends (RLM-RACE). Two alternate TSSs were identified and the putative promoter region was screened for novel polymorphisms using denaturing high performance liquid chromatography (dHPLC). A novel duplication (c.–1176_–1174dupAAT) was identified along with other previously described single nucleotide polymorphisms (c.–1721C>G and c.–1169G>A). Strong linkage disequilibrium ensured that all three polymorphisms were associated with skin colour such that the –1721G, +dup and –1169A alleles were associated with olive skin in Caucasians. No linkage disequilibrium was observed between the promoter and coding region polymorphisms, suggesting independent effects. The association analyses were complemented with functional data, showing that the –1721G, +dup and –1169A alleles significantly decreased SLC45A2 transcriptional activity. Based on in silico bioinformatic analysis that showed these alleles remove a microphthalmia-associated transcription factor (MITF) binding site, and that MITF is a known regulator of SLC45A2 (Baxter and Pavan, 2002; Du and Fisher, 2002), it was postulated that SLC45A2 promoter polymorphisms could contribute to the regulation of pigmentation by altering MITF binding affinity. Further characterisation of the SLC45A2 promoter was carried out using luciferase reporter assays to determine the transcriptional activity of different regions of the promoter. Five constructs were designed of increasing length and their promoter activity evaluated. Constitutive promoter activity was observed within the first ~200 bp and promoter activity increased as the construct size increased. The functional impact of the –1721G, +dup and –1169A alleles, which removed a MITF consensus binding site, were assessed using electrophoretic mobility shift assays (EMSA) and expression analysis of genotyped melanoblast and melanocyte cell lines. EMSA results confirmed that the promoter polymorphisms affected DNA-protein binding. Interestingly, however, the protein/s involved were not MITF, or at least MITF was not the protein directly binding to the DNA. In an effort to more thoroughly characterise the functional consequences of SLC45A2 promoter polymorphisms, the mRNA expression levels of SLC45A2 and MITF were determined in melanocyte/melanoblast cell lines. Based on SLC45A2’s role in processing and trafficking TYRP1 from the trans-Golgi network to stage 2 melanosmes, the mRNA expression of TYRP1 was also investigated. Expression results suggested a coordinated expression of pigmentation genes. This thesis has substantially contributed to the field of pigmentation by showing that SLC45A2 polymorphisms not only show allele frequency differences between population groups, but also contribute to normal pigmentation variation within a Caucasian population. In addition, promoter polymorphisms have been shown to have functional consequences for SLC45A2 transcription and the expression of other pigmentation genes. Combined, the data presented in this work supports the notion that SLC45A2 is an important contributor to normal pigmentation variation and should be the target of further research to elucidate its role in determining pigmentation phenotypes. Understanding SLC45A2’s function may lead to the development of therapeutic interventions for oculocutaneous albinism and other disorders of pigmentation. It may also help in our understanding of skin cancer susceptibility and evolutionary adaptation to different UV environments, and contribute to the forensic application of pigmentation phenotype prediction.
Resumo:
Habitat fragmentation can have an impact on a wide variety of biological processes including abundance, life history strategies, mating system, inbreeding and genetic diversity levels of individual species. Although fragmented populations have received much attention, ecological and genetic responses of species to fragmentation have still not been fully resolved. The current study investigated the ecological factors that may influence the demographic and genetic structure of the giant white-tailed rat (Uromys caudimaculatus) within fragmented tropical rainforests. It is the first study to examine relationships between food resources, vegetation attributes and Uromys demography in a quantitative manner. Giant white-tailed rat densities were strongly correlated with specific suites of food resources rather than forest structure or other factors linked to fragmentation (i.e. fragment size). Several demographic parameters including the density of resident adults and juvenile recruitment showed similar patterns. Although data were limited, high quality food resources appear to initiate breeding in female Uromys. Where data were sufficient, influx of juveniles was significantly related to the density of high quality food resources that had fallen in the previous three months. Thus, availability of high quality food resources appear to be more important than either vegetation structure or fragment size in influencing giant white-tailed rat demography. These results support the suggestion that a species’ response to fragmentation can be related to their specific habitat requirements and can vary in response to local ecological conditions. In contrast to demographic data, genetic data revealed a significant negative effect of habitat fragmentation on genetic diversity and effective population size in U. caudimaculatus. All three fragments showed lower levels of allelic richness, number of private alleles and expected heterozygosity compared with the unfragmented continuous rainforest site. Populations at all sites were significantly differentiated, suggesting restricted among population gene flow. The combined effects of reduced genetic diversity, lower effective population size and restricted gene flow suggest that long-term viability of small fragmented populations may be at risk, unless effective management is employed in the future. A diverse range of genetic reproductive behaviours and sex-biased dispersal patterns were evident within U. caudimaculatus populations. Genetic paternity analyses revealed that the major mating system in U. caudimaculatus appeared to be polygyny at sites P1, P3 and C1. Evidence of genetic monogamy, however, was also found in the three fragmented sites, and was the dominant mating system in the remaining low density, small fragment (P2). High variability in reproductive skew and reproductive success was also found but was less pronounced when only resident Uromys were considered. Male body condition predicted which males sired offspring, however, neither body condition nor heterozygosity levels were accurate predictors of the number of offspring assigned to individual males or females. Genetic spatial autocorrelation analyses provided evidence for increased philopatry among females at site P1, but increased philopatry among males at site P3. This suggests that male-biased dispersal occurs at site P1 and female-biased dispersal at site P3, implying that in addition to mating systems, Uromys may also be able to adjust their dispersal behaviour to suit local ecological conditions. This study highlights the importance of examining the mechanisms that underlie population-level responses to habitat fragmentation using a combined ecological and genetic approach. The ecological data suggested that habitat quality (i.e. high quality food resources) rather than habitat quantity (i.e. fragment size) was relatively more important in influencing giant white-tailed rat demographics, at least for the populations studied here . Conversely, genetic data showed strong evidence that Uromys populations were affected adversely by habitat fragmentation and that management of isolated populations may be required for long-term viability of populations within isolated rainforest fragments.
Resumo:
Background Chlamydia pneumoniae is a widespread pathogen causing upper and lower respiratory tract infections in addition to a range of other diseases in humans and animals. Previous whole genome analyses have focused on four essentially clonal (> 99% identity) C. pneumoniae human genomes (AR39, CWL029, J138 and TW183), providing relatively little insight into strain diversity and evolution of this species. Results We performed individual gene-by-gene comparisons of the recently sequenced C. pneumoniae koala genome and four C. pneumoniae human genomes to identify species-specific genes, and more importantly, to gain an insight into the genetic diversity and evolution of the species. We selected genes dispersed throughout the chromosome, representing genes that were specific to C. pneumoniae, genes with a demonstrated role in chlamydial biology and/or pathogenicity (n = 49), genes encoding nucleotide salvage or amino acid biosynthesis proteins (n = 6), and extrachromosomal elements (9 plasmid and 2 bacteriophage genes). Conclusions We have identified strain-specific differences and targets for detection of C. pneumoniae isolates from both human and animal origin. Such characterisation is necessary for an improved understanding of disease transmission and intervention.