323 resultados para candidate gene prediction
Resumo:
Ankylosing spondylitis (AS), the prototypic seronegative arthropathy, is known to be highly heritable, with >90% of the risk of developing the disease determined genetically. As with most common heritable diseases, progress in identifying the genes involved using family-based or candidate gene approaches has been slow. The recent development of the genome-wide association study approach has revolutionized genetic studies of such diseases. Early studies in ankylosing spondylitis have produced two major breakthroughs in the identification of genes contributing roughly one third of the population attributable risk of the disease, and pointing directly to a potential therapy. These exciting findings highlight the potential of future more comprehensive genetic studies of determinants of disease risk and clinical manifestations, and are the biggest advance in our understanding of the causation of the disease since the discovery of the association with HLA-B27.
Resumo:
Candidate gene studies have reported CYP19A1 variants to be associated with endometrial cancer and with estradiol (E2) concentrations. We analyzed 2937 single nucleotide polymorphisms (SNPs) in 6608 endometrial cancer cases and 37 925 controls and report the first genome wide-significant association between endometrial cancer and a CYP19A1 SNP (rs727479 in intron 2, P=4.8x10(-11)). SNP rs727479 was also among those most strongly associated with circulating E2 concentrations in 2767 post-menopausal controls (P=7.4x10(-8)). The observed endometrial cancer odds ratio per rs727479 A-allele (1.15, CI=1.11-1.21) is compatible with that predicted by the observed effect on E2 concentrations (1.09, CI=1.03-1.21), consistent with the hypothesis that endometrial cancer risk is driven by E2. From 28 candidate-causal SNPs, 12 co-located with three putative gene-regulatory elements and their risk alleles associated with higher CYP19A1 expression in bioinformatical analyses. For both phenotypes, the associations with rs727479 were stronger among women with a higher BMI (Pinteraction=0.034 and 0.066 respectively), suggesting a biologically plausible gene-environment interaction.
Resumo:
Endometriosis is a complex disease that affects 6-10% of women in their reproductive years and 20-50% of women with infertility. Genome-wide and candidate-gene association studies for endometriosis have identified 10 independent risk loci, and of these, nine (rs7521902, rs13394619, rs4141819, rs6542095, rs1519761, rs7739264, rs12700667, rs1537377, and rs10859871) are polymorphic in European populations. Here we investigate the replication of nine SNP loci in 998 laparoscopically and histologically confirmed endometriosis cases and 783 disease-free controls from Belgium. SNPs rs7521902, rs13394619, and rs6542095 show nominally significant (p <.05) associations with endometriosis, while the directions of effect for seven SNPs are consistent with the original reports. Association of rs6542095 at the IL1A locus with 'All' (p =.066) and 'Grade-B' (p =.01) endometriosis is noteworthy because this is the first successful replication in an independent population. Meta-analysis with the published results yields genome-wide significant evidence for rs7521902, rs13394619, rs6542095, rs12700667, rs7739264, and rs1537377. Notably, three coding variants in GREB1 (near rs13394619) and CDKN2B-AS1 (near rs1537377) also showed nominally significant associations with endometriosis. Overall, this study provides important replication in a uniquely characterized independent population, and indicates that the majority of the original genome-wide association findings are not due to chance alone. © The Author(s) 2015.
Resumo:
The pathogenesis of androgenetic alopecia (AGA, male-pattern baldness) is driven by androgens, and genetic predisposition is the major prerequisite. Candidate gene and genome-wide association studies have reported that single-nucleotide polymorphisms (SNPs) at eight different genomic loci are associated with AGA development. However, a significant fraction of the overall heritable risk still awaits identification. Furthermore, the understanding of the pathophysiology of AGA is incomplete, and each newly associated locus may provide novel insights into contributing biological pathways. The aim of this study was to identify unknown AGA risk loci by replicating SNPs at the 12 genomic loci that showed suggestive association (5 x 10(-8)
genetic evidence supporting an involvement of WNT signaling in AGA development.
Resumo:
Association mapping seeks to identify marker alleles present at significantly different frequencies in cases carrying a particular disease or trait compared with controls. Genome-wide association studies are increasingly replacing candidate gene-based association studies for complex diseases, where a number of loci are likely to contribute to disease risk and the effect size of each particular risk allele is typically modest or low. Good study design is essential to the success of an association study, and factors such as the heritability of the disease under investigation, the choice of controls, statistical power, multiple testing and whether the association can be replicated need to be considered before beginning. Likewise, thorough quality control of the genotype data needs to be undertaken prior to running any association analyses. Finally, it should be kept in mind that a significant genetic association is not proof positive that a particular genetic locus causes a disease, but rather an important first step in discovering the genetic variants underlying a complex disease.
Resumo:
OBJECTIVE: The study of ethnically homogeneous populations may help to identify schizophrenia risk loci. The authors conducted a genomewide linkage scan for schizophrenia in an Indian population. METHOD: Participants were 441 individuals (262 affected probands and siblings) who were recruited primarily from one ethnically homogeneous group, the Tamil Brahmin caste, although individuals from other geographically proximal castes also participated. Genotyping of 124 affected sibling pair pedigrees was performed with 402 short tandem repeat polymorphisms. Linkage analyses were conducted using nonparametric exponential LOD (logarithm of the odds ratio for linkage) scores and parametric heterogeneity LOD scores. Parametric heterogeneity scores were calculated using simple dominant and recessive models, correcting for multiple statistics. The data were examined for evidence of consanguinity. Genomewide significance levels were determined using 10,000 gene dropping simulations. RESULTS: These findings revealed genomewide significant linkage to chromosome 1p31.1, through the use of both exponential and heterogeneity LOD scores, incorporating correction for multiple statistics and mild consanguinity. The estimated sibling recurrence risk associated with this putative locus was 1.95. Analysis for heterogeneity LOD scores also detected suggestive linkage to chromosomes 13q22.1 and 16q12.2. Using 117 tag single nucleotide polymorphisms (SNPs), family-based association analyses of phosphodiesterase 4B (PDE4B), the closest schizophrenia candidate gene, detected no convincing evidence of association, suggesting that the chromosome 1 peak represents a novel risk locus. CONCLUSIONS: This is the first study-to the authors' knowledge-to report significant linkage of schizophrenia to chromosome 1p31.1. Further investigation of this chromosome region in diverse populations is warranted to identify underlying sequence variants.
Resumo:
BACKGROUND: Genetic variation contributes to the risk of developing endometriosis. This review summarizes gene mapping studies in endometriosis and the prospects of finding gene pathways contributing to disease using the latest genome-wide strategies. METHODS: To identify candidate-gene association studies of endometriosis, a systematic literature search was conducted in PubMed of publications up to 1 April 2008, using the search terms 'endometriosis' plus 'allele' or 'polymorphism' or 'gene'. Papers included were those with information on both case and control selection, showed allelic and/or genotypic results for named germ-line polymorphisms and were published in the English language. RESULTS: Genetic variants in 76 genes have been examined for association, but none shows convincing evidence of replication in multiple studies. There is evidence for genetic linkage to chromosomes 7 and 10, but the genes (or variants) in these regions contributing to disease risk have yet to be identified. Genome-wide association is a powerful method that has been successful in locating genetic variants contributing to a range of common diseases. Several groups are planning these studies in endometriosis. For this to be successful, the endometriosis research community must work together to genotype sufficient cases, using clearly defined disease classifications, and conduct the necessary replication studies in several thousands of cases and controls. CONCLUSIONS: Genes with convincing evidence for association with endometriosis are likely to be identified in large genome-wide studies. This will provide a starting point for functional and biological studies to develop better diagnosis and treatment for this debilitating disease.
Resumo:
Introduction: Osteoporosis is the commonest metabolic bone disease worldwide. The clinical hallmark of osteoporosis is low trauma fracture, with the most devastating being hip fracture, resulting in significant effects on both morbidity and mortality. Sources of data: Data for this review have been gathered from the published literature and from a range of web resources. Areas of agreement: Genome-wide association studies in the field of osteoporosis have led to the identification of a number of loci associated with both bone mineral density and fracture risk and further increased our understanding of disease. Areas of controversy: The early strategies for mapping osteoporosis disease genes reported only isolated associations, with replication in independent cohorts proving difficult. Neither candidate gene or linkage studies showed association at genome-wide level of significance. Growing points: The advent of massive parallel sequencing technologies has proved extremely successful in mapping monogenic diseases and thus leading to the utilization of this new technology in complex disease genetics. Areas timely for developing research: The identification of novel genes and pathways will potentially lead to the identification of novel therapeutic options for patients with osteoporosis. © 2014 The Author.
Resumo:
Exponential growth of genomic data in the last two decades has made manual analyses impractical for all but trial studies. As genomic analyses have become more sophisticated, and move toward comparisons across large datasets, computational approaches have become essential. One of the most important biological questions is to understand the mechanisms underlying gene regulation. Genetic regulation is commonly investigated and modelled through the use of transcriptional regulatory network (TRN) structures. These model the regulatory interactions between two key components: transcription factors (TFs) and the target genes (TGs) they regulate. Transcriptional regulatory networks have proven to be invaluable scientific tools in Bioinformatics. When used in conjunction with comparative genomics, they have provided substantial insights into the evolution of regulatory interactions. Current approaches to regulatory network inference, however, omit two additional key entities: promoters and transcription factor binding sites (TFBSs). In this study, we attempted to explore the relationships among these regulatory components in bacteria. Our primary goal was to identify relationships that can assist in reducing the high false positive rates associated with transcription factor binding site predictions and thereupon enhance the reliability of the inferred transcription regulatory networks. In our preliminary exploration of relationships between the key regulatory components in Escherichia coli transcription, we discovered a number of potentially useful features. The combination of location score and sequence dissimilarity scores increased de novo binding site prediction accuracy by 13.6%. Another important observation made was with regards to the relationship between transcription factors grouped by their regulatory role and corresponding promoter strength. Our study of E.coli ��70 promoters, found support at the 0.1 significance level for our hypothesis | that weak promoters are preferentially associated with activator binding sites to enhance gene expression, whilst strong promoters have more repressor binding sites to repress or inhibit gene transcription. Although the observations were specific to �70, they nevertheless strongly encourage additional investigations when more experimentally confirmed data are available. In our preliminary exploration of relationships between the key regulatory components in E.coli transcription, we discovered a number of potentially useful features { some of which proved successful in reducing the number of false positives when applied to re-evaluate binding site predictions. Of chief interest was the relationship observed between promoter strength and TFs with respect to their regulatory role. Based on the common assumption, where promoter homology positively correlates with transcription rate, we hypothesised that weak promoters would have more transcription factors that enhance gene expression, whilst strong promoters would have more repressor binding sites. The t-tests assessed for E.coli �70 promoters returned a p-value of 0.072, which at 0.1 significance level suggested support for our (alternative) hypothesis; albeit this trend may only be present for promoters where corresponding TFBSs are either all repressors or all activators. Nevertheless, such suggestive results strongly encourage additional investigations when more experimentally confirmed data will become available. Much of the remainder of the thesis concerns a machine learning study of binding site prediction, using the SVM and kernel methods, principally the spectrum kernel. Spectrum kernels have been successfully applied in previous studies of protein classification [91, 92], as well as the related problem of promoter predictions [59], and we have here successfully applied the technique to refining TFBS predictions. The advantages provided by the SVM classifier were best seen in `moderately'-conserved transcription factor binding sites as represented by our E.coli CRP case study. Inclusion of additional position feature attributes further increased accuracy by 9.1% but more notable was the considerable decrease in false positive rate from 0.8 to 0.5 while retaining 0.9 sensitivity. Improved prediction of transcription factor binding sites is in turn extremely valuable in improving inference of regulatory relationships, a problem notoriously prone to false positive predictions. Here, the number of false regulatory interactions inferred using the conventional two-component model was substantially reduced when we integrated de novo transcription factor binding site predictions as an additional criterion for acceptance in a case study of inference in the Fur regulon. This initial work was extended to a comparative study of the iron regulatory system across 20 Yersinia strains. This work revealed interesting, strain-specific difierences, especially between pathogenic and non-pathogenic strains. Such difierences were made clear through interactive visualisations using the TRNDifi software developed as part of this work, and would have remained undetected using conventional methods. This approach led to the nomination of the Yfe iron-uptake system as a candidate for further wet-lab experimentation due to its potential active functionality in non-pathogens and its known participation in full virulence of the bubonic plague strain. Building on this work, we introduced novel structures we have labelled as `regulatory trees', inspired by the phylogenetic tree concept. Instead of using gene or protein sequence similarity, the regulatory trees were constructed based on the number of similar regulatory interactions. While the common phylogentic trees convey information regarding changes in gene repertoire, which we might regard being analogous to `hardware', the regulatory tree informs us of the changes in regulatory circuitry, in some respects analogous to `software'. In this context, we explored the `pan-regulatory network' for the Fur system, the entire set of regulatory interactions found for the Fur transcription factor across a group of genomes. In the pan-regulatory network, emphasis is placed on how the regulatory network for each target genome is inferred from multiple sources instead of a single source, as is the common approach. The benefit of using multiple reference networks, is a more comprehensive survey of the relationships, and increased confidence in the regulatory interactions predicted. In the present study, we distinguish between relationships found across the full set of genomes as the `core-regulatory-set', and interactions found only in a subset of genomes explored as the `sub-regulatory-set'. We found nine Fur target gene clusters present across the four genomes studied, this core set potentially identifying basic regulatory processes essential for survival. Species level difierences are seen at the sub-regulatory-set level; for example the known virulence factors, YbtA and PchR were found in Y.pestis and P.aerguinosa respectively, but were not present in both E.coli and B.subtilis. Such factors and the iron-uptake systems they regulate, are ideal candidates for wet-lab investigation to determine whether or not they are pathogenic specific. In this study, we employed a broad range of approaches to address our goals and assessed these methods using the Fur regulon as our initial case study. We identified a set of promising feature attributes; demonstrated their success in increasing transcription factor binding site prediction specificity while retaining sensitivity, and showed the importance of binding site predictions in enhancing the reliability of regulatory interaction inferences. Most importantly, these outcomes led to the introduction of a range of visualisations and techniques, which are applicable across the entire bacterial spectrum and can be utilised in studies beyond the understanding of transcriptional regulatory networks.
Resumo:
Previous studies have enabled exact prediction of probabilities of identity-by-descent (IBD) in randommating populations for a few loci (up to four or so), with extension to more using approximate regression methods. Here we present a precise predictor of multiple-locus IBD using simple formulas based on exact results for two loci. In particular, the probability of non-IBD X ABC at each of ordered loci A, B, and C can be well approximated by XABC = XABXBC/XB and generalizes to X123. . .k = X12X23. . .Xk-1,k/ Xk-2, where X is the probability of non-IBD at each locus. Predictions from this chain rule are very precise with population bottlenecks and migration, but are rather poorer in the presence of mutation. From these coefficients, the probabilities of multilocus IBD and non-IBD can also be computed for genomic regions as functions of population size, time, and map distances. An approximate but simple recurrence formula is also developed, which generally is less accurate than the chain rule but is more robust with mutation. Used together with the chain rule it leads to explicit equations for non-IBD in a region. The results can be applied to detection of quantitative trait loci (QTL) by computing the probability of IBD at candidate loci in terms of identity-by-state at neighboring markers.
Resumo:
A novel multiple regression method (RM) is developed to predict identity-by-descent probabilities at a locus L (IBDL), among individuals without pedigree, given information on surrounding markers and population history. These IBDL probabilities are a function of the increase in linkage disequilibrium (LD) generated by drift in a homogeneous population over generations. Three parameters are sufficient to describe population history: effective population size (Ne), number of generations since foundation (T), and marker allele frequencies among founders (p). IBD L are used in a simulation study to map a quantitative trait locus (QTL) via variance component estimation. RM is compared to a coalescent method (CM) in terms of power and robustness of QTL detection. Differences between RM and CM are small but significant. For example, RM is more powerful than CM in dioecious populations, but not in monoecious populations. Moreover, RM is more robust than CM when marker phases are unknown or when there is complete LD among founders or Ne is wrong, and less robust when p is wrong. CM utilises all marker haplotype information, whereas RM utilises information contained in each individual marker and all possible marker pairs but not in higher order interactions. RM consists of a family of models encompassing four different population structures, and two ways of using marker information, which contrasts with the single model that must cater for all possible evolutionary scenarios in CM.
Resumo:
Typical migraine is a complex neurological disorder comprised of two main subtypes: migraine with (MA) and without aura (MO). The disease etiology is still unclear, but family studies provide strong evidence that defective genes play an important role. Familial hemiplegic migraine (FHM) is a very rare and severe subtype of MA. It has been proposed that FHM and MA may have a similar genetic etiology. Therefore, genetic studies on FHM provide a useful model for investigating the more prevalent types of typical migraine. FHM in some families has been shown to be caused by mutations in a brain-specific P/Q-type calcium channel alpha1 subunit gene (CACNA1A) on chromosome 19p13. There has also been a report of a CACNA1A mutation being associated with MA in a patient from a family with predominant FHM. We have previously demonstrated suggestive linkage of typical migraine in a large Australian family to the FHM region on chromosome 19p13. These findings suggest that CACNA1A may also be implicated in the etiology of typical migraine in this pedigree. To investigate this possibility, we sequenced two patients carrying the critical susceptibility haplotype surrounding CACNA1A. No disease-causing mutations or polymorphisms were revealed in any of the 47 exons screened. To determine whether the CACNA1A gene was implicated in typical migraine susceptibility in the general Caucasian population, we also analyzed 82 independent pedigrees and a large case control group. We did not detect any linkage or association in these groups and conclude that if CACNA1A plays a role in typical migraine, it does not confer a major effect on the disease.
Resumo:
Alcohol consumption is a moderately heritable trait, but the genetic basis in humans is largely unknown, despite its clinical and societal importance. We report a genome-wide association study meta-analysis of approximately 2.5 million directly genotyped or imputed SNPs with alcohol consumption (gram per day per kilogram body weight) among 12 population-based samples of European ancestry, comprising 26,316 individuals, with replication genotyping in an additional 21,185 individuals. SNP rs6943555 in autism susceptibility candidate 2 gene (AUTS2) was associated with alcohol consumption at genome-wide significance (P = 4 x 10(-8) to P = 4 x 10(-9)). We found a genotype-specific expression of AUTS2 in 96 human prefrontal cortex samples (P = 0.026) and significant (P < 0.017) differences in expression of AUTS2 in whole-brain extracts of mice selected for differences in voluntary alcohol consumption. Down-regulation of an AUTS2 homolog caused reduced alcohol sensitivity in Drosophila (P < 0.001). Our finding of a regulator of alcohol consumption adds knowledge to our understanding of genetic mechanisms influencing alcohol drinking behavior.
Resumo:
This work is concerned with the genetic basis of normal human pigmentation variation. Specifically, the role of polymorphisms within the solute carrier family 45 member 2 (SLC45A2 or membrane associated transporter protein; MATP) gene were investigated with respect to variation in hair, skin and eye colour ― both between and within populations. SLC45A2 is an important regulator of melanin production and mutations in the gene underly the most recently identified form of oculocutaneous albinism. There is evidence to suggest that non-synonymous polymorphisms in SLC45A2 are associated with normal pigmentation variation between populations. Therefore, the underlying hypothesis of this thesis is that polymorphisms in SLC45A2 will alter the function or regulation of the protein, thereby altering the important role it plays in melanogenesis and providing a mechanism for normal pigmentation variation. In order to investigate the role that SLC45A2 polymorphisms play in human pigmentation variation, a DNA database was established which collected pigmentation phenotypic information and blood samples of more than 700 individuals. This database was used as the foundation for two association studies outlined in this thesis, the first of which involved genotyping two previously-described non-synonymous polymorphisms, p.Glu272Lys and p.Phe374Leu, in four different population groups. For both polymorphisms, allele frequencies were significantly different between population groups and the 272Lys and 374Leu alleles were strongly associated with black hair, brown eyes and olive skin colour in Caucasians. This was the first report to show that SLC45A2 polymorphisms were associated with normal human intra-population pigmentation variation. The second association study involved genotyping several SLC45A2 promoter polymorphisms to determine if they also played a role in pigmentation variation. Firstly, the transcription start site (TSS), and hence putative proximal promoter region, was identified using 5' RNA ligase mediated rapid amplification of cDNA ends (RLM-RACE). Two alternate TSSs were identified and the putative promoter region was screened for novel polymorphisms using denaturing high performance liquid chromatography (dHPLC). A novel duplication (c.–1176_–1174dupAAT) was identified along with other previously described single nucleotide polymorphisms (c.–1721C>G and c.–1169G>A). Strong linkage disequilibrium ensured that all three polymorphisms were associated with skin colour such that the –1721G, +dup and –1169A alleles were associated with olive skin in Caucasians. No linkage disequilibrium was observed between the promoter and coding region polymorphisms, suggesting independent effects. The association analyses were complemented with functional data, showing that the –1721G, +dup and –1169A alleles significantly decreased SLC45A2 transcriptional activity. Based on in silico bioinformatic analysis that showed these alleles remove a microphthalmia-associated transcription factor (MITF) binding site, and that MITF is a known regulator of SLC45A2 (Baxter and Pavan, 2002; Du and Fisher, 2002), it was postulated that SLC45A2 promoter polymorphisms could contribute to the regulation of pigmentation by altering MITF binding affinity. Further characterisation of the SLC45A2 promoter was carried out using luciferase reporter assays to determine the transcriptional activity of different regions of the promoter. Five constructs were designed of increasing length and their promoter activity evaluated. Constitutive promoter activity was observed within the first ~200 bp and promoter activity increased as the construct size increased. The functional impact of the –1721G, +dup and –1169A alleles, which removed a MITF consensus binding site, were assessed using electrophoretic mobility shift assays (EMSA) and expression analysis of genotyped melanoblast and melanocyte cell lines. EMSA results confirmed that the promoter polymorphisms affected DNA-protein binding. Interestingly, however, the protein/s involved were not MITF, or at least MITF was not the protein directly binding to the DNA. In an effort to more thoroughly characterise the functional consequences of SLC45A2 promoter polymorphisms, the mRNA expression levels of SLC45A2 and MITF were determined in melanocyte/melanoblast cell lines. Based on SLC45A2’s role in processing and trafficking TYRP1 from the trans-Golgi network to stage 2 melanosmes, the mRNA expression of TYRP1 was also investigated. Expression results suggested a coordinated expression of pigmentation genes. This thesis has substantially contributed to the field of pigmentation by showing that SLC45A2 polymorphisms not only show allele frequency differences between population groups, but also contribute to normal pigmentation variation within a Caucasian population. In addition, promoter polymorphisms have been shown to have functional consequences for SLC45A2 transcription and the expression of other pigmentation genes. Combined, the data presented in this work supports the notion that SLC45A2 is an important contributor to normal pigmentation variation and should be the target of further research to elucidate its role in determining pigmentation phenotypes. Understanding SLC45A2’s function may lead to the development of therapeutic interventions for oculocutaneous albinism and other disorders of pigmentation. It may also help in our understanding of skin cancer susceptibility and evolutionary adaptation to different UV environments, and contribute to the forensic application of pigmentation phenotype prediction.
Resumo:
BACKGROUND Endometriosis is a polygenic disease with a complex and multifactorial aetiology that affects 8-10% of women of reproductive age. Epidemiological data support a link between endometriosis and cancers of the reproductive tract. Fibroblast growth factor receptor 2 (FGFR2) has recently been implicated in both endometrial and breast cancer. Our previous studies on endometriosis identified significant linkage to a novel susceptibility locus on chromosome 10q26 and the FGFR2 gene maps within this linkage region. We therefore hypothesized that variation in FGFR2 may contribute to the risk of endometriosis. METHODS We genotyped 13 single nucleotide polymorphisms (SNPs) densely covering a 27 kb region within intron 2 of FGFR2 including two SNPs (rs2981582 and rs1219648) significantly associated with breast cancer and a total 40 tagSNPs across 150 kb of the FGFR2 gene. SNPs were genotyped in 958 endometriosis cases and 959 unrelated controls. RESULTS We found no evidence for association between endometriosis and FGFR2 intron 2 SNPs or SNP haplotypes and no evidence for association between endometriosis and variation across the FGFR2 gene. CONCLUSIONS Common variation in the breast-cancer implicated intron 2 and other highly plausible causative candidate regions of FGFR2 do not appear to be a major contributor to endometriosis susceptibility in our large Australian sample.