884 resultados para genome-wide association study
Resumo:
Background Leucocyte telomere length (LTL), which is fashioned by multiple genes, has been linked to a host of human diseases, including sporadic melanoma. A number of genes associated with LTL have already been identified through genome-wide association studies. The main aim of this study was to establish whether DCAF4 (DDB1 and CUL4-associated factor 4) is associated with LTL. In addition, using ingenuity pathway analysis (IPA), we examined whether LTL-associated genes in the general population might partially explain the inherently longer LTL in patients with sporadic melanoma, the risk for which is increased with ultraviolet radiation (UVR). Results Genome-wide association (GWA) meta-analysis and de novo genotyping of 20 022 individuals revealed a novel association (p=6.4×10−10) between LTL and rs2535913, which lies within DCAF4. Notably, eQTL analysis showed that rs2535913 is associated with decline in DCAF4 expressions in both lymphoblastoid cells and sun-exposed skin (p=4.1×10−3 and 2×10−3, respectively). Moreover, IPA revealed that LTL-associated genes, derived from GWA meta-analysis (N=9190), are over-represented among genes engaged in melanoma pathways. Meeting increasingly stringent p value thresholds (p<0.05, <0.01, <0.005, <0.001) in the LTL-GWA meta-analysis, these genes were jointly over-represented for melanoma at p values ranging from 1.97×10−169 to 3.42×10−24. Conclusions We uncovered a new locus associated with LTL in the general population. We also provided preliminary findings that suggest a link of LTL through genetic mechanisms with UVR and melanoma in the general population.
Resumo:
Migraine is a debilitating neurological disorder affecting around 1 in 7 people worldwide, but its molecular mechanisms remain poorly understood. Some debate exists over whether migraine is a disease of vascular dysfunction, or a result of neuronal dysfunction with secondary vascular changes. Genome-wide association (GWA) studies have thus far identified 13 independent loci associated with migraine. To identify new susceptibility loci, we performed the largest genetic study of migraine to date, comprising 59,674 cases and 316,078 controls from 22 GWA studies. We identified 45 independent single nucleotide polymorphisms (SNPs) significantly associated with migraine risk (P < 5 x 10-8) that map to 38 distinct genomic loci, including 28 loci not previously reported and the first locus identified on chromosome X. Furthermore, a subset analysis for migraine without aura (MO) identified seven of the same loci as from the full sample, whereas no loci reached genome-wide significance in the migraine with aura (MA) subset. In subsequent computational analyzes, the identified loci showed enrichment for genes expressed in vascular and smooth muscle tissues, consistent with a predominant theory of migraine that highlights vascular etiologies.
Resumo:
STUDY QUESTION: Do DNA variants in the growth regulation by estrogen in breast cancer 1 (GREB1) region regulate endometrial GREB1 expression and increase the risk of developing endometriosis in women? SUMMARY ANSWER: We identified new single nucleotide polymorphisms (SNPs) with strong association with endometriosis at the GREB1 locus although we did not detect altered GREB1 expression in endometriosis patients with defined genotypes. WHAT IS ALREADY KNOWN: Genome-wide association studies have identified the GREB1 region on chromosome 2p25.1 for increasing endometriosis risk. The differential expression of GREB1 has also been reported by others in association with endometriosis disease phenotype. STUDY DESIGN, SIZE, DURATION: Fine mapping studies comprehensively evaluated SNPs within the GREB1 region in a large-scale data set (>2500 cases and >4000 controls). Publicly available bioinformatics tools were employed to functionally annotate SNPs showing the strongest association signal with endometriosis risk. Endometrial GREB1 mRNA and protein expression was studied with respect to phases of the menstrual cycle (n = 2-45 per cycle stage) and expression quantitative trait loci (eQTL) analysis for significant SNPs were undertaken for GREB1 [mRNA (n = 94) and protein (n = 44) in endometrium]. PARTICIPANTS/MATERIALS, SETTING, METHODS: Participants in this study are females who provided blood and/or endometrial tissue samples in a hospital setting. The key SNPs were genotyped using Sequenom MassARRAY. The functional roles and regulatory annotations for identified SNPs are predicted by various publicly available bioinformatics tools. Endometrial GREB1 expression work employed qRT-PCR, western blotting and immunohistochemistry studies. MAIN RESULTS AND THE ROLE OF CHANCE: Fine mapping results identified a number of SNPs showing stronger association (0.004 < P < 0.032) with endometriosis risk than the original GWAS SNP (rs13394619) (P = 0.034). Some of these SNPs were predicted to have functional roles, for example, interaction with transcription factor motifs. The haplotype (a combination of alleles) formed by the risk alleles from two common SNPs showed significant association (P = 0.026) with endometriosis and epistasis analysis showed no evidence for interaction between the two SNPs, suggesting an additive effect of SNPs on endometriosis risk. In normal human endometrium, GREB1 protein expression was altered depending on the cycle stage (significantly different in late proliferative versus late secretory, P < 0.05) and cell type (glandular epithelium, not stromal cells). However, GREB1 expression in endometriosis cases versus controls and eQTL analyses did not reveal any significant changes. LIMITATIONS, REASONS FOR CAUTION: In silico prediction tools are generally based on cell lines different to our tissue and disease of interest. Functional annotations drawn from these analyses should be considered with this limitation in mind. We identified cell-specific and hormone-specific changes in GREB1 protein expression. The lack of a significant difference observed following our GREB1 expression studies may be the result of moderate power on mixed cell populations in the endometrial tissue samples. WIDER IMPLICATIONS OF THE FINDINGS: This study further implicates the GREB1 region on chromosome 2p25.1 and the GREB1 gene with involvement in endometriosis risk. More detailed functional studies are required to determine the role of the novel GREB1 transcripts in endometriosis pathophysiology. STUDY FUNDING/COMPETING INTERESTS: Funding for this work was provided by NHMRC Project Grants APP1012245, APP1026033, APP1049472 and APP1046880. There are no competing interests.
Inference of the genetic architecture underlying BMI and height with the use of 20,240 sibling pairs
Resumo:
Evidence that complex traits are highly polygenic has been presented by population-based genome-wide association studies (GWASs) through the identification of many significant variants, as well as by family-based de novo sequencing studies indicating that several traits have a large mutational target size. Here, using a third study design, we show results consistent with extreme polygenicity for body mass index (BMI) and height. On a sample of 20,240 siblings (from 9,570 nuclear families), we used a within-family method to obtain narrow-sense heritability estimates of 0.42 (SE = 0.17, p = 0.01) and 0.69 (SE = 0.14, p = 6 x 10(-)(7)) for BMI and height, respectively, after adjusting for covariates. The genomic inflation factors from locus-specific linkage analysis were 1.69 (SE = 0.21, p = 0.04) for BMI and 2.18 (SE = 0.21, p = 2 x 10(-10)) for height. This inflation is free of confounding and congruent with polygenicity, consistent with observations of ever-increasing genomic-inflation factors from GWASs with large sample sizes, implying that those signals are due to true genetic signals across the genome rather than population stratification. We also demonstrate that the distribution of the observed test statistics is consistent with both rare and common variants underlying a polygenic architecture and that previous reports of linkage signals in complex traits are probably a consequence of polygenic architecture rather than the segregation of variants with large effects. The convergent empirical evidence from GWASs, de novo studies, and within-family segregation implies that family-based sequencing studies for complex traits require very large sample sizes because the effects of causal variants are small on average.
Resumo:
Obesity increases the risk for several conditions, including type 2 diabetes mellitus, cardiovascular disease, hypertension, osteoarthirits and certain types of cancer. Twin- and family studies have shown that there is a major genetic component in the determination of body mass. In recent years several technological and scientific advance have been made in obesity research. For instance, novel replicated loci have been revealed by a number of genome wide association studies. This thesis aimed to investigate the association of genetic factors and obesity-related quantitative traits. The first study investigated the role of the lactase gene in anthropometric traits. We genetically defined lactose persistence by genotyping 31 720 individuals of European descent. We found that lactase persistence was significantly correlated with weight and body mass index but not with height. In the second study we performed the largest whole genome linkage scan for body mass index to date. The sample consisted of 4401 twin families and 10 535 individuals from six European countries. We found supporting evidence for two loci (3q29 and 7q36). We observed that the heritability estimate increased substantially when additional family members were removed from the analyses, which suggests reduced environmental variance in the twin sample. In the third study we assessed metabonomic, transcriptomic and genomic variation in a Finnish population cohort of 518 individuals. We formed gene expression networks to portray pathways and showed that a set of highly correlated genes of an inflammatory pathway associated with 80 serum metabolites (of 134 quantified measures). Strong association was found, for example, with several lipoprotein subclasses. We inferred causality by using genetic variation as anchors. The expression of the network genes was found to be dependent on the circulatory metabolite concentrations.
Resumo:
Colorectal cancer (CRC) is one of the most frequent malignancies in Western countries. Inherited factors have been suggested to be involved in 35% of CRCs. The hereditary CRC syndromes explain only ~6% of all CRCs, indicating that a large proportion of the inherited susceptibility is still unexplained. Much of the remaining genetic predisposition for CRC is probably due to undiscovered low-penetrance variations. This study was conducted to identify germline and somatic changes that contribute to CRC predisposition and tumorigenesis. MLH1 and MSH2, that underlie Hereditary non-polyposis colorectal cancer (HNPCC) are considered to be tumor suppressor genes; the first hit is inherited in the germline and somatic inactivation of the wild type allele is required for tumor initiation. In a recent study, frequent loss of the mutant allele in HNPCC tumors was detected and a new model, arguing against the two-hit hypothesis, was proposed for somatic HNPCC tumorigenesis. We tested this hypothesis by conducting LOH analysis on 25 colorectal HNPCC tumors with a known germline mutation in the MLH1 or MSH2 genes. LOH was detected in 56% of the tumors. All the losses targeted the wild type allele supporting the classical two-hit model for HNPCC tumorigenesis. The variants 3020insC, R702W and G908R in NOD2 predispose to Crohn s disease. Contribution of NOD2 to CRC predisposition has been examined in several case-control series, with conflicting results. We have previously shown that 3020insC does not predispose to CRC in Finnish CRC patients. To expand our previous study the variants R702W and G908R were genotyped in a population-based series of 1042 Finnish CRC patients and 508 healthy controls. Association analyses did not show significant evidence for association of the variants with CRC. Single nucleotide polymorphism (SNP) rs6983267 at chromosome 8q24 was the first CRC susceptibility variant identified through genome-wide association studies. To characterize the role of rs6983267 in CRC predisposition in the Finnish population, we genotyped the SNP in the case-control material of 1042 cases and 1012 controls and showed that G allele of rs6983267 is associated with the increased risk of CRC (OR 1.22; P=0.0018). Examination of allelic imbalance in the tumors heterozygous for rs6983267 revealed that copy number increase affected 22% of the tumors and interestingly, it favored the G allele. By utilizing a computer algorithm, Enhancer Element Locator (EEL), an evolutionary conserved regulatory motif containing rs6983267 was identified. The SNP affected the binding site of TCF4, a transcription factor that mediates Wnt signaling in cells, and has proven to be crucial in colorectal neoplasia. The preferential binding of TCF4 to the risk allele G was showed in vitro and in vivo. The element drove lacZ marker gene expression in mouse embryos in a pattern that is consistent with genes regulated by the Wnt signaling pathway. These results suggest that rs6983267 at 8q24 exerts its effect in CRC predisposition by regulating gene expression. The most obvious target gene for the enhancer element is MYC, residing ~335 kb downstream, however further studies are required to establish the transcriptional target(s) of the predicted enhancer element.
Resumo:
Schizophrenia is a severe psychotic disorder affecting 0.5-1 % of the population. The disorder is characterized by hallucinations; delusions; disorganized behavior and speech; avolition; anhedonia; flattened affect and cognitive deficits. The etiology of the disorder is complex with evidence for multiple genes contributing to the onset of the disorder along with environmental factors. DISC1 is one of the most promising candidate genes for schizophrenia. It codes for a protein which takes part in numerous molecular interactions along several pathways. This network, termed as the DISC1 pathway, is evidently important for the development and maturation of the central nervous system from the embryo until young adulthood. Disruption at these pathways is thought to predispose schizophrenia. In the present study, we have studied the DISC1 pathway in the etiology of schizophrenia in the Finnish population. We have utilized large Finnish samples; the schizophrenia family sample where DISC1 was originally shown to associate with schizophrenia and the Northern Finland birth cohort 1966 (NFBC66). Several DISC1 binding partners displayed evidence for association in the family sample along with DISC1. Through a genome-wide linkage study, we found a significant linkage signal to a locus where a DISC1 binding partner NDE1 is located at the carriers of a certain DISC1 risk variant. In a follow-up study, genetic markers in NDE1 displayed significant evidence for association with schizophrenia. Further exploration of association between 11 genes of the DISC1 pathway and schizophrenia led to recognition of novel variants in NDEL1, PDE4B and PDE4D that significantly either increased or decreased the risk for schizophrenia. Further, we found evidence that DISC1 itself has a significant role in the human mental functioning even in the healthy population. Variants in DISC1 had a significant effect on anhedonia which is a trait present at everybody but is in its severe form one of the main symptoms of schizophrenia and correlates with the risk of developing the disorder. Further, utilizing genome-wide marker data, we recognized three genes; MIR620; CCDC141 and LCT; that are closely related to the DISC1 pathway but which effects on anhedonia were observable only at the individuals who carried these specific DISC1 variants. Our findings significantly add up to the previous evidence for the involvement of DISC1 and the DISC1 pathway in the etiology of schizophrenia and psychosis. Our results support the concept of a number of DISC1 pathway related genes contributing in the etiology of schizophrenia along with DISC1 and provide new candidates for the studies of schizophrenia. Our findings also significantly increase the importance of DISC1 itself as having a role in psychological functioning in the general population.
Resumo:
The ratio of the lengths of an individual's second to fourth digit (2D:4D) is commonly used as a noninvasive retrospective biomarker for prenatal androgen exposure. In order to identify the genetic determinants of 2D:4D, we applied a genome-wide association approach to 1507 11-year-old children from the Avon Longitudinal Study of Parents and Children (ALSPAC) in whom 2D:4D ratio had been measured, as well as a sample of 1382 12- to 16-year-olds from the Brisbane Adolescent Twin Study. A meta-analysis of the two scans identified a single variant in the LIN28B gene that was strongly associated with 2D:4D (rs314277: p = 4.1 x 10(-8)) and was subsequently independently replicated in an additional 3659 children from the ALSPAC cohort (p = 1.53 x 10(-6)). The minor allele of the rs314277 variant has previously been linked to increased height and delayed age at menarche, but in our study it was associated with increased 2D:4D in the direction opposite to that of previous reports on the correlation between 2D:4D and age at menarche. Our findings call into question the validity of 2D:4D as a simplistic retrospective biomarker for prenatal testosterone exposure.
Resumo:
OBJECTIVE: To further investigate a common variant (rs9939609) in the fat mass- and obesity-associated gene (FTO), which recent genome-wide association studies have shown to be associated with body mass index (BMI) and obesity. DESIGN: We examined the effect of this FTO variant on BMI in 3353 Australian adult male and female twins. RESULTS: The minor A allele of rs9939609 was associated with an increased BMI (P=0.0007). Each additional copy of the A allele was associated with a mean BMI increase of approximately 1.04 kg/m(2) (approximately 3.71 kg). Using variance components decomposition, we estimate that this single-nucleotide polymorphism accounts for approximately 3% of the genetic variance in BMI in our sample (approximately 2% of the total variance). By comparing intrapair variances of monozygotic twins of different genotypes we were able to perform a direct test of gene by environment (G x E) interaction in both sexes and gene by parity (G x P) interaction in women, but no evidence was found for either. CONCLUSIONS: In addition to supporting earlier findings that the rs9939609 variant in the FTO gene is associated with an increased BMI, our results indicate that the associated genetic effect does not interact with environment or parity.
Resumo:
Positional cloning has enabled hypothesis-free, genome-wide scans for genetic factors contributing to disorders or traits. Traditionally linkage analysis has been used to identify regions of interest, followed by meticulous fine mapping and candidate gene screening using association methods and finally sequencing of regions of interest. More recently, genome-wide association analysis has enabled a more direct approach to identify specific genetic variants explaining a part of the variance of the phenotype of interest. Autism spectrum disorders (ASDs) are a group of childhood onset neuropsychiatric disorders with shared core symptoms but varying severity. Although a strong genetic component has been established in ASDs, genetic susceptibility factors have largely eluded characterization. Here, we have utilized modern molecular genetic methods combined with the advantages provided by the special population structure in Finland to identify genetic risk factors for ASDs. The results of this study show that numerous genetic risk factors exist for ASDs even within a population isolate. Stratification based on clinical phenotype resulted in encouraging results, as previously identified linkage to 3p14-p24 was replicated in an independent family set of families with Asperger syndrome, but no other ASDs. Fine-mapping of the previously identified linkage peak for ASDs at 3q25-q27 revealed association between autism and a subunit of the 5-hydroxytryptamine receptor 3C (HTR3C). We also used dense, genome-wide single nucleotide polymorphism (SNP) data to characterize the population structure of Finns. We observed significant population substructure which correlates with the known history of multiple consecutive bottle-necks experienced by the Finnish population. We used this information to ascertain a genetically homogenous subset of autism families to identify possible rare, enriched risk variants using genome-wide SNP data. No rare enriched genetic risk factors were identified in this dataset, although a subset of families could be genealogically linked to form two extended pedigrees. The lack of founder mutations in this isolated population suggests that the majority of genetic risk factors are rare, de novo mutations unique to individual nuclear families. The results of this study are consistent with others in the field. The underlying genetic architecture for this group of disorders appears highly heterogeneous, with common variants accounting for only a subset of genetic risk. The majority of identified risk factors have turned out to be exceedingly rare, and only explain a subset of the genetic risk in the general population in spite of their high penetrance within individual families. The results of this study, together with other results obtained in this field, indicate that family specific linkage, homozygosity mapping and resequencing efforts are needed to identify these rare genetic risk factors.
Resumo:
Latent class analysis was performed on migraine symptom data collected in a Dutch population sample (N = 12,210, 59% female) in order to obtain empirical groupings of individuals suffering from symptoms of migraine headache. Based on these heritable groupings (h(2) = 0.49, 95% CI: 0.41-0.57) individuals were classified as affected (migrainous headache) or unaffected. Genome-wide linkage analysis was performed using genotype data from 105 families with at least 2 affected siblings. In addition to this primary phenotype, linkage analyses were performed for the individual migraine symptoms. Significance levels, corrected for the analysis of multiple traits, were determined empirically via a novel simulation approach. Suggestive linkage for migrainous headache was found on chromosomes 1 (LOD = 1.63; pointwise P = 0.0031), 13 (LOD = 1.63; P = 0.0031), and 20 (LOD = 1.85; P = 0.0018). Interestingly, the chromosome 1 peak was located close to the ATP1A2 gene, associated with familial hemiplegic migraine type 2 (FHM2). Individual symptom analysis produced a LOD score of 1.97 (P = 0.0013) on chromosome 5 (photo/phonophobia), a LOD score of 2.13 (P = 0.0009) on chromosome 10 (moderate/severe pain intensity) and a near significant LOD score of 3.31 (P = 0.00005) on chromosome 13 (pulsating headache). These peaks were all located near regions previously reported in migraine linkage studies. Our results provide important replication and support for the presence of migraine susceptibility genes within these regions, and further support the utility of an LCA-based phenotyping approach and analysis of individual symptoms in migraine genetic research. Additionally, our novel "2-step" analysis and simulation approach provides a powerful means to investigate linkage to individual trait components.
Resumo:
Numerous studies have reported association between variants in the dystrobrevin binding protein 1 (dysbindin) gene (DTNBP1) and schizophrenia. However, the pattern of results is complex and to date, no specific risk marker or haplotype has been consistently identified. The number of single nucleotide polymorphisms (SNPs) tested in these studies has ranged from 5 to 20. We attempted to replicate previous findings by testing 16 SNPs in samples of 41 Australian pedigrees, 194 Australian cases and 180 controls, and 197 Indian pedigrees. No globally significant evidence for association was observed in any sample, despite power calculations indicating sufficient power to replicate several previous findings. Possible explanations for our results include sample differences in background linkage disequilibrium and/or risk allele effect size, the presence of multiple risk alleles upon different haplotypes, or the presence of a single risk allele upon multiple haplotypes. Some previous associations may also represent false positives. Examination of Caucasian HapMap phase II genotype data spanning the DTNBP1 region indicates upwards of 40 SNPs are required to satisfactorily assess all nonredundant variation within DTNBP1 and its potential regulatory regions for association with schizophrenia. More comprehensive studies in multiple samples will be required to determine whether specific DTNBP1 variants function as risk factors for schizophrenia.
Resumo:
Background Next-generation sequencing technology is an important tool for the rapid, genome-wide identification of genetic variations. However, it is difficult to resolve the ‘signal’ of variations of interest and the ‘noise’ of stochastic sequencing and bioinformatic errors in the large datasets that are generated. We report a simple approach to identify regional linkage to a trait that requires only two pools of DNA to be sequenced from progeny of a defined genetic cross (i.e. bulk segregant analysis) at low coverage (<10×) and without parentage assignment of individual SNPs. The analysis relies on regional averaging of pooled SNP frequencies to rapidly scan polymorphisms across the genome for differential regional homozygosity, which is then displayed graphically. Results Progeny from defined genetic crosses of Tribolium castaneum (F4 and F19) segregating for the phosphine resistance trait were exposed to phosphine to select for the resistance trait while the remainders were left unexposed. Next generation sequencing was then carried out on the genomic DNA from each pool of selected and unselected insects from each generation. The reads were mapped against the annotated T. castaneum genome from NCBI (v3.0) and analysed for SNP variations. Since it is difficult to accurately call individual SNP frequencies when the depth of sequence coverage is low, variant frequencies were averaged across larger regions. Results from regional SNP frequency averaging identified two loci, tc_rph1 on chromosome 8 and tc_rph2 on chromosome 9, which together are responsible for high level resistance. Identification of the two loci was possible with only 5-7× average coverage of the genome per dataset. These loci were subsequently confirmed by direct SNP marker analysis and fine-scale mapping. Individually, homozygosity of tc_rph1 or tc_rph2 results in only weak resistance to phosphine (estimated at up to 1.5-2.5× and 3-5× respectively), whereas in combination they interact synergistically to provide a high-level resistance >200×. The tc_rph2 resistance allele resulted in a significant fitness cost relative to the wild type allele in unselected beetles over eighteen generations. Conclusion We have validated the technique of linkage mapping by low-coverage sequencing of progeny from a simple genetic cross. The approach relied on regional averaging of SNP frequencies and was used to successfully identify candidate gene loci for phosphine resistance in T. castaneum. This is a relatively simple and rapid approach to identifying genomic regions associated with traits in defined genetic crosses that does not require any specialised statistical analysis.
Resumo:
Bipolar disorder (BP) is a complex psychiatric disorder characterized by episodes of mania and depression. BP affects approximately 1% of the world’s population and shows no difference in lifetime prevalence between males and females. BP arises from complex interactions among genetic, developmental and environmental factors, and it is likely that several predisposing genes are involved in BP. The genetic background of BP is still poorly understood, although intensive and long-lasting research has identified several chromosomal regions and genes involved in susceptibility to BP. This thesis work aims to identify the genetic variants that influence bipolar disorder in the Finnish population by candidate gene and genome-wide linkage analyses in families with many BP cases. In addition to diagnosis-based phenotypes, neuropsychological traits that can be seen as potential endophenotypes or intermediate traits for BP were analyzed. In the first part of the thesis, we examined the role of the allelic variants of the TSNAX/DISC1 gene cluster to psychotic and bipolar spectrum disorders and found association of distinct allelic haplotypes with these two groups of disorders. The haplotype at the 5’ end of the Disrupted-in-Schizophrenia-1 gene (DISC1) was over-transmitted to males with psychotic disorder (p = 0.008; for an extended haplotype p = 0.0007 with both genders), whereas haplotypes at the 3’ end of DISC1 associated with bipolar spectrum disorder (p = 0.0002; for an extended haplotype p = 0.0001). The variants of these haplotypes also showed association with different cognitive traits. The haplotypes at the 5’ end associated with perseverations and auditory attention, while the variants at the 3’ end associated with several cognitive traits including verbal fluency and psychomotor processing speed. Second, in our complete set of BP families with 723 individuals we studied six functional candidate genes from three distinct signalling systems: serotonin-related genes (SLC6A4 and TPH2), BDNF -related genes (BDNF, CREB1 and NTRK2) and one gene related to the inflammation and cytokine system (P2RX7). We replicated association of the functional variant Val66Met of BDNF with BP and better performance in retention. The variants at the 5’ end of SLC6A4 also showed some evidence of association among males (p = 0.004), but the widely studied functional variants did not yield any significant results. A protective four-variant haplotype on P2RX7 showed evidence of association with BP and executive functions: semantic and phonemic fluency (p = 0.006 and p = 0.0003, respectively). Third, we analyzed 23 bipolar families originating from the North-Eastern region of Finland. A genome-wide scan was performed using the 6K single nucleotide polymorphism (SNP) array. We identified susceptibility loci at chromosomes 7q31 with a LOD score of 3.20 and at 9p13.1 with a LOD score of 4.02. We followed up both linkage findings in the complete set of 179 Finnish bipolar families. The finding on chromosome 9p13 was supported (maximum LOD score of 3.02), but the susceptibility gene itself remains unclarified. In the fourth part of the thesis, we wanted to test the role of the allelic variants that have associated with bipolar disorder in recent genome-wide association studies (GWAS). We could confirm findings for the DFNB31, SORCS2, SCL39A3, and DGKH genes. The best signal in this study comes from DFNB31, which remained significant after multiple testing corrections. Two variants of SORCS2 were allelic replications and presented the same signal as the haplotype analysis. However, no association was detected with the PALB2 gene, which was the most significantly associated region in the previous GWAS. Our results indicate that BP is heterogeneous and its genetic background may accordingly vary in different populations. In order to fully understand the allelic heterogeneity that underlies common diseases such as BP, complete genome sequencing for many individuals with and without the disease is required. Identification of the specific risk variants will help us better understand the pathophysiology underlying BP and will lead to the development of treatments with specific biochemical targets. In addition, it will further facilitate the identification of environmental factors that alter risk, which will potentially provide improved occupational, social and psychological advice for individuals with high risk of BP.
Genome-wide analysis and experimentation of plant serine/threonine/tyrosine-specific protein kinases
Resumo:
Protein tyrosine phosphorylation plays an important role in cell growth, development and oncogenesis. No classical protein tyrosine kinase has hitherto been cloned from plants. Does protein tyrosine kinase exist in plants? To address this, we have performed a genomic survey of protein tyrosine kinase motifs in plants using the delineated tyrosine phosphorylation motifs from the animal system. The Arabidopsis thaliana genome encodes 57 different protein kinases that have tyrosine kinase motifs. Animal non-receptor tyrosine kinases, SRC, ABL, LYN, FES, SEK, KIN and RAS have structural relationship with putative plant tyrosine kinases. In an extended analysis, animal receptor and non-receptor kinases, Raf and Ras kinases, mixed lineage kinases and plant serine/threonine/tyrosine (STY) protein kinases, form a well-supported group sharing a common origin within the superfamily of STY kinases. We report that plants lack bona fide tyrosine kinases, which raise an intriguing possibility that tyrosine phosphorylation is carried out by dual-specificity STY protein kinases in plants. The distribution pattern of STY protein kinase families on Arabidopsis chromosomes indicates that this gene family is partly a consequence of duplication and reshuffling of the Arabidopsis genome and of the generation of tandem repeats. Genome-wide analysis is supported by the functional expression and characterization of At2g24360 and phosphoproteomics of Arabidopsis. Evidence for tyrosine phosphorylated proteins is provided by alkaline hydrolysis, anti-phosphotyrosine immunoblotting, phosphoamino acid analysis and peptide mass fingerprinting. These results report the first comprehensive survey of genome-wide and tyrosine phosphoproteome analysis of plant STY protein kinases.