39 resultados para histone H2A variant
Resumo:
Congenital lactase deficiency (CLD) (MIM 223000) is a rare autosomal recessive gastrointestinal disorder characterized by watery diarrhea in infants fed with breast milk or other lactose-containing formulas. The CLD locus was previously assigned by linkage and linkage disequilibrium analyses on 2q21 in 19 Finnish families. In this study, the molecular background of this disorder is reported. The CLD locus was refined in 32 CLD patients in 24 families by using microsatellite and single nucleotide polymorphism (SNP) haplotypes. Mutation analyses were performed by direct sequencing. We identified 5 distinct mutations in the lactase (LCT) gene, encoding the enzyme that hydrolyzes lactose in the intestinal lumen. These findings facilitate genetic testing of CLD in clinical practice and enable genetic counseling. The present data also provide the basis for detailed characterization of the molecular pathogenesis of this disorder. Adult-type hypolactasia (MIM 223100) (lactase non-persistence, lactose intolerance) is an autosomal recessive gastrointestinal condition that is a result of a decline in the activity of lactase in the intestinal lumen after weaning. Adult-type hypolactasia is considered to be a normal phenomenon among mammals and symptoms are remarkably milder than experienced in CLD. Recently, a variant C/T-13910 was shown to associate with the adult-type hypolactasia trait, locating 13.9 kb upstream of the LCT gene. In this study, the functional significance of the C/T-13910 variant was determined by studying the LCT mRNA levels in intestinal biopsy samples in children and adults with different genotypes. RT-PCR followed by solid-phase minisequencing was applied to determine the relative expression levels of the LCT alleles using an informative SNP located in exon 1. In children, the C-13910 allele was observed to be downregulated after five years of age in parallel with lactase enzyme activity. The expression of the LCT mRNA in the intestinal mucosa in individuals with the T-13910 A-22018 alleles was 11.5 times higher than that found in individuals with the C-13910, G-22018 alleles. These findings suggest that the C/T-13910 associated with adult-type hypolactasia is associated with the transcriptional regulation of the LCT gene. The presence of the T-13910 A-22018 allele also showed significant elevation lactase activity. Galactose, the hydrolysing product of the milk sugar lactose, has been hypothesized to be poisonous to ovarian epithelial cells. Hence, consumption of dairy products and lactase persistence has been proposed to be a risk factor for ovarian carcinoma. To investigate whether lactase persistence is related to the risk of ovarian carcinoma the C/T-13910 genotype was determined in a cohort of 782 women with ovarian carcinoma 1331 individuals serving as controls. Lactase persistence did not associate significantly with the risk for ovarian carcinoma in the Finnish, in the Polish or in the Swedish populations. The findings do not support the hypothesis that lactase persistence increases the risk for ovarian carcinoma.
Resumo:
Colorectal cancer (CRC) is one of the most frequent malignancies in Western countries. Inherited factors have been suggested to be involved in 35% of CRCs. The hereditary CRC syndromes explain only ~6% of all CRCs, indicating that a large proportion of the inherited susceptibility is still unexplained. Much of the remaining genetic predisposition for CRC is probably due to undiscovered low-penetrance variations. This study was conducted to identify germline and somatic changes that contribute to CRC predisposition and tumorigenesis. MLH1 and MSH2, that underlie Hereditary non-polyposis colorectal cancer (HNPCC) are considered to be tumor suppressor genes; the first hit is inherited in the germline and somatic inactivation of the wild type allele is required for tumor initiation. In a recent study, frequent loss of the mutant allele in HNPCC tumors was detected and a new model, arguing against the two-hit hypothesis, was proposed for somatic HNPCC tumorigenesis. We tested this hypothesis by conducting LOH analysis on 25 colorectal HNPCC tumors with a known germline mutation in the MLH1 or MSH2 genes. LOH was detected in 56% of the tumors. All the losses targeted the wild type allele supporting the classical two-hit model for HNPCC tumorigenesis. The variants 3020insC, R702W and G908R in NOD2 predispose to Crohn s disease. Contribution of NOD2 to CRC predisposition has been examined in several case-control series, with conflicting results. We have previously shown that 3020insC does not predispose to CRC in Finnish CRC patients. To expand our previous study the variants R702W and G908R were genotyped in a population-based series of 1042 Finnish CRC patients and 508 healthy controls. Association analyses did not show significant evidence for association of the variants with CRC. Single nucleotide polymorphism (SNP) rs6983267 at chromosome 8q24 was the first CRC susceptibility variant identified through genome-wide association studies. To characterize the role of rs6983267 in CRC predisposition in the Finnish population, we genotyped the SNP in the case-control material of 1042 cases and 1012 controls and showed that G allele of rs6983267 is associated with the increased risk of CRC (OR 1.22; P=0.0018). Examination of allelic imbalance in the tumors heterozygous for rs6983267 revealed that copy number increase affected 22% of the tumors and interestingly, it favored the G allele. By utilizing a computer algorithm, Enhancer Element Locator (EEL), an evolutionary conserved regulatory motif containing rs6983267 was identified. The SNP affected the binding site of TCF4, a transcription factor that mediates Wnt signaling in cells, and has proven to be crucial in colorectal neoplasia. The preferential binding of TCF4 to the risk allele G was showed in vitro and in vivo. The element drove lacZ marker gene expression in mouse embryos in a pattern that is consistent with genes regulated by the Wnt signaling pathway. These results suggest that rs6983267 at 8q24 exerts its effect in CRC predisposition by regulating gene expression. The most obvious target gene for the enhancer element is MYC, residing ~335 kb downstream, however further studies are required to establish the transcriptional target(s) of the predicted enhancer element.
Resumo:
Human growth and attained height are determined by a combination of genetic and environmental effects and in modern Western societies > 80% of the observed variation in height is determined by genetic factors. Height is a fundamental human trait that is associated with many socioeconomic and psychosocial factors and health measures, however little is known of the identity of the specific genes that influence height variation in the general population. This thesis work aimed to identify the genetic variants that influence height in the general population by genome-wide linkage analysis utilizing large family samples. The study focused on analysis of three separate sets of families consisting of: 1) 1,417 individuals from 277 Finnish families (FinnHeight), 2) 8,450 individuals from 3,817 families from Australia and Europe (EUHeight) and 3) 9,306 individuals from 3,302 families from the United States (USHeight). The most significant finding in this study was found in the Finnish family sample where we a locus in the chromosomal region 1p21 was linked to adult height. Several regions showed evidence for linkage in the Australian, European and US families with 8q21 and 15q25 being the most significant. The region on 1p21 was followed up with further studies and we were able to show that the collagen 11-alpha-1 gene (COL11A1) residing at this location was associated with adult height. This association was also confirmed in an independent Finnish population cohort (Health 2000) consisting of 6,542 individuals. From this population sample, we estimated that homozygous males and females for this gene variant were 1.1 and 0.6 cm taller than the respective controls. In this thesis work we identified a gene variant in the COL11A1 gene that influences human height, although this variant alone explains only 0.1% of height variation in the Finnish population. We also demonstrated in this study that special stratification strategies such as performing sex-limited analyses, focusing on dizygous twin pairs, analyzing ethnic groups within a population separately and utilizing homogenous populations such as the Finns can improve the statistical power of finding QTL significantly. Also, we concluded from the results of this study that even though genetic effects explain a great proportion of height variance, it is likely that there are tens or even hundreds of genes with small individual effects underlying the genetic architecture of height.
Resumo:
Human herpesvirus 6 (HHV-6) was identified from patients with HIV and lymphoproliferative diseases in 1986. It is a β-herpesvirus and is divided into two subgroups, variants A and B. HHV-6 variant B is the cause of exanthema subitum, while variant A has not yet definitely proven to cause any disease. HHV-6, especially variant A, is a highly neurotropic virus and has been associated with many diseases of the central nervous system (CNS) such as encephalitis and multiple sclerosis (MS). The present studies were aimed to elucidate the role of HHV-6 and its two variants in neurological infections. Special attention was given to study the possible role of HHV-6 in the pathogenesis of MS. We studied the expression of HHV-6 antigens using immunohistochemistry in brain autopsy samples from patients with MS and controls. HHV-6 antigen was identified in 70% of MS specimens whereas 30% of control specimens expressed HHV-6 antigen. Serum and cerebrospinal fluid (CSF) samples were collected from patients with MS and patients with other neurological diseases (OND) from patients visiting Helsinki University Central Hospital Neurological Outpatient Clinic during the years 2003 and 2004. In addition, we studied 53 children with suspected encephalitis. We developed an immunofluorescence IgG-avidity assay for the detection of primary HHV-6A and HHV-6B infection. For HHV-6B antibodies, no differences were observed between patients with MS and OND. For HHV-6A both seroprevalence and mean titers were significantly higher in MS compared to OND. HHV-6A low-avidity IgG antibodies, suggestive of primary infection, were found in serum of two, three and one patient with definite MS, possible MS and OND, respectively. From pediatric patients with suspected encephalitis, six serum samples (11.3%) contained low-avidity antibodies, indicating a temporal association between HHV-6A infection and onset of encephalitis. Three out of 26 patients with CDMS and four out of 19 patients with CPMS had HHV-6 antibodies in their CSF compared to none of the patients with OND (p=0.06 and p=0.01, respectively). Two patients with CDMS and three patients with CPMS appeared to have specific intrathecal synthesis of HHV-6A antibodies. In addition, oligoclonal bands (OCB) were observed in the CSF of five out of nine MS patients tested, and in two the OCBs reacted specifically with HHV-6 antigen, which is a novel finding. These results indicate HHV-6 specific antibody production in the CNS and suggest that there is a subset of MS patients with an active or chronic HHV-6A infection in the CNS that might be involved in the pathogenesis of MS. Our studies suggest that HHV-6 is an important causative or associated virus in some neurological infections, such as encephalitis and it might contribute to the development of MS, at least in some cases. In conclusion, HHV-6 is a neurotropic virus that should be taken into consideration when studying acute and chronic CNS diseases of unknown origin.
Defects in tricarboxylic acid cycle enzymes Fumarate hydratase and Succinate dehydrogenase in cancer
Resumo:
Hereditary leiomyomatosis and renal cell cancer (HLRCC) is a recently characterized cancer syndrome which predisposes to cutaneous and uterine leiomyomas as well as renal cell carcinoma (RCC). Uterine leiomyosarcoma (ULMS) has also been observed in certain Finnish HLRCC families. The predisposing gene for this syndrome, fumarate hydratase (FH), was identified in 2002. The well-known function of FH is in the tricarboxylic acid cycle (TCAC) in the energy metabolism of cells. As FH is a novel cancer gene, the role of FH mutations in tumours is in general unknown. Similarly, the mechanisms through which defective FH is associated with tumourigenesis are unclear. The loss of a wild type allele has been observed in virtually all HLRCC patients tumours and the FH enzyme activities are either totally lost or remarkably reduced in the tissues of mutation carrier patients. Therefore, FH is assumed to function as a tumour suppressor. Mutations in genes encoding subunits of other TCAC enzyme SDH have also been reported recently in tumours: mutations in SDHB, SDHC, and SDHD genes predispose to paraganglioma and pheochromocytoma. In the present study, mutations in the SDHB gene were observed to predispose to RCC. This was the first time that mutations in SDHB have been detected in extra-paraganglial tumours. Two different SDHB mutations were observed in two unrelated families. In the first family, the index patient was diagnosed with RCC at the age of 24 years. Additionally, his mother with a paraganglioma (PGL) of the heart and his maternal uncle with lung cancer were both carriers of the mutation. The RCC of the index patient and the PGL of his mother showed LOH. In the other family, an SDHB mutation was detected in two siblings who were both diagnosed with RCC at the ages of 24 and 26 years. Both of the siblings also suffered PGL. All these tumours showed LOH. Therefore, we concluded that mutations in SDHB predispose also for RCC in certain families. Several tumour types were analysed for FH mutations to define the role of FH mutations in these tumour types. In addition, patients with a putative cancer phenotype were analysed to identify new HLRCC families. Three FH variants were detected, of which two were novel. One of the variants was observed in a patient diagnosed with ULMS at the age of 41 years. However, LOH was not detected in the tumour tissue. The FH enzyme activity of the mutated protein was clearly reduced, being 43% of the activity of the normal protein. Together with the results from an earlier study we calculated that the prevalence of FH mutations in Finnish non-syndromic ULMS is around 2.4%. Therefore, FH mutations seem to have a minor role in the pathogenesis on non-syndromic ULMS. Two other germline variants were detected in a novel tumour type, ovarian mucinous cystadenoma. However, tumour tissues of the patients were not available for LOH studies and therefore LOH status remained unclear. Therefore, it is possible that FH mutations predispose also for ovarian tumours but further studies are needed to verify this result. A novel variant form of the FH gene (FHv) was identified and characterized in more detail. FHv contains an alternative first exon (1b), which appeared to function as 5 UTR sequence. The translation of FHv is initiated in vitro from exons two and three. The localization of FHv is both cytosolic and nuclear, in contrast to the localization of FH in mitochondria. FHv is expressed at low levels in all human tissues. Interestingly, the expression was induced after heat shock treatment and in chronic hypoxia. Therefore, FHv might have a role e.g. in the adaptation to unfavourable growth conditions. However, this remains to be elucidated.
Resumo:
Schizophrenia is a severe psychotic disorder affecting 0.5-1 % of the population. The disorder is characterized by hallucinations; delusions; disorganized behavior and speech; avolition; anhedonia; flattened affect and cognitive deficits. The etiology of the disorder is complex with evidence for multiple genes contributing to the onset of the disorder along with environmental factors. DISC1 is one of the most promising candidate genes for schizophrenia. It codes for a protein which takes part in numerous molecular interactions along several pathways. This network, termed as the DISC1 pathway, is evidently important for the development and maturation of the central nervous system from the embryo until young adulthood. Disruption at these pathways is thought to predispose schizophrenia. In the present study, we have studied the DISC1 pathway in the etiology of schizophrenia in the Finnish population. We have utilized large Finnish samples; the schizophrenia family sample where DISC1 was originally shown to associate with schizophrenia and the Northern Finland birth cohort 1966 (NFBC66). Several DISC1 binding partners displayed evidence for association in the family sample along with DISC1. Through a genome-wide linkage study, we found a significant linkage signal to a locus where a DISC1 binding partner NDE1 is located at the carriers of a certain DISC1 risk variant. In a follow-up study, genetic markers in NDE1 displayed significant evidence for association with schizophrenia. Further exploration of association between 11 genes of the DISC1 pathway and schizophrenia led to recognition of novel variants in NDEL1, PDE4B and PDE4D that significantly either increased or decreased the risk for schizophrenia. Further, we found evidence that DISC1 itself has a significant role in the human mental functioning even in the healthy population. Variants in DISC1 had a significant effect on anhedonia which is a trait present at everybody but is in its severe form one of the main symptoms of schizophrenia and correlates with the risk of developing the disorder. Further, utilizing genome-wide marker data, we recognized three genes; MIR620; CCDC141 and LCT; that are closely related to the DISC1 pathway but which effects on anhedonia were observable only at the individuals who carried these specific DISC1 variants. Our findings significantly add up to the previous evidence for the involvement of DISC1 and the DISC1 pathway in the etiology of schizophrenia and psychosis. Our results support the concept of a number of DISC1 pathway related genes contributing in the etiology of schizophrenia along with DISC1 and provide new candidates for the studies of schizophrenia. Our findings also significantly increase the importance of DISC1 itself as having a role in psychological functioning in the general population.
Resumo:
Multiple sclerosis (MS) is a chronic inflammatory disease of the central nervous system (CNS). Both environmental factors and several predisposing genes are required to generate MS. Despite intensive research these risk factors are still largely unknown, the pathogenesis of MS demyelination is poorly understood, and no curative treatment exists. Both prevalence and familial occurrence of MS are exceptionally high in a Finnish population subisolate, Southern Ostrobothnia, presumably due to enrichment of predisposing genetic variants within this region. Previous linkage scan on MS pedigrees from Southern Ostrobothnia detected three main MS loci on chromosomes 5p, 6p (HLA) and 17q. Linkage studies in other populations have also provided independent evidence for the location of MS susceptibility genes in these regions. Further, these loci are syntenic to the experimental autoimmune encephalomyelitis (EAE) susceptibility loci of rodents. In this thesis work an effort was made to localize MS predisposing alleles of the linked loci outside the HLA region by studying familial MS cases from the Southern Ostrobothnia isolate. Analysis of the 5p locus revealed one region, flanking the complement component 7 (C7) gene. The identified relatively rare haplotype seems to have a fairly large effect on genetic susceptibility of MS (frequency MS 12%, controls 4%; p=0.000003, OR=2.73). Evidence for association with alleles of the region and MS was seen also in more heterogeneous populations. Convincingly, plasma C7 protein levels and complement activity correlated with the risk haplotype identified. The finding stimulated us to study other complement cascade genes in MS. No evidence for association could be observed with the complement component coding genes outside 5p. A scan of the 17q locus provided evidence for association with variants of the protein kinase C alpha (PRKCA) gene (p=0.0001). Modest evidence for association with PRKCA was observed also in Canadian MS families. Finally we used a candidate gene based approach to identify potential MS loci. Mutations of DAP12 and TREM2 cause a recessively inherited CNS white matter disease PLOSL. Interestingly, DAP12 and TREM2 are located in MS regions on 6p and 19q, and we tested them as potential candidate genes in the Finnish MS sample. No evidence for association with MS was observed. This thesis provides an example of how extended families from special populations can be utilized in fine-mapping of the linked loci. A first relatively rare MS variant was identified utilizing the strength of a Finnish population subisolate. This variant seems to have an effect on activity of the complement system, which has previously been suggested to have an important role in the pathogenesis of MS.
Studies of the genetic epidemiology of cardiovascular disease: focus on inflammatory candidate genes
Resumo:
Cardiovascular disease (CVD) is a complex disease with multifactorial aetiology. Both genetic and environmental factors contribute to the disease risk. The lifetime risk for CVD differs markedly between men and women, men being at increased risk. Inflammatory reaction contributes to the development of the disease by promoting atherosclerosis in artery walls. In the first part of this thesis, we identified several inflammatory related CVD risk factors associating with the amount of DNA from whole blood samples, indicating a potential source of bias if a genetic study selects the participants based on the available amount of DNA. In the following studies, this observation was taken into account by applying whole genome amplification to samples otherwise subjected to exclusion due to very low DNA yield. We continued by investigating the contribution of inflammatory genes to the risk for CVD separately in men and women, and looked for sex-genotype interaction. In the second part, we explored a new candidate gene and its role in the risk for CVD. Selenoprotein S (SEPS1) is a membrane protein residing in the endoplasmic reticulum where it participates in retro-translocation of unfolded proteins to cytosolic protein degradation. Previous studies have indicated that SEPS1 protects cells from oxidative stress and that variations in the gene are associated with circulating levels of inflammatory cytokines. In our study, we identified two variants in the SEPS1 gene, which associated with coronary heart disease and ischemic stroke in women. This is, to our knowledge, the first study suggesting a role of SEPS1 in the risk for CVD after extensively examining the variation within the gene region. In the third part of this thesis, we focused on a set of seven genes (angiotensin converting enzyme, angiotensin II receptor type I, C-reactive protein (CRP), and fibrinogen alpha-, beta-, and gamma-chains (FGA, FGB, FGG)) related to inflammatory cytokine interleukin 6 (IL6) and their association with the risk for CVD. We identified one variant in the IL6 gene conferring risk for CVD in men and a variant pair from IL6 and FGA genes associated with decreased risk. Moreover, we identified and confirmed an association between a rare variant in the CRP gene and lower CRP levels, and found two variants in the FGA and FGG genes associating with fibrinogen. The results from this third study suggest a role for the interleukin 6 pathway genes in the pathogenesis of CVD and warrant further studies in other populations. In addition to the IL6 -related genes, we describe in this thesis several sex-specific associations in other genes included in this study. The majority of the findings were evident only in women encouraging other studies of cardiovascular disease to include and analyse women separately from men.
Resumo:
Neuronal ceroid lipofuscinoses (NCLs) are a family of inherited pediatric neurodegenerative disorders, leading to retinal degeneration, death of selective neuronal populations and accumulation of autofluorscent ceroid-lipopigments. The clinical manifestations are generally similar in all forms. The Finnish variant late infantile neuronal ceroid lipofuscinosis (vLINCLFin) is a form of NCL, especially enriched in the Finnish population. The aim of this thesis was to analyse the brain pathology of vLINCLFin utilising the novel Cln5-/- mouse model. Gene expression profiling of the brains of already symptomatic Cln5-/- mice revealed that inflammation, neurodegeneration and defects in myelinization are the major characteristics of the later stages of the disease. Histological characterization of the brain pathology confirmed that the thalamocortical system is affected in Cln5-/- mice, similarly to the other NCL mouse models. However, whereas the brain pathology in all other analyzed NCL mice initiate in the thalamus and spread only months later to the cortex, we observed that the sequence of events is uniquely reversed in Cln5-/- mice; beginning in the cortex and spreading to the thalamus only months later. We could also show that even though neurodegeneration is inititated in the cortex, reactive gliosis and loss of myelin are evident in specific nuclei of the thalamus already in the 1 month old brain. To obtain a deeper insight into the disturbed metabolic pathways, we performed gene expression profiling of presymptomatic mouse brains. We validated these findings with immunohistological analyses, and could show that cytoskeleton and myelin were affected in Cln5-/- mice. Comparison of gene expression profiling results of Cln5-/- and Cln1-/- mice, further highlighted that these two NCL models share a common defective pathway, leading to disturbances in the neuronal growth cone and cytoskeleton. Encouraged by the evidence of this defected pathway, we analyzed the molecular interactions of NCL-proteins and observed that Cln5 and Cln1/Ppt1 proteins interact with each other. Furthermore, we demonstrated that Cln5 and Cln1/Ppt1 share an interaction partner, the F1-ATP synthase, potentially linking both vLINCLFIN and INCL diseases to disturbed lipid metabolism. In addition, Cln5 was shown to interact with other NCL proteins; Cln2, Cln3, Cln6 and Cln8, implicating a central role for Cln5 in the NCL pathophysiology. This study is the first to describe the brain pathology and gene expression changes in the Cln5-/- mouse. Together the findings presented in this thesis represent novel information of the disease processes and the molecular mechanisms behind vLINCLFin and have highlighted that vLINCLFin forms a very important model to analyze the pathophysiology of NCL diseases.
Resumo:
Colorectal cancer (CRC) is the third most common cancer in Finland. Of all CRC tumors, 15% display microsatellite-instability (MSI) caused by defective cellular mismatch repair. Cells displaying MSI accumulate a high number of mutations genome-wide, especially in short repeat areas, microsatellites. When targeting genes essential for cell growth or death, MSI can promote tumorigenesis. In non-coding areas, microsatellite mutations are generally considered as passenger events. Since the discovery of MSI and its linkage to cancer, more that 200 genes have been investigated for a role in MSI tumorigenesis. Although various criteria have been suggested for MSI target gene identification, the challenge has been to distinguish driver mutations from passenger mutations. This study aimed to clarify these key issues in the research field of MSI cancer. Prior to this, background mutation rate in MSI cancer has not been studied in a large-scale. We investigated the background mutation rate in MSI CRC by analyzing the spectrum of microsatellite mutations in non-coding areas. First, semenogelin I was studied for a possible role in MSI carcinogenesis. The intronic T9 repeat of semenogelin I was frequently mutated but no evidence for selection during tumorigenesis was obtained. Second, a sequencing approach was utilized to evaluate the general background mutation rate in MSI CRC. Both intronic and intergenic repeats harbored extremely high mutation rates of ≤ 87% and intergenic repeats were more unstable than the intronic repeats. As mutation rates of presumably neutral microsatellites can be high in MSI CRC in the absence of apparent selection pressure, high mutation frequency alone is not sufficient evidence for identification of driver MSI target genes. Next, an unbiased approach was designed to identify the mutatome of MSI CRC. By combining expression array data and a database search we identified novel genes possibly related to MSI CRC carcinogenesis. One of the genes was studied further. In the functional analysis this gene was observed to cause an abnormal cancer-prone cellular phenotype, possibly through altered responses to DNA damage. In our recent study, smooth muscle myosin heavy chain 11 (MYH11) was identified as a novel MSI CRC gene. Additionally, MYH11 has a well established role in acute myeloid leukemia (AML) through an oncogenic fusion protein CBFB-MYH11. We investigated further the role of MYH11 in AML by sequencing. Three novel missense variants of MYH11 were identified. None of the variants were present in the population-based control material. One of the identified variants, V71A, lies in the N-terminal SH3-like domain of MYH11 of unknown function. The other two variants, K1059E and R1792Q are located in the coil-coiled myosin rod essential for the regulation and filament formation of MYH11. The variant K1059E lies in the close proximity of the K1044N that has been functionally assessed in our earlier work of CRC and has been reported to cause total loss of MYH11 protein regulation. As the functional significance of the three novel variants examined in this work remains unknown, future studies should clarify the further role of MYH11 in AML leukaemogenesis and in other malignancies.
Resumo:
Cardiovascular diseases (CVD) are major contributors to morbidity and mortality worldwide. Several interacting environmental, biochemical, and genetic risk factors can increase disease susceptibility. While some of the genes involved in the etiology of CVD are known, many are yet to be discovered. During the last few decades, scientists have searched for these genes with genome-wide linkage and association methods, and with more targeted candidate gene studies. This thesis investigates variation within the upstream transcription factor 1 (USF1) gene locus in relation to CVD risk factors, atherosclerosis, and incidence and prevalence of CVD. This candidate gene was first identified in Finnish families ascertained for familial combined hyperlipidemia, a common dyslipidemia predisposing to coronary heart disease. The gene is a ubiquitously expressed transcription factor regulating expression of several genes from lipid and glucose metabolism, inflammation, and endothelial function. First, we examined association between USF1 variants and several CVD risk factors, such as lipid phenotypes, body composition measures, and metabolic syndrome, in two prospective population cohorts. Our data suggested that USF1 contributes to these CVD risk factors at the population level. Notably, the associations with quantitative measurements were mostly detected among study subjects with CVD or metabolic syndrome, suggesting complex interactions between USF1 effects and the pathophysiological state of an individual. Second, we investigated how variation at the USF1 locus contributes to atherosclerotic lesions of the coronary arteries and abdominal aorta. For this, we used two study samples of middle-aged men with detailed measurements of atherosclerosis obtained in autopsy. USF1 variation significantly associated with areas of several types of lesions, especially with calcification of the arteries. Next, we tested what effect the USF1 risk variants have on sudden cardiac death and incidence of CVD. The atherosclerosis-associated risk variant increased the risk of sudden cardiac death of the same study subjects. Furthermore, USF1 alleles associated with incidence of CVD in the Finnish population follow-up cohorts. These associations were especially prominent among women, suggesting a sex specific effect, which has also been detected in subsequent studies. Finally, as some of the low-yield DNA samples of the Finnish follow-up study cohort needed to be whole-genome amplified (WGA) prior to genotyping, we evaluated whether the produced WGA genotypes were of good quality. Although the samples giving genotype discrepancies could not be detected before genotyping with standard laboratory quality control methods, our results suggested that enhanced quality control at the time of the genotyping could identify such samples. In addition, combining two WGA reactions into one pooled DNA sample for genotyping markedly reduced the number of discrepancies and samples showing them. In conclusion, USF1 seems to have a role in the etiology of CVD. Additional studies are warranted to identify functional variants and to study interactions between USF1 and other genetic or environmental factors. This USF1 study, and other studies with low DNA yield of some samples, can benefit from whole genome amplification of the low-yield samples prior to genotyping. Careful quality control procedures are, however, needed in WGA genotyping.
Resumo:
Disorders resulting from degenerative changes in the nervous system are progressive and incurable. Both environmental and inherited factors affect neuron function, and neurodegenerative diseases are often the sum of both factors. The cellular events leading to neuronal death are still mostly unknown. Monogenic diseases can offer a model for studying the mechanisms of neurodegeneration. Neuronal ceroid lipofuscinoses, or NCLs, are a group of monogenic, recessively inherited diseases affecting mostly children. NCLs cause severe and specific loss of neurons in the central nervous system, resulting in the deterioration of motor and mental skills and leading to premature death. In this thesis, the focus has been on two forms of NCL, the infantile NCL (INCL, CLN1) and the Finnish variant of late infantile NCL (vLINCLFin, CLN5). INCL is caused by mutations in the CLN1 gene encoding for the PPT1 (palmitoyl protein thioesterase 1) enzyme. PPT1 removes a palmitate moiety from proteins in experimental conditions, but its substrates in vivo are not known. In the Finnish variant of late infantile NCL (vLINCLFin), the CLN5 gene is defective, but the function of the encoded CLN5 has remained unknown. The aim of this thesis was to elucidate the disease mechanisms of these two NCL diseases by focusing on the molecular interactions of the defective proteins. In this work, the first interaction partner for PPT1, the mitochondrial F1-ATP synthase, was described. This protein has been linked to HDL metabolism in addition to its well-known role in the mitochondrial energy production. The connection between PPT1 and the F1-ATP synthase was studied utilizing the INCL-disease model, the genetically modified Ppt1-deficient mice. The levels of F1-ATP synthase subunits were increased on the surface of Ppt1-deficient neurons when compared to controls. We also detected several changes in lipid metabolism both at the cellular and systemic levels in Ppt1-deficient mice when compared to controls. The interactions between different NCL proteins were also elucidated. We were able to detect novel interactions between CLN5 and other NCL proteins, and to replicate the previously reported interactions. Some of the novel interactions influenced the intracellular trafficking of the proteins. The multiple interactions between CLN5 and other NCL proteins suggest a connection between the NCL subtypes at the cellular level. The main results of this thesis elicit information about the neuronal function of PPT1. The connection between INCL and neuronal lipid metabolism introduces a new perspective to this rather poorly characterized subject. The evidence of the interactions between NCL proteins provides the basis for future research trying to untangle the NCL disease mechanisms and to develop strategies for therapies.
Resumo:
Mulibrey nanism is a hereditary developmental disorder, characterized by prenatal onset growth failure without postnatal catch-up growth, distinctive craniofacial features, progressive cardiopathy and failure of sexual maturation. In addition, the patients develop insulin resistance syndrome and type 2 diabetes and they have an increased risk of developing tumors. The TRIM37 gene that underlies mulibrey nanism encodes for a member of the tripartite motif (TRIM) protein family. The physiological function of TRIM37 and the pathogenetic mechanisms leading from TRIM37 dysfunction to the mulibrey nanism phenotype are unknown. However, TRIM37 localizes at least partially to peroxisomes, and possesses ubiquitin E3-ligase activity. Thus, it may mediate ubiquitin dependent protein degradation, suggesting that accumulation of yet unknown substrate proteins may underlie the disease pathogenesis. In this study, the TRIM37 gene was characterized in detail. A transcription initiation window, with several separate transcription start sites, was identified and the putative promoter region immediately upstream from the transcription initiation window was shown to possess basal promoter activity. Further, several alternative splice variants of the gene were identified, including a highly expressed testis specific variant, encoding for an identical protein product with the main transcript. Expression of TRIM37 mRNA was detected in several different tissues, with highest expression seen in testis and in brain, when the expression patterns of the two major transcripts in different human tissues were studied by quantitative real-time PCR. Several mulibrey nanism patients were studied and thirteen novel mutations in TRIM37 were found, including three mutations (p.Gly322Val, p.Cys109Ser, p.Glu271_Ser287), that are likely to express mutant TRIM37 proteins. These mutations were further shown to alter the subcellular localization of the mutant proteins. Most of the mulibrey nanism associated mutations however, lead to premature termination codons and degradation of mRNA. All the TRIM37 mutations identified to date predict loss-of-function alleles, and thus no phenotype-genotype correlation is seen among the patients. In order to understand the pathogenetic mechanisms underlying mulibrey nanism, an animal model for the disorder is needed. For the development of a Trim37 knock-out mouse, the mouse Trim37 gene was characterized. Alternative splice variants, were identified, including a testis specific variant predicting a longer protein product. Further, a strictly tissue and cell-specific pattern of Trim37 expression was observed in developing and adult mouse tissues, when studied by immunohistochemical methods. This distribution of Trim37 expression in mouse tissues is in agreement with the clinical findings in human mulibrey nanism patients. This thesis work gives new tools for the diagnostics of mulibrey nanism as well as for studying the molecular pathogenesis behind this interesting disorder.
Resumo:
Inorganic pyrophosphatases (PPases, EC 3.6.1.1) hydrolyse pyrophosphate in a reaction that provides the thermodynamic 'push' for many reactions in the cell, including DNA and protein synthesis. Soluble PPases can be classified into two families that differ completely in both sequence and structure. While Family I PPases are found in all kingdoms, family II PPases occur only in certain prokaryotes. The enzyme from baker's yeast (Saccharomyces cerevisiae) is very well characterised both kinetically and structurally, but the exact mechanism has remained elusive. The enzyme uses divalent cations as cofactors; in vivo the metal is magnesium. Two metals are permanently bound to the enzyme, while two come with the substrate. The reaction cycle involves the activation of the nucleophilic oxygen and allows different pathways for product release. In this thesis I have solved the crystal structures of wild type yeast PPase and seven active site variants in the presence of the native cofactor magnesium. These structures explain the effects of the mutations and have allowed me to describe each intermediate along the catalytic pathway with a structure. Although establishing the ʻchoreographyʼ of the heavy atoms is an important step in understanding the mechanism, hydrogen atoms are crucial for the mechanism. The most unambiguous method to determine the positions of these hydrogen atoms is neutron crystallography. In order to determine the neutron structure of yeast PPase I perdeuterated the enzyme and grew large crystals of it. Since the crystals were not stable at ambient temperature, a cooling device was developed to allow neutron data collection. In order to investigate the structural changes during the reaction in real time by time-resolved crystallography a photolysable substrate precursor is needed. I synthesised a candidate molecule and characterised its photolysis kinetics, but unfortunately it is hydrolysed by both yeast and Thermotoga maritima PPases. The mechanism of Family II PPases is subtly different from Family I. The native metal cofactor is manganese instead of magnesium, but the metal activation is more complex because the metal ions that arrive with the substrate are magnesium different from those permanently bound to the enzyme. I determined the crystal structures of wild type Bacillus subtilis PPase with the inhibitor imidodiphosphate and an inactive H98Q variant with the substrate pyrophosphate. These structures revealed a new trimetal site that activates the nucleophile. I also determined that the metal ion sites were partially occupied by manganese and iron using anomalous X- ray scattering.
Resumo:
Topic detection and tracking (TDT) is an area of information retrieval research the focus of which revolves around news events. The problems TDT deals with relate to segmenting news text into cohesive stories, detecting something new, previously unreported, tracking the development of a previously reported event, and grouping together news that discuss the same event. The performance of the traditional information retrieval techniques based on full-text similarity has remained inadequate for online production systems. It has been difficult to make the distinction between same and similar events. In this work, we explore ways of representing and comparing news documents in order to detect new events and track their development. First, however, we put forward a conceptual analysis of the notions of topic and event. The purpose is to clarify the terminology and align it with the process of news-making and the tradition of story-telling. Second, we present a framework for document similarity that is based on semantic classes, i.e., groups of words with similar meaning. We adopt people, organizations, and locations as semantic classes in addition to general terms. As each semantic class can be assigned its own similarity measure, document similarity can make use of ontologies, e.g., geographical taxonomies. The documents are compared class-wise, and the outcome is a weighted combination of class-wise similarities. Third, we incorporate temporal information into document similarity. We formalize the natural language temporal expressions occurring in the text, and use them to anchor the rest of the terms onto the time-line. Upon comparing documents for event-based similarity, we look not only at matching terms, but also how near their anchors are on the time-line. Fourth, we experiment with an adaptive variant of the semantic class similarity system. The news reflect changes in the real world, and in order to keep up, the system has to change its behavior based on the contents of the news stream. We put forward two strategies for rebuilding the topic representations and report experiment results. We run experiments with three annotated TDT corpora. The use of semantic classes increased the effectiveness of topic tracking by 10-30\% depending on the experimental setup. The gain in spotting new events remained lower, around 3-4\%. The anchoring the text to a time-line based on the temporal expressions gave a further 10\% increase the effectiveness of topic tracking. The gains in detecting new events, again, remained smaller. The adaptive systems did not improve the tracking results.