14 resultados para Single-nucleotide Polymorphisms
em Helda - Digital Repository of University of Helsinki
Resumo:
Evolutionary genetics incorporates traditional population genetics and studies of the origins of genetic variation by mutation and recombination, and the molecular evolution of genomes. Among the primary forces that have potential to affect the genetic variation within and among populations, including those that may lead to adaptation and speciation, are genetic drift, gene flow, mutations and natural selection. The main challenges in knowing the genetic basis of evolutionary changes is to distinguish the adaptive selection forces that cause existent DNA sequence variants and also to identify the nucleotide differences responsible for the observed phenotypic variation. To understand the effects of various forces, interpretation of gene sequence variation has been the principal basis of many evolutionary genetic studies. The main aim of this thesis was to assess different forms of teleost gene sequence polymorphisms in evolutionary genetic studies of Atlantic salmon (Salmo salar) and other species. Firstly, the level of Darwinian adaptive evolution affected coding regions of the growth hormone (GH) gene during the teleost evolution was investigated based on the sequence data existing in public databases. Secondly, a target gene approach was used to identify within population variation in the growth hormone 1 (GH1) gene in salmon. Then, a new strategy for single nucleotide polymorphisms (SNPs) discovery in salmonid fishes was introduced, and, finally, the usefulness of a limited number of SNP markers as molecular tools in several applications of population genetics in Atlantic salmon was assessed. This thesis showed that the gene sequences in databases can be utilized to perform comparative studies of molecular evolution, and some putative evidence of the existence of Darwinian selection during the teleost GH evolution was presented. In addition, existent sequence data was exploited to investigate GH1 gene variation within Atlantic salmon populations throughout its range. Purifying selection is suggested to be the predominant evolutionary force controlling the genetic variation of this gene in salmon, and some support for gene flow between continents was also observed. The novel approach to SNP discovery in species with duplicated genome fragments introduced here proved to be an effective method, and this may have several applications in evolutionary genetics with different species - e.g. when developing gene-targeted markers to investigate quantitative genetic variation. The thesis also demonstrated that only a few SNPs performed highly similar signals in some of the population genetic analyses when compared with the microsatellite markers. This may have useful applications when estimating genetic diversity in genes having a potential role in ecological and conservation issues, or when using hard biological samples in genetic studies as SNPs can be applied with relatively highly degraded DNA.
Resumo:
The leading cause of death in the Western world continues to be coronary heart disease (CHD). At the root of the disease process is dyslipidemia an aberration in the relevant amounts of circulating blood lipids. Cholesterol builds up in the arterial wall and following rupture of these plaques, myocardial infarction or stroke can occur. Heart disease runs in families and a number of hereditary forms are known. The leading cause of adult dyslipidemia presently however is overweight and obesity. This thesis work presents an investigation of the molecular genetics of common, hereditary dyslipidemia and the tightly related condition of obesity. Familial combined hyperlipidemia (FCHL) is the most common hereditary dyslipidemia in man with an estimated population prevalence of 1-6%. This complex disease is characterized by elevated levels of serum total cholesterol, triglycerides or both and is observed in about 20% of individuals with premature CHD. Our group identified the disease to be associated with genetic variation in the USF1 transcription factor gene. USF1 has a key role in regulating other genes that control lipid and glucose metabolism as well as the inflammatory response all central processes in the progression of atherosclerosis and CHD. The first two works of this thesis aimed at understanding how these USF1 variants result in increased disease risk. Among the many, non-coding single-nucleotide polymorphisms (SNPs) that associated with the disease, one was found to have a functional effect. The risk-enhancing allele of this SNP seems to eradicate the ability of the important hormone insulin to induce the expression of USF1 in peripheral tissues. The resultant changes in the expression of numerous USF1 target genes over time probably enhance and accelerate the atherogenic processes. Dyslipidemias often represent an outcome of obesity and in the final work of this thesis we wanted to address the metabolic pathways related to acquired obesity. It is recognized that active processes in adipose tissue play an important role in the development of dyslipidemia, insulin resistance and other pathological conditions associated with obesity. To minimize the confounding effects of genetic differences present in most human studies, we investigated a rare collection of identical twins that differed significantly in the amount of body fat. In the obese, but otherwise healthy young adults, several notable changes were observed. In addition to chronic inflammation, the adipose tissue of the obese co-twins was characterized by a marked (47%) decrease in amount of mitochondrial DNA (mtDNA) a change associated with mitochondrial dysfunction. The catabolism of branched chain amino acids (BCAAs) was identified as the most down-regulated process in the obese co-twins. A concordant increase in the serum level of these insulin secretagogues was identified. This hyperaminoacidemia may provide the feed-back signal from insulin resistant adipose tissue to the pancreas to ensure an appropriately augmented secretory response. The down regulation of BCAA catabolism correlated closely with liver fat accumulation and insulin. The single most up-regulated gene (5.9 fold) in the obese co-twins was osteopontin (SPP1) a cytokine involved in macrophage recruitment to adipose tissue. SPP1 is here implicated as an important player in the development of insulin resistance. These studies of exceptional study samples provide better understanding of the underlying pathology in common dyslipidemias and other obesity associated diseases important for future improvement of intervention strategies and treatments to combat atherosclerosis and coronary heart disease.
Resumo:
Celiac disease, or gluten intolerance, is triggered by dietary glutens in genetically susceptible individuals and it affects approximately 1% of the Caucasian population. The best known genetic risk factors for celiac disease are HLA DQ2 and DQ8 heterodimers, which are necessary for the development of the disease. However, they alone are not sufficient for disease induction, other risk factors are required. This thesis investigated genetic factors for celiac disease, concentrating on susceptibility loci on chromosomes 5q31-q33, 19p13 and 2q12 previously reported in genome-wide linkage and association studies. In addition, a novel genotyping method for the detection of HLA DQ2 and DQ8 coding haplotypes was validated. This study was conducted using Finnish and Hungarian family materials, and Finnish, Hungarian and Italian case-control materials. Genetic linkage and association were analysed in these materials using candidate gene and fine-mapping approaches. The results confirmed linkage to celiac disease on the chromosomal regions 5q31-q33 and 19p13. Fine-mapping on chromosome 5q31-q33 revealed several modest associations in the region, and highlighted the need for further investigations to locate the causal risk variants. The MYO9B gene on chromosome 19p13 showed evidence for linkage and association particularly with dermatitis herpetiformis, the skin manifestation of celiac disease. This implies a potential difference in the genetic background of the intestinal and skin forms of the disease, although studies on larger samplesets are required. The IL18RAP locus on chromosome 2q12, shown to be associated with celiac disease in a previous genome-wide association study and a subsequent follow-up, showed association in the Hungarian population in this study. The expression of IL18RAP was further investigated in small intestinal tissue and in peripheral blood mononuclear cells. The results showed that IL18RAP is expressed in the relevant tissues. Two putative isoforms of IL18RAP were detected by Western blot analysis, and the results suggested that the ratios and total levels of these isoforms may contribute to the aetiology of celiac disease. A novel genotyping method for celiac disease-associated HLA haplotypes was also validated in this thesis. The method utilises single-nucleotide polymorphisms tagging these HLA haplotypes with high sensitivity and specificity. Our results suggest that this method is transferable between populations, and it is suitable for large-scale analysis. In conclusion, this doctorate study provides an insight into the roles of the 5q31-q33, MYO9B, IL18RAP and HLA loci in the susceptibility to celiac disease in the Finnish, Hungarian and Italian populations, highlighting the need for further studies at these genetic loci and examination of the function of the candidate genes.
Resumo:
Glaucoma is the second leading cause of blindness worldwide. It is a group of optic neuropathies, characterized by progressive optic nerve degeneration, excavation of the optic disc due to apoptosis of retinal ganglion cells and corresponding visual field defects. Open angle glaucoma (OAG) is a subtype of glaucoma, classified according to the age of onset into juvenile and adult- forms with a cut-off point of 40 years of age. The prevalence of OAG is 1-2% of the population over 40 years and increases with age. During the last decade several candidate loci and three candidate genes, myocilin (MYOC), optineurin (OPTN) and WD40-repeat 36 (WDR36), for OAG have been identified. Exfoliation syndrome (XFS), age, elevated intraocular pressure and genetic predisposition are known risk factors for OAG. XFS is characterized by accumulation of grayish scales of fibrillogranular extracellular material in the anterior segment of the eye. XFS is overall the most common identifiable cause of glaucoma (exfoliation glaucoma, XFG). In the past year, three single nucleotide polymorphisms (SNPs) on the lysyl oxidase like 1 (LOXL1) gene have been associated with XFS and XFG in several populations. This thesis describes the first molecular genetic studies of OAG and XFS/XFG in the Finnish population. The role of the MYOC and OPTN genes and fourteen candidate loci was investigated in eight Finnish glaucoma families. Both candidate genes and loci were excluded in families, further confirming the heterogeneous nature of OAG. To investigate the genetic basis of glaucoma in a large Finnish family with juvenile and adult onset OAG, we analysed the MYOC gene in family members. Glaucoma associated mutation (Thr377Met) was identified in the MYOC gene segregating with the disease in the family. This finding has great significance for the family and encourages investigating the MYOC gene also in other Finnish OAG families. In order to identify the genetic susceptibility loci for XFS, we carried out a genome-wide scan in the extended Finnish XFS family. This scan produced promising candidate locus on chromosomal region 18q12.1-21.33 and several additional putative susceptibility loci for XFS. This locus on chromosome 18 provides a solid starting point for the fine-scale mapping studies, which are needed to identify variants conferring susceptibility to XFS in the region. A case-control and family-based association study and family-based linkage study was performed to evaluate whether SNPs in the LOXL1 gene contain a risk for XFS, XFG or POAG in the Finnish patients. A significant association between the LOXL1 gene SNPs and XFS and XFG was confirmed in the Finnish population. However, no association was detected with POAG. Probably also other genetic and environmental factors are involved in the pathogenesis of XFS and XFG.
Resumo:
Schizophrenia is a severe mental disorder affecting 0.4-1% of the population worldwide. It is characterized by impairments in the perception of reality and by significant social or occupational dysfunction. The disorder is one of the major contributors to the global burden of diseases. Studies of twins, families, and adopted children point to strong genetic components for schizophrenia, but environmental factors also play a role in the pathogenesis of disease. Molecular genetic studies have identified several potential positional candidate genes. The strongest evidence for putative schizophrenia susceptibility loci relates to the genes encoding dysbindin (DTNBP1) and neuregulin (NRG1), but studies lack impressive consistency in the precise genetic regions and alleles implicated. We have studied the role of three potential candidate genes by genotyping 28 single nucleotide polymorphisms in the DNTBP1, NRG1, and AKT1 genes in a large schizophrenia family sample consisting of 441 families with 865 affected individuals from Finland. Our results do not support a major role for these genes in the pathogenesis of schizophrenia in Finland. We have previously identified a region on chromosome 5q21-34 as a susceptibility locus for schizophrenia in a Finnish family sample. Recently, two studies reported association between the γ-aminobutyric acid type A receptor cluster of genes in this region and one study showed suggestive evidence for association with another regional gene encoding clathrin interactor 1 (CLINT1, also called Epsin 4 and ENTH). To further address the significance of these genes under the linkage peak in the Finnish families, we genotyped SNPs of these genes, and observed statistically significant association of variants between GABRG2 and schizophrenia. Furthermore, these variants also seem to affect the functioning of the working memory. Fetal events and obstetric complications are associated with schizophrenia. Rh incompatibility has been implicated as a risk factor for schizophrenia in several epidemiological studies. We conducted a family-based candidate-gene study that assessed the role of maternal-fetal genotype incompatibility at the RhD locus in schizophrenia. There was significant evidence for an RhD maternal-fetal genotype incompatibility, and the risk ratio was estimated at 2.3. This is the first candidate-gene study to explicitly test for and provide evidence of a maternal-fetal genotype incompatibility mechanism in schizophrenia. In conclusion, in this thesis we found evidence that one GABA receptor subunit, GABRG2, is significantly associated with schizophrenia. Furthermore, it also seems to affect to the functioning of the working memory. In addition, an RhD maternal-fetal genotype incompatibility increases the risk of schizophrenia by two-fold.
Resumo:
Large-scale chromosome rearrangements such as copy number variants (CNVs) and inversions encompass a considerable proportion of the genetic variation between human individuals. In a number of cases, they have been closely linked with various inheritable diseases. Single-nucleotide polymorphisms (SNPs) are another large part of the genetic variance between individuals. They are also typically abundant and their measuring is straightforward and cheap. This thesis presents computational means of using SNPs to detect the presence of inversions and deletions, a particular variety of CNVs. Technically, the inversion-detection algorithm detects the suppressed recombination rate between inverted and non-inverted haplotype populations whereas the deletion-detection algorithm uses the EM-algorithm to estimate the haplotype frequencies of a window with and without a deletion haplotype. As a contribution to population biology, a coalescent simulator for simulating inversion polymorphisms has been developed. Coalescent simulation is a backward-in-time method of modelling population ancestry. Technically, the simulator also models multiple crossovers by using the Counting model as the chiasma interference model. Finally, this thesis includes an experimental section. The aforementioned methods were tested on synthetic data to evaluate their power and specificity. They were also applied to the HapMap Phase II and Phase III data sets, yielding a number of candidates for previously unknown inversions, deletions and also correctly detecting known such rearrangements.
Resumo:
The basis of this work was the identification of a genomic region on chromosome 7p14-p15 that strongly associated with asthma and high serum total immunoglobulin E in a Finnish founder population from Kainuu. Using a hierarchical genotyping approach the linkage region was narrowed down until an evolutionary collectively inherited 133-kb haplotype block was discovered. The results were confirmed in two independent data sets: Asthma families from Quebec and allergy families from North-Karelia. In all the three cohorts studied, single nucleotide polymorphisms tagging seven common gene variants (haplotypes) were identified. Over half of the asthma patients carried three evolutionary closely related susceptibility haplotypes as opposed to approximately one third of the healthy controls. The risk effects of the gene variants varied from 1.4 to 2.5. In the disease-associated region, there was one protein-coding gene named GPRA (G Protein-coupled Receptor for Asthma susceptibility also known as NPSR1) which displayed extensive alternative splicing. Only the two isoforms with distinct intracellular tail sequences, GPRA-A and -B, encoded a full-length G protein-coupled receptor with seven transmembrane regions. Using various techniques, we showed that GPRA is expressed in multiple mucosal surfaces including epithelial cells throughout the respiratory tract. GPRA-A has additional expression in respiratory smooth muscle cells. However, in bronchial biopsies with unknown haplotypes, GPRA-B was upregulated in airways of all patient samples in contrast to the lack of expression in controls. Further support for GPRA as a common mediator of inflammation was obtained from a mouse model of ovalbumin-induced inflammation, where metacholine-induced airway hyperresponsiveness correlated with elevated GPRA mRNA levels in the lung and increased GPRA immunostaining in pulmonary macrophages. A novel GPRA agonist, Neuropeptide S (NPS), stimulated phagocytosis of Esterichia coli bacteria in a mouse macrophage cell line indicating a role for GPRA in the removal of inhaled allergens. The suggested GPRA functions prompted us to study, whether GPRA haplotypes associate with respiratory distress syndrome (RDS) and bronchopulmonary dysplasia (BPD) in infants sharing clinical symptoms with asthma. According to the results, near-term RDS and asthma may also share the same susceptibility and protective GPRA haplotypes. As in asthma, GPRA-B isoform expression was induced in bronchial smooth muscle cells in RDS and BPD suggesting a role for GPRA in bronchial hyperresponsiveness. In conclusion, the results of the present study suggest that the dysregulation of the GPRA/NPS pathway may not only be limited to the individuals carrying the risk variants of the gene but is also involved in the regulation of immune functions of asthma.
Resumo:
The glomerular epithelial cells and their intercellular junctions, termed slit diaphragms, are essential components of the filtration barrier in the kidney glomerulus. Nephrin is a transmembrane adhesion protein of the slit diaphragm and a signalling molecule regulating podocyte physiology. In congenital nephrotic syndrome of the Finnish type, mutation of nephrin leads to disruption of the permeability barrier and leakage of plasma proteins into the urine. This doctoral thesis hypothesises that novel nephrin-associated molecules are involved in the function of the filtration barrier in health and disease. Bioinformatics tools were utilized to identify novel nephrin-like molecules in genomic databases, and their distribution in the kidney and other tissues was investigated. Filtrin, a novel nephrin homologue, is expressed in the glomerular podocytes and, according to immunoelectron microscopy, localizes at the slit diaphragm. Interestingly, the nephrin and filtrin genes, NPHS1 and KIRREL2, locate in a head-to-head orientation on chromosome 19q13.12. Another nephrin-like molecule, Nphs1as was cloned in mouse, however, no expression was detected in the kidney but instead in the brain and lymphoid tissue. Notably, Nphs1as is transcribed from the nephrin locus in an antisense orientation. The glomerular mRNA and protein levels of filtrin were measured in kidney biopsies of patients with proteinuric diseases, and marked reduction of filtrin mRNA levels was detected in the proteinuric samples as compared to controls. In addition, altered distribution of filtrin in injured glomeruli was observed, with the most prominent decrease of the expression in focal segmental glomerulosclerosis. The role of the slit diaphragm-associated genes for the development of diabetic nephropathy was investigated by analysing single nucleotide polymorphisms. The genes encoding filtrin, densin-180, NEPH1, podocin, and alpha-actinin-4 were analysed, and polymorphisms at the alpha-actinin-4 gene were associated with diabetic nephropathy in a gender-dependent manner. Filtrin is a novel podocyte-expressed protein with localization at the slit diaphragm, and the downregulation of filtrin seems to be characteristic for human proteinuric diseases. In the context of the crucial role of nephrin for the glomerular filter, filtrin appears to be a potential candidate molecule for proteinuria. Although not expressed in the kidney, the nephrin antisense Nphs1as may regulate the expression of nephrin in extrarenal tissues. The genetic association analysis suggested that the alpha-actinin-4 gene, encoding an actin-filament cross-linking protein of the podocytes, may contribute to susceptibility for diabetic nephropathy.
Resumo:
Kohonneiden kolesterolipitoisuuksien alentamisessa käytettävien statiinien hyödyt sydän- ja verisuonisairauksien estossa on vahvasti osoitettu ja niiden käyttö on niin Suomessa kuin muuallakin maailmassa kasvanut voimakkaasti – Suomessa statiininkäyttäjiä on noin 600 000. Statiinilääkitys on pitkäaikaisessakin käytössä melko hyvin siedetty, mutta yleisimpinä haittavaikutuksina voi ilmetä lihasheikkoutta, -kipua ja -kramppeja, jotka voivat edetä jopa henkeä uhkaavaksi lihasvaurioksi. Lihashaittariski suurenee suhteessa statiiniannokseen ja plasman statiinipitoisuuksiin. Statiinien plasmapitoisuuksissa, tehossa ja haittavaikutusten ilmenemisessä on suuria potilaskohtaisia eroja. SLCO1B1-geenin koodaama OATP1B1-kuljetusproteiini kuljettaa monia elimistön omia aineita ja lääkeaineita verenkierrosta solukalvon läpi maksasoluun, mm. statiineja, joiden kolesterolia alentava vaikutus ja poistuminen elimistöstä tapahtuvat pääosin maksassa. Erään SLCO1B1-geenin nukleotidimuutoksen (c.521T>C) tiedetään heikentävän OATP1B1:n kuljetustehoa. Tässä väitöskirjatyössä selvitettiin SLCO1B1-geenin perinnöllistä muuntelua suomalaisilla ja eri väestöissä maailmanlaajuisesti. Lisäksi selvitettiin SLCO1B1:n muunnosten vaikutusta eri statiinien pitoisuuksiin (farmakokinetiikka) ja vaikutuksiin (farmakodynamiikka) sekä kolesteroliaineenvaihduntaan. Näihin tutkimuksiin valittiin SLCO1B1-genotyypin perusteella terveitä vapaaehtoisia koehenkilöitä, joille annettiin eri päivinä kerta-annos kutakin tutkittavaa statiinia: fluvastatiinia, pravastatiinia, simvastatiinia, rosuvastatiinia ja atorvastatiinia. Verinäytteistä määritettiin plasman statiinien ja niiden aineenvaihduntatuotteiden sekä kolesterolin ja sen muodostumista ja imeytymistä kuvaavien merkkiaineiden pitoisuuksia. Toiminnallisesti merkittävien SLCO1B1-geenimuunnosten esiintyvyydessä todettiin suuria eroja eri väestöjen välillä. Suomalaisilla SLCO1B1 c.521TC-genotyypin (geenimuunnos toisessa vastinkromosomissa) esiintyvyys oli noin 32 % ja SLCO1B1 c.521CC-genotyypin (geenimuunnos molemmissa vastinkromosomeissa) esiintyvyys noin 4 %. Globaalisti geenimuunnosten esiintyvyys korreloi maapallon leveyspiirien kanssa siten, että matalaan transportteriaktiivisuuteen johtavat muunnokset olivat yleisimpiä pohjoisessa ja korkeaan aktiivisuuteen johtavat päiväntasaajan lähellä asuvilla väestöillä. SLCO1B1-genotyypillä oli merkittävä vaikutus statiinien plasmapitoisuksiin lukuun ottamatta fluvastatiinia. Simvastatiinihapon plasmapitoisuudet olivat keskimäärin 220 %, atorvastatiinin 140 %, pravastatiinin 90 % ja rosuvastatiinin 70 % suuremmat c.521CC-genotyypin omaavilla koehenkilöillä verrattuna normaalin c.521TT-genotyypin omaaviin. Genotyypillä ei ollut merkittävää vaikutusta minkään statiinin tehoon tässä kerta-annostutkimuksessa, mutta geenimuunnoksen kantajilla perustason kolesterolisynteesinopeus oli suurempi. Tulokset osoittavat, että SLCO1B1 c.521T>C geenimuunnos on varsin yleinen suomalaisilla ja muilla ei-afrikkalaisilla väestöillä. Tämä geenimuunnos voi altistaa erityisesti simvastatiinin, mutta myös atorvastatiinin, pravastatiinin ja rosuvastatiinin, aiheuttamille lihashaitoille suurentamalla niiden plasmapitoisuuksia. SLCO1B1:n geenimuunnoksen testaamista voidaan tulevaisuudessa käyttää apuna valittaessa sopivaa statiinilääkitystä ja -annosta potilaalle, ja näin parantaa sekä statiinihoidon turvallisuutta että tehoa.
Resumo:
Electric activity of the heart consists of repeated cardiomyocyte depolarizations and repolarizations. Abnormalities in repolarization predispose to ventricular arrhythmias. In body surface electrocardiogram, ventricular repolarization generates the T wave. Several electrocardiographic measures have been developed both for clinical and research purposes to detect repolarization abnormalities. The study aim was to investigate modifiers of ventricular repolarization with the focus on the relationship of the left ventricular mass, antihypertensive drugs, and common gene variants, to electrocardiographic repolarization parameters. The prognostic value of repolarization parameters was also assessed. The study subjects originated from a population of more than 200 middle-aged hypertensive men attending the GENRES hypertension study, and from an epidemiological survey, the Health 2000 Study, including more than 6000 participants. Ventricular repolarization was analysed from digital standard 12-lead resting electrocardiograms with two QT-interval based repolarization parameters (QT interval, T-wave peak to T-wave end interval) and with a set of four T-wave morphology parameters. The results showed that in hypertensive men, a linear change in repolarization parameters is present even in the normal range of left ventricular mass, and that even mild left ventricular hypertrophy is associated with potentially adverse electrocardiographic repolarization changes. In addition, treatments with losartan, bisoprolol, amlodipine, and hydrochlorothiazide have divergent short-term effects on repolarization parameters in hypertensive men. Analyses of the general population sample showed that single nucleotide polymorphisms in KCNH2, KCNE1, and NOS1AP genes are associated with changes in QT-interval based repolarization parameters but not consistently with T-wave morphology parameters. T-wave morphology parameters, but not QT interval or T-wave peak to T-wave end interval, provided independent prognostic information on mortality. The prognostic value was specifically related to cardiovascular mortality. The results indicate that, in hypertension, altered ventricular repolarization is already present in mild left ventricular mass increase, and that commonly used antihypertensive drugs may relatively rapidly and treatment-specifically modify electrocardiographic repolarization parameters. Common variants in cardiac ion channel genes and NOS1AP gene may also modify repolarization-related arrhythmia vulnerability. In the general population, T-wave morphology parameters may be useful in the risk assessment of cardiovascular mortality.
Resumo:
Both inherited genetic variations and somatically acquired mutations drive cancer development. The aim of this thesis was to gain insight into the molecular mechanisms underlying colorectal cancer (CRC) predisposition and tumor progression. Whereas one-third of CRC may develop in the context of hereditary predisposition, the known highly penetrant syndromes only explain a small fraction of all cases. Genome-wide association studies have shown that ten common single nucleotide polymorphisms (SNPs) modestly predispose to CRC. Our population-based sample series of around thousand CRC cases and healthy controls was genotyped for these SNPs. Tumors of heterozygous patients were analyzed for allelic imbalance, in an attempt to reveal the role of these SNPs in somatic tumor progression. The risk allele of rs6983267 at 8q24 was favored in the tumors significantly more often than the neutral allele, indicating that this germline variant is somatically selected for. No imbalance targeting the risk allele was observed in the remaining loci, suggesting that most of the low-penetrance CRC SNPs mainly play a role in the early stages of the neoplastic process. The ten SNPs were further analyzed in 788 CRC cases, 97 of which had a family history of CRC, to evaluate their combined contribution. A significant association appeared between the overall number of risk alleles and familial CRC and these ten SNPs seem to explain around 9% of the familial clustering of CRC. Finding more CRC susceptibility alleles may facilitate individualized risk prediction and cancer prevention in the future. Microsatellite instability (MSI), resulting from defective mismatch repair function, is a hallmark of Lynch syndrome and observed in a subset of all CRCs. Our aim was to identify microsatellite frameshift mutations that inactivate tumor suppressor genes in MSI CRCs. By sequencing microsatellite repeats of underexpressed genes we found six novel MSI target genes that were frequently mutated in 100 MSI CRCs: 51% in GLYR1, 47% in ABCC5, 43% in WDTC1, 33% in ROCK1, 30% in OR51E2, and 28% in TCEB3. Immunohistochemical staining of GLYR1 revealed defective protein expression in homozygously mutated tumors, providing further support for the loss of function hypothesis. Another mutation screening effort sought to identify MSI target genes with putative oncogenic functions. Microsatellites were similarly sequenced in genes that were overexpressed and, upon mutation, predicted to avoid nonsense-mediated mRNA decay. The mitotic checkpoint kinase TTK harbored protein-elongating mutations in 59% of MSI CRCs and the mutant protein was detected in heterozygous MSI CRC cells. No checkpoint dysregulation or defective protein localization was observable however, and the biological relevance of this mutation may hence be related to other mechanisms. In conclusion, these two large-scale and unbiased efforts identified frequently mutated genes that are likely to contribute to the development of this cancer type and may be utilized in developing diagnostic and therapeutic applications.
Resumo:
Multiple sclerosis (MS) is the most common cause of neurological disability in young adults, affecting more than two million people worldwide. It manifests as a chronic inflammation in the central nervous system (CNS) and causes demyelination and neurodegeneration. Depending on the location of the demyelinated plaques and axonal loss, a variety of symptoms can be observed including deficits in vision, coordination, balance and movement. With a typical age of onset at 20-40 years, the social and economic impacts of MS on lives of the patients and their families are considerable. Unfortunately the current treatments are relatively inefficient and the development of more effective treatments has been impeded by our limited understanding of the causes and pathogenesis of MS. Risk of MS is higher in biological relatives of MS patients than in the general population. Twin and adoption studies have shown that familial clustering of MS is explained by shared genetic factors rather than by shared familial environment. While the involvement of the human leukocyte antigen (HLA) genes was first discovered four decades ago, additional genetic risk factors have only recently been identified through genome-wide association studies (GWAS). Current evidence suggests that MS is a highly polygenic disease with perhaps hundreds of common variants with relatively modest effects contributing to susceptibility. Despite extensive research, the majority of these risk factors still remain to be identified. In this thesis the aim was to identify novel genes and pathways involved in MS. Using genome-wide microarray technology, gene expression levels in peripheral blood mononuclear cells (PBMC) from 12 MS patients and 15 controls were profiled and more than 600 genes with altered expression in MS were identified. Three of five selected findings, DEFA1A3, LILRA4 and TNFRSF25, were successfully replicated in an independent sample. Increased expression of DEFA1A3 in MS is a particularly interesting observation, because its elevated levels have previously been reported also in several other autoimmune diseases. A systematic review of seven microarray studies was then performed leading to identification of 229 genes, in which either decreased or increased expression in MS had been reported in at least two studies. In general there was relatively little overlap across the experiments: 11 of the 229 genes had been reported in three studies and only HSPA1A in four studies. Nevertheless, these 229 genes were associated with several immunological pathways including interleukin pathways related to type 2 and type 17 helper T cells and regulatory T cells. However, whether these pathways are involved in causing MS or related to secondary processes activated after disease onset remains to be investigated. The 229 genes were also compared with loci identified in published MS GWASs. Single nucleotide polymorphisms (SNP) in 17 of the 229 loci had been reported to be associated with MS with P-value less than 0.0001 including variants in CXCR4 and SAPS2, which were the only loci where evidence for correlation between the associated variant and gene expression was found. The CXCR4 variant was further tested for association with MS in a large case-control sample and the previously reported suggestive association was replicated (P-value is 0.0004). Finally, common genetic variants in candidate genes, which had been selected on the basis of showing association with other autoimmune diseases (MYO9B) or showing differential expression in MS in our study (DEFA1A3, LILRA4 and TNFRSF25), were tested for association with MS, but no evidence of association was found. In conclusion, through a systematic review of genome-wide expression studies in MS we have identified several promising candidate genes and pathways for future studies. In addition, we have replicated a previously suggested association of a SNP variant upstream of CXCR4 with MS. Keywords: autoimmune disease, common variant, CXCR4, DEFA1A3, HSPA1A,gene expression, genetic association, GWAS, MS, multiple sclerosis, systematic review
Resumo:
In this thesis, two separate single nucleotide polymorphism (SNP) genotyping techniques were set up at the Finnish Genome Center, pooled genotyping was evaluated as a screening method for large-scale association studies, and finally, the former approaches were used to identify genetic factors predisposing to two distinct complex diseases by utilizing large epidemiological cohorts and also taking environmental factors into account. The first genotyping platform was based on traditional but improved restriction-fragment-length-polymorphism (RFLP) utilizing 384-microtiter well plates, multiplexing, small reaction volumes (5 µl), and automated genotype calling. We participated in the development of the second genotyping method, based on single nucleotide primer extension (SNuPeTM by Amersham Biosciences), by carrying out the alpha- and beta tests for the chemistry and the allele-calling software. Both techniques proved to be accurate, reliable, and suitable for projects with thousands of samples and tens of markers. Pooled genotyping (genotyping of pooled instead of individual DNA samples) was evaluated with Sequenom s MassArray MALDI-TOF, in addition to SNuPeTM and PCR-RFLP techniques. We used MassArray mainly as a point of comparison, because it is known to be well suited for pooled genotyping. All three methods were shown to be accurate, the standard deviations between measurements being 0.017 for the MassArray, 0.022 for the PCR-RFLP, and 0.026 for the SNuPeTM. The largest source of error in the process of pooled genotyping was shown to be the volumetric error, i.e., the preparation of pools. We also demonstrated that it would have been possible to narrow down the genetic locus underlying congenital chloride diarrhea (CLD), an autosomal recessive disorder, by using the pooling technique instead of genotyping individual samples. Although the approach seems to be well suited for traditional case-control studies, it is difficult to apply if any kind of stratification based on environmental factors is needed. Therefore we chose to continue with individual genotyping in the following association studies. Samples in the two separate large epidemiological cohorts were genotyped with the PCR-RFLP and SNuPeTM techniques. The first of these association studies concerned various pregnancy complications among 100,000 consecutive pregnancies in Finland, of which we genotyped 2292 patients and controls, in addition to a population sample of 644 blood donors, with 7 polymorphisms in the potentially thrombotic genes. In this thesis, the analysis of a sub-study of pregnancy-related venous thromboses was included. We showed that the impact of factor V Leiden polymorphism on pregnancy-related venous thrombosis, but not the other tested polymorphisms, was fairly large (odds ratio 11.6; 95% CI 3.6-33.6), and increased multiplicatively when combined with other risk factors such as obesity or advanced age. Owing to our study design, we were also able to estimate the risks at the population level. The second epidemiological cohort was the Helsinki Birth Cohort of men and women who were born during 1924-1933 in Helsinki. The aim was to identify genetic factors that might modify the well known link between small birth size and adult metabolic diseases, such as type 2 diabetes and impaired glucose tolerance. Among ~500 individuals with detailed birth measurements and current metabolic profile, we found that an insertion/deletion polymorphism of the angiotensin converting enzyme (ACE) gene was associated with the duration of gestation, and weight and length at birth. Interestingly, the ACE insertion allele was also associated with higher indices of insulin secretion (p=0.0004) in adult life, but only among individuals who were born small (those among the lowest third of birth weight). Likewise, low birth weight was associated with higher indices of insulin secretion (p=0.003), but only among carriers of the ACE insertion allele. The association with birth measurements was also found with a common haplotype of the glucocorticoid receptor (GR) gene. Furthermore, the association between short length at birth and adult impaired glucose tolerance was confined to carriers of this haplotype (p=0.007). These associations exemplify the interaction between environmental factors and genotype, which, possibly due to altered gene expression, predisposes to complex metabolic diseases. Indeed, we showed that the common GR gene haplotype associated with reduced mRNA expression in thymus of three individuals (p=0.0002).
Resumo:
The prevalence of obesity is increasing at an alarming rate in all age groups worldwide. Obesity is a serious health problem due to increased risk of morbidity and mortality. Although environmental factors play a major role in the development of obesity, the identification of rare monogenic defects in human genes have confirmed that obesity has a strong genetic component. Mutations have been identified in genes encoding proteins of the leptin-melanocortin signaling system, which has an important role in the regulation of appetite and energy balance. The present study aimed at identifying mutations and genetic variations in the melanocortin receptors 2-5 and other genes active on the same signaling pathway accounting for severe early-onset obesity in children and morbid obesity in adults. The main achievement of this thesis was the identification of melanocortin-4 receptor (MC4R) mutations in Finnish patients. Six pathogenic MC4R mutations (308delT, P299H, two S127L and two -439delGC mutations) were identified, corresponding to a prevalence of 3% in severe early-onset obesity. No obesity causing MC4R mutations were found among patients with adult-onset morbid obesity. The MC4R 308delT deletion is predicted to result in a grossly truncated nonfunctional receptor of only 107 amino acids. The C-terminal residues, which are important in MC4R cell surface targeting, are totally absent from the mutant 308delT receptor. In vitro functional studies supported a pathogenic role for the S127L mutation since agonist induced signaling of the receptor was impaired. Cell membrane localization of the S127L receptor did not differ from that of the wild-type receptor, confirming that impaired function of the S127L receptor was due to reduced signaling properties. The P299H mutation leads to intracellular retention of the receptor. The -439delGC deletion is situated at a potential nescient helix-loop-helix 2 (NHLH2) -binding site in the MC4R promoter. It was demonstrated that the transcription factor NHLH2 binds to the consensus sequence at the -439delGC site in vitro, possibly resulting in altered promoter activity. Several genetic variants were identified in the melanocortin-3 receptor (MC3R) and pro-opiomelanocortin (POMC) genes. These polymorphisms do not explain morbid obesity, but the results indicate that some of these genetic variations may be modifying factors in obesity, resulting in subtle changes in obesity-related traits. A risk haplotype for obesity was identified in the ectonucleotide pyrophosphatase phosphodiesterase 1 (ENPP1) gene through a candidate gene single nucleotide polymorphism (SNP) genotyping approach. An ENPP1 haplotype, composed of SNPs rs1800949 and rs943003, was shown to be significantly associated with morbid obesity in adults. Accordingly, the MC3R, POMC and ENPP1 genes represent examples of susceptibility genes in which genetic variants predispose to obesity. In conclusion, pathogenic mutations in the MC4R gene were shown to account for 3% of cases with severe early-onset obesity in Finland. This is in line with results from other populations demonstrating that mutations in the MC4R gene underlie 1-6% of morbid obesity worldwide. MC4R deficiency thus represents the most common monogenic defect causing human obesity reported so far. The severity of the MC4-receptor defect appears to be associated with time of onset and the degree of obesity. Classification of MC4R mutations may provide a useful tool when predicting the outcome of the disease. In addition, several other genetic variants conferring susceptibility to obesity were detected in the MC3R, MC4R, POMC and ENPP1 genes.