48 resultados para SNP genotyping
Resumo:
Colorectal cancer (CRC) is one of the most frequent malignancies in Western countries. Inherited factors have been suggested to be involved in 35% of CRCs. The hereditary CRC syndromes explain only ~6% of all CRCs, indicating that a large proportion of the inherited susceptibility is still unexplained. Much of the remaining genetic predisposition for CRC is probably due to undiscovered low-penetrance variations. This study was conducted to identify germline and somatic changes that contribute to CRC predisposition and tumorigenesis. MLH1 and MSH2, that underlie Hereditary non-polyposis colorectal cancer (HNPCC) are considered to be tumor suppressor genes; the first hit is inherited in the germline and somatic inactivation of the wild type allele is required for tumor initiation. In a recent study, frequent loss of the mutant allele in HNPCC tumors was detected and a new model, arguing against the two-hit hypothesis, was proposed for somatic HNPCC tumorigenesis. We tested this hypothesis by conducting LOH analysis on 25 colorectal HNPCC tumors with a known germline mutation in the MLH1 or MSH2 genes. LOH was detected in 56% of the tumors. All the losses targeted the wild type allele supporting the classical two-hit model for HNPCC tumorigenesis. The variants 3020insC, R702W and G908R in NOD2 predispose to Crohn s disease. Contribution of NOD2 to CRC predisposition has been examined in several case-control series, with conflicting results. We have previously shown that 3020insC does not predispose to CRC in Finnish CRC patients. To expand our previous study the variants R702W and G908R were genotyped in a population-based series of 1042 Finnish CRC patients and 508 healthy controls. Association analyses did not show significant evidence for association of the variants with CRC. Single nucleotide polymorphism (SNP) rs6983267 at chromosome 8q24 was the first CRC susceptibility variant identified through genome-wide association studies. To characterize the role of rs6983267 in CRC predisposition in the Finnish population, we genotyped the SNP in the case-control material of 1042 cases and 1012 controls and showed that G allele of rs6983267 is associated with the increased risk of CRC (OR 1.22; P=0.0018). Examination of allelic imbalance in the tumors heterozygous for rs6983267 revealed that copy number increase affected 22% of the tumors and interestingly, it favored the G allele. By utilizing a computer algorithm, Enhancer Element Locator (EEL), an evolutionary conserved regulatory motif containing rs6983267 was identified. The SNP affected the binding site of TCF4, a transcription factor that mediates Wnt signaling in cells, and has proven to be crucial in colorectal neoplasia. The preferential binding of TCF4 to the risk allele G was showed in vitro and in vivo. The element drove lacZ marker gene expression in mouse embryos in a pattern that is consistent with genes regulated by the Wnt signaling pathway. These results suggest that rs6983267 at 8q24 exerts its effect in CRC predisposition by regulating gene expression. The most obvious target gene for the enhancer element is MYC, residing ~335 kb downstream, however further studies are required to establish the transcriptional target(s) of the predicted enhancer element.
Resumo:
Hereditary Leiomyomatosis and Renal Cell Cancer (HLRCC) is a hereditary tumour predisposition syndrome. Its phenotype includes benign cutaneous and uterine leiomyomas (CLM, ULM) with high penetrance and rarer renal cell cancer (RCC), most commonly of papillary type 2 subtype. Over 130 HLRCC families have been identified world-wide but the RCC phenotype seems to concentrate in families from Finland and North America for unknown reasons. HLRCC is caused by heterozygous germline mutations in the fumarate hydratase (FH) gene. FH encodes the enzyme fumarase from mitochondrial citric acid cycle. Fumarase enzyme activity or type or site of the FH mutation are unassociated with disease phenotype. The strongest evidence for tumourigenesis mechanism in HLRCC supports a hypoxia inducible factor driven process called pseudohypoxia resulting from accumulation of the fumarase substrate fumarate. In this study, to assess the importance of gene- or exon-level deletions or amplifications of FH in patients with HLRCC-associated phenotypes, multiplex ligation-dependent probe amplification (MLPA) method was used. One novel FH mutation, deletion of exon 1, was found in a Swedish male patient with an evident HLRCC phenotype with CLM, RCC, and a family history of ULM and RCC. Six other patients with CLM and 12 patients with only RCC or uterine leiomyosarcoma (ULMS) remained FH mutation-negative. These results suggest that copy number aberrations of FH or its exons are an infrequent cause of HLRCC and that only co-occurrence of benign tumour types justifies FH-mutation screening in RCC or ULMS patients. Determination of the genomic profile of 11 HLRCC-associated RCCs from Finnish patients was performed by array comparative genomic hybridization. The most common copy number aberrations were gains of 2, 7, and 17 and losses of 13q12.3-q21.1, 14, 18, and X. When compared to aberrations of sporadic papillary RCCs, HLRCC-associated RCCs harboured a distinct DNA copy number profile and lacked many of the changes characterizing the sporadic RCCs. The findings suggest a divergent molecular pathway for tumourigenesis of papillary RCCs in HLRCC. In order to find a genetic modifier of RCC risk in HLRCC, genome-wide linkage and identical by descent (IBD) analysis studies were performed in Finnish HLRCC families with microsatellite marker mapping and SNP-array platforms. The linkage analysis identified only one locus of interest, the FH gene locus in 1q43, but no mutations were found in the genes of the region. IBD analysis yielded no convincing haplotypes shared by RCC patients. Although these results do not exclude the existence of a genetic modifier for RCC risk in HLRCC, they emphasize the role of FH mutations in the malignant tumourigenesis of HLRCC. To study the benign tumours in HLRCC, genome-wide DNA copy number and gene expression profiles of sporadic and HLRCC ULMs were defined with modern SNP- and gene-expression array platforms. The gene expression array suggests novel genes involved in FH-deficient ULM tumourigenesis and novel genes with putative roles in propagation of sporadic ULM. Both the gene expression and copy number profiles of HLRCC ULMs differed from those of sporadic ULMs indicating distinct molecular basis of the FH-deficient HLRCC tumours.
Resumo:
Chromosomal alterations in leukemia have been shown to have prognostic and predictive significance and are also important minimal residual disease (MRD) markers in the follow-up of leukemia patients. Although specific oncogenes and tumor suppressors have been discovered in some of the chromosomal alterations, the role and target genes of many alterations in leukemia remain unknown. In addition, a number of leukemia patients have a normal karyotype by standard cytogenetics, but have variability in clinical course and are often molecularly heterogeneous. Cytogenetic methods traditionally used in leukemia analysis and diagnostics; G-banding, various fluorescence in situ hybridization (FISH) techniques, and chromosomal comparative genomic hybridization (cCGH), have enormously increased knowledge about the leukemia genome, but have limitations in resolution or in genomic coverage. In the last decade, the development of microarray comparative genomic hybridization (array-CGH, aCGH) for DNA copy number analysis and the SNP microarray (SNP-array) method for simultaneous copy number and loss of heterozygosity (LOH) analysis has enabled investigation of chromosomal and gene alterations genome-wide with high resolution and high throughput. In these studies, genetic alterations were analyzed in acute myeloid leukemia (AML) and chronic lymphocytic leukemia (CLL). The aim was to screen and characterize genomic alterations that could play role in leukemia pathogenesis by using aCGH and SNP-arrays. One of the most important goals was to screen cryptic alterations in karyotypically normal leukemia patients. In addition, chromosomal changes were evaluated to narrow the target regions, to find new markers, and to obtain tumor suppressor and oncogene candidates. The work presented here shows the capability of aCGH to detect submicroscopic copy number alterations in leukemia, with information about breakpoints and genes involved in the alterations, and that genome-wide microarray analyses with aCGH and SNP-array are advantageous methods in the research and diagnosis of leukemia. The most important findings were the cryptic changes detected with aCGH in karyotypically normal AML and CLL, characterization of amplified genes in 11q marker chromosomes, detection of deletion-based mechanisms of MLL-ARHGEF12 fusion gene formation, and detection of LOH without copy number alteration in karyotypically normal AML. These alterations harbor candidate oncogenes and tumor suppressors for further studies.
Resumo:
Celiac disease, or gluten intolerance, is triggered by dietary glutens in genetically susceptible individuals and it affects approximately 1% of the Caucasian population. The best known genetic risk factors for celiac disease are HLA DQ2 and DQ8 heterodimers, which are necessary for the development of the disease. However, they alone are not sufficient for disease induction, other risk factors are required. This thesis investigated genetic factors for celiac disease, concentrating on susceptibility loci on chromosomes 5q31-q33, 19p13 and 2q12 previously reported in genome-wide linkage and association studies. In addition, a novel genotyping method for the detection of HLA DQ2 and DQ8 coding haplotypes was validated. This study was conducted using Finnish and Hungarian family materials, and Finnish, Hungarian and Italian case-control materials. Genetic linkage and association were analysed in these materials using candidate gene and fine-mapping approaches. The results confirmed linkage to celiac disease on the chromosomal regions 5q31-q33 and 19p13. Fine-mapping on chromosome 5q31-q33 revealed several modest associations in the region, and highlighted the need for further investigations to locate the causal risk variants. The MYO9B gene on chromosome 19p13 showed evidence for linkage and association particularly with dermatitis herpetiformis, the skin manifestation of celiac disease. This implies a potential difference in the genetic background of the intestinal and skin forms of the disease, although studies on larger samplesets are required. The IL18RAP locus on chromosome 2q12, shown to be associated with celiac disease in a previous genome-wide association study and a subsequent follow-up, showed association in the Hungarian population in this study. The expression of IL18RAP was further investigated in small intestinal tissue and in peripheral blood mononuclear cells. The results showed that IL18RAP is expressed in the relevant tissues. Two putative isoforms of IL18RAP were detected by Western blot analysis, and the results suggested that the ratios and total levels of these isoforms may contribute to the aetiology of celiac disease. A novel genotyping method for celiac disease-associated HLA haplotypes was also validated in this thesis. The method utilises single-nucleotide polymorphisms tagging these HLA haplotypes with high sensitivity and specificity. Our results suggest that this method is transferable between populations, and it is suitable for large-scale analysis. In conclusion, this doctorate study provides an insight into the roles of the 5q31-q33, MYO9B, IL18RAP and HLA loci in the susceptibility to celiac disease in the Finnish, Hungarian and Italian populations, highlighting the need for further studies at these genetic loci and examination of the function of the candidate genes.
Resumo:
Positional cloning has enabled hypothesis-free, genome-wide scans for genetic factors contributing to disorders or traits. Traditionally linkage analysis has been used to identify regions of interest, followed by meticulous fine mapping and candidate gene screening using association methods and finally sequencing of regions of interest. More recently, genome-wide association analysis has enabled a more direct approach to identify specific genetic variants explaining a part of the variance of the phenotype of interest. Autism spectrum disorders (ASDs) are a group of childhood onset neuropsychiatric disorders with shared core symptoms but varying severity. Although a strong genetic component has been established in ASDs, genetic susceptibility factors have largely eluded characterization. Here, we have utilized modern molecular genetic methods combined with the advantages provided by the special population structure in Finland to identify genetic risk factors for ASDs. The results of this study show that numerous genetic risk factors exist for ASDs even within a population isolate. Stratification based on clinical phenotype resulted in encouraging results, as previously identified linkage to 3p14-p24 was replicated in an independent family set of families with Asperger syndrome, but no other ASDs. Fine-mapping of the previously identified linkage peak for ASDs at 3q25-q27 revealed association between autism and a subunit of the 5-hydroxytryptamine receptor 3C (HTR3C). We also used dense, genome-wide single nucleotide polymorphism (SNP) data to characterize the population structure of Finns. We observed significant population substructure which correlates with the known history of multiple consecutive bottle-necks experienced by the Finnish population. We used this information to ascertain a genetically homogenous subset of autism families to identify possible rare, enriched risk variants using genome-wide SNP data. No rare enriched genetic risk factors were identified in this dataset, although a subset of families could be genealogically linked to form two extended pedigrees. The lack of founder mutations in this isolated population suggests that the majority of genetic risk factors are rare, de novo mutations unique to individual nuclear families. The results of this study are consistent with others in the field. The underlying genetic architecture for this group of disorders appears highly heterogeneous, with common variants accounting for only a subset of genetic risk. The majority of identified risk factors have turned out to be exceedingly rare, and only explain a subset of the genetic risk in the general population in spite of their high penetrance within individual families. The results of this study, together with other results obtained in this field, indicate that family specific linkage, homozygosity mapping and resequencing efforts are needed to identify these rare genetic risk factors.
Resumo:
Cardiovascular diseases (CVD) are major contributors to morbidity and mortality worldwide. Several interacting environmental, biochemical, and genetic risk factors can increase disease susceptibility. While some of the genes involved in the etiology of CVD are known, many are yet to be discovered. During the last few decades, scientists have searched for these genes with genome-wide linkage and association methods, and with more targeted candidate gene studies. This thesis investigates variation within the upstream transcription factor 1 (USF1) gene locus in relation to CVD risk factors, atherosclerosis, and incidence and prevalence of CVD. This candidate gene was first identified in Finnish families ascertained for familial combined hyperlipidemia, a common dyslipidemia predisposing to coronary heart disease. The gene is a ubiquitously expressed transcription factor regulating expression of several genes from lipid and glucose metabolism, inflammation, and endothelial function. First, we examined association between USF1 variants and several CVD risk factors, such as lipid phenotypes, body composition measures, and metabolic syndrome, in two prospective population cohorts. Our data suggested that USF1 contributes to these CVD risk factors at the population level. Notably, the associations with quantitative measurements were mostly detected among study subjects with CVD or metabolic syndrome, suggesting complex interactions between USF1 effects and the pathophysiological state of an individual. Second, we investigated how variation at the USF1 locus contributes to atherosclerotic lesions of the coronary arteries and abdominal aorta. For this, we used two study samples of middle-aged men with detailed measurements of atherosclerosis obtained in autopsy. USF1 variation significantly associated with areas of several types of lesions, especially with calcification of the arteries. Next, we tested what effect the USF1 risk variants have on sudden cardiac death and incidence of CVD. The atherosclerosis-associated risk variant increased the risk of sudden cardiac death of the same study subjects. Furthermore, USF1 alleles associated with incidence of CVD in the Finnish population follow-up cohorts. These associations were especially prominent among women, suggesting a sex specific effect, which has also been detected in subsequent studies. Finally, as some of the low-yield DNA samples of the Finnish follow-up study cohort needed to be whole-genome amplified (WGA) prior to genotyping, we evaluated whether the produced WGA genotypes were of good quality. Although the samples giving genotype discrepancies could not be detected before genotyping with standard laboratory quality control methods, our results suggested that enhanced quality control at the time of the genotyping could identify such samples. In addition, combining two WGA reactions into one pooled DNA sample for genotyping markedly reduced the number of discrepancies and samples showing them. In conclusion, USF1 seems to have a role in the etiology of CVD. Additional studies are warranted to identify functional variants and to study interactions between USF1 and other genetic or environmental factors. This USF1 study, and other studies with low DNA yield of some samples, can benefit from whole genome amplification of the low-yield samples prior to genotyping. Careful quality control procedures are, however, needed in WGA genotyping.
Resumo:
Genetics, the science of heredity and variation in living organisms, has a central role in medicine, in breeding crops and livestock, and in studying fundamental topics of biological sciences such as evolution and cell functioning. Currently the field of genetics is under a rapid development because of the recent advances in technologies by which molecular data can be obtained from living organisms. In order that most information from such data can be extracted, the analyses need to be carried out using statistical models that are tailored to take account of the particular genetic processes. In this thesis we formulate and analyze Bayesian models for genetic marker data of contemporary individuals. The major focus is on the modeling of the unobserved recent ancestry of the sampled individuals (say, for tens of generations or so), which is carried out by using explicit probabilistic reconstructions of the pedigree structures accompanied by the gene flows at the marker loci. For such a recent history, the recombination process is the major genetic force that shapes the genomes of the individuals, and it is included in the model by assuming that the recombination fractions between the adjacent markers are known. The posterior distribution of the unobserved history of the individuals is studied conditionally on the observed marker data by using a Markov chain Monte Carlo algorithm (MCMC). The example analyses consider estimation of the population structure, relatedness structure (both at the level of whole genomes as well as at each marker separately), and haplotype configurations. For situations where the pedigree structure is partially known, an algorithm to create an initial state for the MCMC algorithm is given. Furthermore, the thesis includes an extension of the model for the recent genetic history to situations where also a quantitative phenotype has been measured from the contemporary individuals. In that case the goal is to identify positions on the genome that affect the observed phenotypic values. This task is carried out within the Bayesian framework, where the number and the relative effects of the quantitative trait loci are treated as random variables whose posterior distribution is studied conditionally on the observed genetic and phenotypic data. In addition, the thesis contains an extension of a widely-used haplotyping method, the PHASE algorithm, to settings where genetic material from several individuals has been pooled together, and the allele frequencies of each pool are determined in a single genotyping.
Resumo:
Growth is a fundamental aspect of life cycle of all organisms. Body size varies highly in most animal groups, such as mammals. Moreover, growth of a multicellular organism is not uniform enlargement of size, but different body parts and organs grow to their characteristic sizes at different times. Currently very little is known about the molecular mechanisms governing this organ-specific growth. The genome sequencing projects have provided complete genomic DNA sequences of several species over the past decade. The amount of genomic sequence information, including sequence variants within species, is constantly increasing. Based on the universal genetic code, we can make sense of this sequence information as far as it codes proteins. However, less is known about the molecular mechanisms that control expression of genes, and about the variations in gene expression that underlie many pathological states in humans. This is caused in part by lack of information about the second genetic code that consists of the binding specificities of transcription factors and the combinatorial code by which transcription factor binding sites are assembled to form tissue-specific and/or ligand-regulated enhancer elements. This thesis presents a high-throughput assay for identification of transcription factor binding specificities, which were then used to measure the DNA binding profiles of transcription factors involved in growth control. We developed ‘enhancer element locator’, a computational tool, which can be used to predict functional enhancer elements. A genome-wide prediction of human and mouse enhancer elements generated a large database of enhancer elements. This database can be used to identify target genes of signaling pathways, and to predict activated transcription factors based on changes in gene expression. Predictions validated in transgenic mouse embryos revealed the presence of multiple tissue-specific enhancers in mouse c- and N-Myc genes, which has implications to organ specific growth control and tumor type specificity of oncogenes. Furthermore, we were able to locate a variation in a single nucleotide, which carries a susceptibility to colorectal cancer, to an enhancer element and propose a mechanism by which this SNP might be involved in generation of colorectal cancer.
Resumo:
Wood decay fungi belonging to the species complex Heterobasidion annosum sensu lato are among the most common and economically important species causing root rot and stem decay in conifers of the northern temperate regions. New infections by these pathogens can be suppressed by tree stump treatments using chemical or biological control agents. In Finland, the corticiaceous fungus Phlebiopsis gigantea has been formulated into a commercial biocontrol agent called Rotstop (Verdera Ltd.). This thesis addresses the ecological impacts of Rotstop biocontrol treatment on the mycoflora of conifer stumps. Locally, fungal communities within Rotstop-treated and untreated stumps were analyzed using a novel method based on DGGE profiling of small subunit ribosomal DNA fragments amplified directly from wood samples. Population analyses for P. gigantea and H. annosum s.l. were conducted to evaluate possible risks associated with local and/or global distribution of the Rotstop strain. Based on molecular community profiling by DGGE, we detected a few individual wood-inhabiting fungal species (OTUs) that seemed to have suffered or benefited from the Rotstop biocontrol treatment. The DGGE analyses also revealed fungal diversity not retrieved by cultivation and some fungal sequence types untypical for decomposing conifer wood. However, statistical analysis of DGGE community profiles obtained from Rotstop-treated and untreated conifer stumps revealed that the Rotstop treatment had not caused a statistically significant reduction in the species diversity of wood-inhabiting fungi within our experimental forest plots. Locally, ISSR genotyping of cultured P. gigantea strains showed that the Rotstop biocontrol strain was capable of surviving up to six years within treated Norway spruce stumps, while in Scots pine stumps it was sooner replaced by successor fungal species. In addition, the spread of resident P. gigantea strains into Rotstop-treated forest stands seemed effective in preventing the formation of genetically monomorphic populations in the short run. On a global scale, we detected a considerable level of genetic differentiation between the interfertile European and North American populations of P. gigantea. These results strongly suggest that local biocontrol strains should be used in order to prevent global spread of P. gigantea and hybrid formation between geographically isolated populations. The population analysis for H. annosum s.l. revealed a collection of Chinese fungal strains that showed a high degree of laboratory fertility with three different allopatric H. annosum s.l. taxa. However, based on the molecular markers, the Chinese strains could be clearly affiliated with the H. parviporum taxonomical cluster, which thus appears to have a continuous distribution range from Europe through southern Siberia to northern China. Keywords: Rotstop, wood decay, DGGE, ISSR fingerprinting, ribosomal DNA
Resumo:
The basis of this work was the identification of a genomic region on chromosome 7p14-p15 that strongly associated with asthma and high serum total immunoglobulin E in a Finnish founder population from Kainuu. Using a hierarchical genotyping approach the linkage region was narrowed down until an evolutionary collectively inherited 133-kb haplotype block was discovered. The results were confirmed in two independent data sets: Asthma families from Quebec and allergy families from North-Karelia. In all the three cohorts studied, single nucleotide polymorphisms tagging seven common gene variants (haplotypes) were identified. Over half of the asthma patients carried three evolutionary closely related susceptibility haplotypes as opposed to approximately one third of the healthy controls. The risk effects of the gene variants varied from 1.4 to 2.5. In the disease-associated region, there was one protein-coding gene named GPRA (G Protein-coupled Receptor for Asthma susceptibility also known as NPSR1) which displayed extensive alternative splicing. Only the two isoforms with distinct intracellular tail sequences, GPRA-A and -B, encoded a full-length G protein-coupled receptor with seven transmembrane regions. Using various techniques, we showed that GPRA is expressed in multiple mucosal surfaces including epithelial cells throughout the respiratory tract. GPRA-A has additional expression in respiratory smooth muscle cells. However, in bronchial biopsies with unknown haplotypes, GPRA-B was upregulated in airways of all patient samples in contrast to the lack of expression in controls. Further support for GPRA as a common mediator of inflammation was obtained from a mouse model of ovalbumin-induced inflammation, where metacholine-induced airway hyperresponsiveness correlated with elevated GPRA mRNA levels in the lung and increased GPRA immunostaining in pulmonary macrophages. A novel GPRA agonist, Neuropeptide S (NPS), stimulated phagocytosis of Esterichia coli bacteria in a mouse macrophage cell line indicating a role for GPRA in the removal of inhaled allergens. The suggested GPRA functions prompted us to study, whether GPRA haplotypes associate with respiratory distress syndrome (RDS) and bronchopulmonary dysplasia (BPD) in infants sharing clinical symptoms with asthma. According to the results, near-term RDS and asthma may also share the same susceptibility and protective GPRA haplotypes. As in asthma, GPRA-B isoform expression was induced in bronchial smooth muscle cells in RDS and BPD suggesting a role for GPRA in bronchial hyperresponsiveness. In conclusion, the results of the present study suggest that the dysregulation of the GPRA/NPS pathway may not only be limited to the individuals carrying the risk variants of the gene but is also involved in the regulation of immune functions of asthma.
Resumo:
Dispersal is a highly important life history trait. In fragmented landscapes the long-term persistence of populations depends on dispersal. Evolution of dispersal is affected by costs and benefits and these may differ between different landscapes. This results in differences in the strength and direction of natural selection on dispersal in fragmented landscapes. Dispersal has been shown to be a nonrandom process that is associated with traits such as flight ability in insects. This thesis examines genetic and physiological traits affecting dispersal in the Glanville fritillary butterfly (Melitaea cinxia). Flight metabolic rate is a repeatable trait representing flight ability. Unlike in many vertebrates, resting metabolic rate cannot be used as a surrogate of maximum metabolic rate as no strong correlation between the two was found in the Glanville fritillary. Resting and flight metabolic rate are affected by environmental variables, most notably temperature. However, only flight metabolic rate has a strong genetic component. Molecular variation in the much-studied candidate locus phosphoglucose isomerase (Pgi), which encodes the glycolytic enzyme PGI, has an effect on carbohydrate metabolism in flight. This effect is temperature dependent: in low to moderate temperatures individuals with the heterozygous genotype at the single nucleotide polymorphism (SNP) AA111 have higher flight metabolic rate than the common homozygous genotype. At high temperatures the situation is reversed. This finding suggests that variation in enzyme properties is indeed translated to organismal performance. High-resolution data on individual female Glanville fritillaries moving freely in the field were recorded using harmonic radar. There was a strong positive correlation between flight metabolic rate and dispersal rate. Flight metabolic rate explained one third of the observed variation in the one-hour movement distance. A fine-scaled analysis of mobility showed that mobility peaked at intermediate ambient temperatures but the two common Pgi genotypes differed in their reaction norms to temperature. As with flight metabolic rate, heterozygotes at SNP AA111 were the most active genotype in low to moderate temperatures. The results show that molecular variation is associated with variation in dispersal rate through the link of flight physiology under the influence of environmental conditions. The evolutionary pressures for dispersal differ between males and females. The effect of flight metabolic rate on dispersal was examined in both sexes in field and laboratory conditions. The relationship between flight metabolic rate and dispersal rate in the field and flight duration in the laboratory were found to differ between the two sexes. In females the relationship was positive, but in males the longest distances and flight durations were recorded for individuals with low flight metabolic rate. These findings may reflect male investment in mate locating. Instead of dispersing, males with high flight metabolic rate may establish territories and follow a perching strategy when locating females and hence move less on the landscape level. Males with low metabolic rate may be forced to disperse due to low competitive success or may show adaptations to an alternative strategy: patrolling. In the light of life history trade-offs and the rate of living theory having high metabolic rate may carry a cost in the form of shortened lifespan. Experiments relating flight metabolic rate to longevity showed a clear correlation in the opposite direction: high flight metabolic rate was associated with long lifespan. This suggests that individuals with high metabolic rate do not pay an extra physiological cost for their high flight capacity, rather there are positive correlations between different measures of fitness. These results highlight the importance of condition.
Resumo:
Evolutionary genetics incorporates traditional population genetics and studies of the origins of genetic variation by mutation and recombination, and the molecular evolution of genomes. Among the primary forces that have potential to affect the genetic variation within and among populations, including those that may lead to adaptation and speciation, are genetic drift, gene flow, mutations and natural selection. The main challenges in knowing the genetic basis of evolutionary changes is to distinguish the adaptive selection forces that cause existent DNA sequence variants and also to identify the nucleotide differences responsible for the observed phenotypic variation. To understand the effects of various forces, interpretation of gene sequence variation has been the principal basis of many evolutionary genetic studies. The main aim of this thesis was to assess different forms of teleost gene sequence polymorphisms in evolutionary genetic studies of Atlantic salmon (Salmo salar) and other species. Firstly, the level of Darwinian adaptive evolution affected coding regions of the growth hormone (GH) gene during the teleost evolution was investigated based on the sequence data existing in public databases. Secondly, a target gene approach was used to identify within population variation in the growth hormone 1 (GH1) gene in salmon. Then, a new strategy for single nucleotide polymorphisms (SNPs) discovery in salmonid fishes was introduced, and, finally, the usefulness of a limited number of SNP markers as molecular tools in several applications of population genetics in Atlantic salmon was assessed. This thesis showed that the gene sequences in databases can be utilized to perform comparative studies of molecular evolution, and some putative evidence of the existence of Darwinian selection during the teleost GH evolution was presented. In addition, existent sequence data was exploited to investigate GH1 gene variation within Atlantic salmon populations throughout its range. Purifying selection is suggested to be the predominant evolutionary force controlling the genetic variation of this gene in salmon, and some support for gene flow between continents was also observed. The novel approach to SNP discovery in species with duplicated genome fragments introduced here proved to be an effective method, and this may have several applications in evolutionary genetics with different species - e.g. when developing gene-targeted markers to investigate quantitative genetic variation. The thesis also demonstrated that only a few SNPs performed highly similar signals in some of the population genetic analyses when compared with the microsatellite markers. This may have useful applications when estimating genetic diversity in genes having a potential role in ecological and conservation issues, or when using hard biological samples in genetic studies as SNPs can be applied with relatively highly degraded DNA.
Resumo:
In the present study, we identified a novel asthma susceptibility gene, NPSR1 (neuropeptide S receptor 1) on chromosome 7p14.3 by the positional cloning strategy. An earlier significant linkage mapping result among Finnish Kainuu asthma families was confirmed in two independent cohorts: in asthma families from Quebec, Canada and in allergy families from North Karelia, Finland. The linkage region was narrowed down to a 133-kb segment by a hierarchial genotyping method. The observed 77-kb haplotype block showed 7 haplotypes and a similar risk and nonrisk pattern in all three populations studied. All seven haplotypes occur in all three populations at frequences > 2%. Significant elevated relative risks were detected for elevated total IgE (immunoglobulin E) or asthma. Risk effects of the gene variants varied from 1.4 to 2.5. NPSR1 belongs to the G protein-coupled receptor (GPCR) family with a topology of seven transmembrane domains. NPSR1 has 9 exons, with the two main transcripts, A and B, encoding proteins of 371 and 377 amino acids, respectively. We detected a low but ubiquitous expression level of NPSR1-B in various tissues and endogenous cell lines while NPSR1-A has a more restricted expression pattern. Both isoforms were expressed in the lung epithelium. We observed aberrant expression levels of NPSR1-B in smooth muscle in asthmatic bronchi as compared to healthy. In an experimental mouse model, the induced lung inflammation resulted in elevated Npsr1 levels. Furthermore, we demonstrated that the activation of NPSR1 with its endogenous agonist, neuropeptide S (NPS), resulted in a significant inhibition of the growth of NPSR1-A overexpressing stable cell lines (NPSR1-A cells). To determine which target genes were regulated by the NPS-NPSR1 pathway, NPSR1-A cells were stimulated with NPS, and differentially expressed genes were identified using the Affymetrix HGU133Plus2 GeneChip. A total of 104 genes were found significantly up-regulated and 42 down-regulated 6 h after NPS administration. The up-regulated genes included many neuronal genes and some putative susceptibility genes for respiratory disorders. By Gene Ontology enrichment analysis, the biological process terms, cell proliferation, morphogenesis and immune response were among the most altered. The expression of four up-regulated genes, matrix metallopeptidase 10 (MMP10), INHBA (activin A), interleukin 8 (IL8) and EPH receptor A2 (EPHA2), were verified and confirmed by quantitative reverse-transcriptase-PCR. In conclusion, we identified a novel asthma susceptibility gene, NPSR1, on chromosome 7p14.3. NPS-NPSR1 represents a novel pathway that regulates cell proliferation and immune responses, and thus may have functional relevance in the pathogenesis of asthma.
Resumo:
Multiple sclerosis (MS) is an immune-mediated demyelinating disorder of the central nervous system (CNS) affecting 0.1-0.2% of Northern European descent population. MS is considered to be a multifactorial disease, both environment and genetics play a role in its pathogenesis. Despite several decades of intense research, the etiological and pathogenic mechanisms underlying MS remain still largely unknown and no curative treatment exists. The genetic architecture underlying MS is complex with multiple genes involved. The strongest and the best characterized predisposing genetic factors for MS are located, as in other immune-mediated diseases, in the major histocompatibility complex (MHC) on chromosome 6. In humans MHC is called human leukocyte antigen (HLA). Alleles of the HLA locus have been found to associate strongly with MS and remained for many years the only consistently replicable genetic associations. However, recently other genes located outside the MHC region have been proposed as strong candidates for susceptibility to MS in several studies. In this thesis a new genetic locus located on chromosome 7q32, interferon regulatory factor 5 (IRF5), was identified in the susceptibility to MS. In particular, we found that common variation of the gene was associated with the disease in three different populations, Spanish, Swedish and Finnish. We also suggested a possible functional role for one of the risk alleles with impact on the expression of the IRF5 locus. Previous studies have pointed out a possible role played by chromosome 2q33 in the susceptibility to MS and other autoimmune disorders. The work described here also investigated the involvement of this chromosomal region in MS predisposition. After the detection of genetic association with 2q33 (article-1), we extended our analysis through fine-scale single nucleotide polymorphism (SNP) mapping to define further the contribution of this genomic area to disease pathogenesis (article-4). We found a trend (p=0.04) for association to MS with an intronic SNP located in the inducible T-cell co-stimulator (ICOS) gene, an important player in the co-stimulatory pathway of the immune system. Expression analysis of ICOS revealed a novel, previously uncharacterized, alternatively spliced isoform, lacking the extracellular domain that is needed for ligand binding. The stability of the newly-identified transcript variant and its subcellular localization were analyzed. These studies indicated that the novel isoform is stable and shows different subcellular localization as compared to full-length ICOS. The novel isoform might have a regulatory function, but further studies are required to elucidate its function. Chromosome 19q13 has been previously suggested as one of the genomic areas involved in MS predisposition. In several populations, suggestive linkage signals between MS predisposition and 19q13 have been obtained. Here, we analysed the role of allelic variation in 19q13 by family based association analysis in 782 MS families collected from Finland. In this dataset, we were not able to detect any statistically significant associations, although several previously suggested markers were included to the analysis. Replication of the previous findings on the basis of linkage disequilibrium between marker allele and disease/risk allele appears notoriously difficult because of limitations such as allelic heterogeneity. Re-sequencing based approaches may be required for elucidating the role of chromosome 19q13 with MS. This thesis has resulted in the identification of a new MS susceptibility locus (IRF5) previously associated with other inflammatory or autoimmune disorders, such as SLE. IRF5 is one of the mediators of interferons biological function. In addition to providing new insight in the possible pathogenetic pathway of the disease, this finding suggests that there might be common mechanisms between different immune-mediated disorders. Furthermore the work presented here has uncovered a novel isoform of ICOS, which may play a role in regulatory mechanisms of ICOS, an important mediator of lymphocyte activation. Further work is required to uncover its functions and possible involvement of the ICOS locus in MS susceptibility.
Resumo:
Bipolar disorder (BP) is a complex psychiatric disorder characterized by episodes of mania and depression. BP affects approximately 1% of the world’s population and shows no difference in lifetime prevalence between males and females. BP arises from complex interactions among genetic, developmental and environmental factors, and it is likely that several predisposing genes are involved in BP. The genetic background of BP is still poorly understood, although intensive and long-lasting research has identified several chromosomal regions and genes involved in susceptibility to BP. This thesis work aims to identify the genetic variants that influence bipolar disorder in the Finnish population by candidate gene and genome-wide linkage analyses in families with many BP cases. In addition to diagnosis-based phenotypes, neuropsychological traits that can be seen as potential endophenotypes or intermediate traits for BP were analyzed. In the first part of the thesis, we examined the role of the allelic variants of the TSNAX/DISC1 gene cluster to psychotic and bipolar spectrum disorders and found association of distinct allelic haplotypes with these two groups of disorders. The haplotype at the 5’ end of the Disrupted-in-Schizophrenia-1 gene (DISC1) was over-transmitted to males with psychotic disorder (p = 0.008; for an extended haplotype p = 0.0007 with both genders), whereas haplotypes at the 3’ end of DISC1 associated with bipolar spectrum disorder (p = 0.0002; for an extended haplotype p = 0.0001). The variants of these haplotypes also showed association with different cognitive traits. The haplotypes at the 5’ end associated with perseverations and auditory attention, while the variants at the 3’ end associated with several cognitive traits including verbal fluency and psychomotor processing speed. Second, in our complete set of BP families with 723 individuals we studied six functional candidate genes from three distinct signalling systems: serotonin-related genes (SLC6A4 and TPH2), BDNF -related genes (BDNF, CREB1 and NTRK2) and one gene related to the inflammation and cytokine system (P2RX7). We replicated association of the functional variant Val66Met of BDNF with BP and better performance in retention. The variants at the 5’ end of SLC6A4 also showed some evidence of association among males (p = 0.004), but the widely studied functional variants did not yield any significant results. A protective four-variant haplotype on P2RX7 showed evidence of association with BP and executive functions: semantic and phonemic fluency (p = 0.006 and p = 0.0003, respectively). Third, we analyzed 23 bipolar families originating from the North-Eastern region of Finland. A genome-wide scan was performed using the 6K single nucleotide polymorphism (SNP) array. We identified susceptibility loci at chromosomes 7q31 with a LOD score of 3.20 and at 9p13.1 with a LOD score of 4.02. We followed up both linkage findings in the complete set of 179 Finnish bipolar families. The finding on chromosome 9p13 was supported (maximum LOD score of 3.02), but the susceptibility gene itself remains unclarified. In the fourth part of the thesis, we wanted to test the role of the allelic variants that have associated with bipolar disorder in recent genome-wide association studies (GWAS). We could confirm findings for the DFNB31, SORCS2, SCL39A3, and DGKH genes. The best signal in this study comes from DFNB31, which remained significant after multiple testing corrections. Two variants of SORCS2 were allelic replications and presented the same signal as the haplotype analysis. However, no association was detected with the PALB2 gene, which was the most significantly associated region in the previous GWAS. Our results indicate that BP is heterogeneous and its genetic background may accordingly vary in different populations. In order to fully understand the allelic heterogeneity that underlies common diseases such as BP, complete genome sequencing for many individuals with and without the disease is required. Identification of the specific risk variants will help us better understand the pathophysiology underlying BP and will lead to the development of treatments with specific biochemical targets. In addition, it will further facilitate the identification of environmental factors that alter risk, which will potentially provide improved occupational, social and psychological advice for individuals with high risk of BP.