71 resultados para Evolutionary Genetics
Resumo:
Congenital lactase deficiency (CLD) (MIM 223000) is a rare autosomal recessive gastrointestinal disorder characterized by watery diarrhea in infants fed with breast milk or other lactose-containing formulas. The CLD locus was previously assigned by linkage and linkage disequilibrium analyses on 2q21 in 19 Finnish families. In this study, the molecular background of this disorder is reported. The CLD locus was refined in 32 CLD patients in 24 families by using microsatellite and single nucleotide polymorphism (SNP) haplotypes. Mutation analyses were performed by direct sequencing. We identified 5 distinct mutations in the lactase (LCT) gene, encoding the enzyme that hydrolyzes lactose in the intestinal lumen. These findings facilitate genetic testing of CLD in clinical practice and enable genetic counseling. The present data also provide the basis for detailed characterization of the molecular pathogenesis of this disorder. Adult-type hypolactasia (MIM 223100) (lactase non-persistence, lactose intolerance) is an autosomal recessive gastrointestinal condition that is a result of a decline in the activity of lactase in the intestinal lumen after weaning. Adult-type hypolactasia is considered to be a normal phenomenon among mammals and symptoms are remarkably milder than experienced in CLD. Recently, a variant C/T-13910 was shown to associate with the adult-type hypolactasia trait, locating 13.9 kb upstream of the LCT gene. In this study, the functional significance of the C/T-13910 variant was determined by studying the LCT mRNA levels in intestinal biopsy samples in children and adults with different genotypes. RT-PCR followed by solid-phase minisequencing was applied to determine the relative expression levels of the LCT alleles using an informative SNP located in exon 1. In children, the C-13910 allele was observed to be downregulated after five years of age in parallel with lactase enzyme activity. The expression of the LCT mRNA in the intestinal mucosa in individuals with the T-13910 A-22018 alleles was 11.5 times higher than that found in individuals with the C-13910, G-22018 alleles. These findings suggest that the C/T-13910 associated with adult-type hypolactasia is associated with the transcriptional regulation of the LCT gene. The presence of the T-13910 A-22018 allele also showed significant elevation lactase activity. Galactose, the hydrolysing product of the milk sugar lactose, has been hypothesized to be poisonous to ovarian epithelial cells. Hence, consumption of dairy products and lactase persistence has been proposed to be a risk factor for ovarian carcinoma. To investigate whether lactase persistence is related to the risk of ovarian carcinoma the C/T-13910 genotype was determined in a cohort of 782 women with ovarian carcinoma 1331 individuals serving as controls. Lactase persistence did not associate significantly with the risk for ovarian carcinoma in the Finnish, in the Polish or in the Swedish populations. The findings do not support the hypothesis that lactase persistence increases the risk for ovarian carcinoma.
Resumo:
Colorectal cancer (CRC) is one of the most frequent malignancies in Western countries. Inherited factors have been suggested to be involved in 35% of CRCs. The hereditary CRC syndromes explain only ~6% of all CRCs, indicating that a large proportion of the inherited susceptibility is still unexplained. Much of the remaining genetic predisposition for CRC is probably due to undiscovered low-penetrance variations. This study was conducted to identify germline and somatic changes that contribute to CRC predisposition and tumorigenesis. MLH1 and MSH2, that underlie Hereditary non-polyposis colorectal cancer (HNPCC) are considered to be tumor suppressor genes; the first hit is inherited in the germline and somatic inactivation of the wild type allele is required for tumor initiation. In a recent study, frequent loss of the mutant allele in HNPCC tumors was detected and a new model, arguing against the two-hit hypothesis, was proposed for somatic HNPCC tumorigenesis. We tested this hypothesis by conducting LOH analysis on 25 colorectal HNPCC tumors with a known germline mutation in the MLH1 or MSH2 genes. LOH was detected in 56% of the tumors. All the losses targeted the wild type allele supporting the classical two-hit model for HNPCC tumorigenesis. The variants 3020insC, R702W and G908R in NOD2 predispose to Crohn s disease. Contribution of NOD2 to CRC predisposition has been examined in several case-control series, with conflicting results. We have previously shown that 3020insC does not predispose to CRC in Finnish CRC patients. To expand our previous study the variants R702W and G908R were genotyped in a population-based series of 1042 Finnish CRC patients and 508 healthy controls. Association analyses did not show significant evidence for association of the variants with CRC. Single nucleotide polymorphism (SNP) rs6983267 at chromosome 8q24 was the first CRC susceptibility variant identified through genome-wide association studies. To characterize the role of rs6983267 in CRC predisposition in the Finnish population, we genotyped the SNP in the case-control material of 1042 cases and 1012 controls and showed that G allele of rs6983267 is associated with the increased risk of CRC (OR 1.22; P=0.0018). Examination of allelic imbalance in the tumors heterozygous for rs6983267 revealed that copy number increase affected 22% of the tumors and interestingly, it favored the G allele. By utilizing a computer algorithm, Enhancer Element Locator (EEL), an evolutionary conserved regulatory motif containing rs6983267 was identified. The SNP affected the binding site of TCF4, a transcription factor that mediates Wnt signaling in cells, and has proven to be crucial in colorectal neoplasia. The preferential binding of TCF4 to the risk allele G was showed in vitro and in vivo. The element drove lacZ marker gene expression in mouse embryos in a pattern that is consistent with genes regulated by the Wnt signaling pathway. These results suggest that rs6983267 at 8q24 exerts its effect in CRC predisposition by regulating gene expression. The most obvious target gene for the enhancer element is MYC, residing ~335 kb downstream, however further studies are required to establish the transcriptional target(s) of the predicted enhancer element.
Resumo:
Schizophrenia, affecting about 1% of population worldwide, is a severe mental disorder characterized by positive and negative symptoms, such as psychosis and anhedonia, as well as cognitive deficits. At present, schizophrenia is considered a complex disorder of neurodevelopmental origin with both genetic and environmental factors contributing to its onset. Although a number of candidate genes for schizophrenia have been highlighted, only very few schizophrenia patients are likely to share identical genetic liability. This study is based on the nation-wide schizophrenia family sample of the National Institute for Health and Welfare, and represents one of the largest and most well-characterized familial series in the world. In the first part of this study, we investigated the roles of the DTNBP1, NRG1, and AKT1 genes in the background of schizophrenia in Finland. Although these genes are associated with schizophrenia liability in several populations, any significant association with clinical diagnostic information of schizophrenia remained absent in our sample of 441 schizophrenia families. In the second part of this study, we first replicated schizophrenia linkage on the long arm of chromosome 7 in 352 schizophrenia families. In the following association analysis, we utilized additional clinical disorder features and intermediate phenotypes – endophenotypes - in addition to diagnostic information from altogether 290 neuropsychologically assessed schizophrenia families. An intragenic short tandem repeat allele of the regional RELN gene, supposed to play a role in the background of several neurodevelopmental disorders, showed significant association with poorer cognitive functioning and more severe schizophrenia symptoms. Additionally, this risk allele was significantly more prevalent among the individuals affected with schizophrenia spectrum disorders. We have previously identified linkage of schizophrenia and its cognitive endophenotypes on the long arms of chromosomes 2, 4, and 5. In the last part of this study, we selected altogether 104 functionally relevant candidate genes from the linked regions. We detected several promising associations, of which especially interesting are the ERBB4 gene, showing association with the severity of schizophrenia symptoms and impairments in traits related to verbal abilities, and the GRIA1 gene, showing association with the severity of schizophrenia symptoms. Our results extend the previous evidence that the genetic risk for schizophrenia is at least partially mediated via the effects of the candidate genes and their combinations on relevant brain systems, resulting in alterations in different disorder domains, such as the cognitive deficits.
Resumo:
Celiac disease, or gluten intolerance, is triggered by dietary glutens in genetically susceptible individuals and it affects approximately 1% of the Caucasian population. The best known genetic risk factors for celiac disease are HLA DQ2 and DQ8 heterodimers, which are necessary for the development of the disease. However, they alone are not sufficient for disease induction, other risk factors are required. This thesis investigated genetic factors for celiac disease, concentrating on susceptibility loci on chromosomes 5q31-q33, 19p13 and 2q12 previously reported in genome-wide linkage and association studies. In addition, a novel genotyping method for the detection of HLA DQ2 and DQ8 coding haplotypes was validated. This study was conducted using Finnish and Hungarian family materials, and Finnish, Hungarian and Italian case-control materials. Genetic linkage and association were analysed in these materials using candidate gene and fine-mapping approaches. The results confirmed linkage to celiac disease on the chromosomal regions 5q31-q33 and 19p13. Fine-mapping on chromosome 5q31-q33 revealed several modest associations in the region, and highlighted the need for further investigations to locate the causal risk variants. The MYO9B gene on chromosome 19p13 showed evidence for linkage and association particularly with dermatitis herpetiformis, the skin manifestation of celiac disease. This implies a potential difference in the genetic background of the intestinal and skin forms of the disease, although studies on larger samplesets are required. The IL18RAP locus on chromosome 2q12, shown to be associated with celiac disease in a previous genome-wide association study and a subsequent follow-up, showed association in the Hungarian population in this study. The expression of IL18RAP was further investigated in small intestinal tissue and in peripheral blood mononuclear cells. The results showed that IL18RAP is expressed in the relevant tissues. Two putative isoforms of IL18RAP were detected by Western blot analysis, and the results suggested that the ratios and total levels of these isoforms may contribute to the aetiology of celiac disease. A novel genotyping method for celiac disease-associated HLA haplotypes was also validated in this thesis. The method utilises single-nucleotide polymorphisms tagging these HLA haplotypes with high sensitivity and specificity. Our results suggest that this method is transferable between populations, and it is suitable for large-scale analysis. In conclusion, this doctorate study provides an insight into the roles of the 5q31-q33, MYO9B, IL18RAP and HLA loci in the susceptibility to celiac disease in the Finnish, Hungarian and Italian populations, highlighting the need for further studies at these genetic loci and examination of the function of the candidate genes.
Resumo:
Glaucoma is the second leading cause of blindness worldwide. It is a group of optic neuropathies, characterized by progressive optic nerve degeneration, excavation of the optic disc due to apoptosis of retinal ganglion cells and corresponding visual field defects. Open angle glaucoma (OAG) is a subtype of glaucoma, classified according to the age of onset into juvenile and adult- forms with a cut-off point of 40 years of age. The prevalence of OAG is 1-2% of the population over 40 years and increases with age. During the last decade several candidate loci and three candidate genes, myocilin (MYOC), optineurin (OPTN) and WD40-repeat 36 (WDR36), for OAG have been identified. Exfoliation syndrome (XFS), age, elevated intraocular pressure and genetic predisposition are known risk factors for OAG. XFS is characterized by accumulation of grayish scales of fibrillogranular extracellular material in the anterior segment of the eye. XFS is overall the most common identifiable cause of glaucoma (exfoliation glaucoma, XFG). In the past year, three single nucleotide polymorphisms (SNPs) on the lysyl oxidase like 1 (LOXL1) gene have been associated with XFS and XFG in several populations. This thesis describes the first molecular genetic studies of OAG and XFS/XFG in the Finnish population. The role of the MYOC and OPTN genes and fourteen candidate loci was investigated in eight Finnish glaucoma families. Both candidate genes and loci were excluded in families, further confirming the heterogeneous nature of OAG. To investigate the genetic basis of glaucoma in a large Finnish family with juvenile and adult onset OAG, we analysed the MYOC gene in family members. Glaucoma associated mutation (Thr377Met) was identified in the MYOC gene segregating with the disease in the family. This finding has great significance for the family and encourages investigating the MYOC gene also in other Finnish OAG families. In order to identify the genetic susceptibility loci for XFS, we carried out a genome-wide scan in the extended Finnish XFS family. This scan produced promising candidate locus on chromosomal region 18q12.1-21.33 and several additional putative susceptibility loci for XFS. This locus on chromosome 18 provides a solid starting point for the fine-scale mapping studies, which are needed to identify variants conferring susceptibility to XFS in the region. A case-control and family-based association study and family-based linkage study was performed to evaluate whether SNPs in the LOXL1 gene contain a risk for XFS, XFG or POAG in the Finnish patients. A significant association between the LOXL1 gene SNPs and XFS and XFG was confirmed in the Finnish population. However, no association was detected with POAG. Probably also other genetic and environmental factors are involved in the pathogenesis of XFS and XFG.
Resumo:
Cardiovascular diseases (CVD) are major contributors to morbidity and mortality worldwide. Several interacting environmental, biochemical, and genetic risk factors can increase disease susceptibility. While some of the genes involved in the etiology of CVD are known, many are yet to be discovered. During the last few decades, scientists have searched for these genes with genome-wide linkage and association methods, and with more targeted candidate gene studies. This thesis investigates variation within the upstream transcription factor 1 (USF1) gene locus in relation to CVD risk factors, atherosclerosis, and incidence and prevalence of CVD. This candidate gene was first identified in Finnish families ascertained for familial combined hyperlipidemia, a common dyslipidemia predisposing to coronary heart disease. The gene is a ubiquitously expressed transcription factor regulating expression of several genes from lipid and glucose metabolism, inflammation, and endothelial function. First, we examined association between USF1 variants and several CVD risk factors, such as lipid phenotypes, body composition measures, and metabolic syndrome, in two prospective population cohorts. Our data suggested that USF1 contributes to these CVD risk factors at the population level. Notably, the associations with quantitative measurements were mostly detected among study subjects with CVD or metabolic syndrome, suggesting complex interactions between USF1 effects and the pathophysiological state of an individual. Second, we investigated how variation at the USF1 locus contributes to atherosclerotic lesions of the coronary arteries and abdominal aorta. For this, we used two study samples of middle-aged men with detailed measurements of atherosclerosis obtained in autopsy. USF1 variation significantly associated with areas of several types of lesions, especially with calcification of the arteries. Next, we tested what effect the USF1 risk variants have on sudden cardiac death and incidence of CVD. The atherosclerosis-associated risk variant increased the risk of sudden cardiac death of the same study subjects. Furthermore, USF1 alleles associated with incidence of CVD in the Finnish population follow-up cohorts. These associations were especially prominent among women, suggesting a sex specific effect, which has also been detected in subsequent studies. Finally, as some of the low-yield DNA samples of the Finnish follow-up study cohort needed to be whole-genome amplified (WGA) prior to genotyping, we evaluated whether the produced WGA genotypes were of good quality. Although the samples giving genotype discrepancies could not be detected before genotyping with standard laboratory quality control methods, our results suggested that enhanced quality control at the time of the genotyping could identify such samples. In addition, combining two WGA reactions into one pooled DNA sample for genotyping markedly reduced the number of discrepancies and samples showing them. In conclusion, USF1 seems to have a role in the etiology of CVD. Additional studies are warranted to identify functional variants and to study interactions between USF1 and other genetic or environmental factors. This USF1 study, and other studies with low DNA yield of some samples, can benefit from whole genome amplification of the low-yield samples prior to genotyping. Careful quality control procedures are, however, needed in WGA genotyping.
Resumo:
The studies presented in this thesis contribute to the understanding of evolutionary ecology of three major viruses threatening cultivated sweetpotato (Ipomoea batatas Lam) in East Africa: Sweet potato feathery mottle virus (SPFMV; genus Potyvirus; Potyviridae), Sweet potato chlorotic stunt virus (SPCSV; genus Crinivirus; Closteroviridae) and Sweet potato mild mottle virus (SPMMV; genus Ipomovirus; Potyviridae). The viruses were serologically detected and the positive results confirmed by RT-PCR and sequencing. SPFMV was detected in 24 wild plant species of family Convolvulacea (genera Ipomoea, Lepistemon and Hewittia), of which 19 species were new natural hosts for SPFMV. SPMMV and SPCSV were detected in wild plants belonging to 21 and 12 species (genera Ipomoea, Lepistemon and Hewittia), respectively, all of which were previously unknown to be natural hosts of these viruses. SPFMV was the most abundant virus being detected in 17% of the plants, while SPMMV and SPCSV were detected in 9.8% and 5.4% of the assessed plants, respectively. Wild plants in Uganda were infected with the East African (EA), common (C), and the ordinary (O) strains, or co-infected with the EA and the C strain of SPFMV. The viruses and virus-like diseases were more frequent in the eastern agro-ecological zone than the western and central zones, which contrasted with known incidences of these viruses in sweetpotato crops, except for northern zone where incidences were lowest in wild plants as in sweetpotato. The NIb/CP junction in SPMMV was determined experimentally which facilitated CP-based phylogenetic and evolutionary analyses of SPMMV. Isolates of all the three viruses from wild plants were genetically similar to those found in cultivated sweetpotatoes in East Africa. There was no evidence of host-driven population genetic structures suggesting frequent transmission of these viruses between their wild and cultivated hosts. The p22 RNA silencing suppressor-encoding sequence was absent in a few SPCSV isolates, but regardless of this, SPCSV isolates incited sweet potato virus disease (SPVD) in sweetpotato plants co-infected with SPFMV, indicating that p22 is redundant for synergism between SCSV and SPFMV. Molecular evolutionary analysis revealed that isolates of strain EA of SPFMV that is largely restricted geographically in East Africa experience frequent recombination in comparison to isolates of strain C that is globally distributed. Moreover, non-homologous recombination events between strains EA and C were rare, despite frequent co-infections of these strains in wild plants, suggesting purifying selection against non-homologous recombinants between these strains or that such recombinants are mostly not infectious. Recombination was detected also in the 5 - and 3 -proximal regions of the SPMMV genome providing the first evidence of recombination in genus Ipomovirus, but no recombination events were detected in the characterized genomic regions of SPCSV. Strong purifying selection was implicated on evolution of majority of amino acids of the proteins encoded by the analyzed genomic regions of SPFMV, SPMMV and SPCSV. However, positive selection was predicted on 17 amino acids distributed over the whole the coat protein (CP) in the globally distributed strain C, as compared to only 4 amino acids in the multifunctional CP N-terminus (CP-NT) of strain EA largely restricted geographically to East Africa. A few amino acid sites in the N-terminus of SPMMV P1, the p7 protein and RNA silencing suppressor proteins p22 and RNase3 of SPCSV were also submitted to positive selection. Positively selected amino acids may constitute ligand-binding domains that determine interactions with plant host and/or insect vector factors. The P1 proteinase of SPMMV (genus Ipomovirus) seems to respond to needs of adaptation, which was not observed with the helper component proteinase (HC-Pro) of SPMMV, although the HC-Pro is responsible for many important molecular interactions in genus Potyvirus. Because the centre of origin of cultivated sweetpotato is in the Americas from where the crop was dispersed to other continents in recent history (except for the Australasia and South Pacific region), it would be expected that identical viruses and their strains occur worldwide, presuming virus dispersal with the host. Apparently, this seems not to be the case with SPMMV, the strain EA of SPFMV and the strain EA of SPCSV that are largely geographically confined in East Africa where they are predominant and occur both in natural and agro-ecosystems. The geographical distribution of plant viruses is constrained more by virus-vector relations than by virus-host interactions, which in accordance of the wide range of natural host species and the geographical confinement to East Africa suggest that these viruses existed in East African wild plants before the introduction of sweetpotato. Subsequently, these studies provide compelling evidence that East Africa constitutes a cradle of SPFMV strain EA, SPCSV strain EA, and SPMMV. Therefore, sweet potato virus disease (SPVD) in East Africa may be one of the examples of damaging virus diseases resulting from exchange of viruses between introduced crops and indigenous wild plant species. Keywords: Convolvulaceae, East Africa, epidemiology, evolution, genetic variability, Ipomoea, recombination, SPCSV, SPFMV, SPMMV, selection pressure, sweetpotato, wild plant species Author s Address: Arthur K. Tugume, Department of Agricultural Sciences, Faculty of Agriculture and Forestry, University of Helsinki, Latokartanonkaari 7, P.O Box 27, FIN-00014, Helsinki, Finland. Email: tugume.arthur@helsinki.fi Author s Present Address: Arthur K. Tugume, Department of Botany, Faculty of Science, Makerere University, P.O. Box 7062, Kampala, Uganda. Email: aktugume@botany.mak.ac.ug, tugumeka@yahoo.com
Resumo:
Genetics, the science of heredity and variation in living organisms, has a central role in medicine, in breeding crops and livestock, and in studying fundamental topics of biological sciences such as evolution and cell functioning. Currently the field of genetics is under a rapid development because of the recent advances in technologies by which molecular data can be obtained from living organisms. In order that most information from such data can be extracted, the analyses need to be carried out using statistical models that are tailored to take account of the particular genetic processes. In this thesis we formulate and analyze Bayesian models for genetic marker data of contemporary individuals. The major focus is on the modeling of the unobserved recent ancestry of the sampled individuals (say, for tens of generations or so), which is carried out by using explicit probabilistic reconstructions of the pedigree structures accompanied by the gene flows at the marker loci. For such a recent history, the recombination process is the major genetic force that shapes the genomes of the individuals, and it is included in the model by assuming that the recombination fractions between the adjacent markers are known. The posterior distribution of the unobserved history of the individuals is studied conditionally on the observed marker data by using a Markov chain Monte Carlo algorithm (MCMC). The example analyses consider estimation of the population structure, relatedness structure (both at the level of whole genomes as well as at each marker separately), and haplotype configurations. For situations where the pedigree structure is partially known, an algorithm to create an initial state for the MCMC algorithm is given. Furthermore, the thesis includes an extension of the model for the recent genetic history to situations where also a quantitative phenotype has been measured from the contemporary individuals. In that case the goal is to identify positions on the genome that affect the observed phenotypic values. This task is carried out within the Bayesian framework, where the number and the relative effects of the quantitative trait loci are treated as random variables whose posterior distribution is studied conditionally on the observed genetic and phenotypic data. In addition, the thesis contains an extension of a widely-used haplotyping method, the PHASE algorithm, to settings where genetic material from several individuals has been pooled together, and the allele frequencies of each pool are determined in a single genotyping.