33 resultados para Genomic Regions
em Helda - Digital Repository of University of Helsinki
Resumo:
Lypsylehmien maidon juoksettumiskyvyn jalostuskeinot Väitöskirjassa tutkittiin lypsylehmien maidon juustonvalmistuslaadun parantamista jalostusvalinnan avulla. Tutkimusaihe on tärkeä, sillä yhä suurempi osa maidosta käytetään juustonvalmistukseen. Tutkimuksen kohteena oli maidon juoksettumiskyky, sillä se on yksi keskeisistä juustomäärään vaikuttavista tekijöistä. Maidon juoksettumiskyky vaihteli huomattavasti lehmien, sonnien, karjojen, rotujen ja lypsykauden vaiheiden välillä. Vaikka tankkimaidon juoksettumiskyvyssä olikin suuria eroja karjoittain, karja selitti vain pienen osan juoksettumiskyvyn kokonaisvaihtelusta. Todennäköisesti perinnölliset erot lehmien välillä selittävät suurimman osan karjojen tankkimaitojen juoksettumiskyvyssä havaituista eroista. Hyvä hoito ja ruokinta vähensivät kuitenkin jossain määrin huonosti juoksettuvien tankkimaitojen osuutta karjoissa. Holstein-friisiläiset lehmät olivat juoksettumiskyvyltään ayrshire-rotuisia lehmiä parempia. Huono juoksettuminen ja juoksettumattomuus oli vain vähäinen ongelma holstein-friisiläisillä (10 %), kun taas kolmannes ayrshire-lehmistä tuotti huonosti juoksettuvaa tai juoksettumatonta maitoa. Maitoa sanotaan huonosti juoksettuvaksi silloin, kun juustomassa ei ole riittävän kiinteää leikattavaksi puolen tunnin kuluttua juoksetteen lisäyksestä. Juoksettumattomaksi määriteltävä maito ei saostu lainkaan puolen tunnin aikana ja on siksi erittäin huonoa raaka-ainetta juustomeijereille. Noin 40 % lehmien välisistä eroista maidon juoksettumiskyvyssä selittyi perinnöllisillä tekijöillä. Juoksettumiskykyä voikin sanoa hyvin periytyväksi ominaisuudeksi. Kolme mittauskertaa lehmää kohti riittää varsin hyvin lehmän maidon keskimääräisen juoksettumiskyvyn arvioimiseen. Tällä hetkellä juoksettumiskyvyn suoran jalostamisen ongelmana on kuitenkin automatisoidun, laajamittaiseen käyttöön soveltuvan mittalaitteen puute. Tämän takia väitöskirjassa tutkittiin mahdollisuuksia jalostaa maidon juoksettumiskykyä epäsuorasti, jonkin toisen ominaisuuden kautta. Tällaisen ominaisuuden pitää olla kyllin voimakkaasti perinnöllisesti kytkeytynyt juoksettumiskykyyn, jotta jalostus olisi mahdollista sen avulla. Tutkittavat ominaisuudet olivat sonnien kokonaisjalostusarvossa jo mukana olevat maitotuotos ja utareterveyteen liittyvät ominaisuudet sekä kokonaisjalostusarvoon kuulumattomat maidon valkuais- ja kaseiinipitoisuus sekä maidon pH. Väitöskirjassa tutkittiin myös mahdollisuuksia ns. merkkiavusteiseen valintaan tutkimalla maidon juoksettumattomuuden perinnöllisyyttä ja kartoittamalla siihen liittyvät kromosomialueet. Tutkimuksen tulosten perusteella lehmien utareterveyden jalostaminen parantaa jonkin verran myös maidon juoksettumiskykyä sekä vähentää juoksettumattomuutta ayrshire-rotuisilla lehmillä. Lehmien maitotuotos ja maidon juoksettumiskyky sekä juoksettumattomuus ovat sen sijaan perinnöllisesti toisistaan riippumattomia ominaisuuksia. Myöskin maidon valkuais- ja kaseiinipitoisuuden perinnöllinen yhteys juoksettumiskykyyn oli likimain nolla. Maidon pH:n ja juoksettumiskyvyn välillä oli melko voimakas perinnöllinen yhteys, joten maidon pH:n jalostaminen parantaisi myös maidon juoksettumiskykyä. Todennäköisesti sen jalostaminen ei kuitenkaan vähentäisi juoksettumatonta maitoa tuottavien lehmien määrää. Koska maidon juoksettumattomuus on niin yleinen ongelma suomalaisilla ayrshire-lehmillä, väitöksessä selvitettiin tarkemmin ilmiön taustoja. Kaikissa kolmessa tutkimusaineistoissa noin 10 % ayrshire-lehmistä tuotti juoksettumatonta maitoa. Kahden vuoden kuukausittaisen seurannan aikana osa lehmistä tuotti juoksettumatonta maitoa lähes joka mittauskerralla. Maidon juoksettumattomuus oli yhteydessä lypsykauden vaiheeseen, mutta mikään ympäristötekijöistä ei pystynyt täysin selittämään sitä. Sen sijaan viitteet sen periytyvyydestä vahvistuivat tutkimusten edetessä. Lopuksi tutkimusryhmä onnistui kartoittamaan juoksettumattomuutta aiheuttavat kromosomialueet kromosomeihin 2 ja 18, lähelle DNA-merkkejä BMS1126 ja BMS1355. Tulosten perusteella maidon juoksettumattomuus ei ole yhteydessä maidon juoksettumistapahtumassa keskeisiin kaseiinigeeneihin. Sen sijaan on mahdollista, että juoksettumattomuusongelman aiheuttavat kaseiinigeenien syntetisoinnin jälkeisessä muokkauksessa tapahtuvat virheet. Asia vaatii kuitenkin perusteellista tutkimista. Väitöksen tulosten perusteella maidon juoksettumattomuusgeeniä kantavien eläinten karsiminen jalostuseläinten joukosta olisi tehokkain tapa jalostaa maidon juoksettumiskykyä suomalaisessa lypsykarjapopulaatiossa.
Resumo:
The studies presented in this thesis contribute to the understanding of evolutionary ecology of three major viruses threatening cultivated sweetpotato (Ipomoea batatas Lam) in East Africa: Sweet potato feathery mottle virus (SPFMV; genus Potyvirus; Potyviridae), Sweet potato chlorotic stunt virus (SPCSV; genus Crinivirus; Closteroviridae) and Sweet potato mild mottle virus (SPMMV; genus Ipomovirus; Potyviridae). The viruses were serologically detected and the positive results confirmed by RT-PCR and sequencing. SPFMV was detected in 24 wild plant species of family Convolvulacea (genera Ipomoea, Lepistemon and Hewittia), of which 19 species were new natural hosts for SPFMV. SPMMV and SPCSV were detected in wild plants belonging to 21 and 12 species (genera Ipomoea, Lepistemon and Hewittia), respectively, all of which were previously unknown to be natural hosts of these viruses. SPFMV was the most abundant virus being detected in 17% of the plants, while SPMMV and SPCSV were detected in 9.8% and 5.4% of the assessed plants, respectively. Wild plants in Uganda were infected with the East African (EA), common (C), and the ordinary (O) strains, or co-infected with the EA and the C strain of SPFMV. The viruses and virus-like diseases were more frequent in the eastern agro-ecological zone than the western and central zones, which contrasted with known incidences of these viruses in sweetpotato crops, except for northern zone where incidences were lowest in wild plants as in sweetpotato. The NIb/CP junction in SPMMV was determined experimentally which facilitated CP-based phylogenetic and evolutionary analyses of SPMMV. Isolates of all the three viruses from wild plants were genetically similar to those found in cultivated sweetpotatoes in East Africa. There was no evidence of host-driven population genetic structures suggesting frequent transmission of these viruses between their wild and cultivated hosts. The p22 RNA silencing suppressor-encoding sequence was absent in a few SPCSV isolates, but regardless of this, SPCSV isolates incited sweet potato virus disease (SPVD) in sweetpotato plants co-infected with SPFMV, indicating that p22 is redundant for synergism between SCSV and SPFMV. Molecular evolutionary analysis revealed that isolates of strain EA of SPFMV that is largely restricted geographically in East Africa experience frequent recombination in comparison to isolates of strain C that is globally distributed. Moreover, non-homologous recombination events between strains EA and C were rare, despite frequent co-infections of these strains in wild plants, suggesting purifying selection against non-homologous recombinants between these strains or that such recombinants are mostly not infectious. Recombination was detected also in the 5 - and 3 -proximal regions of the SPMMV genome providing the first evidence of recombination in genus Ipomovirus, but no recombination events were detected in the characterized genomic regions of SPCSV. Strong purifying selection was implicated on evolution of majority of amino acids of the proteins encoded by the analyzed genomic regions of SPFMV, SPMMV and SPCSV. However, positive selection was predicted on 17 amino acids distributed over the whole the coat protein (CP) in the globally distributed strain C, as compared to only 4 amino acids in the multifunctional CP N-terminus (CP-NT) of strain EA largely restricted geographically to East Africa. A few amino acid sites in the N-terminus of SPMMV P1, the p7 protein and RNA silencing suppressor proteins p22 and RNase3 of SPCSV were also submitted to positive selection. Positively selected amino acids may constitute ligand-binding domains that determine interactions with plant host and/or insect vector factors. The P1 proteinase of SPMMV (genus Ipomovirus) seems to respond to needs of adaptation, which was not observed with the helper component proteinase (HC-Pro) of SPMMV, although the HC-Pro is responsible for many important molecular interactions in genus Potyvirus. Because the centre of origin of cultivated sweetpotato is in the Americas from where the crop was dispersed to other continents in recent history (except for the Australasia and South Pacific region), it would be expected that identical viruses and their strains occur worldwide, presuming virus dispersal with the host. Apparently, this seems not to be the case with SPMMV, the strain EA of SPFMV and the strain EA of SPCSV that are largely geographically confined in East Africa where they are predominant and occur both in natural and agro-ecosystems. The geographical distribution of plant viruses is constrained more by virus-vector relations than by virus-host interactions, which in accordance of the wide range of natural host species and the geographical confinement to East Africa suggest that these viruses existed in East African wild plants before the introduction of sweetpotato. Subsequently, these studies provide compelling evidence that East Africa constitutes a cradle of SPFMV strain EA, SPCSV strain EA, and SPMMV. Therefore, sweet potato virus disease (SPVD) in East Africa may be one of the examples of damaging virus diseases resulting from exchange of viruses between introduced crops and indigenous wild plant species. Keywords: Convolvulaceae, East Africa, epidemiology, evolution, genetic variability, Ipomoea, recombination, SPCSV, SPFMV, SPMMV, selection pressure, sweetpotato, wild plant species Author s Address: Arthur K. Tugume, Department of Agricultural Sciences, Faculty of Agriculture and Forestry, University of Helsinki, Latokartanonkaari 7, P.O Box 27, FIN-00014, Helsinki, Finland. Email: tugume.arthur@helsinki.fi Author s Present Address: Arthur K. Tugume, Department of Botany, Faculty of Science, Makerere University, P.O. Box 7062, Kampala, Uganda. Email: aktugume@botany.mak.ac.ug, tugumeka@yahoo.com
Resumo:
Olfaction, the sense of smell, has many important functions in humans. Human responses to odors show substantial individual variation. Olfactory receptor genes have been identified and other genes may also influence olfaction. However, the proportion of phenotypic variation in odor response due to genetic variation remains largely unknown. Little is also known about which genes modify specific responses to odors. This study aimed to elucidate genetic and environmental influences on human responses to odors. Individuals from Finnish families (n=146) and Australian (n=413), British (n=163), Danish (n=336), and Finnish (n=399) twins rated intensity and pleasantness of a set of 12 (families) or 6 (twins) odors and tried to identify the odors. In addition, the participants rated their own sense of smell and annoyance experienced with different environmental odors. The odor stimuli of a commercial smell test (The Brief Smell Identification Test; banana, chocolate, cinnamon, gasoline, lemon, onion, paint thinner, pineapple, rose, smoke, soap, and turpentine) were presented in the family study. Based on the results of the family study and a literature survey, a new set of odor stimuli (androstenone, chocolate, cinnamon, isovaleric acid, lemon, and turpentine) was designed for the twin studies. In the family sample, heritabilities of the traits were estimated and underlying genomic regions were searched using a genome-wide linkage scan. In the pooled twin sample, variation in the measured traits was decomposed into genetic and environmental components using quantitative genetic modeling. In addition, associations between nongenetic factors (e.g., sex, age, and smoking) and olfactory-related traits were explored. Suggestive evidence for a genetic linkage for pleasantness of cinnamon at a locus on chromosome 4q32.3 emerged from the family sample. High heritability for the pleasantness of cinnamon was found in the family but not the twin study. Heritability of perceived intensity of androstenone odor was determined to be ~30% in the twin sample. A strong genetic correlation between perceived intensity and pleasantness of androstenone, in the absence of any environmental correlation, indicated that only the genetic correlation explained the phenotypic correlation between the traits (r=-0.27) and that the traits were influenced by an overlapping set of genes. Self-rated olfactory function appeared to reflect the odor annoyance experienced rather than actual olfactory acuity or genetic involvement. Results from nongenetic analyses supported the speculated superiority of females' olfactory abilities, the age-related diminishing of olfactory acuity, and the influences of experience-dependent factors on odor responses. This was the first study to estimate heritabilities and perform linkage screens for individual odors. A genetic effect was detected for only a few responses to specific odors, suggesting the predominance of environmental effects in odor perceptions.
Resumo:
This thesis presents a highly sensitive genome wide search method for recessive mutations. The method is suitable for distantly related samples that are divided into phenotype positives and negatives. High throughput genotype arrays are used to identify and compare homozygous regions between the cohorts. The method is demonstrated by comparing colorectal cancer patients against unaffected references. The objective is to find homozygous regions and alleles that are more common in cancer patients. We have designed and implemented software tools to automate the data analysis from genotypes to lists of candidate genes and to their properties. The programs have been designed in respect to a pipeline architecture that allows their integration to other programs such as biological databases and copy number analysis tools. The integration of the tools is crucial as the genome wide analysis of the cohort differences produces many candidate regions not related to the studied phenotype. CohortComparator is a genotype comparison tool that detects homozygous regions and compares their loci and allele constitutions between two sets of samples. The data is visualised in chromosome specific graphs illustrating the homozygous regions and alleles of each sample. The genomic regions that may harbour recessive mutations are emphasised with different colours and a scoring scheme is given for these regions. The detection of homozygous regions, cohort comparisons and result annotations are all subjected to presumptions many of which have been parameterized in our programs. The effect of these parameters and the suitable scope of the methods have been evaluated. Samples with different resolutions can be balanced with the genotype estimates of their haplotypes and they can be used within the same study.
Resumo:
Transposons are mobile elements of genetic material that are able to move in the genomes of their host organisms using a special form of recombination called transposition. Bacteriophage Mu was the first transposon for which a cell-free in vitro transposition reaction was developed. Subsequently, the reaction has been refined and the minimal Mu in vitro reaction is useful in the generation of comprehensive libraries of mutant DNA molecules that can be used in a variety of applications. To date, the functional genetics applications of Mu in vitro technology have been subjected to either plasmids or genomic regions and entire genomes of viruses cloned on specific vectors. This study expands the use of Mu in vitro transposition in functional genetics and genomics by describing novel methods applicable to the targeted transgenesis of mouse and the whole-genome analysis of bacteriophages. The methods described here are rapid, efficient, and easily applicable to a wide variety of organisms, demonstrating the potential of the Mu transposition technology in the functional analysis of genes and genomes. First, an easy-to-use, rapid strategy to generate construct for the targeted mutagenesis of mouse genes was developed. To test the strategy, a gene encoding a neuronal K+/Cl- cotransporter was mutagenised. After a highly efficient transpositional mutagenesis, the gene fragments mutagenised were cloned into a vector backbone and transferred into bacterial cells. These constructs were screened with PCR using an effective 3D matrix system. In addition to traditional knock-out constructs, the method developed yields hypomorphic alleles that lead into reduced expression of the target gene in transgenic mice and have since been used in a follow-up study. Moreover, a scheme is devised to rapidly produce conditional alleles from the constructs produced. Next, an efficient strategy for the whole-genome analysis of bacteriophages was developed based on the transpositional mutagenesis of uncloned, infective virus genomes and their subsequent transfer into susceptible host cells. Mutant viruses able to produce viable progeny were collected and their transposon integration sites determined to map genomic regions nonessential to the viral life cycle. This method, applied here to three very different bacteriophages, PRD1, ΦYeO3 12, and PM2, does not require the target genome to be cloned and is directly applicable to all DNA and RNA viruses that have infective genomes. The method developed yielded valuable novel information on the three bacteriophages studied and whole-genome data can be complemented with concomitant studies on individual genes. Moreover, end-modified transposons constructed for this study can be used to manipulate genomes devoid of suitable restriction sites.
Resumo:
The first part of this work investigates the molecular epidemiology of a human enterovirus (HEV), echovirus 30 (E-30). This project is part of a series of studies performed in our research team analyzing the molecular epidemiology of HEV-B viruses. A total of 129 virus strains had been isolated in different parts of Europe. The sequence analysis was performed in three different genomic regions: 420 nucleotides (nt) in the VP4/VP2 capsid protein coding region, the entire VP1 capsid protein coding gene of 876 nt, and 150 nt in the VP1/2A junction region. The analysis revealed a succession of dominant sublineages within a major genotype. The temporally earlier genotypes had been replaced by a genetically homogenous lineage that has been circulating in Europe since the late 1970s. The same genotype was found by other research groups in North America and Australia. Globally, other cocirculating genetic lineages also exist. The prevalence of a dominant genotype makes E-30 different from other previously studied HEVs, such as polioviruses and coxsackieviruses B4 and B5, for which several coexisting genetic lineages have been reported. The second part of this work deals with molecular epidemiology of human rhinoviruses (HRVs). A total of 61 field isolates were studied in the 420-nt stretch in the capsid coding region of VP4/VP2. The isolates were collected from children under two years of age in Tampere, Finland. Sequences from the clinical isolates clustered in the two previously known phylogenetic clades. Seasonal clustering was found. Also, several distinct serotype-like clusters were found to co-circulate during the same epidemic season. Reappearance of a cluster after disappearing for a season was observed. The molecular epidemiology of the analyzed strains turned out to be complex, and we decided to continue our studies of HRV. Only five previously published complete genome sequences of HRV prototype strains were available for analysis. Therefore, all designated HRV prototype strains (n=102) were sequenced in the VP4/VP2 region, and the possibility of genetic typing of HRV was evaluated. Seventy-six of the 102 prototype strains clustered in HRV genetic group A (HRV-A) and 25 in group B (HRV-B). Serotype 87 clustered separately from other HRVs with HEV species D. The field strains of HRV represented as many as 19 different genotypes, as judged with an approximate demarcation of a 20% nt difference in the VP4/VP2 region. The interserotypic differences of HRV were generally similar to those reported between different HEV serotypes (i.e. about 20%), but smaller differences, less than 10%, were also observed. Because some HRV serotypes are genetically so closely related, we suggest that the genetic typing be performed using the criterion "the closest prototype strain". This study is the first systematic genetic characterization of all known HRV prototype strains, providing a further taxonomic proposal for classification of HRV. We proposed to divide the genus Human rhinoviruses into HRV-A and HRV-B. The final part of the work comprises a phylogenetic analysis of a subset (48) of HRV prototype strains and field isolates (12) in the nonstructural part of the genome coding for the RNA-dependent RNA polymerase (3D). The proposed division of the HRV strains in the species HRV-A and HRV-B was also supported by 3D region. HRV-B clustered closer to HEV species B, C, and also to polioviruses than to HRV-A. Intraspecies variation within both HRV-A and HRV-B was greater in the 3D coding region than in the VP4/VP2 coding region, in contrast to HEV. Moreover, the diversity of HRV in 3D exceeded that of HEV. One group of HRV-A, designated HRV-A', formed a separate cluster outside other HRV-A in the 3D region. It formed a cluster also in the capsid region, but located within HRV-A. This may reflect a different evolutionary history of distinct genomic regions among HRV-A. Furthermore, the tree topology within HRV-A in the 3D region differed from that in the VP4/VP2, suggesting possible recombination events in the evolution of the strains. No conflicting phylogenies were observed in any of the 12 field isolates. Possible recombination was further studied using the Similarity and Bootscanning analyses of the complete genome sequences of HRV available in public databases. Evidence for recombination among HRV-A was found, as HRV2 and HRV39 showed higher similarity in the nonstructural part of the genome. Whether HRV2 and HRV39 strains - and perhaps also some other HRV-A strains not yet completely sequenced - are recombinants remains to be determined.
Resumo:
Background: Using array comparative genomic hybridization (aCGH), a large number of deleted genomic regions have been identified in human cancers. However, subsequent efforts to identify target genes selected for inactivation in these regions have often been challenging. Methods: We integrated here genome-wide copy number data with gene expression data and non-sense mediated mRNA decay rates in breast cancer cell lines to prioritize gene candidates that are likely to be tumour suppressor genes inactivated by bi-allelic genetic events. The candidates were sequenced to identify potential mutations. Results: This integrated genomic approach led to the identification of RIC8A at 11p15 as a putative candidate target gene for the genomic deletion in the ZR-75-1 breast cancer cell line. We identified a truncating mutation in this cell line, leading to loss of expression and rapid decay of the transcript. We screened 127 breast cancers for RIC8A mutations, but did not find any pathogenic mutations. No promoter hypermethylation in these tumours was detected either. However, analysis of gene expression data from breast tumours identified a small group of aggressive tumours that displayed low levels of RIC8A transcripts. qRT-PCR analysis of 38 breast tumours showed a strong association between low RIC8A expression and the presence of TP53 mutations (P = 0.006). Conclusion: We demonstrate a data integration strategy leading to the identification of RIC8A as a gene undergoing a classical double-hit genetic inactivation in a breast cancer cell line, as well as in vivo evidence of loss of RIC8A expression in a subgroup of aggressive TP53 mutant breast cancers.
Resumo:
Over the past years, much research on sarcomas based on low-resolution cytogenetic and molecular cytogenetic methods has been published, leading to the identification of genetic abnormalities partially underlying the tumourigenesis. Continued progress in the identification of genetic events such as copy number aberrations relies upon adapting the rapidly evolving high-resolution microarray technology, which will eventually provide novel insights into sarcoma biology, and targets for both diagnostics and drug development. The aim of this Thesis was to characterize DNA copy number changes that are involved in the pathogenesis of soft tissue leiomyosarcoma (LMS), dermatofibrosarcoma protuberans (DFSP), osteosarcoma (OS), malignant fibrous histiocytoma (MFH), and uterine leiomyosarcoma (ULMS) by applying fine resolution array comparative genomic hybridization (aCGH) technology. Both low- and high-grade LMS tumours showed distinct copy number patterns, in addition to sharing two minimal common regions of gains and losses. Small aberrations were detected by aCGH, which were beyond the resolution of chromosomal comparative genomic hybridization (cCGH). DFSP tumours analysed by aCGH showed gains in 17q, 22q, and 21 additional gained regions, but only one region (22q) with copy number loss. Recurrent amplicons identified in OS by aCGH were 12q11-q15, 8q, 6p12-p21, and 17p. Amplicons 12q and 17p were further characterized in detail. The amplicon at 17p was characterized by aCGH in low- and high-grade LMS, OS, and MFH. In all but one case this amplicon, with minimal common regions of gains at 17p11-p12, started with the distal loss of 17p13-pter. OS and high-grade LMS were grouped together as they showed a complex pattern of copy number gains and amplifications at 17p, whereas MFH and low-grade LMS showed a continuous pattern of copy number gains and amplification at 17p. In addition to the commonly gained and lost regions identified in ULMS by aCGH, various biological processes affected by these copy number changes were also indicated by pathway analysis. The three novel findings obtained in this work were: characterization of amplicon 17p in low- and high-grade LMS and MFH, profiles of DNA copy number changes in LMS, and detection of various pathways affected by copy number changes in ULMS. These studies have not been undertaken previously by aCGH technology, thus this Thesis adds new information regarding DNA copy number changes in sarcomas. In conclusion, the aCGH technique used in this Thesis has provided new insights into the genetics of sarcomas by detecting the precise regions affected by copy number changes and some potential candidate target genes within those regions, which had not been uncovered by previously applied low resolution techniques.
Resumo:
Chromosomal alterations in leukemia have been shown to have prognostic and predictive significance and are also important minimal residual disease (MRD) markers in the follow-up of leukemia patients. Although specific oncogenes and tumor suppressors have been discovered in some of the chromosomal alterations, the role and target genes of many alterations in leukemia remain unknown. In addition, a number of leukemia patients have a normal karyotype by standard cytogenetics, but have variability in clinical course and are often molecularly heterogeneous. Cytogenetic methods traditionally used in leukemia analysis and diagnostics; G-banding, various fluorescence in situ hybridization (FISH) techniques, and chromosomal comparative genomic hybridization (cCGH), have enormously increased knowledge about the leukemia genome, but have limitations in resolution or in genomic coverage. In the last decade, the development of microarray comparative genomic hybridization (array-CGH, aCGH) for DNA copy number analysis and the SNP microarray (SNP-array) method for simultaneous copy number and loss of heterozygosity (LOH) analysis has enabled investigation of chromosomal and gene alterations genome-wide with high resolution and high throughput. In these studies, genetic alterations were analyzed in acute myeloid leukemia (AML) and chronic lymphocytic leukemia (CLL). The aim was to screen and characterize genomic alterations that could play role in leukemia pathogenesis by using aCGH and SNP-arrays. One of the most important goals was to screen cryptic alterations in karyotypically normal leukemia patients. In addition, chromosomal changes were evaluated to narrow the target regions, to find new markers, and to obtain tumor suppressor and oncogene candidates. The work presented here shows the capability of aCGH to detect submicroscopic copy number alterations in leukemia, with information about breakpoints and genes involved in the alterations, and that genome-wide microarray analyses with aCGH and SNP-array are advantageous methods in the research and diagnosis of leukemia. The most important findings were the cryptic changes detected with aCGH in karyotypically normal AML and CLL, characterization of amplified genes in 11q marker chromosomes, detection of deletion-based mechanisms of MLL-ARHGEF12 fusion gene formation, and detection of LOH without copy number alteration in karyotypically normal AML. These alterations harbor candidate oncogenes and tumor suppressors for further studies.
Resumo:
Helicobacter pylori infection is a risk factor for gastric cancer, which is a major health issue worldwide. Gastric cancer has a poor prognosis due to the unnoticeable progression of the disease and surgery is the only available treatment in gastric cancer. Therefore, gastric cancer patients would greatly benefit from identifying biomarker genes that would improve diagnostic and prognostic prediction and provide targets for molecular therapies. DNA copy number amplifications are the hallmarks of cancers in various anatomical locations. Mechanisms of amplification predict that DNA double-strand breaks occur at the margins of the amplified region. The first objective of this thesis was to identify the genes that were differentially expressed in H. pylori infection as well as the transcription factors and signal transduction pathways that were associated with the gene expression changes. The second objective was to identify putative biomarker genes in gastric cancer with correlated expression and copy number, and the last objective was to characterize cancers based on DNA copy number amplifications. DNA microarrays, an in vitro model and real-time polymerase chain reaction were used to measure gene expression changes in H. pylori infected AGS cells. In order to identify the transcription factors and signal transduction pathways that were activated after H. pylori infection, gene expression profiling data from the H. pylori experiments and a bioinformatics approach accompanied by experimental validation were used. Genome-wide expression and copy number microarray analysis of clinical gastric cancer samples and immunohistochemistry on tissue microarray were used to identify putative gastric cancer genes. Data mining and machine learning techniques were applied to study amplifications in a cross-section of cancers. FOS and various stress response genes were regulated by H. pylori infection. H. pylori regulated genes were enriched in the chromosomal regions that are frequently changed in gastric cancer, suggesting that molecular pathways of gastric cancer and premalignant H. pylori infection that induces gastritis are interconnected. 16 transcription factors were identified as being associated with H. pylori infection induced changes in gene expression. NF-κB transcription factor and p50 and p65 subunits were verified using elecrophoretic mobility shift assays. ERBB2 and other genes located in 17q12- q21 were found to be up-regulated in association with copy number amplification in gastric cancer. Cancers with similar cell type and origin clustered together based on the genomic localization of the amplifications. Cancer genes and large genes were co-localized with amplified regions and fragile sites, telomeres, centromeres and light chromosome bands were enriched at the amplification boundaries. H. pylori activated transcription factors and signal transduction pathways function in cellular mechanisms that might be capable of promoting carcinogenesis of the stomach. Intestinal and diffuse type gastric cancers showed distinct molecular genetic profiles. Integration of gene expression and copy number microarray data allowed the identification of genes that might be involved in gastric carcinogenesis and have clinical relevance. Gene amplifications were demonstrated to be non-random genomic instabilities. Cell lineage, properties of precursor stem cells, tissue microenvironment and genomic map localization of specific oncogenes define the site specificity of DNA amplifications, whereas labile genomic features define the structures of amplicons. These conclusions suggest that the definition of genomic changes in cancer is based on the interplay between the cancer cell and the tumor microenvironment.
Resumo:
Gastric cancer is the fourth most common cancer and the second most common cause of cancer-related death worldwide. Due to lack of early symptoms, gastric cancer is characterized by late stage diagnosis and unsatisfactory options for curative treatment. Several genomic alterations have been identified in gastric cancer, but the major factors contributing to initiation and progression of gastric cancer remain poorly known. Gene copy number alterations play a key role in the development of gastric cancer, and a change in gene copy number is one of the fundamental mechanisms for a cancer cell to control the expression of potential oncogenes and tumor suppressor genes. This thesis aims at clarifying the complex genomic alterations of gastric cancer to identify novel molecular biomarkers for diagnostic purposes as well as for targeted treatment. To highlight genes of potential biological and clinical relevance, we carried out a systematic microarray-based survey of gene expression and copy number levels in primary gastric tumors and gastric cancer cell lines. Results were validated using immunohistochemistry, real-time qRT-PCR, and affinity capture-based transcript (TRAC) assay. Altogether 192 clinical gastric tissue samples and 7 gastric cancer cell lines were included in this study. Multiple chromosomal regions with recurrent copy number alterations were detected. The most frequent chromosomal alterations included gains at 7q, 8q, 17q, 19q, and 20q and losses at 9p, 18q, and 21q. Distinctive patterns of copy number alterations were detected for different histological subtypes (intestinal and diffuse) and for cancers located in different parts of the stomach. The impact of copy number alterations on gene expression was significant, as 6-10% of genes located in the regions of gains and losses also showed concomitant alterations in their expression. By combining the information from the DNA- and RNA-level analyses many novel gastric cancer-related genes, such as ALPK2, ENAH, HHIPL2, and OSMR, were identified. Independent genome-wide gene expression analysis of Finnish and Japanese gastric tumors revealed an additional set of genes that was differentially expressed in cancerous gastric tissues compared with normal tissue. Overexpression of one of these genes, CXCL1, was associated with an improved survival of gastric cancer. Thus, using an integrative microarray analysis, several novel genes were identified that may be critically important for gastric carcinogenesis. Further studies of these genes may lead to novel biomarkers for gastric cancer diagnosis and targeted therapy.
Resumo:
Extraintestinal pathogenic Escherichia coli (ExPEC) represent a diverse group of strains of E. coli, which infect extraintestinal sites, such as the urinary tract, the bloodstream, the meninges, the peritoneal cavity, and the lungs. Urinary tract infections (UTIs) caused by uropathogenic E. coli (UPEC), the major subgroup of ExPEC, are among the most prevalent microbial diseases world wide and a substantial burden for public health care systems. UTIs are responsible for serious morbidity and mortality in the elderly, in young children, and in immune-compromised and hospitalized patients. ExPEC strains are different, both from genetic and clinical perspectives, from commensal E. coli strains belonging to the normal intestinal flora and from intestinal pathogenic E. coli strains causing diarrhea. ExPEC strains are characterized by a broad range of alternate virulence factors, such as adhesins, toxins, and iron accumulation systems. Unlike diarrheagenic E. coli, whose distinctive virulence determinants evoke characteristic diarrheagenic symptoms and signs, ExPEC strains are exceedingly heterogeneous and are known to possess no specific virulence factors or a set of factors, which are obligatory for the infection of a certain extraintestinal site (e. g. the urinary tract). The ExPEC genomes are highly diverse mosaic structures in permanent flux. These strains have obtained a significant amount of DNA (predictably up to 25% of the genomes) through acquisition of foreign DNA from diverse related or non-related donor species by lateral transfer of mobile genetic elements, including pathogenicity islands (PAIs), plasmids, phages, transposons, and insertion elements. The ability of ExPEC strains to cause disease is mainly derived from this horizontally acquired gene pool; the extragenous DNA facilitates rapid adaptation of the pathogen to changing conditions and hence the extent of the spectrum of sites that can be infected. However, neither the amount of unique DNA in different ExPEC strains (or UPEC strains) nor the mechanisms lying behind the observed genomic mobility are known. Due to this extreme heterogeneity of the UPEC and ExPEC populations in general, the routine surveillance of ExPEC is exceedingly difficult. In this project, we presented a novel virulence gene algorithm (VGA) for the estimation of the extraintestinal virulence potential (VP, pathogenicity risk) of clinically relevant ExPECs and fecal E. coli isolates. The VGA was based on a DNA microarray specific for the ExPEC phenotype (ExPEC pathoarray). This array contained 77 DNA probes homologous with known (e.g. adhesion factors, iron accumulation systems, and toxins) and putative (e.g. genes predictably involved in adhesion, iron uptake, or in metabolic functions) ExPEC virulence determinants. In total, 25 of DNA probes homologous with known virulence factors and 36 of DNA probes representing putative extraintestinal virulence determinants were found at significantly higher frequency in virulent ExPEC isolates than in commensal E. coli strains. We showed that the ExPEC pathoarray and the VGA could be readily used for the differentiation of highly virulent ExPECs both from less virulent ExPEC clones and from commensal E. coli strains as well. Implementing the VGA in a group of unknown ExPECs (n=53) and fecal E. coli isolates (n=37), 83% of strains were correctly identified as extraintestinal virulent or commensal E. coli. Conversely, 15% of clinical ExPECs and 19% of fecal E. coli strains failed to raster into their respective pathogenic and non-pathogenic groups. Clinical data and virulence gene profiles of these strains warranted the estimated VPs; UPEC strains with atypically low risk-ratios were largely isolated from patients with certain medical history, including diabetes mellitus or catheterization, or from elderly patients. In addition, fecal E. coli strains with VPs characteristic for ExPEC were shown to represent the diagnostically important fraction of resident strains of the gut flora with a high potential of causing extraintestinal infections. Interestingly, a large fraction of DNA probes associated with the ExPEC phenotype corresponded to novel DNA sequences without any known function in UTIs and thus represented new genetic markers for the extraintestinal virulence. These DNA probes included unknown DNA sequences originating from the genomic subtractions of four clinical ExPEC isolates as well as from five novel cosmid sequences identified in the UPEC strains HE300 and JS299. The characterized cosmid sequences (pJS332, pJS448, pJS666, pJS700, and pJS706) revealed complex modular DNA structures with known and unknown DNA fragments arranged in a puzzle-like manner and integrated into the common E. coli genomic backbone. Furthermore, cosmid pJS332 of the UPEC strain HE300, which carried a chromosomal virulence gene cluster (iroBCDEN) encoding the salmochelin siderophore system, was shown to be part of a transmissible plasmid of Salmonella enterica. Taken together, the results of this project pointed towards the assumptions that first, (i) homologous recombination, even within coding genes, contributes to the observed mosaicism of ExPEC genomes and secondly, (ii) besides en block transfer of large DNA regions (e.g. chromosomal PAIs) also rearrangements of small DNA modules provide a means of genomic plasticity. The data presented in this project supplemented previous whole genome sequencing projects of E. coli and indicated that each E. coli genome displays a unique assemblage of individual mosaic structures, which enable these strains to successfully colonize and infect different anatomical sites.
Resumo:
Multiple sclerosis (MS) is an immune-mediated demyelinating disorder of the central nervous system (CNS) affecting 0.1-0.2% of Northern European descent population. MS is considered to be a multifactorial disease, both environment and genetics play a role in its pathogenesis. Despite several decades of intense research, the etiological and pathogenic mechanisms underlying MS remain still largely unknown and no curative treatment exists. The genetic architecture underlying MS is complex with multiple genes involved. The strongest and the best characterized predisposing genetic factors for MS are located, as in other immune-mediated diseases, in the major histocompatibility complex (MHC) on chromosome 6. In humans MHC is called human leukocyte antigen (HLA). Alleles of the HLA locus have been found to associate strongly with MS and remained for many years the only consistently replicable genetic associations. However, recently other genes located outside the MHC region have been proposed as strong candidates for susceptibility to MS in several studies. In this thesis a new genetic locus located on chromosome 7q32, interferon regulatory factor 5 (IRF5), was identified in the susceptibility to MS. In particular, we found that common variation of the gene was associated with the disease in three different populations, Spanish, Swedish and Finnish. We also suggested a possible functional role for one of the risk alleles with impact on the expression of the IRF5 locus. Previous studies have pointed out a possible role played by chromosome 2q33 in the susceptibility to MS and other autoimmune disorders. The work described here also investigated the involvement of this chromosomal region in MS predisposition. After the detection of genetic association with 2q33 (article-1), we extended our analysis through fine-scale single nucleotide polymorphism (SNP) mapping to define further the contribution of this genomic area to disease pathogenesis (article-4). We found a trend (p=0.04) for association to MS with an intronic SNP located in the inducible T-cell co-stimulator (ICOS) gene, an important player in the co-stimulatory pathway of the immune system. Expression analysis of ICOS revealed a novel, previously uncharacterized, alternatively spliced isoform, lacking the extracellular domain that is needed for ligand binding. The stability of the newly-identified transcript variant and its subcellular localization were analyzed. These studies indicated that the novel isoform is stable and shows different subcellular localization as compared to full-length ICOS. The novel isoform might have a regulatory function, but further studies are required to elucidate its function. Chromosome 19q13 has been previously suggested as one of the genomic areas involved in MS predisposition. In several populations, suggestive linkage signals between MS predisposition and 19q13 have been obtained. Here, we analysed the role of allelic variation in 19q13 by family based association analysis in 782 MS families collected from Finland. In this dataset, we were not able to detect any statistically significant associations, although several previously suggested markers were included to the analysis. Replication of the previous findings on the basis of linkage disequilibrium between marker allele and disease/risk allele appears notoriously difficult because of limitations such as allelic heterogeneity. Re-sequencing based approaches may be required for elucidating the role of chromosome 19q13 with MS. This thesis has resulted in the identification of a new MS susceptibility locus (IRF5) previously associated with other inflammatory or autoimmune disorders, such as SLE. IRF5 is one of the mediators of interferons biological function. In addition to providing new insight in the possible pathogenetic pathway of the disease, this finding suggests that there might be common mechanisms between different immune-mediated disorders. Furthermore the work presented here has uncovered a novel isoform of ICOS, which may play a role in regulatory mechanisms of ICOS, an important mediator of lymphocyte activation. Further work is required to uncover its functions and possible involvement of the ICOS locus in MS susceptibility.
Resumo:
Pharmacogenetics deals with genetically determined variation in drug response. In this context, three phase I drug-metabolizing enzymes, CYP2D6, CYP2C9, and CYP2C19, have a central role, affecting the metabolism of about 20-30% of clinically used drugs. Since genes coding for these enzymes in human populations exhibit high genetic polymorphism, they are of major pharmacogenetic importance. The aims of this study were to develop new genotyping methods for CYP2D6, CYP2C9, and CYP2C19 that would cover the most important genetic variants altering the enzyme activity, and, for the first time, to describe the distribution of genetic variation at these loci on global and microgeographic scales. In addition, pharmacogenetics was applied to a postmortem forensic setting to elucidate the role of genetic variation in drug intoxications, focusing mainly on cases related to tricyclic antidepressants, which are commonly involved in fatal drug poisonings in Finland. Genetic variability data were obtained by genotyping new population samples by the methods developed based on PCR and multiplex single-nucleotide primer extension reaction, as well as by collecting data from the literature. Data consisted of 138, 129, and 146 population samples for CYP2D6, CYP2C9, and CYP2C19, respectively. In addition, over 200 postmortem forensic cases were examined with respect to drug and metabolite concentrations and genotypic variation at CYP2D6 and CYP2C19. The distribution of genetic variation within and among human populations was analyzed by descriptive statistics and variance analysis and by correlating the genetic and geographic distances using Mantel tests and spatial autocorrelation. The correlation between phenotypic and genotypic variation in drug metabolism observed in postmortem cases was also analyzed statistically. The genotyping methods developed proved to be informative, technically feasible, and cost-effective. Detailed molecular analysis of CYP2D6 genetic variation in a global survey of human populations revealed that the pattern of variation was similar to those of neutral genomic markers. Most of the CYP2D6 diversity was observed within populations, and the spatial pattern of variation was best described as clinal. On the other hand, genetic variants of CYP2D6, CYP2C9, and CYP2C19 associated with altered enzymatic activity could reach extremely high frequencies in certain geographic regions. Pharmacogenetic variation may also be significantly affected by population-specific demographic histories, as seen within the Finnish population. When pharmacogenetics was applied to a postmortem forensic setting, a correlation between amitriptyline metabolic ratios and genetic variation at CYP2D6 and CYP2C19 was observed in the sample material, even in the presence of confounding factors typical for these cases. In addition, a case of doxepin-related fatal poisoning was shown to be associated with a genetic defect at CYP2D6. Each of the genes studied showed a distinct variation pattern in human populations and high frequencies of altered activity variants, which may reflect the neutral evolution and/or selective pressures caused by dietary or environmental exposure. The results are relevant also from the clinical point of view since the genetic variation at CYP2D6, CYP2C9, and CYP2C19 already has a range of clinical applications, e.g. in cancer treatment and oral anticoagulation therapy. This study revealed that pharmacogenetics may also contribute valuable information to the medicolegal investigation of sudden, unexpected deaths.
Resumo:
Hereditary non-polyposis colorectal carcinoma (HNPCC; Lynch syndrome) is among the most common hereditary cancers in man and a model of cancers arising through deficient DNA mismatch repair (MMR). It is inherited in a dominant manner with predisposing germline mutations in the MMR genes, mainly MLH1, MSH2, MSH6 and PMS2. Both copies of the MMR gene need to be inactivated for cancer development. Since Lynch syndrome family members are born with one defective copy of one of the MMR genes in their germline, they only need to acquire a so called second hit to inactivate the MMR gene. Hence, they usually develop cancer at an early age. MMR gene inactivation leads to accumulation of mutations particularly in short repeat tracts, known as microsatellites, causing microsatellite instability (MSI). MSI is the hallmark of Lynch syndrome tumors, but is present in approximately 15% of sporadic tumors as well. There are several possible mechanisms of somatic inactivation (i.e. the second hit ) of MMR genes, for instance deletion of the wild-type copy, leading to loss of heterozygosity (LOH), methylation of promoter regions necessary for gene transcription, or mitotic recombination or gene conversion. In the Lynch syndrome tumors carrying germline mutations in the MMR gene, LOH was found to be the most frequent mechanism of somatic inactivation in the present study. We also studied MLH1/MSH2 deletion carriers and found that somatic mutations identical to the ones in the germline occurred frequently in colorectal cancers and were also present in extracolonic Lynch syndrome-associated tumors. Chromosome-specific marker analysis implied that gene conversion, rather than mitotic recombination or deletion of the respective gene locus accounted for wild-type inactivation. Lynch syndrome patients are predisposed to certain types of cancers, the most common ones being colorectal, endometrial and gastric cancer. Gastric cancer and uroepithelial tumors of bladder and ureter were observed to be true Lynch syndrome tumors with MMR deficiency as the driving force of tumorigenesis. Brain tumors and kidney carcinoma, on the other hand, were mostly MSS, implying the possibility of alternative routes of tumor development. These results present possible implications in clinical cancer surveillance. In about one-third of families suspected of Lynch syndrome, mutations in MMR genes are not found, and we therefore looked for alternative mechanisms of predisposition. According to our results, large genomic deletions, mainly in MSH2, and germline epimutations in MLH1, together explain a significant fraction of point mutation-negative families suspected of Lynch syndrome and are associated with characteristic clinical and family features. Our findings have important implications in the diagnosis and management of Lynch syndrome families.