61 resultados para Phonetic Similarity
Resumo:
Inadvertent climate modification has led to an increase in urban temperatures compared to the surrounding rural area. The main reason for the temperature rise is the altered energy portioning of input net radiation to heat storage and sensible and latent heat fluxes in addition to the anthropogenic heat flux. The heat storage flux and anthropogenic heat flux have not yet been determined for Helsinki and they are not directly measurable. To the contrary, turbulent fluxes of sensible and latent heat in addition to net radiation can be measured, and the anthropogenic heat flux together with the heat storage flux can be solved as a residual. As a result, all inaccuracies in the determination of the energy balance components propagate to the residual term and special attention must be paid to the accurate determination of the components. One cause of error in the turbulent fluxes is the fluctuation attenuation at high frequencies which can be accounted for by high frequency spectral corrections. The aim of this study is twofold: to assess the relevance of high frequency corrections to water vapor fluxes and to assess the temporal variation of the energy fluxes. Turbulent fluxes of sensible and latent heat have been measured at SMEAR III station, Helsinki, since December 2005 using the eddy covariance technique. In addition, net radiation measurements have been ongoing since July 2007. The used calculation methods in this study consist of widely accepted eddy covariance data post processing methods in addition to Fourier and wavelet analysis. The high frequency spectral correction using the traditional transfer function method is highly dependent on relative humidity and has an 11% effect on the latent heat flux. This method is based on an assumption of spectral similarity which is shown not to be valid. A new correction method using wavelet analysis is thus initialized and it seems to account for the high frequency variation deficit. Anyhow, the resulting wavelet correction remains minimal in contrast to the traditional transfer function correction. The energy fluxes exhibit a behavior characteristic for urban environments: the energy input is channeled to sensible heat as latent heat flux is restricted by water availability. The monthly mean residual of the energy balance ranges from 30 Wm-2 in summer to -35 Wm-2 in winter meaning a heat storage to the ground during summer. Furthermore, the anthropogenic heat flux is approximated to be 50 Wm-2 during winter when residential heating is important.
Resumo:
Topic detection and tracking (TDT) is an area of information retrieval research the focus of which revolves around news events. The problems TDT deals with relate to segmenting news text into cohesive stories, detecting something new, previously unreported, tracking the development of a previously reported event, and grouping together news that discuss the same event. The performance of the traditional information retrieval techniques based on full-text similarity has remained inadequate for online production systems. It has been difficult to make the distinction between same and similar events. In this work, we explore ways of representing and comparing news documents in order to detect new events and track their development. First, however, we put forward a conceptual analysis of the notions of topic and event. The purpose is to clarify the terminology and align it with the process of news-making and the tradition of story-telling. Second, we present a framework for document similarity that is based on semantic classes, i.e., groups of words with similar meaning. We adopt people, organizations, and locations as semantic classes in addition to general terms. As each semantic class can be assigned its own similarity measure, document similarity can make use of ontologies, e.g., geographical taxonomies. The documents are compared class-wise, and the outcome is a weighted combination of class-wise similarities. Third, we incorporate temporal information into document similarity. We formalize the natural language temporal expressions occurring in the text, and use them to anchor the rest of the terms onto the time-line. Upon comparing documents for event-based similarity, we look not only at matching terms, but also how near their anchors are on the time-line. Fourth, we experiment with an adaptive variant of the semantic class similarity system. The news reflect changes in the real world, and in order to keep up, the system has to change its behavior based on the contents of the news stream. We put forward two strategies for rebuilding the topic representations and report experiment results. We run experiments with three annotated TDT corpora. The use of semantic classes increased the effectiveness of topic tracking by 10-30\% depending on the experimental setup. The gain in spotting new events remained lower, around 3-4\%. The anchoring the text to a time-line based on the temporal expressions gave a further 10\% increase the effectiveness of topic tracking. The gains in detecting new events, again, remained smaller. The adaptive systems did not improve the tracking results.
Resumo:
This thesis which consists of an introduction and four peer-reviewed original publications studies the problems of haplotype inference (haplotyping) and local alignment significance. The problems studied here belong to the broad area of bioinformatics and computational biology. The presented solutions are computationally fast and accurate, which makes them practical in high-throughput sequence data analysis. Haplotype inference is a computational problem where the goal is to estimate haplotypes from a sample of genotypes as accurately as possible. This problem is important as the direct measurement of haplotypes is difficult, whereas the genotypes are easier to quantify. Haplotypes are the key-players when studying for example the genetic causes of diseases. In this thesis, three methods are presented for the haplotype inference problem referred to as HaploParser, HIT, and BACH. HaploParser is based on a combinatorial mosaic model and hierarchical parsing that together mimic recombinations and point-mutations in a biologically plausible way. In this mosaic model, the current population is assumed to be evolved from a small founder population. Thus, the haplotypes of the current population are recombinations of the (implicit) founder haplotypes with some point--mutations. HIT (Haplotype Inference Technique) uses a hidden Markov model for haplotypes and efficient algorithms are presented to learn this model from genotype data. The model structure of HIT is analogous to the mosaic model of HaploParser with founder haplotypes. Therefore, it can be seen as a probabilistic model of recombinations and point-mutations. BACH (Bayesian Context-based Haplotyping) utilizes a context tree weighting algorithm to efficiently sum over all variable-length Markov chains to evaluate the posterior probability of a haplotype configuration. Algorithms are presented that find haplotype configurations with high posterior probability. BACH is the most accurate method presented in this thesis and has comparable performance to the best available software for haplotype inference. Local alignment significance is a computational problem where one is interested in whether the local similarities in two sequences are due to the fact that the sequences are related or just by chance. Similarity of sequences is measured by their best local alignment score and from that, a p-value is computed. This p-value is the probability of picking two sequences from the null model that have as good or better best local alignment score. Local alignment significance is used routinely for example in homology searches. In this thesis, a general framework is sketched that allows one to compute a tight upper bound for the p-value of a local pairwise alignment score. Unlike the previous methods, the presented framework is not affeced by so-called edge-effects and can handle gaps (deletions and insertions) without troublesome sampling and curve fitting.
Resumo:
In this thesis we present and evaluate two pattern matching based methods for answer extraction in textual question answering systems. A textual question answering system is a system that seeks answers to natural language questions from unstructured text. Textual question answering systems are an important research problem because as the amount of natural language text in digital format grows all the time, the need for novel methods for pinpointing important knowledge from the vast textual databases becomes more and more urgent. We concentrate on developing methods for the automatic creation of answer extraction patterns. A new type of extraction pattern is developed also. The pattern matching based approach chosen is interesting because of its language and application independence. The answer extraction methods are developed in the framework of our own question answering system. Publicly available datasets in English are used as training and evaluation data for the methods. The techniques developed are based on the well known methods of sequence alignment and hierarchical clustering. The similarity metric used is based on edit distance. The main conclusions of the research are that answer extraction patterns consisting of the most important words of the question and of the following information extracted from the answer context: plain words, part-of-speech tags, punctuation marks and capitalization patterns, can be used in the answer extraction module of a question answering system. This type of patterns and the two new methods for generating answer extraction patterns provide average results when compared to those produced by other systems using the same dataset. However, most answer extraction methods in the question answering systems tested with the same dataset are both hand crafted and based on a system-specific and fine-grained question classification. The the new methods developed in this thesis require no manual creation of answer extraction patterns. As a source of knowledge, they require a dataset of sample questions and answers, as well as a set of text documents that contain answers to most of the questions. The question classification used in the training data is a standard one and provided already in the publicly available data.
Resumo:
Spiritualiteetti viittaa syvälliseen, inhimilliseen ulottuvuuteen ja ominaisuuteen, jonka tarkka määritteleminen on haasteellista, ellei mahdotonta. Sitä vastaa yhtäältä uskonnollisuuden kautta toteutuva, elämän tarkoitukseen ja syvemmän olemuksen etsintään liittyvä hengellisyys, mutta toisaalta myös kaikkea muuta hengen viljelyä ja mielekkään olemisen tavoittelua tarkoittava henkisyys. John Swintonin mukaan hengen ulottuvuus on se inhimilliseen olemukseen kuuluva, dynaaminen elinvoima, joka virkistää ja elävöittää ihmistä ja motivoi häntä etsimään Jumalaa, arvoja, merkitystä, tarkoitusta ja toivoa. Tämä tutkimus nostaa tarkastelun kohteeksi kokonaisvaltaisen hengellisyyden, jolloin huomio kiinnitetään niihin sidoksiin, joiden kautta hengen ulottuvuus liittyy muihin inhimillisen elämän olennaisiin toimintoihin ja näkökulmiin. Tällaisia ovat 1) ajattelu 2) teot ja käytännön toiminta 3) suhteet ja vuorovaikutusverkostot 4) tunteet ja kanssakäymistä ohjaavat asenteet 5) olemassaolon ja olemisen ulottuvuudet. Kokemusten merkitys, arvo ja mielekkyys hahmottuvat juuri hengen alueella, toisin sanoen sisäisesti, hengellisenä ja henkisenä asiana. Tutkimusmateriaalina tässä tutkimuksessa on amerikkalaisen vuosina 1827 1915 eläneen Ellen Whiten kuusi myöhäiskauden teosta vuosilta 1892 1905 ja tutkimusmenetelmänä on käytetty systemaattista analyysiä. Olennaista Whiten tavassa käsitellä uskonnon harjoitukseen liittyviä aiheita on hänen käytännöllinen ja elämän arkeen kiinteästi niveltyvä otteensa. Tutkimus paljastaa, että Martti Lutherin käsitykset ovat merkittävästi vaikuttaneet Whiten ajatteluun. Lähteistä paljastuu samankaltaisuutta hänen näkemystensä ja uusimman suomalaisen Luther-tutkimuksen Martti Lutherin tuotannosta esiin nostaman ajattelutavan välillä. Vaikka teologisen oppineisuuden kannalta White ja Luther ovat eri tasoilla, kummankin käsitys ihmisen ja Jumalan välisen suhteen perusolemuksesta on samankaltainen: Lähtökohtana sille on Jumalan rakkaus ja hänen armostaan lähtenyt toiminta. Toiseksi, ihmisen ja Kristuksen välinen, olemuksellinen yhteys, unio , on perustana sille, että Jumala hyväksyy ihmisen ja huolehtii hänestä nyt ja ikuisesti. Kolmanneksi, tämä ihmisen ja Kristuksen liittoutuminen ja yhdistyminen ilmenee yhteistoimintana ja kumppanuutena yhteisten tavoitteiden saavuttamiseksi maailmassa. White korostaa ihmisen ja Kristuksen välisen hengellisen suhteen vuorovaikutteista ja toiminnallista luonnetta, joka tulee ilmi epäitsekkyytenä, toisten ihmisten ja heidän tarpeittensa huomioimisena sekä myötätuntona ja kykynä asettua toisen asemaan. Terveellistä elämäntapaa ja kasvatusta koskevat ajatuksensa White liittää siihen laaja-alaiseen näkemykseen hengellisyydestä, jonka tavoitteena on ihmisen kokonaisvaltainen hyvinvointi. Hän ei näe spiritualiteettia elämän arjesta irrallisena tai erillisenä saarekkeena, vaan ihmistä kaikessa ohjaavana, voimaannuttavana ja mielekkyyttä tuottavana, ensisijaisena ulottuvuutena. Tutkimuksen kuluessa myös Whiten usein käyttämät Jumalalle antautumisen ja luonteen käsitteet nousevat tarkastelun kohteiksi. Hänen mukaansa ihminen ei tahdonponnistuksillaan yksin pysty tavoittamaan Jumalaa vaan hänen on lakattava Jumalan rakastavan kutsun edessä itse tahtomasta ja suostuttava liittymään Jumalan tahtoon ja tarkoitukseen. Tämä liittyy siihen sisäiseen muutokseen, jota White kuvaa luonteen käsitteen avulla. Jumalan armon vaikuttama tahdon uudelleen suuntaaminen muuttaa ihmisen olemusta, arvoja, asennoitumisen tapaa ja myötätuntoisen vuorovaikutuksen kykyä niin ettei ihminen ole enää aivan sama kuin ennen. Kysymys on toisaalta yhtäkkisestä ja kertakaikkisesta olemuksellisesta muuttumisesta, mutta samalla myös hiljaisesta, elämänmittaisesta kasvusta ja kypsymisestä. Juuri luonteen käsitteen avulla White kuvaa hengellisyyttä ja siihen kuuluvaa sisästä matkaa. Tässä tutkimuksessa spiritualiteettia lähestytään yleisinhimillisenä piirteenä ja ominaisuutena, jolloin huomio ei ole ensisijaisesti yksittäisissä opillisissa käsityksissä tai uskonnollisuuden harjoittamisen muodoissa. Tarkoituksena on luoda kokoava rakenne, jonka puitteissa holistinen spiritualiteetti voidaan selkeämmin hahmottaa ja yksilöidymmin ymmärtää.
Resumo:
The first glycyl radical in an enzyme was described 20 years ago and since then the family of glycyl radical enzymes (GREs) has expanded to include enzymes catalysing five chemically distinct reactions. The type enzymes of the family, anaerobic ribonucleotide reductase (RNRIII) and pyruvate formate lyase (PFL) had been studied long before it was known that they are GREs. Spectroscopic measurements on the radical and an observation that exposure to oxygen irreversibly inactivates the enzymes by cleavage of the protein proved that the radical is located on a particular glycine residue, close to the C-terminus of the protein. Both anaerobic RNRIII and PFL, are important for many anaerobic and facultative anaerobic bacteria as RNRIII is responsible for the synthesis of DNA precursors and PFL catalyses a key metabolic reaction in glycolysis. The crystal structures of both were solved in 1999 and they revealed that, although the enzymes do not share significant sequence identity, they share a similar structure - the radical site and residues necessary for catalysis are buried inside a ten stranded $\ualpha $/$\ubeta $-barrel. GREs are synthesised in an inactive form and are post-translationally activated by an activating enzyme which uses S-adenosyl methionine and an iron-sulphur cluster to generate the radical. One of the goals of this thesis work was to crystallise the activating enzyme of PFL. This task is challenging as, like GREs, the activating component is inactivated by oxygen. The experiments were therefore carried out in an oxygen free atmosphere. This is the first report of a crystalline GRE activating enzyme. Recently several new GREs have been characterised, all sharing sequence similarity to PFL but not to RNRIII. Also, the genome sequencing projects have identified many PFL-like GREs of unknown function, usually annotated as PFLs. In the present thesis I describe the grouping of these PFL family enzymes based on the sequence similarity and analyse the conservation patterns when compared to the structure of E. coli PFL. Based on this information an activation route is proposed. I also report a crystal structure of one of the PFL-like enzymes with unknown function, PFL2 from Archaeoglobus fulgidus. As A. fulgidus is a hyperthermophilic organism, possible mechanisms stabilising the structure are discussed. The organisation of an active site of PFL2 suggests that the enzyme may be a dehydratase. Keywords: glycyl radical, enzyme, pyruvate formate lyase, x-ray crystallography, bioinformatics
Resumo:
Many Gram-negative bacteria pathogenic to plants and animals possess type III secretion systems that are used to cause disease. Effector proteins are injected into host cells using the type III secretion machineries. Despite vigorous studies, the nature of the secretion signal for type III secreted proteins still remains elusive. Both mRNA and proteinaceous signals have been proposed. Findings on coupling of translation to secretion by the type III secretion systems are also still contradictory. This study dealt with the secretion signal of HrpA from Pseudomonas syringae pathovar tomato. HrpA is the major component of the type III secretion system-associated Hrp pilus and a substrate for the type III secretion systems. The secretion signal was shown to reside in the first 15 codons or amino acids, a location typical for type III secretion signals. Translation of HrpA in the absence of a functional type III secretion system was established, but it does not exclude the possibility of coupling of translation to secretion when the secretion apparatus is present. The hrpA transcripts from various unrelated plant pathogenic bacteria were shown to be extremely stable. The biological relevance of this observation is unknown, but possible explanations include the high prevalence of HrpA protein, an mRNA secretion signal or timing of secretion. The hrpA mRNAs are stable over a wide range of temperatures, in the absence of translating ribosomes and even in the heterologous host Escherichia coli. The untranslated regions (UTRs) of hrpA transcripts from at least 20 pathovars of Pseudomonas syringae are highly homologous, whilst their coding regions exhibit low similarity. The stable nature of hrpA messenger RNAs is likely to be due to the folding of their 5 and 3 UTRs. In silico the UTRs seem to form stem-loop structures, the hairpin structures in the 3 UTRs being rich in guanidine and cytosine residues. The stable nature of the hrpA transcript redirected the studies to the stabilization of heterologous transcripts and to the use of stable messenger RNAs in recombinant protein production. Fragments of the hrpA transcript can be used to confer stability on heterologous transcripts from several sources of bacterial and eukaryotic origin, and to elevate the levels of production of the corresponding recombinant proteins several folds. hrpA transcript stabilizing elements can be used for improving the yields of recombinant proteins even in Escherichia coli, one of the most commonly used industrial protein production hosts.
Resumo:
Transposable elements, transposons, are discrete DNA segments that are able to move or copy themselves from one locus to another within or between their host genome(s) without a requirement for DNA homology. They are abundant residents in virtually all the genomes studied, for instance, the genomic portion of TEs is approximately 3% in Saccharomyces cerevisiae, 45% in humans, and apparently more than 70% in some plant genomes such as maize and barley. Transposons plays essential role in genome evolution, in lateral transfer of antibiotic resistance genes among bacteria and in life cycle of certain viruses such as HIV-1 and bacteriophage Mu. Despite the diversity of transposable elements they all use a fundamentally similar mechanism called transpositional DNA recombination (transposition) for the movement within and between the genomes of their host organisms. The DNA breakage and joining reactions that underlie their transposition are chemically similar in virtually all known transposition systems. The similarity of the reactions is also reflected in the structure and function of the catalyzing enzymes, transposases and integrases. The transposition reactions take place within the context of a transposition machinery, which can be particularly complex, as in the case of the VLP (virus like particle) machinery of retroelements, which in vivo contains RNA or cDNA and a number of element encoded structural and catalytic proteins. Yet, the minimal core machinery required for transposition comprises a multimer of transposase or integrase proteins and their binding sites at the element DNA ends only. Although the chemistry of DNA transposition is fairly well characterized, the components and function of the transposition machinery have been investigated in detail for only a small group of elements. This work focuses on the identification, characterization, and functional studies of the molecular components of the transposition machineries of BARE-1, Hin-Mu and Mu. For BARE-1 and Hin-Mu transpositional activity has not been shown previously, whereas bacteriophage Mu is a general model of transposition. For BARE-1, which is a retroelement of barley (Hordeum vulgare), the protein and DNA components of the functional VLP machinery were identified from cell extracts. In the case of Hin-Mu, which is a Mu-like prophage in Haemophilus influenzae Rd genome, the components of the core machinery (transposase and its binding sites) were characterized and their functionality was studied by using an in vitro methodology developed for Mu. The function of Mu core machinery was studied for its ability to use various DNA substrates: Hin-Mu end specific DNA substrates and Mu end specific hairpin substrates. The hairpin processing reaction by MuA was characterized in detail. New information was gained of all three machineries. The components or their activity required for functional BARE-1 VLP machinery and retrotransposon life cycle were present in vivo and VLP-like structures could be detected. The Hin-Mu core machinery components were identified and shown to be functional. The components of the Mu and Hin-Mu core machineries were partially interchangeable, reflecting both evolutionary conservation and flexibility within the core machineries. The Mu core machinery displayed surprising flexibility in substrate usage, as it was able to utilize Hin-Mu end specific DNA substrates and to process Mu end DNA hairpin substrates. This flexibility may be evolutionarily and mechanistically important.
Resumo:
While environmental variation is an ubiquitous phenomenon in the natural world which has for long been appreciated by the scientific community recent changes in global climatic conditions have begun to raise consciousness about the economical, political and sociological ramifications of global climate change. Climate warming has already resulted in documented changes in ecosystem functioning, with direct repercussions on ecosystem services. While predicting the influence of ecosystem changes on vital ecosystem services can be extremely difficult, knowledge of the organisation of ecological interactions within natural communities can help us better understand climate driven changes in ecosystems. The role of environmental variation as an agent mediating population extinctions is likely to become increasingly important in the future. In previous studies population extinction risk in stochastic environmental conditions has been tied to an interaction between population density dependence and the temporal autocorrelation of environmental fluctuations. When populations interact with each other, forming ecological communities, the response of such species assemblages to environmental stochasticity can depend, e.g., on trophic structure in the food web and the similarity in species-specific responses to environmental conditions. The results presented in this thesis indicate that variation in the correlation structure between species-specific environmental responses (environmental correlation) can have important qualitative and quantitative effects on community persistence and biomass stability in autocorrelated (coloured) environments. In addition, reddened environmental stochasticity and ecological drift processes (such as demographic stochasticity and dispersal limitation) have important implications for patterns in species relative abundances and community dynamics over time and space. Our understanding of patterns in biodiversity at local and global scale can be enhanced by considering the relevance of different drift processes for community organisation and dynamics. Although the results laid out in this thesis are based on mathematical simulation models, they can be valuable in planning effective empirical studies as well as in interpreting existing empirical results. Most of the metrics considered here are directly applicable to empirical data.
Resumo:
The Baltic Sea is a geologically young, large brackish water basin, and few of the species living there have fully adapted to its special conditions. Many of the species live on the edge of their distribution range in terms of one or more environmental variables such as salinity or temperature. Environmental fluctuations are know to cause fluctuations in populations abundance, and this effect is especially strong near the edges of the distribution range, where even small changes in an environmental variable can be critical to the success of a species. This thesis examines which environmental factors are the most important in relation to the success of various commercially exploited fish species in the northern Baltic Sea. It also examines the uncertainties related to fish stocks current and potential status as well as to their relationship with their environment. The aim is to quantify the uncertainties related to fisheries and environmental management, to find potential management strategies that can be used to reduce uncertainty in management results and to develop methodology related to uncertainty estimation in natural resources management. Bayesian statistical methods are utilized due to their ability to treat uncertainty explicitly in all parts of the statistical model. The results show that uncertainty about important parameters of even the most intensively studied fish species such as salmon (Salmo salar L.) and Baltic herring (Clupea harengus membras L.) is large. On the other hand, management approaches that reduce uncertainty can be found. These include utilising information about ecological similarity of fish stocks and species, and using management variables that are directly related to stock parameters that can be measured easily and without extrapolations or assumptions.
Resumo:
Cell proliferation, transcription and metabolism are regulated by complex partly overlapping signaling networks involving proteins in various subcellular compartments. The objective of this study was to increase our knowledge on such regulatory networks and their interrelationships through analysis of MrpL55, Vig, and Mat1 representing three gene products implicated in regulation of cell cycle, transcription, and metabolism. Genome-wide and biochemical in vitro studies have previously revealed MrpL55 as a component of the large subunit of the mitochondrial ribosome and demonstrated a possible role for the protein in cell cycle regulation. Vig has been implicated in heterochromatin formation and identified as a constituent of the RNAi-induced silencing complex (RISC) involved in cell cycle regulation and RNAi-directed transcriptional gene silencing (TGS) coupled to RNA polymerase II (RNAPII) transcription. Mat1 has been characterized as a regulatory subunit of cyclin-dependent kinase 7 (Cdk7) complex phosphorylating and regulating critical targets involved in cell cycle progression, energy metabolism and transcription by RNAPII. The first part of the study explored whether mRpL55 is required for cell viability or involved in a regulation of energy metabolism and cell proliferation. The results revealed a dynamic requirement of the essential Drosophila mRpL55 gene during development and suggested a function of MrpL55 in cell cycle control either at the G1/S or G2/M transition prior to cell differentiation. This first in vivo characterization of a metazoan-specific constituent of the large subunit of mitochondrial ribosome also demonstrated forth compelling evidence of the interconnection of nuclear and mitochondrial genomes as well as complex functions of the evolutionarily young metazoan-specific mitochondrial ribosomal proteins. In studies on the Drosophila RISC complex regulation, it was noted that Vig, a protein involved in heterochromatin formation, unlike other analyzed RISC associated proteins Argonaute2 and R2D2, is dynamically phosphorylated in a dsRNA-independent manner. Vig displays similarity with a known in vivo substrate for protein kinase C (PKC), human chromatin remodeling factor Ki-1/57, and is efficiently phosphorylated by PKC on multiple sites in vitro. These results suggest that function of the RISC complex protein Vig in RNAi-directed TGS and chromatin modification may be regulated through dsRNA-independent phosphorylation by PKC. In the third part of this study the role of Mat1 in regulating RNAPII transcription was investigated using cultured murine immortal fibroblasts with a conditional allele of Mat1. The results demonstrated that phosphorylation of the carboxy-terminal domain (CTD) of the large subunit of RNAPII in the heptapeptide YSPTSPS repeat in Mat-/- cells was over 10-fold reduced on Serine-5 and subsequently on Serine-2. Occupancy of the hypophosphorylated RNAPII in gene bodies was detectably decreased, whereas capping, splicing, histone methylation and mRNA levels were generally not affected. However, a subset of transcripts in absence of Mat1 was repressed and associated with decreased occupancy of RNAPII at promoters as well as defective capping. The results identify the Cdk7-CycH-Mat1 kinase submodule of TFIIH as a stimulatory non-essential regulator of transcriptional elongation and a genespecific essential factor for stable binding of RNAPII at the promoter region and capping. The results of these studies suggest important roles for both MrpL55 and Mat1 in cell cycle progression and their possible interplay at the G2/M stage in undifferentiated cells. The identified function of Mat1 and of TFIIH kinase complex in gene-specific transcriptional repression is challenging for further studies in regard to a possible link to Vig and RISC-mediated transcriptional gene silencing.
Resumo:
Four GDNF ligands (GDNF, neurturin, artemin and persephin), and mesencephalic astrocyte-derived neurotrophic factor (MANF) and conserved dopamine neurotrophic factor (CDNF) protect midbrain dopaminergic neurons that degenerate in Parkinson's disease. Each GDNF ligand binds a specific coreceptor GDNF family receptor α (GFRα), leading to the formation of a heterotetramer complex, which then interacts with receptor tyrosine kinase RET, the signalling receptor. The present thesis describes the structural and biochemical characterization of the GDNF2-GFRα12 complex and the MANF and CDNF proteins. Previous and current mutation data and comparison between GDNF-GFRα1 and artemin-GFRα3 binding interfaces show that N162GFRα1, I175GFRα1, V230GFRα1, Y120GDNF and L114GDNF are the specificity determinants among different ligand-coreceptor pairs. The structure suggests that sucrose octasulphate, a heparin mimic, interacts with a region R190-K202 within domain 2 of GFRα1. Mutating these residues on the GFRα1 surface, which are not in the GDNF binding region, affected RET phosphorylation, which provides a putative RET binding region in domain 2 and 3 of GFRα1. The structural comparison of the GDNF-GFRα1 and artemin-GFRα3 complexes shows a difference in bend angle between the ligand monomers. This variation in bend angle of the ligand may affect the kinetics of RET phosphorylation. To confirm that the difference is not due to crystallization artefacts, I crystallized the GDNF-GFRα1 complex without SOS in different cell dimensions. The structure of the second GDNF-GFRα1 complex is very similar to the previous one, suggesting that the difference between the artemin-GFRα3 and GDNF-GFRα1 complexes are intrinsic, not due to crystal packing. Finally, MANF and CDNF are bifunctional proteins with extracellular neurotrophic activity and ER resident cytoprotective role. The crystal structures of MANF and CDNF are presented here. Intriguingly, the structures of both the neurotrophic factors do not show structural similarity to any of previously known growth factor superfamilies; instead they are similar to saposins, the lipid-binding proteins. The N-terminal domain of MANF and CDNF contain conserved lysines and arginines on its surface, which may interact with negatively charged head groups of phospholipids, as saposins do. Thus MANF and CDNF may provide neurotrophic activities by interacting with a lipo-receptor. The structure of MANF shows a CXXC motif forming internal disulphide bridge in the natively unfolded C-terminus. This motif is common to reductases and disulphide isomerases. It is thus tempting to speculate that the CXXC motif of MANF and CDNF may be involved in oxidative protein folding, which may explain its cytoprotective role in the ER.
Resumo:
In the present thesis, questions of spectral tuning, the relation of spectral and thermal properties of visual pigments, and evolutionary adaptation to different light environments were addressed using a group of small crustaceans of the genus Mysis as a model. The study was based on microspectrophotometric measurements of visual pigment absorbance spectra, electrophysiological measurements of spectral sensitivities of dark-adapted eyes, and sequencing of the opsin gene retrieved through PCR. The spectral properties were related to the spectral transmission of the respective light environments, as well as to the phylogentic histories of the species. The photoactivation energy (Ea) was estimated from temperature effects on spectral sensitivity in the long-wavelength range, and calculations were made for optimal quantum catch and optimal signal-to-noise ratio in the different light environments. The opsin amino acid sequences of spectrally characterized individuals were compared to find candidate residues for spectral tuning. The general purpose was to clarify to what extent and on what time scale adaptive evolution has driven the functional properties of (mysid) visual pigments towards optimal performance in different light environments. An ultimate goal was to find the molecular mechanisms underlying the spectral tuning and to understand the balance between evolutionary adaptation and molecular constraints. The totally consistent segregation of absorption maxima (λmax) into (shorter-wavelength) marine and (longer-wavelength) freshwater populations suggests that truly adaptive evolution is involved in tuning the visual pigment for optimal performance, driven by selection for high absolute visual sensitivity. On the other hand, the similarity in λmax and opsin sequence between several populations of freshwater M. relicta in spectrally different lakes highlights the limits to adaptation set by evolutionary history and time. A strong inverse correlation between Ea and λmax was found among all visual pigments studied in these respects, including those of M. relicta and 10 species of vertebrate pigments, and this was used to infer thermal noise. The conceptual signal-to-noise ratios thus calculated for pigments with different λmax in the Baltic Sea and Lake Pääjärvi light environments supported the notion that spectral adaptation works towards maximizing the signal-to-noise ratio rather than quantum catch as such. Judged by the shape of absorbance spectra, the visual pigments of all populations of M. relicta and M. salemaai used exclusively the A2 chromophore (3, 4-dehydroretinal). A comparison of amino acid substitutions between M. relicta and M. salemaai indicated that mysid shrimps have a small number of readily available tuning sites to shift between a shorter - and a longer -wavelength opsin. However, phylogenetic history seems to have prevented marine M. relicta from converting back to the (presumably) ancestral opsin form, and thus the more recent reinvention of marine spectral sensitivity has been accomplished by some other novel mechanism, yet to be found
Resumo:
Microbial degradation pathways play a key role in the detoxification and the mineralization of polyaromatic hydrocarbons (PAHs), which are widespread pollutants in soil and constituents of petroleum hydrocarbons. In microbiology the aromatic degradation pathways are traditionally studied from single bacterial strains with capacity to degrade certain pollutant. In soil the degradation of aromatics is performed by a diverse community of micro-organisms. The aim of this thesis was to study biodegradation on different levels starting from a versatile aromatic degrader Sphingobium sp. HV3 and its megaplasmid, extending to revelation of diversity of key catabolic enzymes in the environment and finally studying birch rhizoremediation in PAH-polluted soil. To understand biodegradation of aromatics on bacterial species level, the aromatic degradation capacity of Sphingobium sp. HV3 and the role of the plasmid pSKY4, was studied. Toluene, m-xylene, biphenyl, fluorene, phenanthrene were detected as carbon and energy sources of the HV3 strain. Tn5 transposon mutagenesis linked the degradation capacity of toluene, m-xylene, biphenyl and naphthalene to the pSKY4 plasmid and qPCR expression analysis showed that plasmid extradiol dioxygenases genes (bphC and xylE) are inducted by phenanthrene, m-xylene and biphenyl whereas the 2,4-dichlorophenoxyacetic acid herbicide induced the chlorocatechol 1,2-dioxygenase gene (tfdC) from the ortho-pathway. A method to study upper meta-pathway extradiol dioxygenase gene diversity in soil was developed. The extradiol dioxygenases catalyse cleavage of the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon (meta-cleavage). A high diversity of extradiol dioxygenases were detected from polluted soils. The detected extradiol dioxygenases showed sequence similarity to known catabolic genes of Alpha-, Beta-, and Gammaproteobacteria. Five groups of extradiol dioxygenases contained sequences with no close homologues in the database, representing novel genes. In rhizoremediation experiment with birch (Betula pendula) treatment specific changes of extradiol dioxygenase communities were shown. PAH pollution changed the bulk soil extradiol dioxygenase community structure and birch rhizosphere contained a more diverse extradiol dioxygenase community than the bulk soil showing a rhizosphere effect. The degradation of pyrene in soil was enhanced with birch seedlings compared to soil without birch. The complete 280,923 kb nucleotide sequence of pSKY4 plasmid was determined. The open reading frames of pSKY4 were divided into putative conjugative transfer, aromatic degradation, replication/maintaining and transposition/integration function-encoding proteins. Aromatic degradation orfs shared high similarity to corresponding genes in pNL1, a plasmid from the deep subsurface strain Novosphingobium aromaticivorans F199. The plasmid backbones were considerably more divergent with lower similarity, which suggests that the aromatic pathway has functioned as a plasmid independent mobile genetic element. The functional diversity of microbial communities in soil is still largely unknown. Several novel clusters of extradiol dioxygenases representing catabolic bacteria, whose function, biodegradation pathways and phylogenetic position is not known were amplified with single primer pair from polluted soils. These extradiol dioxygenase communities were shown to change upon PAH pollution, which indicates that their hosts function in PAH biodegradation in soil. Although the degradation pathways of specific bacterial species are substantially better depicted than pathways in situ, the evolution of degradation pathways for the xenobiotic compounds is largely unknown. The pSKY4 plasmid contains aromatic degradation genes in putative mobile genetic element causing flexibility/instability to the pathway. The localisation of the aromatic biodegradation pathway in mobile genetic elements suggests that gene transfer and rearrangements are a competetive advantage for Sphingomonas bacteria in the environment.
Resumo:
The first part of this work investigates the molecular epidemiology of a human enterovirus (HEV), echovirus 30 (E-30). This project is part of a series of studies performed in our research team analyzing the molecular epidemiology of HEV-B viruses. A total of 129 virus strains had been isolated in different parts of Europe. The sequence analysis was performed in three different genomic regions: 420 nucleotides (nt) in the VP4/VP2 capsid protein coding region, the entire VP1 capsid protein coding gene of 876 nt, and 150 nt in the VP1/2A junction region. The analysis revealed a succession of dominant sublineages within a major genotype. The temporally earlier genotypes had been replaced by a genetically homogenous lineage that has been circulating in Europe since the late 1970s. The same genotype was found by other research groups in North America and Australia. Globally, other cocirculating genetic lineages also exist. The prevalence of a dominant genotype makes E-30 different from other previously studied HEVs, such as polioviruses and coxsackieviruses B4 and B5, for which several coexisting genetic lineages have been reported. The second part of this work deals with molecular epidemiology of human rhinoviruses (HRVs). A total of 61 field isolates were studied in the 420-nt stretch in the capsid coding region of VP4/VP2. The isolates were collected from children under two years of age in Tampere, Finland. Sequences from the clinical isolates clustered in the two previously known phylogenetic clades. Seasonal clustering was found. Also, several distinct serotype-like clusters were found to co-circulate during the same epidemic season. Reappearance of a cluster after disappearing for a season was observed. The molecular epidemiology of the analyzed strains turned out to be complex, and we decided to continue our studies of HRV. Only five previously published complete genome sequences of HRV prototype strains were available for analysis. Therefore, all designated HRV prototype strains (n=102) were sequenced in the VP4/VP2 region, and the possibility of genetic typing of HRV was evaluated. Seventy-six of the 102 prototype strains clustered in HRV genetic group A (HRV-A) and 25 in group B (HRV-B). Serotype 87 clustered separately from other HRVs with HEV species D. The field strains of HRV represented as many as 19 different genotypes, as judged with an approximate demarcation of a 20% nt difference in the VP4/VP2 region. The interserotypic differences of HRV were generally similar to those reported between different HEV serotypes (i.e. about 20%), but smaller differences, less than 10%, were also observed. Because some HRV serotypes are genetically so closely related, we suggest that the genetic typing be performed using the criterion "the closest prototype strain". This study is the first systematic genetic characterization of all known HRV prototype strains, providing a further taxonomic proposal for classification of HRV. We proposed to divide the genus Human rhinoviruses into HRV-A and HRV-B. The final part of the work comprises a phylogenetic analysis of a subset (48) of HRV prototype strains and field isolates (12) in the nonstructural part of the genome coding for the RNA-dependent RNA polymerase (3D). The proposed division of the HRV strains in the species HRV-A and HRV-B was also supported by 3D region. HRV-B clustered closer to HEV species B, C, and also to polioviruses than to HRV-A. Intraspecies variation within both HRV-A and HRV-B was greater in the 3D coding region than in the VP4/VP2 coding region, in contrast to HEV. Moreover, the diversity of HRV in 3D exceeded that of HEV. One group of HRV-A, designated HRV-A', formed a separate cluster outside other HRV-A in the 3D region. It formed a cluster also in the capsid region, but located within HRV-A. This may reflect a different evolutionary history of distinct genomic regions among HRV-A. Furthermore, the tree topology within HRV-A in the 3D region differed from that in the VP4/VP2, suggesting possible recombination events in the evolution of the strains. No conflicting phylogenies were observed in any of the 12 field isolates. Possible recombination was further studied using the Similarity and Bootscanning analyses of the complete genome sequences of HRV available in public databases. Evidence for recombination among HRV-A was found, as HRV2 and HRV39 showed higher similarity in the nonstructural part of the genome. Whether HRV2 and HRV39 strains - and perhaps also some other HRV-A strains not yet completely sequenced - are recombinants remains to be determined.