40 resultados para Sequence dependent setups
Resumo:
Bacteria play an important role in many ecological systems. The molecular characterization of bacteria using either cultivation-dependent or cultivation-independent methods reveals the large scale of bacterial diversity in natural communities, and the vastness of subpopulations within a species or genus. Understanding how bacterial diversity varies across different environments and also within populations should provide insights into many important questions of bacterial evolution and population dynamics. This thesis presents novel statistical methods for analyzing bacterial diversity using widely employed molecular fingerprinting techniques. The first objective of this thesis was to develop Bayesian clustering models to identify bacterial population structures. Bacterial isolates were identified using multilous sequence typing (MLST), and Bayesian clustering models were used to explore the evolutionary relationships among isolates. Our method involves the inference of genetic population structures via an unsupervised clustering framework where the dependence between loci is represented using graphical models. The population dynamics that generate such a population stratification were investigated using a stochastic model, in which homologous recombination between subpopulations can be quantified within a gene flow network. The second part of the thesis focuses on cluster analysis of community compositional data produced by two different cultivation-independent analyses: terminal restriction fragment length polymorphism (T-RFLP) analysis, and fatty acid methyl ester (FAME) analysis. The cluster analysis aims to group bacterial communities that are similar in composition, which is an important step for understanding the overall influences of environmental and ecological perturbations on bacterial diversity. A common feature of T-RFLP and FAME data is zero-inflation, which indicates that the observation of a zero value is much more frequent than would be expected, for example, from a Poisson distribution in the discrete case, or a Gaussian distribution in the continuous case. We provided two strategies for modeling zero-inflation in the clustering framework, which were validated by both synthetic and empirical complex data sets. We show in the thesis that our model that takes into account dependencies between loci in MLST data can produce better clustering results than those methods which assume independent loci. Furthermore, computer algorithms that are efficient in analyzing large scale data were adopted for meeting the increasing computational need. Our method that detects homologous recombination in subpopulations may provide a theoretical criterion for defining bacterial species. The clustering of bacterial community data include T-RFLP and FAME provides an initial effort for discovering the evolutionary dynamics that structure and maintain bacterial diversity in the natural environment.
Resumo:
The analysis of sequential data is required in many diverse areas such as telecommunications, stock market analysis, and bioinformatics. A basic problem related to the analysis of sequential data is the sequence segmentation problem. A sequence segmentation is a partition of the sequence into a number of non-overlapping segments that cover all data points, such that each segment is as homogeneous as possible. This problem can be solved optimally using a standard dynamic programming algorithm. In the first part of the thesis, we present a new approximation algorithm for the sequence segmentation problem. This algorithm has smaller running time than the optimal dynamic programming algorithm, while it has bounded approximation ratio. The basic idea is to divide the input sequence into subsequences, solve the problem optimally in each subsequence, and then appropriately combine the solutions to the subproblems into one final solution. In the second part of the thesis, we study alternative segmentation models that are devised to better fit the data. More specifically, we focus on clustered segmentations and segmentations with rearrangements. While in the standard segmentation of a multidimensional sequence all dimensions share the same segment boundaries, in a clustered segmentation the multidimensional sequence is segmented in such a way that dimensions are allowed to form clusters. Each cluster of dimensions is then segmented separately. We formally define the problem of clustered segmentations and we experimentally show that segmenting sequences using this segmentation model, leads to solutions with smaller error for the same model cost. Segmentation with rearrangements is a novel variation to the segmentation problem: in addition to partitioning the sequence we also seek to apply a limited amount of reordering, so that the overall representation error is minimized. We formulate the problem of segmentation with rearrangements and we show that it is an NP-hard problem to solve or even to approximate. We devise effective algorithms for the proposed problem, combining ideas from dynamic programming and outlier detection algorithms in sequences. In the final part of the thesis, we discuss the problem of aggregating results of segmentation algorithms on the same set of data points. In this case, we are interested in producing a partitioning of the data that agrees as much as possible with the input partitions. We show that this problem can be solved optimally in polynomial time using dynamic programming. Furthermore, we show that not all data points are candidates for segment boundaries in the optimal solution.
Resumo:
The results presented in this thesis show that all females of a given population do not necessarily choose similar mating partners. Specifically, partner preferences of a fish, the sand goby (Pomatoschistus minutus), varied among individual females and depended on the social context at the time of choice. I also show that females assess multiple mate choice cues simultaneously; partner preferences were based more strongly on an interaction effect between different choice cues than on any individual cue. Furthermore, I found that preferred matings involved fitness benefits in the form of increased offspring success, but these benefits were not significantly affected by mate compatibility. Hence, mate choice for partner compatibility does not appear to be an important determinant of the observed variation in female mate preferences in this species. The context-dependency of female mating preferences revealed is relevant to how genetic variation in sexually selected traits might be maintained: as the mating success of a certain male type varies according to the choice context, directional sexual selection on male traits is shown to be less intense than generally thought making for a slower loss of genetic variation in these traits. Mating preferences of sand gobies were assessed by giving females a binary choice between males that differed in body size and/or other focus traits. These association preferences were found to be sexually motivated, repeatable and to correspond to actual mating decisions.
Resumo:
Cell adhesion and extracellular matrix (ECM) molecules play a significant role in neuronal plasticity both during development and in the adult. Plastic changes in which ECM components are implicated may underlie important nervous system functions, such as memory formation and learning. Heparin-binding growthassociated molecule (HB-GAM, also known as pleiotrophin), is an ECM protein involved in neurite outgrowth, axonal guidance and synaptogenesis during perinatal period. In the adult brain HB-GAM expression is restricted to the regions which display pronounced synaptic plasticity (e.g., hippocampal CA3-CA1 areas, cerebral cortex laminae II-IV, olfactory bulb). Expression of HB-GAM is regulated in an activity-dependent manner and is also induced in response to neuronal injury. In this work mutant mice were used to study the in vivo function of HB-GAM and its receptor syndecan-3 in hippocampal synaptic plasticity and in hippocampus-dependent behavioral tasks. Phenotypic analysis of HBGAM null mutants and mice overexpressing HB-GAM revealed that opposite genetic manipulations result in reverse changes in synaptic plasticity as well as behavior in the mutants. Electrophysiological recordings showed that mice lacking HB-GAM have an increased level of long-term potentiation (LTP) in the area CA1 of hippocampus and impaired spatial learning, whereas animals with enhanced level of HB-GAM expression have attenuated LTP, but outperformed their wild-type controls in spatial learning. It was also found that GABA(A) receptor-mediated synaptic transmission is altered in the transgenic mice overexpressing HB-GAM. The results suggest that these animals have accentuated hippocampal GABAergic inhibition, which may contribute to the altered glutamatergic synaptic plasticity. Structural studies of HB-GAM demonstrated that this protein belongs to the thrombospondin type I repeat (TSR) superfamily and contains two β-sheet domains connected by a flexible linker. It was found that didomain structure is necessary for biological activity of HB-GAM and electrophysiological phenotype displayed by the HB-GAM mutants. The individual domains displayed weaker binding to heparan sulfate and failed to promote neurite outgrowth as well as affect hippocampal LTP. Effects of HB-GAM on hippocampal synaptic plasticity are believed to be mediated by one of its (co-)receptor molecules, namely syndecan-3. In support of that, HB-GAM did not attenuate LTP in mice deficient in syndecan-3 as it did in wild-type controls. In addition, syndecan-3 knockout mice displayed electrophysiological and behavioral phenotype similar to that of HB-GAM knockouts (i.e. enhanced LTP and impaired learning in Morris water-maze). Thus HB-GAM and syndecan-3 are important modulators of synaptic plasticity in hippocampus and play a role in regulation of learning-related behavior.
Resumo:
The work covered in this thesis is focused on the development of technology for bioconversion of glucose into D-erythorbic acid (D-EA) and 5-ketogluconic acid (5-KGA). The task was to show on proof-of-concept level the functionality of the enzymatic conversion or one-step bioconversion of glucose to these acids. The feasibility of both studies to be further developed for production processes was also evaluated. The glucose - D-EA bioconversion study was based on the use of a cloned gene encoding a D-EA forming soluble flavoprotein, D-gluconolactone oxidase (GLO). GLO was purified from Penicillium cyaneo-fulvum and partially sequenced. The peptide sequences obtained were used to isolate a cDNA clone encoding the enzyme. The cloned gene (GenBank accession no. AY576053) is homologous to the other known eukaryotic lactone oxidases and also to some putative prokaryotic lactone oxidases. Analysis of the deduced protein sequence of GLO indicated the presence of a typical secretion signal sequence at the N-terminus of the enzyme. No other targeting/anchoring signals were found, suggesting that GLO is the first known lactone oxidase that is secreted rather than targeted to the membranes of the endoplasmic reticulum or mitochondria. Experimental evidence supports this analysis, as near complete secretion of GLO was observed in two different yeast expression systems. Highest expression levels of GLO were obtained using Pichia pastoris as an expression host. Recombinant GLO was characterised and the suitability of purified GLO for the production of D-EA was studied. Immobilised GLO was found to be rapidly inactivated during D-EA production. The feasibility of in vivo glucose - D-EA conversion using a P. pastoris strain co-expressing the genes of GLO and glucose oxidase (GOD, E.C. 1.1.3.4) of A. niger was demonstrated. The glucose - 5-KGA bioconversion study followed a similar strategy to that used in the D-EA production research. The rationale was based on the use of a cloned gene encoding a membrane-bound pyrroloquinoline quinone (PQQ)-dependent gluconate 5-dehydrogenase (GA 5-DH). GA 5-DH was purified to homogeneity from the only source of this enzyme known in literature, Gluconobacter suboxydans, and partially sequenced. Using the amino acid sequence information, the GA 5-DH gene was cloned from a genomic library of G. suboxydans. The cloned gene was sequenced (GenBank accession no. AJ577472) and found to be an operon of two adjacent genes encoding two subunits of GA 5-DH. It turned out that GA 5-DH is a rather close homologue of a sorbitol dehydrogenase from another G. suboxydans strain. It was also found that GA 5-DH has significant polyol dehydrogenase activity. The G. suboxydans GA 5-DH gene was poorly expressed in E. coli. Under optimised conditions maximum expression levels of GA 5-DH did not exceed the levels found in wild-type G. suboxydans. Attempts to increase expression levels resulted in repression of growth and extensive cell lysis. However, the expression levels were sufficient to demonstrate the possibility of bioconversion of glucose and gluconate into 5-KGA using recombinant strains of E. coli. An uncharacterised homologue of GA 5-DH was identified in Xanthomonas campestris using in silico screening. This enzyme encoded by chromosomal locus NP_636946 was found by a sequencing project of X. campestris and named as a hypothetical glucose dehydrogenase. The gene encoding this uncharacterised enzyme was cloned, expressed in E. coli and found to encode a gluconate/polyol dehydrogenase without glucose dehydrogenase activity. Moreover, the X. campestris GA 5-DH gene was expressed in E. coli at nearly 30 times higher levels than the G. suboxydans GA 5-DH gene. Good expressability of the X. campestris GA-5DH gene makes it a valuable tool not only for 5-KGA production in the tartaric acid (TA) bioprocess, but possibly also for other bioprocesses (e.g. oxidation of sorbitol into L-sorbose). In addition to glucose - 5-KGA bioconversion, a preliminary study of the feasibility of enzymatic conversion of 5-KGA into TA was carried out. Here, the efficacy of the first step of a prospective two-step conversion route including a transketolase and a dehydrogenase was confirmed. It was found that transketolase convert 5-KGA into TA semialdehyde. A candidate for the second step was suggested to be succinic dehydrogenase, but this was not tested. The analysis of the two subprojects indicated that bioconversion of glucose to TA using X. campestris GA 5-DH should be prioritised first and the process development efforts in future should be focused on development of more efficient GA 5-DH production strains by screening a more suitable production host and by protein engineering.
Resumo:
For most RNA viruses RNA-dependent RNA polymerases (RdRPs) encoded by the virus are responsible for the entire RNA metabolism. Thus, RdRPs are critical components in the viral life cycle. However, it is not fully understood how these important enzymes function during viral replication. Double-stranded RNA (dsRNA) viruses perform the synthesis of their RNA genome within a proteinacous viral particle containing an RdRP as a minor constituent. The phi6 bacteriophage is the best-studied dsRNA virus, providing an excellent background for studies of its RNA synthesis. The purified recombinant phi6 RdRP is highly active in vitro and it possesses both RNA replication and transcription activities. The crystal structure of the phi6 polymerase, solved in complex with a number of ligands, provides a working model for detailed in vitro studies of RNA-dependent RNA polymerization. In this thesis, the primer-independent initiation of the phi6 RdRP was studied in vitro using biochemical and structural methods. A C-terminal, four-amino-acid-long loop protruding into the central cavity of the phi6 RdRP has been suggested to stabilize the incoming nucleotides of the initiation complex formation through stacking interactions. A similar structural element has been found from several other viral RdRPs. In this thesis, this so-called initiation platform loop was subjected to site-directed mutagenesis to address its role in the initiation. It was found that the initiation mode of the mutants is primer-dependent, requiring either an oligonucleotide primer or a back-priming initiation mechanism for the RNA synthesis. The crystal structure of a mutant RdRP with altered initiation platform revealed a set of contacts important for primer-independent initiation. Since phi6 RdRP is structurally and functionally homologous to several viral RdRPs, among them the hepatitis C virus RdRP, these results provide further general insight to understand primer-independent initiation. In this study it is demonstrated that manganese phasing could be used as a practical tool for solving structures of large proteins with a bound manganese ion. The phi6 RdRP was used as a case study to obtain phases for crystallographic analysis. Manganese ions are naturally bound to the phi6 RdRP at the palm domain of the enzyme. In a crystallographic experiment, X-ray diffraction data from a phi6 RdRP crystal were collected at a wavelength of 1.89 Å, which is the K edge of manganese. With this data an automatically built model of the core region of the protein could be obtained. Finally, in this work terminal nucleotidyl transferase (TNTase) activity of the phi6 RdRP was documented in the isolated polymerase as well as in the viral particle. This is the first time that such an activity has been reported in a polymerase of a dsRNA virus. The phi6 RdRP used uridine triphosphates as the sole substrate in a TNTase reaction but could accept several heterologous templates. The RdRP was able to add one or a few non-templated nucleotides to the 3' end of the single- or double-stranded RNA substrate. Based on the results on particle-mediated TNTase activity and previous structural information of the polymerase, a model for termination of the RNA-dependent RNA synthesis is suggested in this thesis.
Resumo:
Evolutionary genetics incorporates traditional population genetics and studies of the origins of genetic variation by mutation and recombination, and the molecular evolution of genomes. Among the primary forces that have potential to affect the genetic variation within and among populations, including those that may lead to adaptation and speciation, are genetic drift, gene flow, mutations and natural selection. The main challenges in knowing the genetic basis of evolutionary changes is to distinguish the adaptive selection forces that cause existent DNA sequence variants and also to identify the nucleotide differences responsible for the observed phenotypic variation. To understand the effects of various forces, interpretation of gene sequence variation has been the principal basis of many evolutionary genetic studies. The main aim of this thesis was to assess different forms of teleost gene sequence polymorphisms in evolutionary genetic studies of Atlantic salmon (Salmo salar) and other species. Firstly, the level of Darwinian adaptive evolution affected coding regions of the growth hormone (GH) gene during the teleost evolution was investigated based on the sequence data existing in public databases. Secondly, a target gene approach was used to identify within population variation in the growth hormone 1 (GH1) gene in salmon. Then, a new strategy for single nucleotide polymorphisms (SNPs) discovery in salmonid fishes was introduced, and, finally, the usefulness of a limited number of SNP markers as molecular tools in several applications of population genetics in Atlantic salmon was assessed. This thesis showed that the gene sequences in databases can be utilized to perform comparative studies of molecular evolution, and some putative evidence of the existence of Darwinian selection during the teleost GH evolution was presented. In addition, existent sequence data was exploited to investigate GH1 gene variation within Atlantic salmon populations throughout its range. Purifying selection is suggested to be the predominant evolutionary force controlling the genetic variation of this gene in salmon, and some support for gene flow between continents was also observed. The novel approach to SNP discovery in species with duplicated genome fragments introduced here proved to be an effective method, and this may have several applications in evolutionary genetics with different species - e.g. when developing gene-targeted markers to investigate quantitative genetic variation. The thesis also demonstrated that only a few SNPs performed highly similar signals in some of the population genetic analyses when compared with the microsatellite markers. This may have useful applications when estimating genetic diversity in genes having a potential role in ecological and conservation issues, or when using hard biological samples in genetic studies as SNPs can be applied with relatively highly degraded DNA.
Resumo:
Visual pigments of different animal species must have evolved at some stage to match the prevailing light environments, since all visual functions depend on their ability to absorb available photons and transduce the event into a reliable neural signal. There is a large literature on correlation between the light environment and spectral sensitivity between different fish species. However, little work has been done on evolutionary adaptation between separated populations within species. More generally, little is known about the rate of evolutionary adaptation to changing spectral environments. The objective of this thesis is to illuminate the constraints under which the evolutionary tuning of visual pigments works as evident in: scope, tempo, available molecular routes, and signal/noise trade-offs. Aquatic environments offer Nature s own laboratories for research on visual pigment properties, as naturally occurring light environments offer an enormous range of variation in both spectral composition and intensity. The present thesis focuses on the visual pigments that serve dim-light vision in two groups of model species, teleost fishes and mysid crustaceans. The geographical emphasis is in the brackish Baltic Sea area with its well-known postglacial isolation history and its aquatic fauna of both marine and fresh-water origin. The absorbance spectrum of the (single) dim-light visual pigment were recorded by microspectrophotometry (MSP) in single rods of 26 fish species and single rhabdoms of 8 opossum shrimp populations of the genus Mysis inhabiting marine, brackish or freshwater environments. Additionally, spectral sensitivity was determined from six Mysis populations by electroretinogram (ERG) recording. The rod opsin gene was sequenced in individuals of four allopatric populations of the sand goby (Pomatoschistus minutus). Rod opsins of two other goby species were investigated as outgroups for comparison. Rod absorbance spectra of the Baltic subspecies or populations of the primarily marine species herring (Clupea harengus membras), sand goby (P. minutus), and flounder (Platichthys flesus) were long-wavelength-shifted compared to their marine populations. The spectral shifts are consistent with adaptation for improved quantum catch (QC) as well as improved signal-to-noise ratio (SNR) of vision in the Baltic light environment. Since the chromophore of the pigment was pure A1 in all cases, this has apparently been achieved by evolutionary tuning of the opsin visual pigment. By contrast, no opsin-based differences were evident between lake and sea populations of species of fresh-water origin, which can tune their pigment by varying chromophore ratios. A more detailed analysis of differences in absorbance spectra and opsin sequence between and within populations was conducted using the sand goby as model species. Four allopatric populations from the Baltic Sea (B), Swedish west coast (S), English Channel (E), and Adriatic Sea (A) were examined. Rod absorbance spectra, characterized by the wavelength of maximum absorbance (λmax), differed between populations and correlated with differences in the spectral light transmission of the respective water bodies. The greatest λmax shift as well as the greatest opsin sequence difference was between the Baltic and the Adriatic populations. The significant within-population variation of the Baltic λmax values (506-511 nm) was analyzed on the level of individuals and was shown to correlate well with opsin sequence substitutions. The sequences of individuals with λmax at shorter wavelengths were identical to that of the Swedish population, whereas those with λmax at longer wavelengths additionally had substitution F261F/Y in the sixth transmembrane helix of the protein. This substitution (Y261) was also present in the Baltic common gobies and is known to redshift spectra. The tuning mechanism of the long-wavelength type Baltic sand gobies is assumed to be the co-expression of F261 and Y261 in all rods to produce ≈ 5 nm redshift. The polymorphism of the Baltic sand goby population possibly indicates ambiguous selection pressures in the Baltic Sea. The visual pigments of all lake populations of the opossum shrimp (Mysis relicta) were red-shifted by 25 nm compared with all Baltic Sea populations. This is calculated to confer a significant advantage in both QC and SNR in many humus-rich lakes with reddish water. Since only A2 chromophore was present, the differences obviously reflect evolutionary tuning of the visual protein, the opsin. The changes have occurred within the ca. 9000 years that the lakes have been isolated from the Sea after the most recent glaciation. At present, it seems that the mechanism explaining the spectral differences between lake and sea populations is not an amino acid substitution at any other conventional tuning site, but the mechanism is yet to be found.
Resumo:
The first part of this work investigates the molecular epidemiology of a human enterovirus (HEV), echovirus 30 (E-30). This project is part of a series of studies performed in our research team analyzing the molecular epidemiology of HEV-B viruses. A total of 129 virus strains had been isolated in different parts of Europe. The sequence analysis was performed in three different genomic regions: 420 nucleotides (nt) in the VP4/VP2 capsid protein coding region, the entire VP1 capsid protein coding gene of 876 nt, and 150 nt in the VP1/2A junction region. The analysis revealed a succession of dominant sublineages within a major genotype. The temporally earlier genotypes had been replaced by a genetically homogenous lineage that has been circulating in Europe since the late 1970s. The same genotype was found by other research groups in North America and Australia. Globally, other cocirculating genetic lineages also exist. The prevalence of a dominant genotype makes E-30 different from other previously studied HEVs, such as polioviruses and coxsackieviruses B4 and B5, for which several coexisting genetic lineages have been reported. The second part of this work deals with molecular epidemiology of human rhinoviruses (HRVs). A total of 61 field isolates were studied in the 420-nt stretch in the capsid coding region of VP4/VP2. The isolates were collected from children under two years of age in Tampere, Finland. Sequences from the clinical isolates clustered in the two previously known phylogenetic clades. Seasonal clustering was found. Also, several distinct serotype-like clusters were found to co-circulate during the same epidemic season. Reappearance of a cluster after disappearing for a season was observed. The molecular epidemiology of the analyzed strains turned out to be complex, and we decided to continue our studies of HRV. Only five previously published complete genome sequences of HRV prototype strains were available for analysis. Therefore, all designated HRV prototype strains (n=102) were sequenced in the VP4/VP2 region, and the possibility of genetic typing of HRV was evaluated. Seventy-six of the 102 prototype strains clustered in HRV genetic group A (HRV-A) and 25 in group B (HRV-B). Serotype 87 clustered separately from other HRVs with HEV species D. The field strains of HRV represented as many as 19 different genotypes, as judged with an approximate demarcation of a 20% nt difference in the VP4/VP2 region. The interserotypic differences of HRV were generally similar to those reported between different HEV serotypes (i.e. about 20%), but smaller differences, less than 10%, were also observed. Because some HRV serotypes are genetically so closely related, we suggest that the genetic typing be performed using the criterion "the closest prototype strain". This study is the first systematic genetic characterization of all known HRV prototype strains, providing a further taxonomic proposal for classification of HRV. We proposed to divide the genus Human rhinoviruses into HRV-A and HRV-B. The final part of the work comprises a phylogenetic analysis of a subset (48) of HRV prototype strains and field isolates (12) in the nonstructural part of the genome coding for the RNA-dependent RNA polymerase (3D). The proposed division of the HRV strains in the species HRV-A and HRV-B was also supported by 3D region. HRV-B clustered closer to HEV species B, C, and also to polioviruses than to HRV-A. Intraspecies variation within both HRV-A and HRV-B was greater in the 3D coding region than in the VP4/VP2 coding region, in contrast to HEV. Moreover, the diversity of HRV in 3D exceeded that of HEV. One group of HRV-A, designated HRV-A', formed a separate cluster outside other HRV-A in the 3D region. It formed a cluster also in the capsid region, but located within HRV-A. This may reflect a different evolutionary history of distinct genomic regions among HRV-A. Furthermore, the tree topology within HRV-A in the 3D region differed from that in the VP4/VP2, suggesting possible recombination events in the evolution of the strains. No conflicting phylogenies were observed in any of the 12 field isolates. Possible recombination was further studied using the Similarity and Bootscanning analyses of the complete genome sequences of HRV available in public databases. Evidence for recombination among HRV-A was found, as HRV2 and HRV39 showed higher similarity in the nonstructural part of the genome. Whether HRV2 and HRV39 strains - and perhaps also some other HRV-A strains not yet completely sequenced - are recombinants remains to be determined.
Resumo:
Double-stranded RNA (dsRNA) viruses encode only a single protein species that contains RNA-dependent RNA polymerase (RdRP) motifs. This protein is a central component in the life cycle of a dsRNA virus, carrying out both RNA transcription and replication. The architecture of viral RdRPs resembles that of a 'cupped right hand' with fingers, palm and thumb domains. Those applying de novo initiation have additional structural features, including a flexible C-terminal domain that constitutes the priming platform. Moreover, viral RdRPs must be able to interact with the incoming 3'-terminus of the template and position it so that a productive binary complex is formed. Bacteriophage phi6 of the Cystoviridae family is to date one of the best studied dsRNA viruses. The purified recombinant phi6 RdRP is highly active in vitro and possesses both RNA replication and transcription activities. The extensive biochemical observations and the atomic level crystal structure of the phi6 RdRP provides an excellent platform for in-depth studies of RNA replication in vitro. In this thesis, targeted structure-based mutagenesis, enzymatic assays and molecular mapping of phi6 RdRP and its RNA were used to elucidate the formation of productive RNA-polymerase binary complexes. The positively charged rim of the template tunnel was shown to have a significant role in the engagement of highly structured ssRNA molecules, whereas specific interactions further down in the template tunnel promote ssRNA entry to the catalytic site. This work demonstrated that by aiding the formation of a stable binary complex with optimized RNA templates, the overall polymerization activity of the phi6 RdRP can be greatly enhanced. Furthermore, proteolyzed phi6 RdRPs that possess a nick in the polypeptide chain at the hinge region, which is part of the extended loop, were better suited for catalysis at higher temperatures whilst favouring back-primed initiation. The clipped C-terminus remains associated with the main body of the polymerase and the hinge region, although structurally disordered, is involved in the control of C-terminal domain displacement. The accumulated knowhow on bacteriophage phi6 was utilized in the development of two technologies for the production of dsRNA: (i) an in vitro system that combines the T7 RNA polymerase and the phi6 RdRP to generate dsRNA molecules of practically unlimited length, and (ii) an in vivo RNA replication system based on restricted infection with phi6 polymerase complexes in bacterial cells to produce virtually unlimited amounts of dsRNA. The pools of small interfering RNAs derived from dsRNA produced by these systems were validated and shown to efficiently decrease the expression of both exogenous and endogenous targets.
Resumo:
Mutation and recombination are the fundamental processes leading to genetic variation in natural populations. This variation forms the raw material for evolution through natural selection and drift. Therefore, studying mutation rates may reveal information about evolutionary histories as well as phylogenetic interrelationships of organisms. In this thesis two molecular tools, DNA barcoding and the molecular clock were examined. In the first part, the efficiency of mutations to delineate closely related species was tested and the implications for conservation practices were assessed. The second part investigated the proposition that a constant mutation rate exists within invertebrates, in form of a metabolic-rate dependent molecular clock, which can be applied to accurately date speciation events. DNA barcoding aspires to be an efficient technique to not only distinguish between species but also reveal population-level variation solely relying on mutations found on a short stretch of a single gene. In this thesis barcoding was applied to discriminate between Hylochares populations from Russian Karelia and new Hylochares findings from the greater Helsinki region in Finland. Although barcoding failed to delineate the two reproductively isolated groups, their distinct morphological features and differing life-history traits led to their classification as two closely related, although separate species. The lack of genetic differentiation appears to be due to a recent divergence event not yet reflected in the beetles molecular make-up. Thus, the Russian Hylochares was described as a new species. The Finnish species, previously considered as locally extinct, was recognized as endangered. Even if, due to their identical genetic make-up, the populations had been regarded as conspecific, conservation strategies based on prior knowledge from Russia would not have guaranteed the survival of the Finnish beetle. Therefore, new conservation actions based on detailed studies of the biology and life-history of the Finnish Hylochares were conducted to protect this endemic rarity in Finland. The idea behind the strict molecular clock is that mutation rates are constant over evolutionary time and may thus be used to infer species divergence dates. However, one of the most recent theories argues that a strict clock does not tick per unit of time but that it has a constant substitution rate per unit of mass-specific metabolic energy. Therefore, according to this hypothesis, molecular clocks have to be recalibrated taking body size and temperature into account. This thesis tested the temperature effect on mutation rates in equally sized invertebrates. For the first dataset (family Eucnemidae, Coleoptera) the phylogenetic interrelationships and evolutionary history of the genus Arrhipis had to be inferred before the influence of temperature on substitution rates could be studied. Further, a second, larger invertebrate dataset (family Syrphidae, Diptera) was employed. Several methodological approaches, a number of genes and multiple molecular clock models revealed that there was no consistent relationship between temperature and mutation rate for the taxa under study. Thus, the body size effect, observed in vertebrates but controversial for invertebrates, rather than temperature may be the underlying driving force behind the metabolic-rate dependent molecular clock. Therefore, the metabolic-rate dependent molecular clock does not hold for the here studied invertebrate groups. This thesis emphasizes that molecular techniques relying on mutation rates have to be applied with caution. Whereas they may work satisfactorily under certain conditions for specific taxa, they may fail for others. The molecular clock as well as DNA barcoding should incorporate all the information and data available to obtain comprehensive estimations of the existing biodiversity and its evolutionary history.
Resumo:
Cyclosporine is an immunosuppressant drug with a narrow therapeutic index and large variability in pharmacokinetics. To improve cyclosporine dose individualization in children, we used population pharmacokinetic modeling to study the effects of developmental, clinical, and genetic factors on cyclosporine pharmacokinetics in altogether 176 subjects (age range: 0.36–20.2 years) before and up to 16 years after renal transplantation. Pre-transplantation test doses of cyclosporine were given intravenously (3 mg/kg) and orally (10 mg/kg), on separate occasions, followed by blood sampling for 24 hours (n=175). After transplantation, in a total of 137 patients, cyclosporine concentration was quantified at trough, two hours post-dose, or with dose-interval curves. One-hundred-four of the studied patients were genotyped for 17 putatively functionally significant sequence variations in the ABCB1, SLCO1B1, ABCC2, CYP3A4, CYP3A5, and NR1I2 genes. Pharmacokinetic modeling was performed with the nonlinear mixed effects modeling computer program, NONMEM. A 3-compartment population pharmacokinetic model with first order absorption without lag-time was used to describe the data. The most important covariate affecting systemic clearance and distribution volume was allometrically scaled body weight i.e. body weight**3/4 for clearance and absolute body weight for volume of distribution. The clearance adjusted by absolute body weight declined with age and pre-pubertal children (< 8 years) had an approximately 25% higher clearance/body weight (L/h/kg) than did older children. Adjustment of clearance for allometric body weight removed its relationship to age after the first year of life. This finding is consistent with a gradual reduction in relative liver size towards adult values, and a relatively constant CYP3A content in the liver from about 6–12 months of age to adulthood. The other significant covariates affecting cyclosporine clearance and volume of distribution were hematocrit, plasma cholesterol, and serum creatinine, explaining up to 20%–30% of inter-individual differences before transplantation. After transplantation, their predictive role was smaller, as the variations in hematocrit, plasma cholesterol, and serum creatinine were also smaller. Before transplantation, no clinical or demographic covariates were found to affect oral bioavailability, and no systematic age-related changes in oral bioavailability were observed. After transplantation, older children receiving cyclosporine twice daily as the gelatine capsule microemulsion formulation had an about 1.25–1.3 times higher bioavailability than did the younger children receiving the liquid microemulsion formulation thrice daily. Moreover, cyclosporine oral bioavailability increased over 1.5-fold in the first month after transplantation, returning thereafter gradually to its initial value in 1–1.5 years. The largest cyclosporine doses were administered in the first 3–6 months after transplantation, and thereafter the single doses of cyclosporine were often smaller than 3 mg/kg. Thus, the results suggest that cyclosporine displays dose-dependent, saturable pre-systemic metabolism even at low single doses, whereas complete saturation of CYP3A4 and MDR1 (P-glycoprotein) renders cyclosporine pharmacokinetics dose-linear at higher doses. No significant associations were found between genetic polymorphisms and cyclosporine pharmacokinetics before transplantation in the whole population for which genetic data was available (n=104). However, in children older than eight years (n=22), heterozygous and homozygous carriers of the ABCB1 c.2677T or c.1236T alleles had an about 1.3 times or 1.6 times higher oral bioavailability, respectively, than did non-carriers. After transplantation, none of the ABCB1 SNPs or any other SNPs were found to be associated with cyclosporine clearance or oral bioavailability in the whole population, in the patients older than eight years, or in the patients younger than eight years. In the whole population, in those patients carrying the NR1I2 g.-25385C–g.-24381A–g.-205_-200GAGAAG–g.7635G–g.8055C haplotype, however, the bioavailability of cyclosporine was about one tenth lower, per allele, than in non-carriers. This effect was significant also in a subgroup of patients older than eight years. Furthermore, in patients carrying the NR1I2 g.-25385C–g.-24381A–g.-205_-200GAGAAG–g.7635G–g.8055T haplotype, the bioavailability was almost one fifth higher, per allele, than in non-carriers. It may be possible to improve individualization of cyclosporine dosing in children by accounting for the effects of developmental factors (body weight, liver size), time after transplantation, and cyclosporine dosing frequency/formulation. Further studies are required on the predictive value of genotyping for individualization of cyclosporine dosing in children.