29 resultados para Genomes
em Helda - Digital Repository of University of Helsinki
Resumo:
A repetitive sequence collection is one where portions of a base sequence of length n are repeated many times with small variations, forming a collection of total length N. Examples of such collections are version control data and genome sequences of individuals, where the differences can be expressed by lists of basic edit operations. Flexible and efficient data analysis on a such typically huge collection is plausible using suffix trees. However, suffix tree occupies O(N log N) bits, which very soon inhibits in-memory analyses. Recent advances in full-text self-indexing reduce the space of suffix tree to O(N log σ) bits, where σ is the alphabet size. In practice, the space reduction is more than 10-fold, for example on suffix tree of Human Genome. However, this reduction factor remains constant when more sequences are added to the collection. We develop a new family of self-indexes suited for the repetitive sequence collection setting. Their expected space requirement depends only on the length n of the base sequence and the number s of variations in its repeated copies. That is, the space reduction factor is no longer constant, but depends on N / n. We believe the structures developed in this work will provide a fundamental basis for storage and retrieval of individual genomes as they become available due to rapid progress in the sequencing technologies.
Resumo:
Epilysin (MMP-28) is the most recently identified member of the matrix metalloproteinase (MMP) family of extracellular proteases. Together these enzymes are capable of degrading almost all components of the extracellular matrix (ECM) and are thus involved in important biological processes such as development, wound healing and immune functions, but also in pathological processes such as tumor invasion, metastasis and arthritis. MMPs do not act solely by degrading the ECM. They also regulate cell behavior by releasing growth factors and biologically active peptides from the ECM, by modulating cell surface receptors and adhesion molecules and by regulating the activity of many important mediators in inflammatory pathways. The aim of this study was to define the unique role of epilysin within the MMP-family, to elucidate how and when it is expressed and how its catalytic activity is regulated. To gain information on its essential functions and substrates, the specific aim was to characterize how epilysin affects the phenotype of epithelial cells, where it is biologically expressed. During the course of the study we found that the epilysin promoter contains a well conserved GT-box that is essential for the basic expression of this gene. Transcription factors Sp1 and Sp3 bind this sequence and could hence regulate both the basic and cell type and differentiation stage specific expression of epilysin. We cloned mouse epilysin cDNA and found that epilysin is well conserved between human and mouse genomes and that epilysin is glycosylated and activated by furin. Similarly to in human tissues, epilysin is normally expressed in a number of mouse tissues. The expression pattern differs from most other MMPs, which are expressed only in response to injury or inflammation and in pathological processes like cancer. These findings implicate that epilysin could be involved in tissue homeostasis, perhaps fine-tuning the phenotype of epithelial cells according to signals from the ECM. In view of these results, it was unexpected to find that epilysin can induce a stable epithelial to mesenchymal transition (EMT) when overexpressed in epithelial lung carcinoma cells. Transforming growth factor b (TGF-b) was recognized as a crucial mediator of this process, which was characterized by the loss of E-cadherin mediated cell-cell adhesion, elevated expression of gelatinase B and MT1-MMP and increased cell migration and invasion into collagen I gels. We also observed that epilysin is bound to the surface of epithelial cells and that this interaction is lost upon cell transformation and is susceptible to degradation by membrane type-1-MMP (MT1-MMP). The wide expression of epilysin under physiological conditions implicates that its effects on epithelial cell phenotype in vivo are not as dramatic as seen in our in vitro cell system. Nevertheless, current results indicate a possible interaction between epilysin and TGF-b also under physiological circumstances, where epilysin activity may not induce EMT but, instead, trigger less permanent changes in TGF-b signaling and cell motility. Epilysin may thus play an important role in TGF-b regulated events such as wound healing and inflammation, processes where involvement of epilysin has been indicated.
Resumo:
The studies presented in this thesis aimed to a better understanding of the molecular biology of Sweet potato chlorotic stunt virus (SPCSV, Crinivirus, Closteroviridae) and its role in the development of synergistic viral diseases. The emphasis was on the severe sweet potato virus disease (SPVD) that results from a synergistic interaction of SPCSV and Sweet potato feathery mottle virus (SPFMV, Potyvirus, Potyviridae). SPVD is the most important disease affecting sweetpotato. It is manifested as a significant increase in symptom severity and SPFMV titres. This is accompanied by a dramatic sweetpotato yield reduction. SPCSV titres remain little affected in the diseased plants. Viral synergistic interactions have been associated with the suppression of an adaptive general defence mechanism discovered in plants and known as RNA silencing. In the studies of this thesis two novel proteins (RNase3 and p22) identified in the genome of a Ugandan SPCSV isolate were shown to be involved in suppression of RNA silencing. RNase3 displayed a dsRNA-specific endonuclease activity that enhanced the RNA-silencing suppression activity of p22. Comparative analyses of criniviral genomes revealed variability in the gene content at the 3´end of the genomic RNA1. Molecular analyses of different isolates of SPCSV indicated a marked intraspecific heterogeneity in this region where the p22 and RNase3 genes are located. Isolates of the East African strain of SPCSV from Tanzania and Peru and an isolate from Israel were missing a 767-nt fragment that included the p22 gene. However, regardless of the absence of p22, all SPCSV isolates acted synergistically with SPFMV in co-infected sweetpotato, enhanced SPFMV titres and caused SPVD. These results showed that p22 is dispensable for development of SPVD. The role of RNase3 in SPVD was then studied by generating transgenic plants expressing the RNase3 protein. These plants had increased titres of SPFMV (ca. 600-fold higher in comparison with nontransgenic plants) 2-3 weeks after graft inoculation and displayed the characteristic SPVD symptoms. RNA silencing suppression (RSS) activity of RNase3 was detected in agroinfiltrated leaves of Nicotiana bethamiana. In vitro studies showed that RNase3 was able to cleave small interferring RNAs (siRNA) to products of ~14-nt. The data thus identified RNase3 as a suppressor of RNA silencing able to cleave siRNAs. RNase3 expression alone was sufficient for breaking down resistance to SPFMV in sweetpotato and for the development of SPVD. Similar RNase III-like genes exist in animal viruses which points out a novel and possibly more general mechanism of RSS by viruses. A reproducible method of sweetpotato transformation was used to target RNA silencing against the SPCSV polymerase region (RdRp) with an intron-spliced hairpin construct. Hence, engineered resistance to SPCSV was obtained. Ten out of 20 transgenic events challenged with SPCSV alone showed significantly reduced virus titres. This was however not sufficient to prevent SPVD upon coinfection with SPFMV. Immunity to SPCSV seems to be required to control SPVD and targeting of different SPCSV regions need to be assessed in further studies. Based on the identified key role of RNase3 in SPVD the possibility to design constructs that target this gene might prove more efficient in future studies.
Resumo:
Genetics, the science of heredity and variation in living organisms, has a central role in medicine, in breeding crops and livestock, and in studying fundamental topics of biological sciences such as evolution and cell functioning. Currently the field of genetics is under a rapid development because of the recent advances in technologies by which molecular data can be obtained from living organisms. In order that most information from such data can be extracted, the analyses need to be carried out using statistical models that are tailored to take account of the particular genetic processes. In this thesis we formulate and analyze Bayesian models for genetic marker data of contemporary individuals. The major focus is on the modeling of the unobserved recent ancestry of the sampled individuals (say, for tens of generations or so), which is carried out by using explicit probabilistic reconstructions of the pedigree structures accompanied by the gene flows at the marker loci. For such a recent history, the recombination process is the major genetic force that shapes the genomes of the individuals, and it is included in the model by assuming that the recombination fractions between the adjacent markers are known. The posterior distribution of the unobserved history of the individuals is studied conditionally on the observed marker data by using a Markov chain Monte Carlo algorithm (MCMC). The example analyses consider estimation of the population structure, relatedness structure (both at the level of whole genomes as well as at each marker separately), and haplotype configurations. For situations where the pedigree structure is partially known, an algorithm to create an initial state for the MCMC algorithm is given. Furthermore, the thesis includes an extension of the model for the recent genetic history to situations where also a quantitative phenotype has been measured from the contemporary individuals. In that case the goal is to identify positions on the genome that affect the observed phenotypic values. This task is carried out within the Bayesian framework, where the number and the relative effects of the quantitative trait loci are treated as random variables whose posterior distribution is studied conditionally on the observed genetic and phenotypic data. In addition, the thesis contains an extension of a widely-used haplotyping method, the PHASE algorithm, to settings where genetic material from several individuals has been pooled together, and the allele frequencies of each pool are determined in a single genotyping.
Resumo:
Evolutionary history of biological entities is recorded within their nucleic acid sequences and can (sometimes) be deciphered by thorough genomic analysis. In this study we sought to gain insights into the diversity and evolution of bacterial and archaeal viruses. Our primary interest was pointed towards those virus groups/families for which comprehensive genomic analysis was not previously possible due to the lack of sufficient amount of genomic data. During the course of this work twenty-five putative proviruses integrated into various prokaryotic genomes were identified, enabling us to undertake a comparative genomics approach. This analysis allowed us to test the previously formulated evolutionary hypotheses and also provided valuable information on the molecular mechanisms behind the genome evolution of the studied virus groups.
Resumo:
Transposable elements, transposons, are discrete DNA segments that are able to move or copy themselves from one locus to another within or between their host genome(s) without a requirement for DNA homology. They are abundant residents in virtually all the genomes studied, for instance, the genomic portion of TEs is approximately 3% in Saccharomyces cerevisiae, 45% in humans, and apparently more than 70% in some plant genomes such as maize and barley. Transposons plays essential role in genome evolution, in lateral transfer of antibiotic resistance genes among bacteria and in life cycle of certain viruses such as HIV-1 and bacteriophage Mu. Despite the diversity of transposable elements they all use a fundamentally similar mechanism called transpositional DNA recombination (transposition) for the movement within and between the genomes of their host organisms. The DNA breakage and joining reactions that underlie their transposition are chemically similar in virtually all known transposition systems. The similarity of the reactions is also reflected in the structure and function of the catalyzing enzymes, transposases and integrases. The transposition reactions take place within the context of a transposition machinery, which can be particularly complex, as in the case of the VLP (virus like particle) machinery of retroelements, which in vivo contains RNA or cDNA and a number of element encoded structural and catalytic proteins. Yet, the minimal core machinery required for transposition comprises a multimer of transposase or integrase proteins and their binding sites at the element DNA ends only. Although the chemistry of DNA transposition is fairly well characterized, the components and function of the transposition machinery have been investigated in detail for only a small group of elements. This work focuses on the identification, characterization, and functional studies of the molecular components of the transposition machineries of BARE-1, Hin-Mu and Mu. For BARE-1 and Hin-Mu transpositional activity has not been shown previously, whereas bacteriophage Mu is a general model of transposition. For BARE-1, which is a retroelement of barley (Hordeum vulgare), the protein and DNA components of the functional VLP machinery were identified from cell extracts. In the case of Hin-Mu, which is a Mu-like prophage in Haemophilus influenzae Rd genome, the components of the core machinery (transposase and its binding sites) were characterized and their functionality was studied by using an in vitro methodology developed for Mu. The function of Mu core machinery was studied for its ability to use various DNA substrates: Hin-Mu end specific DNA substrates and Mu end specific hairpin substrates. The hairpin processing reaction by MuA was characterized in detail. New information was gained of all three machineries. The components or their activity required for functional BARE-1 VLP machinery and retrotransposon life cycle were present in vivo and VLP-like structures could be detected. The Hin-Mu core machinery components were identified and shown to be functional. The components of the Mu and Hin-Mu core machineries were partially interchangeable, reflecting both evolutionary conservation and flexibility within the core machineries. The Mu core machinery displayed surprising flexibility in substrate usage, as it was able to utilize Hin-Mu end specific DNA substrates and to process Mu end DNA hairpin substrates. This flexibility may be evolutionarily and mechanistically important.
Resumo:
Ectomycorrhizal formation between the host tree, Pinus sylvestris and fungal symbiont, Suillus bovinus was investigated at the molecular level by isolating genes regulating the organization of the actin cytoskeleton in the fungal partner S. bovinus. An Agrobacterium tumefaciens mediated transformation (ATMT) system was developed for the ectomycorrhizal fungi in order to assign specific functions to the cloned molecules. The developed ATMT system was also used to transform a plant pathogenic fungus, Helminthosporium turcicum, to hygromycin B resistance. Small GTPases Cdc42 and Rac1, the regulators of actin cytoskeleton in eukaryotes were isolated from S. bovinus. Sbcdc42 and Sbrac1, are both expressed in vegetative and in the symbiotic hyphae of S. bovinus . Using IIF microscopy, Cdc42 and actin were co-localized at the tips of vegetative hyphae and were visualized in association with the plasma membrane in swollen cells typical to the symbiotic hyphae. These results suggest that the small GTPases Cdc42 may play a significant role in the polarized growth of S. bovinus hyphae and regulate fungal morphogenesis during ectomycorrhiza formation through reorganization of the actin cytoskeleton. The functional equality of Cdc42 was tested in yeast complementation experiments using a Saccharomyces cerevisiae temperature sensitive mutant, cdc42-1ts. The genomic clone of CDC42 was isolated from S. bovinus genomic DNA via specific primers for Cdc42. The analogous S. cerevisiae cdc42 mutations, dominant active G12V and dominant negative D118A, were generated in the Sbcdc42 gene by in-vitro mutagenesis. The ectomycorrhizal fungi, S. bovinus, P. involutus and H. cylindroporum were transformed using ATMT and phleomycin as a selectable marker. PCR screeing suggested that the T-DNA was inserted in all the three fungal genomes but the fate of integration could not be proved by Southern blot analysis. An alternative Agrobacterium strain, AGL-1 and selection marker, hygromycin was used to transform our model fungus S. bovinus. PCR and Southern analysis suggested an improved efficiency of transformation. All the transformed fungal colonies selected for hygromycin gave positives in PCR and the Southerns showed multiple or single copy T-DNA integrations into the S. bovinus genome. Using the same Agrobacterium strain and the selectable marker, a maize pathogen, H. turcicum was also subjected to ATMT. The H. turcicum transformation data suggested the single copy T-DNA integrations into the genome of the screened transformants that further confirms wider applicability of the ATMT. The plasmids carrying the wild-type (pHGCDC42) and the mutated Sbcdc42 alleles (pHGGV; pHGDA) under Agaricus bisporus gpd promoter were constructed in an A. tumefaciens vector. ATMT was used to transform S. bovinus with the plasmids carrying the wild-type and mutated Sbcdc42 alleles. The isolation of Sbcdc42 and Sbrac1 genes and some other functionally related genes from ectomycorrhizal fungus, S. bovinus will form the basis of future work to resolve the signalling pathway leading to ectomycorrhizal symbiosis. The development of ATMT system will be a valuable tool in analysing the exact function of signalling pathway components in ectomycorrhizal symbiosis or in plant pathogenic interactions. The transformation frequency and broad applicability along with the simplicity of T-DNA integration make Agrobacterium a valuable, new and a powerfull tool for targeted and insertional mutagenesis in these plant associated fungi. The developed ATMT systems should therefore make it possible to generate large number of transformants with tagged genes which could then be screened for their specific roles in symbiosis and pathogenecity, respectively.
Resumo:
During the past ten years, large-scale transcript analysis using microarrays has become a powerful tool to identify and predict functions for new genes. It allows simultaneous monitoring of the expression of thousands of genes and has become a routinely used tool in laboratories worldwide. Microarray analysis will, together with other functional genomics tools, take us closer to understanding the functions of all genes in genomes of living organisms. Flower development is a genetically regulated process which has mostly been studied in the traditional model species Arabidopsis thaliana, Antirrhinum majus and Petunia hybrida. The molecular mechanisms behind flower development in them are partly applicable in other plant systems. However, not all biological phenomena can be approached with just a few model systems. In order to understand and apply the knowledge to ecologically and economically important plants, other species also need to be studied. Sequencing of 17 000 ESTs from nine different cDNA libraries of the ornamental plant Gerbera hybrida made it possible to construct a cDNA microarray with 9000 probes. The probes of the microarray represent all different ESTs in the database. From the gerbera ESTs 20% were unique to gerbera while 373 were specific to the Asteraceae family of flowering plants. Gerbera has composite inflorescences with three different types of flowers that vary from each other morphologically. The marginal ray flowers are large, often pigmented and female, while the central disc flowers are smaller and more radially symmetrical perfect flowers. Intermediate trans flowers are similar to ray flowers but smaller in size. This feature together with the molecular tools applied to gerbera, make gerbera a unique system in comparison to the common model plants with only a single kind of flowers in their inflorescence. In the first part of this thesis, conditions for gerbera microarray analysis were optimised including experimental design, sample preparation and hybridization, as well as data analysis and verification. Moreover, in the first study, the flower and flower organ-specific genes were identified. After the reliability and reproducibility of the method were confirmed, the microarrays were utilized to investigate transcriptional differences between ray and disc flowers. This study revealed novel information about the morphological development as well as the transcriptional regulation of early stages of development in various flower types of gerbera. The most interesting finding was differential expression of MADS-box genes, suggesting the existence of flower type-specific regulatory complexes in the specification of different types of flowers. The gerbera microarray was further used to profile changes in expression during petal development. Gerbera ray flower petals are large, which makes them an ideal model to study organogenesis. Six different stages were compared and specifically analysed. Expression profiles of genes related to cell structure and growth implied that during stage two, cells divide, a process which is marked by expression of histones, cyclins and tubulins. Stage 4 was found to be a transition stage between cell division and expansion and by stage 6 cells had stopped division and instead underwent expansion. Interestingly, at the last analysed stage, stage 9, when cells did not grow any more, the highest number of upregulated genes was detected. The gerbera microarray is a fully-functioning tool for large-scale studies of flower development and correlation with real-time RT-PCR results show that it is also highly sensitive and reliable. Gene expression data presented here will be a source for gene expression mining or marker gene discovery in the future studies that will be performed in the Gerbera Laboratory. The publicly available data will also serve the plant research community world-wide.
Resumo:
The ultimate goal of this study has been to construct metabolically engineered microbial strains capable of fermenting glucose into pentitols D-arabitol and, especially, xylitol. The path that was chosen to achieve this goal required discovery, isolation and sequencing of at least two pentitol phosphate dehydrogenases of different specificity, followed by cloning and expression of their genes and characterization of recombinant arabitol and xylitol phosphate dehydrogenases. An enzyme of a previously unknown specificity, D-arabitol phosphate dehydrogenase (APDH), was discovered in Enterococcus avium. The enzyme was purified to homogenity from E. avium strain ATCC 33665. SDS/PAGE revealed that the enzyme has a molecular mass of 41 ± 2 kDa, whereas a molecular mass of 160 ± 5 kDa was observed under non-denaturing conditions implying that the APDH may exist as a tetramer with identical subunits. Purified APDH was found to have narrow substrate specificity, converting only D-arabitol 1-phosphate and D-arabitol 5-phosphate into D-xylulose 5-phosphate and D-ribulose 5-phosphate, respectively, in the oxidative reaction. Both NAD+ and NADP+ were accepted as co-factors. Based on the partial protein sequences, the gene encoding APDH was cloned. Homology comparisons place APDH within the medium chain dehydrogenase family. Unlike most members of this family, APDH requires Mn2+ but no Zn2+ for enzymatic activity. The DNA sequence surrounding the gene suggests that it belongs to an operon that also contains several components of phosphotransferase system (PTS). The apparent role of the enzyme is to participate in arabitol catabolism via the arabitol phosphate route similar to the ribitol and xylitol catabolic routes described previously. Xylitol phosphate dehydrogenase (XPDH) was isolated from Lactobacillus rhamnosus strain ATCC 15820. The enzyme was partially sequenced. Amino acid sequences were used to isolate the gene encoding the enzyme. The homology comparisons of the deduced amino acid sequence of L. rhamnosus XPDH revealed several similar enzymes in genomes of various species of Gram-positive bacteria. Two enzymes of Clostridium difficile and an enzyme of Bacillus halodurans were cloned and their substrate specificities together with the substrate specificity of L. rhamnosus XPDH were compared. It was found that one of the XPDH enzymes of C. difficile and the XPDH of L. rhamnosus had the highest selectivity towards D-xylulose 5-phosphate. A known transketolase-deficient and D-ribose-producing mutant of Bacillus subtilis (ATCC 31094) was further modified by disrupting its rpi (D-ribose phosphate isomerase) gene to create D-ribulose- and D-xylulose-producing strain. Expression of APDH of E. avium and XPDH of L. rhamnosus and C. difficile in D-ribulose- and D-xylulose-producing strain of B. subtilis resulted in strains capable of converting D-glucose into D-arabitol and xylitol, respectively. The D-arabitol yield on D-glucose was 38 % (w/w). Xylitol production was accompanied by co-production of ribitol limiting xylitol yield to 23 %.
Resumo:
Transposons are mobile elements of genetic material that are able to move in the genomes of their host organisms using a special form of recombination called transposition. Bacteriophage Mu was the first transposon for which a cell-free in vitro transposition reaction was developed. Subsequently, the reaction has been refined and the minimal Mu in vitro reaction is useful in the generation of comprehensive libraries of mutant DNA molecules that can be used in a variety of applications. To date, the functional genetics applications of Mu in vitro technology have been subjected to either plasmids or genomic regions and entire genomes of viruses cloned on specific vectors. This study expands the use of Mu in vitro transposition in functional genetics and genomics by describing novel methods applicable to the targeted transgenesis of mouse and the whole-genome analysis of bacteriophages. The methods described here are rapid, efficient, and easily applicable to a wide variety of organisms, demonstrating the potential of the Mu transposition technology in the functional analysis of genes and genomes. First, an easy-to-use, rapid strategy to generate construct for the targeted mutagenesis of mouse genes was developed. To test the strategy, a gene encoding a neuronal K+/Cl- cotransporter was mutagenised. After a highly efficient transpositional mutagenesis, the gene fragments mutagenised were cloned into a vector backbone and transferred into bacterial cells. These constructs were screened with PCR using an effective 3D matrix system. In addition to traditional knock-out constructs, the method developed yields hypomorphic alleles that lead into reduced expression of the target gene in transgenic mice and have since been used in a follow-up study. Moreover, a scheme is devised to rapidly produce conditional alleles from the constructs produced. Next, an efficient strategy for the whole-genome analysis of bacteriophages was developed based on the transpositional mutagenesis of uncloned, infective virus genomes and their subsequent transfer into susceptible host cells. Mutant viruses able to produce viable progeny were collected and their transposon integration sites determined to map genomic regions nonessential to the viral life cycle. This method, applied here to three very different bacteriophages, PRD1, ΦYeO3 12, and PM2, does not require the target genome to be cloned and is directly applicable to all DNA and RNA viruses that have infective genomes. The method developed yielded valuable novel information on the three bacteriophages studied and whole-genome data can be complemented with concomitant studies on individual genes. Moreover, end-modified transposons constructed for this study can be used to manipulate genomes devoid of suitable restriction sites.
Resumo:
Viral genomes are encapsidated within protective protein shells. This encapsidation can be achieved either by a co-condensation reaction of the nucleic acid and coat proteins, or by first forming empty viral particles which are subsequently packaged with nucleic acid, the latter mechanism being typical for many dsDNA bacteriophages. Bacteriophage PRD1 is an icosahedral, non-tailed dsDNA virus that has an internal lipid membrane, the hallmark of the Tectiviridae family. Although PRD1 has been known to assemble empty particles into which the genome is subsequently packaged, the mechanism for this has been unknown, and there has been no evidence for a separate packaging vertex, similar to the portal structures used for packaging in the tailed bacteriophages and herpesviruses. In this study, a unique DNA packaging vertex was identified for PRD1, containing the packaging ATPase P9, packaging factor P6 and two small membrane proteins, P20 and P22, extending the packaging vertex to the internal membrane. Lack of small membrane protein P20 was shown to totally abolish packaging, making it an essential part of the PRD1 packaging mechanism. The minor capsid proteins P6 was shown to be an important packaging factor, its absence leading to greatly reduced packaging efficiency. An in vitro DNA packaging mechanism consisting of recombinant packaging ATPase P9, empty procapsids and mutant PRD1 DNA with a LacZ-insert was developed for the analysis of PRD1 packaging, the first such system ever for a virus containing an internal membrane. A new tectiviral sequence, a linear plasmid called pBClin15, was identified in Bacillus cereus, providing material for sequence analysis of the tectiviruses. Analysis of PRD1 P9 and other putative tectiviral ATPase sequences revealed several conserved sequence motifs, among them a new tectiviral packaging ATPase motif. Mutagenesis studies on PRD1 P9 were used to confirm the significance of the motifs. P9-type putative ATPase sequences carrying a similar sequence motif were identified in several other membrane containing dsDNA viruses of bacterial, archaeal and eukaryotic hosts, suggesting that these viruses may have similar packaging mechanisms. Interestingly, almost the same set of viruses that were found to have similar putative packaging ATPases had earlier been found to share similar coat protein folds and capsid structures, and a common origin for these viruses had been suggested. The finding in this study of similar packaging proteins further supports the idea that these viruses are descendants of a common ancestor.
Resumo:
Cell proliferation, transcription and metabolism are regulated by complex partly overlapping signaling networks involving proteins in various subcellular compartments. The objective of this study was to increase our knowledge on such regulatory networks and their interrelationships through analysis of MrpL55, Vig, and Mat1 representing three gene products implicated in regulation of cell cycle, transcription, and metabolism. Genome-wide and biochemical in vitro studies have previously revealed MrpL55 as a component of the large subunit of the mitochondrial ribosome and demonstrated a possible role for the protein in cell cycle regulation. Vig has been implicated in heterochromatin formation and identified as a constituent of the RNAi-induced silencing complex (RISC) involved in cell cycle regulation and RNAi-directed transcriptional gene silencing (TGS) coupled to RNA polymerase II (RNAPII) transcription. Mat1 has been characterized as a regulatory subunit of cyclin-dependent kinase 7 (Cdk7) complex phosphorylating and regulating critical targets involved in cell cycle progression, energy metabolism and transcription by RNAPII. The first part of the study explored whether mRpL55 is required for cell viability or involved in a regulation of energy metabolism and cell proliferation. The results revealed a dynamic requirement of the essential Drosophila mRpL55 gene during development and suggested a function of MrpL55 in cell cycle control either at the G1/S or G2/M transition prior to cell differentiation. This first in vivo characterization of a metazoan-specific constituent of the large subunit of mitochondrial ribosome also demonstrated forth compelling evidence of the interconnection of nuclear and mitochondrial genomes as well as complex functions of the evolutionarily young metazoan-specific mitochondrial ribosomal proteins. In studies on the Drosophila RISC complex regulation, it was noted that Vig, a protein involved in heterochromatin formation, unlike other analyzed RISC associated proteins Argonaute2 and R2D2, is dynamically phosphorylated in a dsRNA-independent manner. Vig displays similarity with a known in vivo substrate for protein kinase C (PKC), human chromatin remodeling factor Ki-1/57, and is efficiently phosphorylated by PKC on multiple sites in vitro. These results suggest that function of the RISC complex protein Vig in RNAi-directed TGS and chromatin modification may be regulated through dsRNA-independent phosphorylation by PKC. In the third part of this study the role of Mat1 in regulating RNAPII transcription was investigated using cultured murine immortal fibroblasts with a conditional allele of Mat1. The results demonstrated that phosphorylation of the carboxy-terminal domain (CTD) of the large subunit of RNAPII in the heptapeptide YSPTSPS repeat in Mat-/- cells was over 10-fold reduced on Serine-5 and subsequently on Serine-2. Occupancy of the hypophosphorylated RNAPII in gene bodies was detectably decreased, whereas capping, splicing, histone methylation and mRNA levels were generally not affected. However, a subset of transcripts in absence of Mat1 was repressed and associated with decreased occupancy of RNAPII at promoters as well as defective capping. The results identify the Cdk7-CycH-Mat1 kinase submodule of TFIIH as a stimulatory non-essential regulator of transcriptional elongation and a genespecific essential factor for stable binding of RNAPII at the promoter region and capping. The results of these studies suggest important roles for both MrpL55 and Mat1 in cell cycle progression and their possible interplay at the G2/M stage in undifferentiated cells. The identified function of Mat1 and of TFIIH kinase complex in gene-specific transcriptional repression is challenging for further studies in regard to a possible link to Vig and RISC-mediated transcriptional gene silencing.
Resumo:
This study addressed the large-scale molecular zoogeography in two brackish water bivalve molluscs, Macoma balthica and Cerastoderma glaucum, and genetic signatures of the postglacial colonization of Northern Europe by them. The traditional view poses that M. balthica in the Baltic, White and Barents seas (i.e. marginal seas) represent direct postglacial descendants of the adjacent Northeast Atlantic populations, but this has recently been challenged by observations of close genetic affinities between these marginal populations and those of the Northeast Pacific. The primary aim of the thesis was to verify, quantify and characterize the Pacific genetic contribution across North European populations of M. balthica and to resolve the phylogeographic histories of the two bivalve taxa in range-wide studies using information from mitochondrial DNA (mtDNA) and nuclear allozyme polymorphisms. The presence of recent Pacific genetic influence in M. balthica of the Baltic, White and Barents seas, along with an Atlantic element, was confirmed by mtDNA sequence data. On a broader temporal and geographical scale, altogether four independent trans-Arctic invasions of Macoma from the Pacific since the Miocene seem to have been involved in generating the current North Atlantic lineage diversity. The latest trans-Arctic invasion that affected the current Baltic, White and Barents Sea populations probably took place in the early post-glacial. The nuclear genetic compositions of these marginal sea populations are intermediate between those of pure Pacific and Atlantic subspecies. In the marginal sea populations of mixed ancestry (Barents, White and Northern Baltic seas), the Pacific and Atlantic components are now randomly associated in the genomes of individual clams, which indicates both pervasive historical interbreeding between the previously long-isolated lineages (subspecies), and current isolation of these populations from the adjacent pure Atlantic populations. These mixed populations can be characterized as self-supporting hybrid swarms, and they arguably represent the most extensive marine animal hybrid swarms so far documented. Each of the three swarms still has a distinct genetic composition, and the relative Pacific contributions vary from 30 to 90 % in local populations. This diversity highlights the potential of introgressive hybridization to rapidly give rise to new evolutionarily and ecologically significant units in the marine realm. In the south of the Danish straits and in the Southern Baltic Sea, a broad genetic transition zone links the pure North Sea subspecies M. balthica rubra to the inner Baltic hybrid swarm, which has about 60 % of Pacific contribution in its genome. This transition zone has no regular smooth clinal structure, but its populations show strong genotypic disequilibria typical of a hybrid zone maintained by the interplay of selection and gene flow by dispersing pelagic larvae. The structure of the genetic transition is partly in line with features of Baltic water circulation and salinity stratification, with greater penetration of Atlantic genes on the Baltic south coast and in deeper water populations. In all, the scenarios of historical isolation and secondary contact that arise from the phylogeographic studies of both Macoma and Cerastoderma shed light to the more general but enigmatic patterns seen in marine phylogeography, where deep genetic breaks are often seen in species with high dispersal potential.
Resumo:
Extraintestinal pathogenic Escherichia coli (ExPEC) represent a diverse group of strains of E. coli, which infect extraintestinal sites, such as the urinary tract, the bloodstream, the meninges, the peritoneal cavity, and the lungs. Urinary tract infections (UTIs) caused by uropathogenic E. coli (UPEC), the major subgroup of ExPEC, are among the most prevalent microbial diseases world wide and a substantial burden for public health care systems. UTIs are responsible for serious morbidity and mortality in the elderly, in young children, and in immune-compromised and hospitalized patients. ExPEC strains are different, both from genetic and clinical perspectives, from commensal E. coli strains belonging to the normal intestinal flora and from intestinal pathogenic E. coli strains causing diarrhea. ExPEC strains are characterized by a broad range of alternate virulence factors, such as adhesins, toxins, and iron accumulation systems. Unlike diarrheagenic E. coli, whose distinctive virulence determinants evoke characteristic diarrheagenic symptoms and signs, ExPEC strains are exceedingly heterogeneous and are known to possess no specific virulence factors or a set of factors, which are obligatory for the infection of a certain extraintestinal site (e. g. the urinary tract). The ExPEC genomes are highly diverse mosaic structures in permanent flux. These strains have obtained a significant amount of DNA (predictably up to 25% of the genomes) through acquisition of foreign DNA from diverse related or non-related donor species by lateral transfer of mobile genetic elements, including pathogenicity islands (PAIs), plasmids, phages, transposons, and insertion elements. The ability of ExPEC strains to cause disease is mainly derived from this horizontally acquired gene pool; the extragenous DNA facilitates rapid adaptation of the pathogen to changing conditions and hence the extent of the spectrum of sites that can be infected. However, neither the amount of unique DNA in different ExPEC strains (or UPEC strains) nor the mechanisms lying behind the observed genomic mobility are known. Due to this extreme heterogeneity of the UPEC and ExPEC populations in general, the routine surveillance of ExPEC is exceedingly difficult. In this project, we presented a novel virulence gene algorithm (VGA) for the estimation of the extraintestinal virulence potential (VP, pathogenicity risk) of clinically relevant ExPECs and fecal E. coli isolates. The VGA was based on a DNA microarray specific for the ExPEC phenotype (ExPEC pathoarray). This array contained 77 DNA probes homologous with known (e.g. adhesion factors, iron accumulation systems, and toxins) and putative (e.g. genes predictably involved in adhesion, iron uptake, or in metabolic functions) ExPEC virulence determinants. In total, 25 of DNA probes homologous with known virulence factors and 36 of DNA probes representing putative extraintestinal virulence determinants were found at significantly higher frequency in virulent ExPEC isolates than in commensal E. coli strains. We showed that the ExPEC pathoarray and the VGA could be readily used for the differentiation of highly virulent ExPECs both from less virulent ExPEC clones and from commensal E. coli strains as well. Implementing the VGA in a group of unknown ExPECs (n=53) and fecal E. coli isolates (n=37), 83% of strains were correctly identified as extraintestinal virulent or commensal E. coli. Conversely, 15% of clinical ExPECs and 19% of fecal E. coli strains failed to raster into their respective pathogenic and non-pathogenic groups. Clinical data and virulence gene profiles of these strains warranted the estimated VPs; UPEC strains with atypically low risk-ratios were largely isolated from patients with certain medical history, including diabetes mellitus or catheterization, or from elderly patients. In addition, fecal E. coli strains with VPs characteristic for ExPEC were shown to represent the diagnostically important fraction of resident strains of the gut flora with a high potential of causing extraintestinal infections. Interestingly, a large fraction of DNA probes associated with the ExPEC phenotype corresponded to novel DNA sequences without any known function in UTIs and thus represented new genetic markers for the extraintestinal virulence. These DNA probes included unknown DNA sequences originating from the genomic subtractions of four clinical ExPEC isolates as well as from five novel cosmid sequences identified in the UPEC strains HE300 and JS299. The characterized cosmid sequences (pJS332, pJS448, pJS666, pJS700, and pJS706) revealed complex modular DNA structures with known and unknown DNA fragments arranged in a puzzle-like manner and integrated into the common E. coli genomic backbone. Furthermore, cosmid pJS332 of the UPEC strain HE300, which carried a chromosomal virulence gene cluster (iroBCDEN) encoding the salmochelin siderophore system, was shown to be part of a transmissible plasmid of Salmonella enterica. Taken together, the results of this project pointed towards the assumptions that first, (i) homologous recombination, even within coding genes, contributes to the observed mosaicism of ExPEC genomes and secondly, (ii) besides en block transfer of large DNA regions (e.g. chromosomal PAIs) also rearrangements of small DNA modules provide a means of genomic plasticity. The data presented in this project supplemented previous whole genome sequencing projects of E. coli and indicated that each E. coli genome displays a unique assemblage of individual mosaic structures, which enable these strains to successfully colonize and infect different anatomical sites.
Resumo:
Evolutionary genetics incorporates traditional population genetics and studies of the origins of genetic variation by mutation and recombination, and the molecular evolution of genomes. Among the primary forces that have potential to affect the genetic variation within and among populations, including those that may lead to adaptation and speciation, are genetic drift, gene flow, mutations and natural selection. The main challenges in knowing the genetic basis of evolutionary changes is to distinguish the adaptive selection forces that cause existent DNA sequence variants and also to identify the nucleotide differences responsible for the observed phenotypic variation. To understand the effects of various forces, interpretation of gene sequence variation has been the principal basis of many evolutionary genetic studies. The main aim of this thesis was to assess different forms of teleost gene sequence polymorphisms in evolutionary genetic studies of Atlantic salmon (Salmo salar) and other species. Firstly, the level of Darwinian adaptive evolution affected coding regions of the growth hormone (GH) gene during the teleost evolution was investigated based on the sequence data existing in public databases. Secondly, a target gene approach was used to identify within population variation in the growth hormone 1 (GH1) gene in salmon. Then, a new strategy for single nucleotide polymorphisms (SNPs) discovery in salmonid fishes was introduced, and, finally, the usefulness of a limited number of SNP markers as molecular tools in several applications of population genetics in Atlantic salmon was assessed. This thesis showed that the gene sequences in databases can be utilized to perform comparative studies of molecular evolution, and some putative evidence of the existence of Darwinian selection during the teleost GH evolution was presented. In addition, existent sequence data was exploited to investigate GH1 gene variation within Atlantic salmon populations throughout its range. Purifying selection is suggested to be the predominant evolutionary force controlling the genetic variation of this gene in salmon, and some support for gene flow between continents was also observed. The novel approach to SNP discovery in species with duplicated genome fragments introduced here proved to be an effective method, and this may have several applications in evolutionary genetics with different species - e.g. when developing gene-targeted markers to investigate quantitative genetic variation. The thesis also demonstrated that only a few SNPs performed highly similar signals in some of the population genetic analyses when compared with the microsatellite markers. This may have useful applications when estimating genetic diversity in genes having a potential role in ecological and conservation issues, or when using hard biological samples in genetic studies as SNPs can be applied with relatively highly degraded DNA.