12 resultados para molecular evolution
em Helda - Digital Repository of University of Helsinki
Resumo:
Evolutionary genetics incorporates traditional population genetics and studies of the origins of genetic variation by mutation and recombination, and the molecular evolution of genomes. Among the primary forces that have potential to affect the genetic variation within and among populations, including those that may lead to adaptation and speciation, are genetic drift, gene flow, mutations and natural selection. The main challenges in knowing the genetic basis of evolutionary changes is to distinguish the adaptive selection forces that cause existent DNA sequence variants and also to identify the nucleotide differences responsible for the observed phenotypic variation. To understand the effects of various forces, interpretation of gene sequence variation has been the principal basis of many evolutionary genetic studies. The main aim of this thesis was to assess different forms of teleost gene sequence polymorphisms in evolutionary genetic studies of Atlantic salmon (Salmo salar) and other species. Firstly, the level of Darwinian adaptive evolution affected coding regions of the growth hormone (GH) gene during the teleost evolution was investigated based on the sequence data existing in public databases. Secondly, a target gene approach was used to identify within population variation in the growth hormone 1 (GH1) gene in salmon. Then, a new strategy for single nucleotide polymorphisms (SNPs) discovery in salmonid fishes was introduced, and, finally, the usefulness of a limited number of SNP markers as molecular tools in several applications of population genetics in Atlantic salmon was assessed. This thesis showed that the gene sequences in databases can be utilized to perform comparative studies of molecular evolution, and some putative evidence of the existence of Darwinian selection during the teleost GH evolution was presented. In addition, existent sequence data was exploited to investigate GH1 gene variation within Atlantic salmon populations throughout its range. Purifying selection is suggested to be the predominant evolutionary force controlling the genetic variation of this gene in salmon, and some support for gene flow between continents was also observed. The novel approach to SNP discovery in species with duplicated genome fragments introduced here proved to be an effective method, and this may have several applications in evolutionary genetics with different species - e.g. when developing gene-targeted markers to investigate quantitative genetic variation. The thesis also demonstrated that only a few SNPs performed highly similar signals in some of the population genetic analyses when compared with the microsatellite markers. This may have useful applications when estimating genetic diversity in genes having a potential role in ecological and conservation issues, or when using hard biological samples in genetic studies as SNPs can be applied with relatively highly degraded DNA.
Resumo:
Puumala virus (PUUV) is the causative agent of nephropathia epidemica (NE), a mild form of hemorrhagic fever with renal syndrome. Finland has the highest documented incidence of NE with around 1000 cases diagnosed annually. PUUV is also found in other Scandinavian countries, Central Europe and the European part of Russia. PUUV belongs to the genus Hantavirus in the family Bunyaviridae. Hantaviruses are rodent-borne viruses each carried by a specific host that is persistently and asymptomatically infected by the virus. PUUV is carried by the bank voles (Myodes glareolus, previously known as Clethrionomys glareolus). Hantaviruses have co-evolved with their carrier rodents for millions of years and these host animals are the evolutionary scene of hantaviruses. In this study, PUUV sequences were recovered from bank voles captured in Denmark and Russian Karelia to study the evolution of PUUV in Scandinavia. Phylogenetic analysis of these strains showed a geographical clustering of genetic variants following the presumable migration pattern of bank voles during the recolonization of Scandinavia after the last ice age approximately 10 000 years ago. The currently known PUUV genome sequences were subjected to in-depth phylogenetic analyses and the results showed that genetic drift seems to be the major mechanism of PUUV evolution. In general, PUUV seems to evolve quite slowly following a molecular clock. We also found evidence for recombination in the evolution of some genetic lineages of PUUV. Viral microevolution was studied in controlled virus transmission in colonized bank voles and changes in quasispecies dynamics were recorded as the virus was transmitted from one animal to another. We witnessed PUUV evolution in vivo, as one synonymous mutation became repeatedly fixed in the viral genome during the experiment. The detailed knowledge on the PUUV diversity was used to establish new sensitive and specific detection methods for this virus. Direct viral invasion of the hypophysis was demonstrated for the first time in a lethal case of NE. PUUV detection was done by immunohistochemistry, in situ hybridization and RT-nested-PCR of the autopsy tissue samples.
Resumo:
Mass occurrences (blooms) of cyanobacteria are common in aquatic environments worldwide. These blooms are often toxic, due to the presence of hepatotoxins or neurotoxins. The most common cyanobacterial toxins are hepatotoxins: microcystins and nodularins. In freshwaters, the main producers of microcystins are Microcystis, Anabaena, and Planktothrix. Nodularins are produced by strains of Nodularia spumigena in brackish waters. Toxic and nontoxic strains of cyanobacteria co-occur and cannot be differentiated by conventional microscopy. Molecular biological methods based on microcystin and nodularin synthetase genes enable detection of potentially hepatotoxic cyanobacteria. In the present study, molecular detection methods for hepatotoxin-producing cyanobacteria were developed, based on microcystin synthetase gene E (mcyE) and the orthologous nodularin synthetase gene F (ndaF) sequences. General primers were designed to amplify the mcyE/ndaF gene region from microcystin-producing Anabaena, Microcystis, Planktothrix, and Nostoc, and nodularin-producing Nodularia strains. The sequences were used for phylogenetic analyses to study how cyanobacterial mcy genes have evolved. The results showed that mcy genes and microcystin are very old and were already present in the ancestor of many modern cyanobacterial genera. The results also suggested that the sporadic distribution of biosynthetic genes in modern cyanobacteria is caused by repeated gene losses in the more derived lineages of cyanobacteria and not by horizontal gene transfer. Phylogenetic analysis also proposed that nda genes evolved from mcy genes. The frequency and composition of the microcystin producers in 70 lakes in Finland were studied by conventional polymerase chain reaction (PCR). Potential microcystin producers were detected in 84% of the lakes, using general mcyE primers, and in 91% of the lakes with the three genus-specific mcyE primers. Potential microcystin-producing Microcystis were detected in 70%, Planktothrix in 63%, and Anabaena in 37% of the lakes. The presence and co-occurrence of potential microcystin producers were more frequent in eutrophic lakes, where the total phosphorus concentration was high. The PCR results could also be associated with various environmental factors by correlation and regression analyses. In these analyses, the total nitrogen concentration and pH were both associated with the presence of multiple microcystin-producing genera and partly explained the probability of occurrence of mcyE genes. In general, the results showed that higher nutrient concentrations increased the occurrence of potential microcystin producers and the risk for toxic bloom formation. Genus-specific probe pairs for microcystin-producing Anabaena, Microcystis, Planktothrix, and Nostoc, and nodularin-producing Nodularia were designed to be used in a DNA-chip assay. The DNA-chip can be used to simultaneously detect all these potential microcystin/nodularin producers in environmental water samples. The probe pairs detected the mcyE/ndaF genes specifically and sensitively when tested with cyanobacterial strains. In addition, potential microcystin/nodularin producers were identified in lake and Baltic Sea samples by the DNA-chip almost as sensitively as by quantitative real-time PCR (qPCR), which was used to validate the DNA-chip results. Further improvement of the DNA-chip assay was achieved by optimization of the PCR, the first step in the assay. Analysis of the mcy and nda gene clusters from various hepatotoxin-producing cyanobacteria was rewarding; it revealed that the genes were ancient. In addition, new methods detecting all the main producers of hepatotoxins could be developed. Interestingly, potential microcystin-producing cyanobacterial strains of Microcystis, Planktothrix, and Anabaena, co-occurred especially in eutrophic and hypertrophic lakes. Protecting waters from eutrophication and restoration of lakes may thus decrease the prevalence of toxic cyanobacteria and the frequency of toxic blooms.
Resumo:
Transposable elements, transposons, are discrete DNA segments that are able to move or copy themselves from one locus to another within or between their host genome(s) without a requirement for DNA homology. They are abundant residents in virtually all the genomes studied, for instance, the genomic portion of TEs is approximately 3% in Saccharomyces cerevisiae, 45% in humans, and apparently more than 70% in some plant genomes such as maize and barley. Transposons plays essential role in genome evolution, in lateral transfer of antibiotic resistance genes among bacteria and in life cycle of certain viruses such as HIV-1 and bacteriophage Mu. Despite the diversity of transposable elements they all use a fundamentally similar mechanism called transpositional DNA recombination (transposition) for the movement within and between the genomes of their host organisms. The DNA breakage and joining reactions that underlie their transposition are chemically similar in virtually all known transposition systems. The similarity of the reactions is also reflected in the structure and function of the catalyzing enzymes, transposases and integrases. The transposition reactions take place within the context of a transposition machinery, which can be particularly complex, as in the case of the VLP (virus like particle) machinery of retroelements, which in vivo contains RNA or cDNA and a number of element encoded structural and catalytic proteins. Yet, the minimal core machinery required for transposition comprises a multimer of transposase or integrase proteins and their binding sites at the element DNA ends only. Although the chemistry of DNA transposition is fairly well characterized, the components and function of the transposition machinery have been investigated in detail for only a small group of elements. This work focuses on the identification, characterization, and functional studies of the molecular components of the transposition machineries of BARE-1, Hin-Mu and Mu. For BARE-1 and Hin-Mu transpositional activity has not been shown previously, whereas bacteriophage Mu is a general model of transposition. For BARE-1, which is a retroelement of barley (Hordeum vulgare), the protein and DNA components of the functional VLP machinery were identified from cell extracts. In the case of Hin-Mu, which is a Mu-like prophage in Haemophilus influenzae Rd genome, the components of the core machinery (transposase and its binding sites) were characterized and their functionality was studied by using an in vitro methodology developed for Mu. The function of Mu core machinery was studied for its ability to use various DNA substrates: Hin-Mu end specific DNA substrates and Mu end specific hairpin substrates. The hairpin processing reaction by MuA was characterized in detail. New information was gained of all three machineries. The components or their activity required for functional BARE-1 VLP machinery and retrotransposon life cycle were present in vivo and VLP-like structures could be detected. The Hin-Mu core machinery components were identified and shown to be functional. The components of the Mu and Hin-Mu core machineries were partially interchangeable, reflecting both evolutionary conservation and flexibility within the core machineries. The Mu core machinery displayed surprising flexibility in substrate usage, as it was able to utilize Hin-Mu end specific DNA substrates and to process Mu end DNA hairpin substrates. This flexibility may be evolutionarily and mechanistically important.
Resumo:
This thesis work focuses on the role of TGF-beta family antagonists during the development of mouse dentition. Tooth develops through an interaction between the dental epithelium and underlying neural crest derived mesenchyme. The reciprocal signaling between these tissues is mediated by soluble signaling molecules and the balance between activatory and inhibitory signals appears to be essential for the pattern formation. We showed the importance of Sostdc1 in the regulation of tooth shape and number. The absence of Sostdc1 altered the molar cusp patterning and led to supernumerary tooth formation both in the molar and incisor region. We showed that initially, Sostdc1 expression is in the mesenchyme, suggesting that dental mesenchyme may limit supernumerary tooth induction. We tested this in wild-type incisors by minimizing the amount of mesenchymal tissue surrounding the incisor tooth germs prior to culture in vitro. The cultured teeth phenocopied the extra incisor phenotype of the Sostdc1-deficient mice. Furthermore, we showed that minimizing the amount of dental mesenchyme in cultured Sostdc1-deficient incisors caused the formation of additional de novo incisors that resembled the successional incisor development resulting from activated Wnt signaling. Sostdc1 seemed to be able to inhibit both mesenchymal BMP4 and epithelial canonical Wnt signaling, which thus allows Sostdc1 to restrict the enamel knot size and regulate the tooth shape and number. Our work emphasizes the dual role for the tooth mesenchyme as a suppressor as well as an activator during tooth development. We found that the placode, forming the thick mouse incisor, is prone to disintegration during initiation of tooth development. The balance between two mesenchymal TGF-beta family signals, BMP4 and Activin is essential in this regulation. The inhibition of BMP4 or increase in Activin signaling led to the splitting of the large incisor placode into two smaller placodes resulting in thin incisors. These two signals appeared to have different effects on tooth epithelium and the analysis of the double null mutant mice lacking Sostdc1 and Follistatin indicated that these TGF-beta inhibitors regulate the mutual balance of BMP and Activin in vivo. In addition, this work provides an alternative explanation for the issue of incisor identity published in Science by Tucker et al. in 1998 and proposes that the molar like morphology that can be obtained by inhibiting BMP signaling is due to partial splitting of the incisor placodes and not due to change in tooth identity from the incisor to the molar. This thesis work presents possible molecular mechanisms that may have modified the mouse dental pattern during evolution leading to the typical rodent dentition of modern mouse. The rodent dentition is specialized for gnawing and consists of two large continuously growing incisors and toothless diastema region separating the molars and incisors. The ancestors of rodents had higher number of more slender incisors together with canines and premolars. Additionally, murine rodents, which include the mouse, have lost their ability for tooth replacement. This work has revealed that the inhibitory molecules appear to play a role in the tooth number suppression by delineating the spatial and temporal action of the inductive signals. The results suggest that Sostdc1 plays an essential role in several stages of tooth development through the regulation of both the BMP and Wnt pathway. The work shows a dormant sequential tooth forming potential present in wild type mouse incisor region and gives a new perspective on tooth suppression by dental mesenchyme. It reveals as well a novel mechanism to create a large mouse incisor through the regulation of mesenchymal balance between inductive and inhibitory signals.
Resumo:
The first part of this work investigates the molecular epidemiology of a human enterovirus (HEV), echovirus 30 (E-30). This project is part of a series of studies performed in our research team analyzing the molecular epidemiology of HEV-B viruses. A total of 129 virus strains had been isolated in different parts of Europe. The sequence analysis was performed in three different genomic regions: 420 nucleotides (nt) in the VP4/VP2 capsid protein coding region, the entire VP1 capsid protein coding gene of 876 nt, and 150 nt in the VP1/2A junction region. The analysis revealed a succession of dominant sublineages within a major genotype. The temporally earlier genotypes had been replaced by a genetically homogenous lineage that has been circulating in Europe since the late 1970s. The same genotype was found by other research groups in North America and Australia. Globally, other cocirculating genetic lineages also exist. The prevalence of a dominant genotype makes E-30 different from other previously studied HEVs, such as polioviruses and coxsackieviruses B4 and B5, for which several coexisting genetic lineages have been reported. The second part of this work deals with molecular epidemiology of human rhinoviruses (HRVs). A total of 61 field isolates were studied in the 420-nt stretch in the capsid coding region of VP4/VP2. The isolates were collected from children under two years of age in Tampere, Finland. Sequences from the clinical isolates clustered in the two previously known phylogenetic clades. Seasonal clustering was found. Also, several distinct serotype-like clusters were found to co-circulate during the same epidemic season. Reappearance of a cluster after disappearing for a season was observed. The molecular epidemiology of the analyzed strains turned out to be complex, and we decided to continue our studies of HRV. Only five previously published complete genome sequences of HRV prototype strains were available for analysis. Therefore, all designated HRV prototype strains (n=102) were sequenced in the VP4/VP2 region, and the possibility of genetic typing of HRV was evaluated. Seventy-six of the 102 prototype strains clustered in HRV genetic group A (HRV-A) and 25 in group B (HRV-B). Serotype 87 clustered separately from other HRVs with HEV species D. The field strains of HRV represented as many as 19 different genotypes, as judged with an approximate demarcation of a 20% nt difference in the VP4/VP2 region. The interserotypic differences of HRV were generally similar to those reported between different HEV serotypes (i.e. about 20%), but smaller differences, less than 10%, were also observed. Because some HRV serotypes are genetically so closely related, we suggest that the genetic typing be performed using the criterion "the closest prototype strain". This study is the first systematic genetic characterization of all known HRV prototype strains, providing a further taxonomic proposal for classification of HRV. We proposed to divide the genus Human rhinoviruses into HRV-A and HRV-B. The final part of the work comprises a phylogenetic analysis of a subset (48) of HRV prototype strains and field isolates (12) in the nonstructural part of the genome coding for the RNA-dependent RNA polymerase (3D). The proposed division of the HRV strains in the species HRV-A and HRV-B was also supported by 3D region. HRV-B clustered closer to HEV species B, C, and also to polioviruses than to HRV-A. Intraspecies variation within both HRV-A and HRV-B was greater in the 3D coding region than in the VP4/VP2 coding region, in contrast to HEV. Moreover, the diversity of HRV in 3D exceeded that of HEV. One group of HRV-A, designated HRV-A', formed a separate cluster outside other HRV-A in the 3D region. It formed a cluster also in the capsid region, but located within HRV-A. This may reflect a different evolutionary history of distinct genomic regions among HRV-A. Furthermore, the tree topology within HRV-A in the 3D region differed from that in the VP4/VP2, suggesting possible recombination events in the evolution of the strains. No conflicting phylogenies were observed in any of the 12 field isolates. Possible recombination was further studied using the Similarity and Bootscanning analyses of the complete genome sequences of HRV available in public databases. Evidence for recombination among HRV-A was found, as HRV2 and HRV39 showed higher similarity in the nonstructural part of the genome. Whether HRV2 and HRV39 strains - and perhaps also some other HRV-A strains not yet completely sequenced - are recombinants remains to be determined.
Resumo:
Mutation and recombination are the fundamental processes leading to genetic variation in natural populations. This variation forms the raw material for evolution through natural selection and drift. Therefore, studying mutation rates may reveal information about evolutionary histories as well as phylogenetic interrelationships of organisms. In this thesis two molecular tools, DNA barcoding and the molecular clock were examined. In the first part, the efficiency of mutations to delineate closely related species was tested and the implications for conservation practices were assessed. The second part investigated the proposition that a constant mutation rate exists within invertebrates, in form of a metabolic-rate dependent molecular clock, which can be applied to accurately date speciation events. DNA barcoding aspires to be an efficient technique to not only distinguish between species but also reveal population-level variation solely relying on mutations found on a short stretch of a single gene. In this thesis barcoding was applied to discriminate between Hylochares populations from Russian Karelia and new Hylochares findings from the greater Helsinki region in Finland. Although barcoding failed to delineate the two reproductively isolated groups, their distinct morphological features and differing life-history traits led to their classification as two closely related, although separate species. The lack of genetic differentiation appears to be due to a recent divergence event not yet reflected in the beetles molecular make-up. Thus, the Russian Hylochares was described as a new species. The Finnish species, previously considered as locally extinct, was recognized as endangered. Even if, due to their identical genetic make-up, the populations had been regarded as conspecific, conservation strategies based on prior knowledge from Russia would not have guaranteed the survival of the Finnish beetle. Therefore, new conservation actions based on detailed studies of the biology and life-history of the Finnish Hylochares were conducted to protect this endemic rarity in Finland. The idea behind the strict molecular clock is that mutation rates are constant over evolutionary time and may thus be used to infer species divergence dates. However, one of the most recent theories argues that a strict clock does not tick per unit of time but that it has a constant substitution rate per unit of mass-specific metabolic energy. Therefore, according to this hypothesis, molecular clocks have to be recalibrated taking body size and temperature into account. This thesis tested the temperature effect on mutation rates in equally sized invertebrates. For the first dataset (family Eucnemidae, Coleoptera) the phylogenetic interrelationships and evolutionary history of the genus Arrhipis had to be inferred before the influence of temperature on substitution rates could be studied. Further, a second, larger invertebrate dataset (family Syrphidae, Diptera) was employed. Several methodological approaches, a number of genes and multiple molecular clock models revealed that there was no consistent relationship between temperature and mutation rate for the taxa under study. Thus, the body size effect, observed in vertebrates but controversial for invertebrates, rather than temperature may be the underlying driving force behind the metabolic-rate dependent molecular clock. Therefore, the metabolic-rate dependent molecular clock does not hold for the here studied invertebrate groups. This thesis emphasizes that molecular techniques relying on mutation rates have to be applied with caution. Whereas they may work satisfactorily under certain conditions for specific taxa, they may fail for others. The molecular clock as well as DNA barcoding should incorporate all the information and data available to obtain comprehensive estimations of the existing biodiversity and its evolutionary history.
Resumo:
The parasitic wasps are one of the largest insect groups and their life histories are remarkably variable. Common to all parasitic wasps is that they kill their hosts, which are usually beetles, butterflies and sometimes spiders. Hosts are often at a larval or pupal stage and live in concealed conditions, such as in plant tissue. Parasitic wasps have two main ways of finding their host. 1) They can detect chemical compounds emitted by damaged plant material or released by larvae living in plant tissue, and 2) detect the larvae by sound vibrations. Even though pupae are immobile and silent, and therefore do not cause vibration, parasitoids have, however, adapted to find passive developmental stages by producing vibration themselves by knocking the substrate with their antennae, and then detecting the echoes with their legs. This echolocation allows a parasitoid to locate its potential hosts that are deeply buried in wood. This study focuses on the relationships of the subfamily Cryptinae (Hymenoptera: Ichneumonidae) and related taxa, and the evolution of host location mechanism. There are no earlier studies of the phylogeny of the Cryptinae, and the position of related taxa are unclear. According to the earlier classification, which is entirely intuitional, the Cryptinae is divided into three tribes: Cryptini, Hemigasterini and Phygadeuontini. Further, these tribes are subdiveded into numerous subtribes. This work, based on molecular characters, shows that the cryptine tribes Cryptini, Phygadeuon¬tini and Hemigasterini come out largely as monophyletic groups, thus agreeing with the earlier classification. The earlier subtribal classification had no support. In addition, it is shown that modified antennal structures are associated with host usage of wood-boring coleopteran hosts. The cryptines have a clear modification series on their antennal tips from a simply tip to a hammer-like structure. The species with strongly modified antennae belong mostly to the tribe Cryptini and they utilise wood-boring beetles as hosts. Also, field observations on insect behaviour support this result.
Resumo:
New stars form in dense interstellar clouds of gas and dust called molecular clouds. The actual sites where the process of star formation takes place are the dense clumps and cores deeply embedded in molecular clouds. The details of the star formation process are complex and not completely understood. Thus, determining the physical and chemical properties of molecular cloud cores is necessary for a better understanding of how stars are formed. Some of the main features of the origin of low-mass stars, like the Sun, are already relatively well-known, though many details of the process are still under debate. The mechanism through which high-mass stars form, on the other hand, is poorly understood. Although it is likely that the formation of high-mass stars shares many properties similar to those of low-mass stars, the very first steps of the evolutionary sequence are unclear. Observational studies of star formation are carried out particularly at infrared, submillimetre, millimetre, and radio wavelengths. Much of our knowledge about the early stages of star formation in our Milky Way galaxy is obtained through molecular spectral line and dust continuum observations. The continuum emission of cold dust is one of the best tracers of the column density of molecular hydrogen, the main constituent of molecular clouds. Consequently, dust continuum observations provide a powerful tool to map large portions across molecular clouds, and to identify the dense star-forming sites within them. Molecular line observations, on the other hand, provide information on the gas kinematics and temperature. Together, these two observational tools provide an efficient way to study the dense interstellar gas and the associated dust that form new stars. The properties of highly obscured young stars can be further examined through radio continuum observations at centimetre wavelengths. For example, radio continuum emission carries useful information on conditions in the protostar+disk interaction region where protostellar jets are launched. In this PhD thesis, we study the physical and chemical properties of dense clumps and cores in both low- and high-mass star-forming regions. The sources are mainly studied in a statistical sense, but also in more detail. In this way, we are able to examine the general characteristics of the early stages of star formation, cloud properties on large scales (such as fragmentation), and some of the initial conditions of the collapse process that leads to the formation of a star. The studies presented in this thesis are mainly based on molecular line and dust continuum observations. These are combined with archival observations at infrared wavelengths in order to study the protostellar content of the cloud cores. In addition, centimetre radio continuum emission from young stellar objects (YSOs; i.e., protostars and pre-main sequence stars) is studied in this thesis to determine their evolutionary stages. The main results of this thesis are as follows: i) filamentary and sheet-like molecular cloud structures, such as infrared dark clouds (IRDCs), are likely to be caused by supersonic turbulence but their fragmentation at the scale of cores could be due to gravo-thermal instability; ii) the core evolution in the Orion B9 star-forming region appears to be dynamic and the role played by slow ambipolar diffusion in the formation and collapse of the cores may not be significant; iii) the study of the R CrA star-forming region suggests that the centimetre radio emission properties of a YSO are likely to change with its evolutionary stage; iv) the IRDC G304.74+01.32 contains candidate high-mass starless cores which may represent the very first steps of high-mass star and star cluster formation; v) SiO outflow signatures are seen in several high-mass star-forming regions which suggest that high-mass stars form in a similar way as their low-mass counterparts, i.e., via disk accretion. The results presented in this thesis provide constraints on the initial conditions and early stages of both low- and high-mass star formation. In particular, this thesis presents several observational results on the early stages of clustered star formation, which is the dominant mode of star formation in our Galaxy.
Resumo:
Earlier phylogenetic studies, including species belonging to the Neckeraceae, have indicated that this pleurocarpous moss family shares a strongly supported sister group relationship with the Lembophyllaceae, but the family delimitation of the former needs adjustment. To test the monophyly of the Neckeraceae, as well as to redefine the family circumscription and to pinpoint its phylogenetic position in a larger context, a phylogenetic study based on molecular data was carried out. Sequence data were compiled, combining data from all three genomes: nuclear ITS1 and 2, plastid trnS-rps4-trnT-trnL-trnF and rpl16, and mitochondrial nad5 intron. The Neckeraceae have sometimes been divided into the two families, Neckeraceae and Thamnobryaceae, a division rejected here. Both parsimony and Bayesian analyses of molecular data revealed that the family concept of the Neckeraceae needs several further adjustments, such as the exclusion of some individual species and smaller genera as well as the inclusion of the Leptodontaceae. Within the family three well-supported clades (A, B and C) can be distinguished. Members of clade A are mainly non-Asiatic and nontropical. Most species have a weak costa and immersed capsules with reduced peristomes (mainly Neckera spp.) and the teeth at the leaf margins are usually unicellular. Clade B members are also mainly non-Asiatic. They are typically fairly robust, distinctly stipilate, having a single, at least relatively strong costa, long setae (capsules exserted), and the peristomes are well developed or only somewhat reduced. Members of clade C are essentially Asiatic and tropical. The species of this clade usually have a strong costa and a long seta, the seta often being mammillose in its upper part. The peristome types in this clade are mixed, since both reduced and unreduced types are found. Several neckeraceous genera that were recognised on a morphological basis are polyphyletic (e.g. Neckera, Homalia, Thamnobryum, Porotrichum). Ancestral state reconstructions revealed that currently used diagnostic traits, such as the leaf asymmetry and costa strength are highly homoplastic. Similarly, the reconstructions revealed that the 'reduced' sporophyte features have evolved independently in each of the three clades.
Resumo:
Cassava brown streak disease (CBSD) was described for the first time in Tanganyika (now Tanzania) about seven decades ago. Tanganyika (now Tanzania) about seven decades ago. It was endemic in the lowland areas of East Africa and inland parts of Malawi and caused by Cassava brown streak virus (CBSV; genus Ipomovirus; Potyviridae). However, in 1990s CBSD was observed at high altitude areas in Uganda. The causes for spread to new locations were not known.The present work was thus initiated to generate information on genetic variability, clarify the taxonomy of the virus or viruses associated with CBSD in Eastern Africa as well as to understand the evolutionary forces acting on their genes. It also sought to develop a molecular based diagnostic tool for detection of CBSD-associated virus isolates. Comparison of the CP-encoding sequences of CBSD-associated virus isolates collected from Uganda and north-western Tanzania in 2007 and the partial sequences available in Genbank revealed occurrence of two genetically distinct groups of isolates. Two isolates were selected to represent the two groups. The complete genomes of isolates MLB3 (TZ:Mlb3:07) and Kor6 (TZ:Kor6:08) obtained from North-Western (Kagera) and North-Eastern (Tanga) Tanzania, respectively, were sequenced. The genomes were 9069 and 8995 nucleotides (nt), respectively. They translated into polyproteins that were predicted to yield ten mature proteins after cleavage. Nine proteins were typical in the family Potyviridae, namely P1, P3, 6K1, CI, 6K2, VPg, NIa-Pro, NIb and CP, but the viruses did not contain HC-Pro. Interestingly, genomes of both isolates contained a Maf/HAM1-like sequence (HAM1h; 678 nucleotides, 25 kDa) recombined between the NIb and CP domains in the 3’-proximal part of the genomes. HAM1h was also identified in Euphorbia ringspot virus (EuRSV) whose sequence was in GenBank. The HAM1 gene is widely spread in both prokaryotes and eukaryotes. In yeast (Saccharomyces cerevisiae) it is known to be a nucleoside triphosphate (NTP) pyrophosphatase. Novel information was obtained on the structural variation at the N-termini of polyproteins of viruses in the genus Ipomovirus. Cucumber vein yellowing virus (CVYV) and Squash vein yellowing virus (SqVYV) contain a duplicated P1 (P1a and P1b) but lack the HC-Pro. On the other hand, Sweet potato mild mottle virus (SPMMV), has a single but large P1 and has HC-Pro. Both virus isolates (TZ:Mlb3:07 & TZ:Kor6:08) characterized in this study contained a single P1 and lacked the HC-Pro which indicates unique evolution in the family Potyviridae. Comparison of 12 complete genomes of CBSD-associated viruses which included two genomes characterized in this study, revealed genetic identity of 69.0–70.3% (nt) and amino acid (aa) identities of 73.6–74.4% at polyprotein level. Comparison was also made among 68 complete CP sequences, which indicated 69.0-70.3 and 73.6-74.4 % identity at nt and aa levels, respectively. The genetic variation was large enough for dermacation of CBSD-associated virus isolates into two distinct species. The name CBSV was retained for isolates that were related to CBSV isolates available in database whereas the new virus described for the first time in this study was named Ugandan cassava brown streak virus (UCBSV) by the International Committee on Virus Taxonomy (ICTV). The isolates TZ:Mlb3:07 and TZ:Kor6:08 belong to UCBSV and CBSV, respectively. The isolates of CBSV and UCBSV were 79.3-95.5% and 86.3-99.3 % identitical at nt level, respectively, suggesting more variation amongst CBSV isolates. The main sources of variation in plant viruses are mutations and recombination. Signals for recombination events were detected in 50% of isolates of each virus. Recombination events were detected in coding and non-coding (3’-UTR) sequences except in the 5’UTR and P3. There was no evidence for recombination between isolates of CBSV and UCBSV. The non-synonomous (dN) to synonomous (dS) nucleotide substitution ratio (ω) for the HAM1h and CP domains of both viruses were ≤ 0.184 suggesting that most sites of these proteins were evolving under strong purifying selection. However, there were individual amino acid sites that were submitted to adaptive evolution. For instance, adaptive evolution was detected in the HAM1h of UCBSV (n=15) where 12 aa sites were under positive selection (P< 0.05) but not in CBSV (n=12). The CP of CBSV (n=23) contained 12 aa sites (p<0.01) while only 5 aa sites in the CP gene of UCBSV were predicted to be submitted to positive selection pressure (p<0.01). The advantages offered by the aa sites under positive selection could not be established but occurrence of such sites in the terminal ends of UCBSV-HAMIh, for example, was interpreted as a requirement for proteolysis during polyprotein processing. Two different primer pairs that simultaneously detect UCBSV and CBSV isolates were developed in this study. They were used successfully to study distribution of CBSV, UCBSV and their mixed infections in Tanzania and Uganda. It was established that the two viruses co-infect cassava and that incidences of co-infection could be as high as 50% around Lake Victoria on the Tanzanian side. Furthermore, it was revealed for the first time that both UCBSV and CBSV were widely distributed in Eastern Africa. The primer pair was also used to confirm infection in a close relative of cassava, Manihot glaziovii (Müller Arg.) with CBSV. DNA barcoding of M. glaziovii was done by sequencing the matK gene. Two out of seven M. glaziovii from the coastal areas of Korogwe and Kibaha in north eastern Tanzania were shown to be infected by CBSV but not UCBSV isolates. Detection in M. glaziovii has an implication in control and management of CBSD as it is likely to serve as virus reservoir. This study has contributed to the understanding of evolution of CBSV and UCBSV, which cause CBSD epidemic in Eastern Africa. The detection tools developed in this work will be useful in plant breeding, verification of the phytosanitary status of materials in regional and international movement of germplasm, and in all diagnostic activities related to management of CBSD. Whereas there are still many issues to be resolved such as the function and biological significance of HAM1h and its origin, this work has laid a foundation upon which the studies on these aspects can be based.