22 resultados para Nucleotide sequence
em Helda - Digital Repository of University of Helsinki
Resumo:
Torque teno virus (TTV) was discovered in 1997 in the serum of a Japanese patient who had a post-transfusion hepatitis of unknown etiology. It is a small virus containing a circular single-stranded DNA genome which is unique among human viruses. Within a few years after its discovery, the TTVs were noted to form a large family of viruses with numerous genotypes. TTV is highly prevalent among the general population throughout the world, and persistent infections and co-infections with several genotypes occur frequently. However, the pathogenicity and the mechanism for the sustained occurrence of the virus in blood are at present unclear. To determine the prevalence of TTV in Finland, we set up PCR methods and examined the sera of asymptomatic subjects for the presence of TTV DNA and for genotype-6 DNA. TTV was found to be highly prevalent also in Finland; 85% of adults harbored TTV in their blood, and 4% were infected with genotype-6. In addition, TTV DNA was detected in a number of different tissues, with no tissue-type or symptom specificity. Most cell-biological events during TTV infections are at the moment unknown. Replicating TTV DNA has, however, been detected in liver and the hematopoietic compartment, and three mRNAs are known to be generated. To characterize TTV cell biology in more detail, we cloned in full length the genome of TTV genotype 6. We showed that in human kidney-derived cells TTV produces altogether six proteins with distinct subcellular localizations. TTV mRNA transcription was detected in all cell lines transfected with the full-length clone, and TTV DNA replicated in several of them, including those of erythroid, kidney, and hepatic origin. Furthermore, the viral DNA replication was shown to utilize the cellular DNA polymerases. Diagnoses of TTV infections have been based almost solely on PCR, whereas serological tests, measuring antibody responses, would give more information on many aspects of these infections. To investigate the TTV immunology in more detail, we produced all six TTV proteins for use as antigens in serological tests. We detected in human sera IgM and IgG antibodies to occur simultaneously with TTV DNA, and observed appearance of TTV DNA regardless of pre-existing antibodies, and disappearance of TTV DNA after antibody appearance. The genotype-6 nucleotide sequence remained stable for years within the infected subjects, suggesting that some mechanism other than mutations is used by this minute virus to evade our immune system and to establish chronic infections in immunocompetent subjects.
Resumo:
Microbial degradation pathways play a key role in the detoxification and the mineralization of polyaromatic hydrocarbons (PAHs), which are widespread pollutants in soil and constituents of petroleum hydrocarbons. In microbiology the aromatic degradation pathways are traditionally studied from single bacterial strains with capacity to degrade certain pollutant. In soil the degradation of aromatics is performed by a diverse community of micro-organisms. The aim of this thesis was to study biodegradation on different levels starting from a versatile aromatic degrader Sphingobium sp. HV3 and its megaplasmid, extending to revelation of diversity of key catabolic enzymes in the environment and finally studying birch rhizoremediation in PAH-polluted soil. To understand biodegradation of aromatics on bacterial species level, the aromatic degradation capacity of Sphingobium sp. HV3 and the role of the plasmid pSKY4, was studied. Toluene, m-xylene, biphenyl, fluorene, phenanthrene were detected as carbon and energy sources of the HV3 strain. Tn5 transposon mutagenesis linked the degradation capacity of toluene, m-xylene, biphenyl and naphthalene to the pSKY4 plasmid and qPCR expression analysis showed that plasmid extradiol dioxygenases genes (bphC and xylE) are inducted by phenanthrene, m-xylene and biphenyl whereas the 2,4-dichlorophenoxyacetic acid herbicide induced the chlorocatechol 1,2-dioxygenase gene (tfdC) from the ortho-pathway. A method to study upper meta-pathway extradiol dioxygenase gene diversity in soil was developed. The extradiol dioxygenases catalyse cleavage of the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon (meta-cleavage). A high diversity of extradiol dioxygenases were detected from polluted soils. The detected extradiol dioxygenases showed sequence similarity to known catabolic genes of Alpha-, Beta-, and Gammaproteobacteria. Five groups of extradiol dioxygenases contained sequences with no close homologues in the database, representing novel genes. In rhizoremediation experiment with birch (Betula pendula) treatment specific changes of extradiol dioxygenase communities were shown. PAH pollution changed the bulk soil extradiol dioxygenase community structure and birch rhizosphere contained a more diverse extradiol dioxygenase community than the bulk soil showing a rhizosphere effect. The degradation of pyrene in soil was enhanced with birch seedlings compared to soil without birch. The complete 280,923 kb nucleotide sequence of pSKY4 plasmid was determined. The open reading frames of pSKY4 were divided into putative conjugative transfer, aromatic degradation, replication/maintaining and transposition/integration function-encoding proteins. Aromatic degradation orfs shared high similarity to corresponding genes in pNL1, a plasmid from the deep subsurface strain Novosphingobium aromaticivorans F199. The plasmid backbones were considerably more divergent with lower similarity, which suggests that the aromatic pathway has functioned as a plasmid independent mobile genetic element. The functional diversity of microbial communities in soil is still largely unknown. Several novel clusters of extradiol dioxygenases representing catabolic bacteria, whose function, biodegradation pathways and phylogenetic position is not known were amplified with single primer pair from polluted soils. These extradiol dioxygenase communities were shown to change upon PAH pollution, which indicates that their hosts function in PAH biodegradation in soil. Although the degradation pathways of specific bacterial species are substantially better depicted than pathways in situ, the evolution of degradation pathways for the xenobiotic compounds is largely unknown. The pSKY4 plasmid contains aromatic degradation genes in putative mobile genetic element causing flexibility/instability to the pathway. The localisation of the aromatic biodegradation pathway in mobile genetic elements suggests that gene transfer and rearrangements are a competetive advantage for Sphingomonas bacteria in the environment.
Resumo:
Evolutionary genetics incorporates traditional population genetics and studies of the origins of genetic variation by mutation and recombination, and the molecular evolution of genomes. Among the primary forces that have potential to affect the genetic variation within and among populations, including those that may lead to adaptation and speciation, are genetic drift, gene flow, mutations and natural selection. The main challenges in knowing the genetic basis of evolutionary changes is to distinguish the adaptive selection forces that cause existent DNA sequence variants and also to identify the nucleotide differences responsible for the observed phenotypic variation. To understand the effects of various forces, interpretation of gene sequence variation has been the principal basis of many evolutionary genetic studies. The main aim of this thesis was to assess different forms of teleost gene sequence polymorphisms in evolutionary genetic studies of Atlantic salmon (Salmo salar) and other species. Firstly, the level of Darwinian adaptive evolution affected coding regions of the growth hormone (GH) gene during the teleost evolution was investigated based on the sequence data existing in public databases. Secondly, a target gene approach was used to identify within population variation in the growth hormone 1 (GH1) gene in salmon. Then, a new strategy for single nucleotide polymorphisms (SNPs) discovery in salmonid fishes was introduced, and, finally, the usefulness of a limited number of SNP markers as molecular tools in several applications of population genetics in Atlantic salmon was assessed. This thesis showed that the gene sequences in databases can be utilized to perform comparative studies of molecular evolution, and some putative evidence of the existence of Darwinian selection during the teleost GH evolution was presented. In addition, existent sequence data was exploited to investigate GH1 gene variation within Atlantic salmon populations throughout its range. Purifying selection is suggested to be the predominant evolutionary force controlling the genetic variation of this gene in salmon, and some support for gene flow between continents was also observed. The novel approach to SNP discovery in species with duplicated genome fragments introduced here proved to be an effective method, and this may have several applications in evolutionary genetics with different species - e.g. when developing gene-targeted markers to investigate quantitative genetic variation. The thesis also demonstrated that only a few SNPs performed highly similar signals in some of the population genetic analyses when compared with the microsatellite markers. This may have useful applications when estimating genetic diversity in genes having a potential role in ecological and conservation issues, or when using hard biological samples in genetic studies as SNPs can be applied with relatively highly degraded DNA.
Resumo:
NMR spectroscopy enables the study of biomolecules from peptides and carbohydrates to proteins at atomic resolution. The technique uniquely allows for structure determination of molecules in solution-state. It also gives insights into dynamics and intermolecular interactions important for determining biological function. Detailed molecular information is entangled in the nuclear spin states. The information can be extracted by pulse sequences designed to measure the desired molecular parameters. Advancement of pulse sequence methodology therefore plays a key role in the development of biomolecular NMR spectroscopy. A range of novel pulse sequences for solution-state NMR spectroscopy are presented in this thesis. The pulse sequences are described in relation to the molecular information they provide. The pulse sequence experiments represent several advances in NMR spectroscopy with particular emphasis on applications for proteins. Some of the novel methods are focusing on methyl-containing amino acids which are pivotal for structure determination. Methyl-specific assignment schemes are introduced for increasing the size range of 13C,15N labeled proteins amenable to structure determination without resolving to more elaborate labeling schemes. Furthermore, cost-effective means are presented for monitoring amide and methyl correlations simultaneously. Residual dipolar couplings can be applied for structure refinement as well as for studying dynamics. Accurate methods for measuring residual dipolar couplings in small proteins are devised along with special techniques applicable when proteins require high pH or high temperature solvent conditions. Finally, a new technique is demonstrated to diminish strong-coupling induced artifacts in HMBC, a routine experiment for establishing long-range correlations in unlabeled molecules. The presented experiments facilitate structural studies of biomolecules by NMR spectroscopy.
Resumo:
The analysis of sequential data is required in many diverse areas such as telecommunications, stock market analysis, and bioinformatics. A basic problem related to the analysis of sequential data is the sequence segmentation problem. A sequence segmentation is a partition of the sequence into a number of non-overlapping segments that cover all data points, such that each segment is as homogeneous as possible. This problem can be solved optimally using a standard dynamic programming algorithm. In the first part of the thesis, we present a new approximation algorithm for the sequence segmentation problem. This algorithm has smaller running time than the optimal dynamic programming algorithm, while it has bounded approximation ratio. The basic idea is to divide the input sequence into subsequences, solve the problem optimally in each subsequence, and then appropriately combine the solutions to the subproblems into one final solution. In the second part of the thesis, we study alternative segmentation models that are devised to better fit the data. More specifically, we focus on clustered segmentations and segmentations with rearrangements. While in the standard segmentation of a multidimensional sequence all dimensions share the same segment boundaries, in a clustered segmentation the multidimensional sequence is segmented in such a way that dimensions are allowed to form clusters. Each cluster of dimensions is then segmented separately. We formally define the problem of clustered segmentations and we experimentally show that segmenting sequences using this segmentation model, leads to solutions with smaller error for the same model cost. Segmentation with rearrangements is a novel variation to the segmentation problem: in addition to partitioning the sequence we also seek to apply a limited amount of reordering, so that the overall representation error is minimized. We formulate the problem of segmentation with rearrangements and we show that it is an NP-hard problem to solve or even to approximate. We devise effective algorithms for the proposed problem, combining ideas from dynamic programming and outlier detection algorithms in sequences. In the final part of the thesis, we discuss the problem of aggregating results of segmentation algorithms on the same set of data points. In this case, we are interested in producing a partitioning of the data that agrees as much as possible with the input partitions. We show that this problem can be solved optimally in polynomial time using dynamic programming. Furthermore, we show that not all data points are candidates for segment boundaries in the optimal solution.
Resumo:
Growth is a fundamental aspect of life cycle of all organisms. Body size varies highly in most animal groups, such as mammals. Moreover, growth of a multicellular organism is not uniform enlargement of size, but different body parts and organs grow to their characteristic sizes at different times. Currently very little is known about the molecular mechanisms governing this organ-specific growth. The genome sequencing projects have provided complete genomic DNA sequences of several species over the past decade. The amount of genomic sequence information, including sequence variants within species, is constantly increasing. Based on the universal genetic code, we can make sense of this sequence information as far as it codes proteins. However, less is known about the molecular mechanisms that control expression of genes, and about the variations in gene expression that underlie many pathological states in humans. This is caused in part by lack of information about the second genetic code that consists of the binding specificities of transcription factors and the combinatorial code by which transcription factor binding sites are assembled to form tissue-specific and/or ligand-regulated enhancer elements. This thesis presents a high-throughput assay for identification of transcription factor binding specificities, which were then used to measure the DNA binding profiles of transcription factors involved in growth control. We developed ‘enhancer element locator’, a computational tool, which can be used to predict functional enhancer elements. A genome-wide prediction of human and mouse enhancer elements generated a large database of enhancer elements. This database can be used to identify target genes of signaling pathways, and to predict activated transcription factors based on changes in gene expression. Predictions validated in transgenic mouse embryos revealed the presence of multiple tissue-specific enhancers in mouse c- and N-Myc genes, which has implications to organ specific growth control and tumor type specificity of oncogenes. Furthermore, we were able to locate a variation in a single nucleotide, which carries a susceptibility to colorectal cancer, to an enhancer element and propose a mechanism by which this SNP might be involved in generation of colorectal cancer.
Resumo:
Visual pigments of different animal species must have evolved at some stage to match the prevailing light environments, since all visual functions depend on their ability to absorb available photons and transduce the event into a reliable neural signal. There is a large literature on correlation between the light environment and spectral sensitivity between different fish species. However, little work has been done on evolutionary adaptation between separated populations within species. More generally, little is known about the rate of evolutionary adaptation to changing spectral environments. The objective of this thesis is to illuminate the constraints under which the evolutionary tuning of visual pigments works as evident in: scope, tempo, available molecular routes, and signal/noise trade-offs. Aquatic environments offer Nature s own laboratories for research on visual pigment properties, as naturally occurring light environments offer an enormous range of variation in both spectral composition and intensity. The present thesis focuses on the visual pigments that serve dim-light vision in two groups of model species, teleost fishes and mysid crustaceans. The geographical emphasis is in the brackish Baltic Sea area with its well-known postglacial isolation history and its aquatic fauna of both marine and fresh-water origin. The absorbance spectrum of the (single) dim-light visual pigment were recorded by microspectrophotometry (MSP) in single rods of 26 fish species and single rhabdoms of 8 opossum shrimp populations of the genus Mysis inhabiting marine, brackish or freshwater environments. Additionally, spectral sensitivity was determined from six Mysis populations by electroretinogram (ERG) recording. The rod opsin gene was sequenced in individuals of four allopatric populations of the sand goby (Pomatoschistus minutus). Rod opsins of two other goby species were investigated as outgroups for comparison. Rod absorbance spectra of the Baltic subspecies or populations of the primarily marine species herring (Clupea harengus membras), sand goby (P. minutus), and flounder (Platichthys flesus) were long-wavelength-shifted compared to their marine populations. The spectral shifts are consistent with adaptation for improved quantum catch (QC) as well as improved signal-to-noise ratio (SNR) of vision in the Baltic light environment. Since the chromophore of the pigment was pure A1 in all cases, this has apparently been achieved by evolutionary tuning of the opsin visual pigment. By contrast, no opsin-based differences were evident between lake and sea populations of species of fresh-water origin, which can tune their pigment by varying chromophore ratios. A more detailed analysis of differences in absorbance spectra and opsin sequence between and within populations was conducted using the sand goby as model species. Four allopatric populations from the Baltic Sea (B), Swedish west coast (S), English Channel (E), and Adriatic Sea (A) were examined. Rod absorbance spectra, characterized by the wavelength of maximum absorbance (λmax), differed between populations and correlated with differences in the spectral light transmission of the respective water bodies. The greatest λmax shift as well as the greatest opsin sequence difference was between the Baltic and the Adriatic populations. The significant within-population variation of the Baltic λmax values (506-511 nm) was analyzed on the level of individuals and was shown to correlate well with opsin sequence substitutions. The sequences of individuals with λmax at shorter wavelengths were identical to that of the Swedish population, whereas those with λmax at longer wavelengths additionally had substitution F261F/Y in the sixth transmembrane helix of the protein. This substitution (Y261) was also present in the Baltic common gobies and is known to redshift spectra. The tuning mechanism of the long-wavelength type Baltic sand gobies is assumed to be the co-expression of F261 and Y261 in all rods to produce ≈ 5 nm redshift. The polymorphism of the Baltic sand goby population possibly indicates ambiguous selection pressures in the Baltic Sea. The visual pigments of all lake populations of the opossum shrimp (Mysis relicta) were red-shifted by 25 nm compared with all Baltic Sea populations. This is calculated to confer a significant advantage in both QC and SNR in many humus-rich lakes with reddish water. Since only A2 chromophore was present, the differences obviously reflect evolutionary tuning of the visual protein, the opsin. The changes have occurred within the ca. 9000 years that the lakes have been isolated from the Sea after the most recent glaciation. At present, it seems that the mechanism explaining the spectral differences between lake and sea populations is not an amino acid substitution at any other conventional tuning site, but the mechanism is yet to be found.
Resumo:
The prevalence of obesity is increasing at an alarming rate in all age groups worldwide. Obesity is a serious health problem due to increased risk of morbidity and mortality. Although environmental factors play a major role in the development of obesity, the identification of rare monogenic defects in human genes have confirmed that obesity has a strong genetic component. Mutations have been identified in genes encoding proteins of the leptin-melanocortin signaling system, which has an important role in the regulation of appetite and energy balance. The present study aimed at identifying mutations and genetic variations in the melanocortin receptors 2-5 and other genes active on the same signaling pathway accounting for severe early-onset obesity in children and morbid obesity in adults. The main achievement of this thesis was the identification of melanocortin-4 receptor (MC4R) mutations in Finnish patients. Six pathogenic MC4R mutations (308delT, P299H, two S127L and two -439delGC mutations) were identified, corresponding to a prevalence of 3% in severe early-onset obesity. No obesity causing MC4R mutations were found among patients with adult-onset morbid obesity. The MC4R 308delT deletion is predicted to result in a grossly truncated nonfunctional receptor of only 107 amino acids. The C-terminal residues, which are important in MC4R cell surface targeting, are totally absent from the mutant 308delT receptor. In vitro functional studies supported a pathogenic role for the S127L mutation since agonist induced signaling of the receptor was impaired. Cell membrane localization of the S127L receptor did not differ from that of the wild-type receptor, confirming that impaired function of the S127L receptor was due to reduced signaling properties. The P299H mutation leads to intracellular retention of the receptor. The -439delGC deletion is situated at a potential nescient helix-loop-helix 2 (NHLH2) -binding site in the MC4R promoter. It was demonstrated that the transcription factor NHLH2 binds to the consensus sequence at the -439delGC site in vitro, possibly resulting in altered promoter activity. Several genetic variants were identified in the melanocortin-3 receptor (MC3R) and pro-opiomelanocortin (POMC) genes. These polymorphisms do not explain morbid obesity, but the results indicate that some of these genetic variations may be modifying factors in obesity, resulting in subtle changes in obesity-related traits. A risk haplotype for obesity was identified in the ectonucleotide pyrophosphatase phosphodiesterase 1 (ENPP1) gene through a candidate gene single nucleotide polymorphism (SNP) genotyping approach. An ENPP1 haplotype, composed of SNPs rs1800949 and rs943003, was shown to be significantly associated with morbid obesity in adults. Accordingly, the MC3R, POMC and ENPP1 genes represent examples of susceptibility genes in which genetic variants predispose to obesity. In conclusion, pathogenic mutations in the MC4R gene were shown to account for 3% of cases with severe early-onset obesity in Finland. This is in line with results from other populations demonstrating that mutations in the MC4R gene underlie 1-6% of morbid obesity worldwide. MC4R deficiency thus represents the most common monogenic defect causing human obesity reported so far. The severity of the MC4-receptor defect appears to be associated with time of onset and the degree of obesity. Classification of MC4R mutations may provide a useful tool when predicting the outcome of the disease. In addition, several other genetic variants conferring susceptibility to obesity were detected in the MC3R, MC4R, POMC and ENPP1 genes.
Resumo:
Kohonneiden kolesterolipitoisuuksien alentamisessa käytettävien statiinien hyödyt sydän- ja verisuonisairauksien estossa on vahvasti osoitettu ja niiden käyttö on niin Suomessa kuin muuallakin maailmassa kasvanut voimakkaasti – Suomessa statiininkäyttäjiä on noin 600 000. Statiinilääkitys on pitkäaikaisessakin käytössä melko hyvin siedetty, mutta yleisimpinä haittavaikutuksina voi ilmetä lihasheikkoutta, -kipua ja -kramppeja, jotka voivat edetä jopa henkeä uhkaavaksi lihasvaurioksi. Lihashaittariski suurenee suhteessa statiiniannokseen ja plasman statiinipitoisuuksiin. Statiinien plasmapitoisuuksissa, tehossa ja haittavaikutusten ilmenemisessä on suuria potilaskohtaisia eroja. SLCO1B1-geenin koodaama OATP1B1-kuljetusproteiini kuljettaa monia elimistön omia aineita ja lääkeaineita verenkierrosta solukalvon läpi maksasoluun, mm. statiineja, joiden kolesterolia alentava vaikutus ja poistuminen elimistöstä tapahtuvat pääosin maksassa. Erään SLCO1B1-geenin nukleotidimuutoksen (c.521T>C) tiedetään heikentävän OATP1B1:n kuljetustehoa. Tässä väitöskirjatyössä selvitettiin SLCO1B1-geenin perinnöllistä muuntelua suomalaisilla ja eri väestöissä maailmanlaajuisesti. Lisäksi selvitettiin SLCO1B1:n muunnosten vaikutusta eri statiinien pitoisuuksiin (farmakokinetiikka) ja vaikutuksiin (farmakodynamiikka) sekä kolesteroliaineenvaihduntaan. Näihin tutkimuksiin valittiin SLCO1B1-genotyypin perusteella terveitä vapaaehtoisia koehenkilöitä, joille annettiin eri päivinä kerta-annos kutakin tutkittavaa statiinia: fluvastatiinia, pravastatiinia, simvastatiinia, rosuvastatiinia ja atorvastatiinia. Verinäytteistä määritettiin plasman statiinien ja niiden aineenvaihduntatuotteiden sekä kolesterolin ja sen muodostumista ja imeytymistä kuvaavien merkkiaineiden pitoisuuksia. Toiminnallisesti merkittävien SLCO1B1-geenimuunnosten esiintyvyydessä todettiin suuria eroja eri väestöjen välillä. Suomalaisilla SLCO1B1 c.521TC-genotyypin (geenimuunnos toisessa vastinkromosomissa) esiintyvyys oli noin 32 % ja SLCO1B1 c.521CC-genotyypin (geenimuunnos molemmissa vastinkromosomeissa) esiintyvyys noin 4 %. Globaalisti geenimuunnosten esiintyvyys korreloi maapallon leveyspiirien kanssa siten, että matalaan transportteriaktiivisuuteen johtavat muunnokset olivat yleisimpiä pohjoisessa ja korkeaan aktiivisuuteen johtavat päiväntasaajan lähellä asuvilla väestöillä. SLCO1B1-genotyypillä oli merkittävä vaikutus statiinien plasmapitoisuksiin lukuun ottamatta fluvastatiinia. Simvastatiinihapon plasmapitoisuudet olivat keskimäärin 220 %, atorvastatiinin 140 %, pravastatiinin 90 % ja rosuvastatiinin 70 % suuremmat c.521CC-genotyypin omaavilla koehenkilöillä verrattuna normaalin c.521TT-genotyypin omaaviin. Genotyypillä ei ollut merkittävää vaikutusta minkään statiinin tehoon tässä kerta-annostutkimuksessa, mutta geenimuunnoksen kantajilla perustason kolesterolisynteesinopeus oli suurempi. Tulokset osoittavat, että SLCO1B1 c.521T>C geenimuunnos on varsin yleinen suomalaisilla ja muilla ei-afrikkalaisilla väestöillä. Tämä geenimuunnos voi altistaa erityisesti simvastatiinin, mutta myös atorvastatiinin, pravastatiinin ja rosuvastatiinin, aiheuttamille lihashaitoille suurentamalla niiden plasmapitoisuuksia. SLCO1B1:n geenimuunnoksen testaamista voidaan tulevaisuudessa käyttää apuna valittaessa sopivaa statiinilääkitystä ja -annosta potilaalle, ja näin parantaa sekä statiinihoidon turvallisuutta että tehoa.
Resumo:
"The genetic diversity of Puumala hantavirus (PUUV) was studied in a local population of its natural host, the bank vole (Myodes glareolus). The trapping area (2.5x2.5 km) at Konnevesi, Central Finland, included 14 trapping sites, at least 500 m apart; altogether, 147 voles were captured during May and October 2005. Partial sequences of the S, M and L viral genome segments were recovered from 40 animals. Seven, 12 and 17 variants were detected for the S, M and L sequences, respectively; these represent new wild-type PUUV strains that belong to the Finnish genetic lineage. The genetic diversity of PUUV strains from Konnevesi was 0.2-4.9% for the S segment, 0.2-4.8% for the M segment and 0.2-9.7% for the L segment. Most nucleotide substitutions were synonymous and most deduced amino acid substitutions were conservative, probably due to strong stabilizing selection operating at the protein level. Based on both sequence markers and phylogenetic clustering, the S, M and L sequences could be assigned to two groups, 'A' and 'B'. Notably, not all bank voles carried S, M and L sequences belonging to the same group, i.e. SAMALA or SBMBLB.. A substantial proportion (8/40, 20%) of the newly characterized PUUV strains possessed reassortant genomes such as SBMALA, SAMBLB or SBMALB. These results suggest that at least some of the PUUV reassortants are viable and can survive in the presence of their parental strains."