939 resultados para temporal molecular evolution
Resumo:
The interaction of man with viral agents was possibly a key factor shaping human evolution, culture and civilization from its outset. Evidence of the effect of disease, since the early stages of human speciation, through pre-historical times to the present suggest that the types of viruses associated with man changed in time. As human populations progressed technologically, they grew in numbers and density. As a consequence different viruses found suitable conditions to thrive and establish long-lasting associations with man. Although not all viral agents cause disease and some may in fact be considered beneficial, the present situation of overpopulation, poverty and ecological inbalance may have devastating effets on human progress. Recently emerged diseases causing massive pandemics (eg., HIV-1 and HCV, dengue, etc.) are becoming formidable challenges, which may have a direct impact on the fate of our species.
Resumo:
Electron microscopic analysis of heteroduplexes between the most distantly related Xenopus vitellogenin genes (A genes X B genes) has revealed the distribution of homologous regions that have been preferentially conserved after the duplication events that gave rise to the multigene family in Xenopus laevis. DNA sequence analysis was limited to the region downstream of the transcription initiation site of the Xenopus genes A1, B1 and B2 and a comparison with the Xenopus A2 and the major chicken vitellogenin gene is presented. Within the coding regions of the first three exons, nucleotide substitutions resulting in amino acid changes accumulate at a rate similar to that observed in globin genes. This suggests that the duplication event which led to the formation of the A and B ancestral genes in Xenopus laevis occurred about 150 million years ago. Homologous exons of the A1-A2 and B1-B2 gene pairs, which formed about 30 million years ago, show a quite similar sequence divergence. In contrast, A1-A2 homologous introns seem to have evolved much faster than their B1-B2 counterparts.
Resumo:
Abstract Arbuscular mycorrhizal fungi (AMF) form symbiosis with roots of approximately 80% of known land plants. These fungi play a key role in the ecology and adaptation of plants to various ecosystems.by increasing the plant resources for various nutrients. Despite their important ecological role, we still have poor understanding of their genetic structure and their molecular evolution. The work presented in this thesis aims to isolate and analyse AMF genes with various molecular techniques, in order to obtain new insights about their genetics, phylogeny and molecular evolution. Some AMF genes were shown through phylogenetic analyses to be more related with plants or mycoparasites than with other fungal organisms. These results led to the prediction that lateral gene transfers (LGT) occurred between AMF and plants during their long-term co-évolution. By phylogenetic and molecular analyses, in the chapter 2 I demonstrate that the hypothesis of LGT is most likely a consequence of analyses carried out on contaminant non AMF-DNA. In addition, various features characteristic of AMF genes have been determined, allowing researchers to scan their own sequence databases for potential non-AMF contaminants. Phylogenetic relationships of AMF with other fungi has been mostly analysed using molecular markers of ribosomal origin. In chapter 2 I successfully isolated gene encoding α- and ß-tubulins from several AMF genera. Consequently, phylogenetic analyses showed that AMF possess an unexpected relationship with ancestral aquatic fungi (chytrids). These results are consistent with the prediction stating that AMF may have played an important role in the colonisation of land by green plants through the establishment of a symbiosis and after the divergence of AMF from aquatic ancestors. In Chapter 4 I tried to isolate the entire AMF gene family encoding P-Type II ATPases, in order to determine their molecular evolution with the fungal kingdom. These genes were further analysed to detect the level of sequence polymorphism that is present within an AMF population. The results obtained show that mutational events previously thought as occurring only among divergent evolutionary lineages (gene duplications, indel mutations in coding regions) can occur within a single population of AMF. These results have far reaching consequences for our understanding of the genetics and ecology of AMF. Résumé Les champignons endomycorrhiziens arbusculaires (CEA) forment une symbiose racinaire avec environ 80% des plantes vasculaires connues. Ces champignons possèdent un rôle important dans l'écologie et l'adaptation des plantes au sein de différents écosystèmes en .augmentant leurs ressources en nutriments. Le travail présenté dans cette thèse se propose d'isoler et d'analyser certains gènes de CEA avec différentes techniques moléculaires à fin d'obtenir de pÌus amples informations concernant l'évolution moléculaire, la phylogénie et leur diversité génétique à diverses échelles taxonomiques. Certaines analyses phylogénétiques des CEA ont conduit à l'hypothèse que des transferts horizontaux de gènes (THG) ont pu avoir lieu durant leur longue co-évolution avec les plantes vasculaires. Dans le chapitre 2 de cette thèse nous démontrons par analyses moléculaire et phylogénétique que l'hypothèse de THG est une conséquence de contaminations à partir d'ADN de plante ou d'autres micro-organismes. De plus, de nombreuses caractéristiques moléculaires de CEA ont pu être déterminées, permettant la mise en place d'un plan à suivre lors de l'analyse de gènes de CEA dans les études futures. Les relations évolutives des. CEA avec d'autres champignons ont été analysées majoritairement à l'aide de marqueurs moléculaires d'origine ribosomiale. Dans les chapitres 2 et 3 j'ai isolé des gènes codant pour l'a- et la ß-tubuline chez différents genres, de CEA. Les analyses phylogénétiques ont démontré une parenté entre les CEA et des champignons aquatiques ancestraux (chytrides). Ces résultats sont en accord avec l'hypothèse selon laquelle les CEA ont probablement joué un rôle primordial dans l'établissement des plantes sur terre à travers une symbiose et suite à leur évolution à partir d'ancêtres vivant dans des milieux aquatiques: Dans le chapitre 4 j'ai isolé une entière famille de gènes chez les CEA codant des ATPases de la membrane plasmique, et étudié leur évolution moléculaire dans le règne des champignons. Ces mêmes gènes ont été analysés ultérieurement à fin de déterminer le degré de polymorphisme de séquence qui peut être présent au sein d'une population de CEA. Les résultats obtenus montrent que des évènements mutationnels considérés comme apparaissant exclusivement dans des lignées évolutives très divergentes (duplication de gènes, insertions/délétions dans des régions transcrites du génome) ont lieu sein d'une même population de CEA. Cette découverte a un impact important sur nos connaissances concernant la génétique des populations des CEA et leur écologie.
Resumo:
Owing to its special mode of evolution and central role in the adaptive immune system, the major histocompatibility complex (MHC) has become the focus of diverse disciplines such as immunology, evolutionary ecology, and molecular evolution. MHC evolution has been studied extensively in diverse vertebrate lineages over the last few decades, and it has been suggested that birds differ from the established mammalian norm. Mammalian MHC genes evolve independently, and duplication history (i.e., orthology) can usually be traced back within lineages. In birds, this has been observed in only 3 pairs of closely related species. Here we report strong evidence for the persistence of orthology of MHC genes throughout an entire avian order. Phylogenetic reconstructions of MHC class II B genes in 14 species of owls trace back orthology over tens of thousands of years in exon 3. Moreover, exon 2 sequences from several species show closer relationships than sequences within species, resembling transspecies evolution typically observed in mammals. Thus, although previous studies suggested that long-term evolutionary dynamics of the avian MHC was characterized by high rates of concerted evolution, resulting in rapid masking of orthology, our results question the generality of this conclusion. The owl MHC thus opens new perspectives for a more comprehensive understanding of avian MHC evolution.
Resumo:
Mammals are characterized by specific phenotypic traits that include lactation, hair, and relatively large brains with unique structures. Individual mammalian lineages have, in turn, evolved characteristic traits that distinguish them from others. These include obvious anatom¬ical differences but also differences related to reproduction, life span, cognitive abilities, be¬havior. and disease susceptibility. However, the molecular basis of the diverse mammalian phenotypes and the selective pressures that shaped their evolution remain largely unknown. In the first part of my thesis, I analyzed the genetic factors associated with the origin of a unique mammalian phenotype lactation and I studied the selective pressures that forged the transition from oviparity to viviparity. Using a comparative genomics approach and evolutionary simulations, I showed that the emergence of lactation, as well as the appear¬ance of the casein gene family, significantly reduced selective pressure on the major egg-yolk proteins (the vitellogenin family). This led to a progressive loss of vitellogenins, which - in oviparous species - act as storage proteins for lipids, amino acids, phosphorous and calcium in the isolated egg. The passage to internal fertilization and placentation in therian mam¬mals rendered vitellogenins completely dispensable, which ended in the loss of the whole gene family in this lineage. As illustrated by the vitellogenin study, changes in gene content are one possible underlying factor for the evolution of mammalian-specific phenotypes. However, more subtle genomic changes, such as mutations in protein-coding sequences, can also greatly affect the phenotypes. In particular, it was proposed that changes at the level of gene reg¬ulation could underlie many (or even most) phenotypic differences between species. In the second part of my thesis, I participated in a major comparative study of mammalian tissue transcriptomes, with the goal of understanding how evolutionary forces affected expression patterns in the past 200 million years of mammalian evolution. I showed that, while com¬parisons of gene expressions are in agreement with the known species phylogeny, the rate of expression evolution varies greatly among lineages. Species with low effective population size, such as monotremes and hominoids, showed significantly accelerated rates of gene expression evolution. The most likely explanation for the high rate of gene expression evolution in these lineages is the accumulation of mildly deleterious mutations in regulatory regions, due to the low efficiency of purifying selection. Thus, our observations are in agreement with the nearly neutral theory of molecular evolution. I also describe substantial differences in evolutionary rates between tissues, with brain being the most constrained (especially in primates) and testis significantly accelerated. The rate of gene expression evolution also varies significantly between chromosomes. In particular, I observed an acceleration of gene expression changes on the X chromosome, probably as a result of adaptive processes associated with the origin of therian sex chromosomes. Lastly, I identified several individual genes as well as co-regulated expression modules that have undergone lineage specific expression changes and likely under¬lie various phenotypic innovations in mammals. The methods developed during my thesis, as well as the comprehensive gene content analyses and transcriptomics datasets made available by our group, will likely prove to be useful for further exploratory analyses of the diverse mammalian phenotypes.
Resumo:
Pseudomonas protegens is a biocontrol rhizobacterium with a plant-beneficial and an insect pathogenic lifestyle, but it is not understood how the organism switches between the two states. Here, we focus on understanding the function and possible evolution of a molecular sensor that enables P. protegens to detect the insect environment and produce a potent insecticidal toxin specifically during insect infection but not on roots. By using quantitative single cell microscopy and mutant analysis, we provide evidence that the sensor histidine kinase FitF is a key regulator of insecticidal toxin production. Our experimental data and bioinformatic analyses indicate that FitF shares a sensing domain with DctB, a histidine kinase regulating carbon uptake in Proteobacteria. This suggested that FitF has acquired its specificity through domain shuffling from a common ancestor. We constructed a chimeric DctB-FitF protein and showed that it is indeed functional in regulating toxin expression in P. protegens. The shuffling event and subsequent adaptive modifications of the recruited sensor domain were critical for the microorganism to express its potent insect toxin in the observed host-specific manner. Inhibition of the FitF sensor during root colonization could explain the mechanism by which P. protegens differentiates between the plant and insect host. Our study establishes FitF of P. protegens as a prime model for molecular evolution of sensor proteins and bacterial pathogenicity.
Resumo:
BACKGROUND: Complete mitochondrial genome sequences have become important tools for the study of genome architecture, phylogeny, and molecular evolution. Despite the rapid increase in available mitogenomes, the taxonomic sampling often poorly reflects phylogenetic diversity and is often also biased to represent deeper (family-level) evolutionary relationships. RESULTS: We present the first fully sequenced ant (Hymenoptera: Formicidae) mitochondrial genomes. We sampled four mitogenomes from three species of fire ants, genus Solenopsis, which represent various evolutionary depths. Overall, ant mitogenomes appear to be typical of hymenopteran mitogenomes, displaying a general A+T-bias. The Solenopsis mitogenomes are slightly more compact than other hymentoperan mitogenomes (~15.5 kb), retaining all protein coding genes, ribosomal, and transfer RNAs. We also present evidence of recombination between the mitogenomes of the two conspecific Solenopsis mitogenomes. Finally, we discuss potential ways to improve the estimation of phylogenies using complete mitochondrial genome sequences. CONCLUSIONS: The ant mitogenome presents an important addition to the continued efforts in studying hymenopteran mitogenome architecture, evolution, and phylogenetics. We provide further evidence that the sampling across many taxonomic levels (including conspecifics and congeners) is useful and important to gain detailed insights into mitogenome evolution. We also discuss ways that may help improve the use of mitogenomes in phylogenetic analyses by accounting for non-stationary and non-homogeneous evolution among branches.
Resumo:
With the advancement of high-throughput sequencing and dramatic increase of available genetic data, statistical modeling has become an essential part in the field of molecular evolution. Statistical modeling results in many interesting discoveries in the field, from detection of highly conserved or diverse regions in a genome to phylogenetic inference of species evolutionary history Among different types of genome sequences, protein coding regions are particularly interesting due to their impact on proteins. The building blocks of proteins, i.e. amino acids, are coded by triples of nucleotides, known as codons. Accordingly, studying the evolution of codons leads to fundamental understanding of how proteins function and evolve. The current codon models can be classified into three principal groups: mechanistic codon models, empirical codon models and hybrid ones. The mechanistic models grasp particular attention due to clarity of their underlying biological assumptions and parameters. However, they suffer from simplified assumptions that are required to overcome the burden of computational complexity. The main assumptions applied to the current mechanistic codon models are (a) double and triple substitutions of nucleotides within codons are negligible, (b) there is no mutation variation among nucleotides of a single codon and (c) assuming HKY nucleotide model is sufficient to capture essence of transition- transversion rates at nucleotide level. In this thesis, I develop a framework of mechanistic codon models, named KCM-based model family framework, based on holding or relaxing the mentioned assumptions. Accordingly, eight different models are proposed from eight combinations of holding or relaxing the assumptions from the simplest one that holds all the assumptions to the most general one that relaxes all of them. The models derived from the proposed framework allow me to investigate the biological plausibility of the three simplified assumptions on real data sets as well as finding the best model that is aligned with the underlying characteristics of the data sets. -- Avec l'avancement de séquençage à haut débit et l'augmentation dramatique des données géné¬tiques disponibles, la modélisation statistique est devenue un élément essentiel dans le domaine dé l'évolution moléculaire. Les résultats de la modélisation statistique dans de nombreuses découvertes intéressantes dans le domaine de la détection, de régions hautement conservées ou diverses dans un génome de l'inférence phylogénétique des espèces histoire évolutive. Parmi les différents types de séquences du génome, les régions codantes de protéines sont particulièrement intéressants en raison de leur impact sur les protéines. Les blocs de construction des protéines, à savoir les acides aminés, sont codés par des triplets de nucléotides, appelés codons. Par conséquent, l'étude de l'évolution des codons mène à la compréhension fondamentale de la façon dont les protéines fonctionnent et évoluent. Les modèles de codons actuels peuvent être classés en trois groupes principaux : les modèles de codons mécanistes, les modèles de codons empiriques et les hybrides. Les modèles mécanistes saisir une attention particulière en raison de la clarté de leurs hypothèses et les paramètres biologiques sous-jacents. Cependant, ils souffrent d'hypothèses simplificatrices qui permettent de surmonter le fardeau de la complexité des calculs. Les principales hypothèses retenues pour les modèles actuels de codons mécanistes sont : a) substitutions doubles et triples de nucleotides dans les codons sont négligeables, b) il n'y a pas de variation de la mutation chez les nucléotides d'un codon unique, et c) en supposant modèle nucléotidique HKY est suffisant pour capturer l'essence de taux de transition transversion au niveau nucléotidique. Dans cette thèse, je poursuis deux objectifs principaux. Le premier objectif est de développer un cadre de modèles de codons mécanistes, nommé cadre KCM-based model family, sur la base de la détention ou de l'assouplissement des hypothèses mentionnées. En conséquence, huit modèles différents sont proposés à partir de huit combinaisons de la détention ou l'assouplissement des hypothèses de la plus simple qui détient toutes les hypothèses à la plus générale qui détend tous. Les modèles dérivés du cadre proposé nous permettent d'enquêter sur la plausibilité biologique des trois hypothèses simplificatrices sur des données réelles ainsi que de trouver le meilleur modèle qui est aligné avec les caractéristiques sous-jacentes des jeux de données. Nos expériences montrent que, dans aucun des jeux de données réelles, tenant les trois hypothèses mentionnées est réaliste. Cela signifie en utilisant des modèles simples qui détiennent ces hypothèses peuvent être trompeuses et les résultats de l'estimation inexacte des paramètres. Le deuxième objectif est de développer un modèle mécaniste de codon généralisée qui détend les trois hypothèses simplificatrices, tandis que d'informatique efficace, en utilisant une opération de matrice appelée produit de Kronecker. Nos expériences montrent que sur un jeux de données choisis au hasard, le modèle proposé de codon mécaniste généralisée surpasse autre modèle de codon par rapport à AICc métrique dans environ la moitié des ensembles de données. En outre, je montre à travers plusieurs expériences que le modèle général proposé est biologiquement plausible.
Resumo:
Evolution of proteins after whole-genome duplicationGene and genome duplication are considered major mechanisms in the creation of newfunctions in genomes, or in the refinement of networks by the division of function amongmore genes. In animals, the best demonstrated whole genome duplication occurred at theorigin of Teleost fishes. This makes fishes an ideal model to study the consequences ofgenome duplication, particularly since we have a good sampling of genome sequences,abundant functional information, and a very well studied outgroup: the tetrapodes (includinghuman). More specifically, I studied the consequences of duplication on proteins usingevolutionary models to infer adaptive events. I analysed the influence of positive selection invertebrate genes, by contrasting singleton genes and duplicated genes. The conclusion of theanalyses was threefold: (i) positive selection affects diverse phylogenetic branches anddiverse gene categories during vertebrate evolution; (ii) it concerns only a small proportion ofsites (1%-5%); and (iii) whole genome duplication had no detectable impact on theprevalence of this positive selection.I also studied evolution at the amino acid level with different methods to detect functionalshifts (covarion process and constant-but-different process). As in my previous research, Ifound similar numbers of functional shifts between duplicates and between orthologs.The accepted framework for studies of molecular evolution is that orthologs share the samefunction, whereas the function of paralogs diverges. This framework gives a special place togene duplication in evolution, as the main mechanism for generating novelty. With myprevious results showing that duplication and speciation are not so different, we investigatedthe literature to question the evidence for similar or divergent evolution of gene function afterduplication relative to speciation genes. This led us to propose a more rigorous design offuture studies of gene duplication.Finally, based on my automated protocol, we built a database of positive selection invertebrates' genes, Selectome. This database is freely available on the web and will helpfuture evolutionary as well as biochemical studies.
Resumo:
Background: Hox and ParaHox gene clusters are thought to have resulted from the duplication of a ProtoHox gene cluster early in metazoan evolution. However, the origin and evolution of the other genes belonging to the extended Hox group of homeobox-containing genes, that is, Mox and Evx, remains obscure. We constructed phylogenetic trees with mouse, amphioxus and Drosophila extended Hox and other related Antennapedia-type homeobox gene sequences and analyzed the linkage data available for such genes.Results: We claim that neither Mox nor Evx is a Hox or ParaHox gene. We propose a scenariothat reconciles phylogeny with linkage data, in which an Evx/Mox ancestor gene linked to aProtoHox cluster was involved in a segmental tandem duplication event that generated an arrayof all Hox-like genes, referred to as the `coupled¿ cluster. A chromosomal breakage within thiscluster explains the current composition of the extended Hox cluster (with Evx, Hox and Moxgenes) and the ParaHox cluster.Conclusions: Most studies dealing with the origin and evolution of Hox and ParaHox clustershave not included the Hox-related genes Mox and Evx. Our phylogenetic analyses and theavailable linkage data in mammalian genomes support an evolutionary scenario in which anancestor of Evx and Mox was linked to the ProtoHox cluster, and that a tandem duplication of alarge genomic region early in metazoan evolution generated the Hox and ParaHox clusters, plusthe cluster-neighbors Evx and Mox. The large `coupled¿ Hox-like cluster EvxHox/MoxParaHox wassubsequently broken, thus grouping the Mox and Evx genes to the Hox clusters, and isolating theParaHox cluster.
Resumo:
Phenotypic plasticity allows organisms to produce alternative phenotypes under different conditions and represents one of the most important ways by which organisms adaptively respond to the environment. However, the relationship between phenotypic plasticity and molecular evolution remains poorly understood. We addressed this issue by investigating the evolution of genes associated with phenotypically plastic castes, sexes, and developmental stages of the fire ant Solenopsis invicta. We first determined if genes associated with phenotypic plasticity in S. invicta evolved at a rapid rate, as predicted under theoretical models. We found that genes differentially expressed between S. invicta castes, sexes, and developmental stages all exhibited elevated rates of evolution compared with ubiquitously expressed genes. We next investigated the evolutionary history of genes associated with the production of castes. Surprisingly, we found that orthologs of caste-biased genes in S. invicta and the social bee Apis mellifera evolved rapidly in lineages without castes. Thus, in contrast to some theoretical predictions, our results suggest that rapid rates of molecular evolution may not arise primarily as a consequence of phenotypic plasticity. Instead, genes evolving under relaxed purifying selection may more readily adopt new forms of biased expression during the evolution of alternate phenotypes. These results suggest that relaxed selective constraint on protein-coding genes is an important and underappreciated element in the evolutionary origin of phenotypic plasticity.
Resumo:
Assessing the contribution of promoters and coding sequences to gene evolution is an important step toward discovering the major genetic determinants of human evolution. Many specific examples have revealed the evolutionary importance of cis-regulatory regions. However, the relative contribution of regulatory and coding regions to the evolutionary process and whether systemic factors differentially influence their evolution remains unclear. To address these questions, we carried out an analysis at the genome scale to identify signatures of positive selection in human proximal promoters. Next, we examined whether genes with positively selected promoters (Prom+ genes) show systemic differences with respect to a set of genes with positively selected protein-coding regions (Cod+ genes). We found that the number of genes in each set was not significantly different (8.1% and 8.5%, respectively). Furthermore, a functional analysis showed that, in both cases, positive selection affects almost all biological processes and only a few genes of each group are located in enriched categories, indicating that promoters and coding regions are not evolutionarily specialized with respect to gene function. On the other hand, we show that the topology of the human protein network has a different influence on the molecular evolution of proximal promoters and coding regions. Notably, Prom+ genes have an unexpectedly high centrality when compared with a reference distribution (P = 0.008, for Eigenvalue centrality). Moreover, the frequency of Prom+ genes increases from the periphery to the center of the protein network (P = 0.02, for the logistic regression coefficient). This means that gene centrality does not constrain the evolution of proximal promoters, unlike the case with coding regions, and further indicates that the evolution of proximal promoters is more efficient in the center of the protein network than in the periphery. These results show that proximal promoters have had a systemic contribution to human evolution by increasing the participation of central genes in the evolutionary process.
Resumo:
MOTIVATION: Comparative analyses of gene expression data from different species have become an important component of the study of molecular evolution. Thus methods are needed to estimate evolutionary distances between expression profiles, as well as a neutral reference to estimate selective pressure. Divergence between expression profiles of homologous genes is often calculated with Pearson's or Euclidean distance. Neutral divergence is usually inferred from randomized data. Despite being widely used, neither of these two steps has been well studied. Here, we analyze these methods formally and on real data, highlight their limitations and propose improvements. RESULTS: It has been demonstrated that Pearson's distance, in contrast to Euclidean distance, leads to underestimation of the expression similarity between homologous genes with a conserved uniform pattern of expression. Here, we first extend this study to genes with conserved, but specific pattern of expression. Surprisingly, we find that both Pearson's and Euclidean distances used as a measure of expression similarity between genes depend on the expression specificity of those genes. We also show that the Euclidean distance depends strongly on data normalization. Next, we show that the randomization procedure that is widely used to estimate the rate of neutral evolution is biased when broadly expressed genes are abundant in the data. To overcome this problem, we propose a novel randomization procedure that is unbiased with respect to expression profiles present in the datasets. Applying our method to the mouse and human gene expression data suggests significant gene expression conservation between these species. CONTACT: marc.robinson-rechavi@unil.ch; sven.bergmann@unil.ch SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Resumo:
BACKGROUND: The bacterial flagellum is the most important organelle of motility in bacteria and plays a key role in many bacterial lifestyles, including virulence. The flagellum also provides a paradigm of how hierarchical gene regulation, intricate protein-protein interactions and controlled protein secretion can result in the assembly of a complex multi-protein structure tightly orchestrated in time and space. As if to stress its importance, plants and animals produce receptors specifically dedicated to the recognition of flagella. Aside from motility, the flagellum also moonlights as an adhesion and has been adapted by humans as a tool for peptide display. Flagellar sequence variation constitutes a marker with widespread potential uses for studies of population genetics and phylogeny of bacterial species. RESULTS: We sequenced the complete flagellin gene (flaA) in 18 different species and subspecies of Aeromonas. Sequences ranged in size from 870 (A. allosaccharophila) to 921 nucleotides (A. popoffii). The multiple alignment displayed 924 sites, 66 of which presented alignment gaps. The phylogenetic tree revealed the existence of two groups of species exhibiting different FlaA flagellins (FlaA1 and FlaA2). Maximum likelihood models of codon substitution were used to analyze flaA sequences. Likelihood ratio tests suggested a low variation in selective pressure among lineages, with an omega ratio of less than 1 indicating the presence of purifying selection in almost all cases. Only one site under potential diversifying selection was identified (isoleucine in position 179). However, 17 amino acid positions were inferred as sites that are likely to be under positive selection using the branch-site model. Ancestral reconstruction revealed that these 17 amino acids were among the amino acid changes detected in the ancestral sequence. CONCLUSION: The models applied to our set of sequences allowed us to determine the possible evolutionary pathway followed by the flaA gene in Aeromonas, suggesting that this gene have probably been evolving independently in the two groups of Aeromonas species since the divergence of a distant common ancestor after one or several episodes of positive selection. REVIEWERS: This article was reviewed by Alexey Kondrashov, John Logsdon and Olivier Tenaillon (nominated by Laurence D Hurst).
Resumo:
Picornaviruses are the most common human viruses and the identification of the picornaviruses is nowadays based on molecular techniques, for example, reverse transcriptase polymerase chain reaction (RT-PCR). One aim of this thesis was to improve the identification of picornaviruses, especially rhino- and enteroviruses, with a real-time assay format and, also, to improve the differentiation of the viruses with genus-specific locked nucleic acid (LNA) probes. Another aim was to identify and study the causative agent of the enterovirus epidemics that appeared in Finland during seasons 2008-2010. In this thesis, the first version of picornavirus qRT-PCR with a melting curve analysis was used in a study of rhinovirus transmission within families with a rhinovirus positive index child where rhinovirus infection was monitored in all family members. In conclusion, rhinoviruses spread effectively within families causing mostly symptomatic infections in children and asymptomatic infections in adults. To improve the differentiation between rhino- and enterovirus the picornavirus qRT-PCR was modified with LNA-incorporated probes. The LNA probes were validated with picornavirus prototypes and different clinical specimen types. The LNA probe-based picornavirus qRT-PCR was able to differentiate all rhino- and enteroviruses correctly, which makes it suitable for diagnostic use. Moreover, in this thesis enterovirus outbreaks were studied with a well-observed method to create a strain-specific qRT-PCR from the typing region VP1 protein. In a hand-foot-and-mouth-disease (HFMD) outbreak in 2008, the causative agent was identified as CV-A6 and when the molecular evolution of the new HFMD CV-A6 strain was studied it was found that CV-A6 was the emerging agent for HFMD and onychomadesis. Furthermore, unusual E-30 meningitis epidemics that apeared during seasons 2009 and 2010 were studied with strain-specific qRT-PCR. The E-30 affected mostly adolescents and was probably spread in sports teams.