990 resultados para Gene tree
Resumo:
Quest for Orthologs (QfO) is a community effort with the goal to improve and benchmark orthology predictions. As quality assessment assumes prior knowledge on species phylogenies, we investigated the congruency between existing species trees by comparing the relationships of 147 QfO reference organisms from six Tree of Life (ToL)/species tree projects: The National Center for Biotechnology Information (NCBI) taxonomy, Opentree of Life, the sequenced species/species ToL, the 16S ribosomal RNA (rRNA) database, and trees published by Ciccarelli et al. (Ciccarelli FD, et al. 2006. Toward automatic reconstruction of a highly resolved tree of life. Science 311:1283-1287) and by Huerta-Cepas et al. (Huerta-Cepas J, Marcet-Houben M, Gabaldon T. 2014. A nested phylogenetic reconstruction approach provides scalable resolution in the eukaryotic Tree Of Life. PeerJ PrePrints 2:223) Our study reveals that each species tree suggests a different phylogeny: 87 of the 146 (60%) possible splits of a dichotomous and rooted tree are congruent, while all other splits are incongruent in at least one of the species trees. Topological differences are observed not only at deep speciation events, but also within younger clades, such as Hominidae, Rodentia, Laurasiatheria, or rosids. The evolutionary relationships of 27 archaea and bacteria are highly inconsistent. By assessing 458,108 gene trees from 65 genomes, we show that consistent species topologies are more often supported by gene phylogenies than contradicting ones. The largest concordant species tree includes 77 of the QfO reference organisms at the most. Results are summarized in the form of a consensus ToL (http://swisstree.vital-it.ch/species_tree) that can serve different benchmarking purposes.
Resumo:
Plasmodium falciparum is the parasite responsible for the most acute form of malaria in humans. Recently, the serine repeat antigen (SERA) in P. falciparum has attracted attention as a potential vaccine and drug target, and it has been shown to be a member of a large gene family. To clarify the relationships among the numerous P. falciparum SERAs and to identify orthologs to SERA5 and SERA6 in Plasmodium species affecting rodents, gene trees were inferred from nucleotide and amino acid sequence data for 33 putative SERA homologs in seven different species. (A distance method for nucleotide sequences that is specifically designed to accommodate differing GC content yielded results that were largely compatible with the amino acid tree. Standard-distance and maximum-likelihood methods for nucleotide sequences, on the other hand, yielded gene trees that differed in important respects.) To infer the pattern of duplication, speciation, and gene loss events in the SERA gene family history, the resulting gene trees were then "reconciled" with two competing Plasmodium species tree topologies that have been identified by previous phylogenetic studies. Parsimony of reconciliation was used as a criterion for selecting a gene tree/species tree pair and provided (1) support for one of the two species trees and for the core topology of the amino acid-derived gene tree, (2) a basis for critiquing fine detail in a poorly resolved region of the gene tree, (3) a set of predicted "missing genes" in some species, (4) clarification of the relationship among the P. falciparum SERA, and (5) some information about SERA5 and SERA6 orthologs in the rodent malaria parasites. Parsimony of reconciliation and a second criterion--implied mutational pattern at two key active sites in the SERA proteins-were also seen to be useful supplements to standard "bootstrap" analysis for inferred topologies.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
The taxonomic position of a bacterium isolated from water samples from the Rio Negro, in Amazon, Brazil, was determined by using a polyphasic approach. The organism formed a distinct phyletic line in the Chromobacterium 16S rRNA gene tree and had chemotaxonomic and morphological properties consistent with its classification in this genus. It was found to be closely related to Chromobacterium vaccinii DSM 25150(T) (98.6 % 16S rRNA gene similarity) and shared 98.5 % 16S rRNA gene similarity with Chromobacterium piscinae LGM 3947(T). DNA-DNA relatedness studies showed that isolate CBMAI 310(T) belongs to distinct genomic species. The isolate was readily distinguished from the type strain of these species using a combination of phenotypic and chemotaxonomic properties. Thus, based on genotypic and phenotypic data, it is proposed that isolate CBMAI 310(T) (=DSM 26508(T)) be classified in the genus Chromobacterium as the type strain of a novel species, namely, Chromobacterium amazonense sp. nov.
Resumo:
Background: The common vampire bat Desmodus rotundus is an excellent model organism for studying ecological vicariance in the Neotropics due to its broad geographic range and its preference for forested areas as roosting sites. With the objective of testing for Pleistocene ecological vicariance, we sequenced a mitocondrial DNA (mtDNA) marker and two nuclear markers (RAG2 and DRB) to try to understand how Pleistocene glaciations affected the distribution of intraspecific lineages in this bat. Results: Five reciprocally monophyletic clades were evident in the mitochondrial gene tree, and in most cases with high bootstrap support: Central America (CA), Amazon and Cerrado (AMC), Pantanal (PAN), Northern Atlantic Forest (NAF) and Southern Atlantic Forest (SAF). The Atlantic forest clades formed a monophyletic clade with high bootstrap support, creating an east/west division for this species in South America. On the one hand, all coalescent and non-coalescent estimates point to a Pleistocene time of divergence between the clades. On the other hand, the nuclear markers showed extensive sharing of haplotypes between distant localities, a result compatible with male-biased gene flow. In order to test if the disparity between the mitochondrial and nuclear markers was due to the difference in mutation rate and effective size, we performed a coalescent simulation to examine the feasibility that, given the time of separation between the observed lineages, even with a gene flow rate close to zero, there would not be reciprocal monophyly for a neutral nuclear marker. We used the observed values of theta and an estimated mutation rate for the nuclear marker gene to perform 1000 iterations of the simulation. The results of this simulation were inconclusive: the number of iterations with and without reciprocal monophyly of one or more clades are similar. Conclusions: We therefore conclude that the pattern exhibited by the common vampire bat, with marked geographical structure for a mitochondrial marker and no phylogeographic structure for nuclear markers is compatible with a historical scenario of complete isolation of refuge-like populations during the Pleistocene. The results on demographic history on this species is compatible with the Carnaval-Moritz model of Pleistocene vicariance, with demographic expansions in the southern Atlantic forest.
Resumo:
IntroductionMicrosporidia constitute the most common black fly pathogens, although the species' diversity, seasonal occurrence and transmission mechanisms remain poorly understood. Infections by this agent are often chronic and non-lethal, but they can cause reduced fecundity and decreased longevity. The objective of this study was to identify microsporidia infecting Simulium (Chirostilbia) pertinax (Kollar, 1832) larvae from Caraguatatuba, State of São Paulo, Brazil, by molecular and morphological characterization.MethodsLarvae were collected at a single point in a stream in a rural area of the city and were kept under artificial aeration until analysis. Polydispyrenia spp. infection was characterized by the presence of at least 32 mononuclear spores measuring 6.9 ± 1.0 × 5.0 ± 0.7µm in persistent sporophorous vesicles. Similarly, Amblyospora spp. were characterized by the presence of eight uninucleate spores measuring 4.5 × 3.5µm in sporophorous vesicles.ResultsThe molecular analysis confirmed the presence of microsporidian DNA in the 8 samples (prevalence of 0.51%). Six samples (Brazilian larvae) were related to Polydispyrenia simulii and Caudospora palustris reference sequences but in separate clusters. One sample was clustered with Amblyospora spp. Edhazardia aedis was the positive control taxon.ConclusionsSamples identified as Polydispyrenia spp. and Amblyospora spp. were grouped with P. simulii and Amblyospora spp., respectively, corroborating previous results. However, the 16S gene tree showed a considerable distance between the black fly-infecting Amblyospora spp. and the mosquito-infecting spp. This distance suggests that these two groups are not congeneric. Additional genomic region evaluation is necessary to obtain a coherent phylogeny for this group.
Resumo:
[Français] Une fraction importante des génomes eucaryotes est constituée de Gènes Répétés en Tandem (GRT). Un mécanisme fondamental dans l’évolution des GRT est la recombinaison inégale durant la méiose, entrainant la duplication locale (en tandem) de segments chromosomiques contenant un ou plusieurs gènes adjacents. Différents algorithmes ont été proposés pour inférer une histoire de duplication en tandem pour un cluster de GRT. Cependant, leur utilisation est limitée dans la pratique, car ils ne tiennent pas compte d’autres événements évolutifs pourtant fréquents, comme les inversions, les duplications inversées et les délétions. Cette thèse propose différentes approches algorithmiques permettant d’intégrer ces événements dans le modèle de duplication en tandem classique. Nos contributions sont les suivantes: • Intégrer les inversions dans un modèle de duplication en tandem simple (duplication d’un gène à la fois) et proposer un algorithme exact permettant de calculer le nombre minimal d’inversions s’étant produites dans l’évolution d’un cluster de GRT. • Généraliser ce modèle pour l’étude d’un ensemble de clusters orthologues dans plusieurs espèces. • Proposer un algorithme permettant d’inférer l’histoire évolutive d’un cluster de GRT en tenant compte des duplications en tandem, duplications inversées, inversions et délétions de segments chromosomiques contenant un ou plusieurs gènes adjacents.
Resumo:
Une réconciliation entre un arbre de gènes et un arbre d’espèces décrit une histoire d’évolution des gènes homologues en termes de duplications et pertes de gènes. Pour inférer une réconciliation pour un arbre de gènes et un arbre d’espèces, la parcimonie est généralement utilisée selon le nombre de duplications et/ou de pertes. Les modèles de réconciliation sont basés sur des critères probabilistes ou combinatoires. Le premier article définit un modèle combinatoire simple et général où les duplications et les pertes sont clairement identifiées et la réconciliation parcimonieuse n’est pas la seule considérée. Une architecture de toutes les réconciliations est définie et des algorithmes efficaces (soit de dénombrement, de génération aléatoire et d’exploration) sont développés pour étudier les propriétés combinatoires de l’espace de toutes les réconciliations ou seulement les plus parcimonieuses. Basée sur le processus classique nommé naissance-et-mort, un algorithme qui calcule la vraisemblance d’une réconciliation a récemment été proposé. Le deuxième article utilise cet algorithme avec les outils combinatoires décrits ci-haut pour calculer efficacement (soit approximativement ou exactement) les probabilités postérieures des réconciliations localisées dans le sous-espace considéré. Basé sur des taux réalistes (selon un modèle probabiliste) de duplication et de perte et sur des données réelles/simulées de familles de champignons, nos résultats suggèrent que la masse probabiliste de toute l’espace des réconciliations est principalement localisée autour des réconciliations parcimonieuses. Dans un contexte d’approximation de la probabilité d’une réconciliation, notre approche est une alternative intéressante face aux méthodes MCMC et peut être meilleure qu’une approche sophistiquée, efficace et exacte pour calculer la probabilité d’une réconciliation donnée. Le problème nommé Gene Tree Parsimony (GTP) est d’inférer un arbre d’espèces qui minimise le nombre de duplications et/ou de pertes pour un ensemble d’arbres de gènes. Basé sur une approche qui explore tout l’espace des arbres d’espèces pour les génomes considérés et un calcul efficace des coûts de réconciliation, le troisième article décrit un algorithme de Branch-and-Bound pour résoudre de façon exacte le problème GTP. Lorsque le nombre de taxa est trop grand, notre algorithme peut facilement considérer des relations prédéfinies entre ensembles de taxa. Nous avons testé notre algorithme sur des familles de gènes de 29 eucaryotes.
Resumo:
Les gènes sont les parties du génome qui codent pour les protéines. Les gènes d’une ou plusieurs espèces peuvent être regroupés en "familles", en fonction de leur similarité de séquence. Cependant, pour connaître les relations fonctionnelles entre ces copies de gènes, la similarité de séquence ne suffit pas. Pour cela, il est important d’étudier l’évolution d’une famille par duplications et pertes afin de pouvoir distinguer entre gènes orthologues, des copies ayant évolué par spéciation et susceptibles d’avoir conservé une fonction commune, et gènes paralogues, des copies ayant évolué par duplication qui ont probablement développé des nouvelles fonctions. Étant donnée une famille de gènes présents dans n espèces différentes, un arbre de gènes (obtenu par une méthode phylogénétique classique), et un arbre phylogénétique pour les n espèces, la "réconciliation" est l’approche la plus courante permettant d’inférer une histoire d’évolution de cette famille par duplications, spéciations et pertes. Le degré de confiance accordé à l’histoire inférée est directement relié au degré de confiance accordé à l’arbre de gènes lui-même. Il est donc important de disposer d’une méthode préliminaire de correction d’arbres de gènes. Ce travail introduit une méthodologie permettant de "corriger" un arbre de gènes : supprimer le minimum de feuilles "mal placées" afin d’obtenir un arbre dont les sommets de duplications (inférés par la réconciliation) sont tous des sommets de "duplications apparentes" et obtenir ainsi un arbre de gènes en "accord" avec la phylogénie des espèces. J’introduis un algorithme exact pour des arbres d’une certaine classe, et une heuristique pour le cas général.
Resumo:
In conventional phylogeographic studies, historical demographic processes are elucidated from the geographical distribution of individuals represented on an inferred gene tree. However, the interpretation of gene trees in this context can be difficult as the same demographic/geographical process can randomly lead to multiple different genealogies. Likewise, the same gene trees can arise under different demographic models. This problem has led to the emergence of many statistical methods for making phylogeographic inferences. A popular phylogeographic approach based on nested clade analysis is challenged by the fact that a certain amount of the interpretation of the data is left to the subjective choices of the user, and it has been argued that the method performs poorly in simulation studies. More rigorous statistical methods based on coalescence theory have been developed. However, these methods may also be challenged by computational problems or poor model choice. In this review, we will describe the development of statistical methods in phylogeographic analysis, and discuss some of the challenges facing these methods.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Toadlets of the genus Brachycephalus are endemic to the Atlantic rainforests of southeastern and southern Brazil. The 14 species currently described have snout-vent lengths less than 18. mm and are thought to have evolved through miniaturization: an evolutionary process leading to an extremely small adult body size. Here, we present the first comprehensive phylogenetic analysis for Brachycephalus, using a multilocus approach based on two nuclear (Rag-1 and Tyr) and three mitochondrial (Cyt b, 12S, and 16S rRNA) gene regions. Phylogenetic relationships were inferred using a partitioned Bayesian analysis of concatenated sequences and the hierarchical Bayesian method (BEST) that estimates species trees based on the multispecies coalescent model. Individual gene trees showed conflict and also varied in resolution. With the exception of the mitochondrial gene tree, no gene tree was completely resolved. The concatenated gene tree was completely resolved and is identical in topology and degree of statistical support to the individual mtDNA gene tree. On the other hand, the BEST species tree showed reduced significant node support relative to the concatenate tree and recovered a basal trichotomy, although some bipartitions were significantly supported at the tips of the species tree. Comparison of the log likelihoods for the concatenated and BEST trees suggests that the method implemented in BEST explains the multilocus data for Brachycephalus better than the Bayesian analysis of concatenated data. Landmark-based geometric morphometrics revealed marked variation in cranial shape between the species of Brachycephalus. In addition, a statistically significant association was demonstrated between variation in cranial shape and genetic distances estimated from the mtDNA and nuclear loci. Notably, B. ephippium and B. garbeana that are predicted to be sister-species in the individual and concatenated gene trees and the BEST species tree share an evolutionary novelty, the hyperossified dorsal plate. © 2011 Elsevier Inc.
Resumo:
Right whales carry large populations of three ‘whale lice’ (Cyamus ovalis, Cyamus gracilis, Cyamus erraticus) that have no other hosts. We used sequence variation in the mitochondrial COI gene to ask (i) whether cyamid population structures might reveal associations among right whale individuals and subpopulations, (ii) whether the divergences of the three nominally conspecific cyamid species on North Atlantic, North Pacific, and southern right whales (Eubalaena glacialis, Eubalaena japonica, Eubalaena australis) might indicate their times of separation, and (iii) whether the shapes of cyamid gene trees might contain information about changes in the population sizes of right whales. We found high levels of nucleotide diversity but almost no population structure within oceans, indicating large effective population sizes and high rates of transfer between whales and subpopulations. North Atlantic and Southern Ocean populations of all three species are reciprocally monophyletic, and North Pacific C. erraticus is well separated from North Atlantic and southern C. erraticus. Mitochondrial clock calibrations suggest that these divergences occurred around 6 million years ago (Ma), and that the Eubalaena mitochondrial clock is very slow. North Pacific C. ovalis forms a clade inside the southern C. ovalis gene tree, implying that at least one right whale has crossed the equator in the Pacific Ocean within the last 1–2 million years (Myr). Low-frequency polymorphisms are more common than expected under neutrality for populations of constant size, but there is no obvious signal of rapid, interspecifically congruent expansion of the kind that would be expected if North Atlantic or southern right whales had experienced a prolonged population bottleneck within the last 0.5 Myr.
Resumo:
Verrucosispora isolate AB-18-032(T), the abyssomicin- and proximicin-producing actinomycete, has chemotaxonomic and morphological properties consistent with its classification in the genus Verrucosispora. The organism formed a distinct phyletic line in the Verrucosispora 16S rRNA gene tree sharing similarities of 99.7%, 98.7% and 98.9% with Verrucosispora gifhornensis DSM 44337(T), Verrucosispora lutea YIM 013(T) and Verrucosispora sediminis MS 426(T), respectively. It was readily distinguished from the two latter species using a range of phenotypic features and from V. gifhornensis DSM 44337(T), its nearest phylogenetic neighbor, by a DNA G+C content of 65.5 mol% obtained by thermal denaturation and fluorometry and DNA:DNA relatedness values of 64.0% and 65.0% using renaturation and fluorometric methods, respectively. It is apparent from the combined genotypic and phenotypic data that strain AB-18-032(T) should be classified in the genus Verrucosispora as a new species. The name Verrucosispora maris sp. nov. is proposed for this taxon with isolate AB-18-032(T) (= DSM 45365(T) = NRRL B-24793(T)) as the type strain.
Resumo:
We describe a new species of Bothrops from Vitoria Island, off the coast of Sao Paulo, southeastern Brazil. The new species differs from the mainland coastal populations of B. jararaca mostly in its smaller and stouter body, number and form of scales, and hemipenial morphology. From B. insularis and B. alcatraz, both related species endemic to islands in southeastern Brazil, B. otavioi sp. nov. differs mainly in its body form and number of scales. The new species has the twist common mitochondrial haplotype for mainland populations of B. jararaca, which is also found in B. alcatraz. A mitochondrial genealogy (gene tree) shows the new species nested within the northern clade of B. jararaca. This genealogical pattern can be explained by a recent speciation event for B. otavioi sp. nov. The isolation of insular species of Bothrops from continental ancestor populations are probably related to the same vicariant process, the oscillations of sea level during the Pleistocene. The new species feeds on small hylid frogs, and attains sexual maturity at 388 mm snout-vent length (SVL; males) and 692 mm SVL (females). Bothrops facial sp. nov. is endemic to Vitoria Island, and should be listed as critically endangered because it is known from only a single area (an island), its geographic range covers less than 100 km(2), and there is a projected continuing decline in the quality of its habitat because of increasing human settlement.