913 resultados para Tree Alignment
Resumo:
The starting point of this article is the question "How to retrieve fingerprints of rhythm in written texts?" We address this problem in the case of Brazilian and European Portuguese. These two dialects of Modern Portuguese share the same lexicon and most of the sentences they produce are superficially identical. Yet they are conjectured, on linguistic grounds, to implement different rhythms. We show that this linguistic question can be formulated as a problem of model selection in the class of variable length Markov chains. To carry on this approach, we compare texts from European and Brazilian Portuguese. These texts are previously encoded according to some basic rhythmic features of the sentences which can be automatically retrieved. This is an entirely new approach from the linguistic point of view. Our statistical contribution is the introduction of the smallest maximizer criterion which is a constant free procedure for model selection. As a by-product, this provides a solution for the problem of optimal choice of the penalty constant when using the BIC to select a variable length Markov chain. Besides proving the consistency of the smallest maximizer criterion when the sample size diverges, we also make a simulation study comparing our approach with both the standard BIC selection and the Peres-Shields order estimation. Applied to the linguistic sample constituted for our case study, the smallest maximizer criterion assigns different context-tree models to the two dialects of Portuguese. The features of the selected models are compatible with current conjectures discussed in the linguistic literature.
Resumo:
The circumscription of genera belonging to tribe Bignonieae (Bignoniaceae) has traditionally been complex, with only a few genera having stable circumscriptions in the various classification systems proposed for the tribe. The genus Lundia, for instance, is well characterized by a series of morphological synapomorphies and its circumscription has remained quite stable throughout its history. Despite the stable circumscription of Lundia, the circumscription of species within the genus has remained problematic. This study aims to reconstruct the phylogeny of Lundia in order to refine species circumscriptions, gain a better understanding of relationships between taxa, and identify potential morphological synapomorphies for species and major clades. We sampled 26 accessions representing 13 species of Lundia, and 5 outgroups, and reconstructed the phylogeny of the genus using a chloroplast (ndhF) and a nuclear marker (PepC). Data derived from sequences of the individual loci were analyzed using parsimony and Bayesian inference, and the combined molecular dataset was analyzed with Bayesian methods. The monophyly of Lundia nitidula, a species with a particularly complex circumscription, was tested using Shimodaira-Hasegawa (SH) test and the approximately unbiased test for phylogenetic tree selection (AU test). In addition, 40 morphological characters were mapped onto the tree that resulted from the analysis of the combined molecular dataset in order to identify morphological synapomorphies of individual species and major clades. Lundia and most species currently recognized within the genus were strongly supported as monophyletic in all analyses. One species, Lundia nitidula, was not resolved as monophyletic, but the monophyly of this species was not rejected by the AU and SH tests. Lundia sect. Eriolundia is resolved as paraphyletic in all analyses, while Lundia sect. Eulundia is monophyletic and supported by the same morphological characters traditionally used to circumscribe this section. The phylogeny of Lundia contributed important information for a better circumscription of species and served as basis the taxonomic revision of the genus.
Resumo:
The gecko genus Phyllopezus occurs across South America's open biomes: Cerrado, Seasonally Dry Tropical Forests (SDTF, including Caatinga), and Chaco. We generated a multi-gene dataset and estimated phylogenetic relationships among described Phyllopezus taxa and related species. We included exemplars from both described Phyllopezus pollicaris subspecies, P. p. pollicaris and P. p. przewalskii. Phylogenies from the concatenated data as well as species trees constructed from individual gene trees were largely congruent. All phylogeny reconstruction methods showed Bogertia lutzae as the sister species of Phyllopezus maranjonensis, rendering Phyllopezus paraphyletic. We synonymized the monotypic genus Bogertia with Phyllopezus to maintain a taxonomy that is isomorphic with phylogenetic history. We recovered multiple, deeply divergent, cryptic lineages within P. pollicaris. These cryptic lineages possessed mtDNA distances equivalent to distances among other gekkotan sister taxa. Described P. pollicaris subspecies are not reciprocally monophyletic and current subspecific taxonomy does not accurately reflect evolutionary relationships among cryptic lineages. We highlight the conservation significance of these results in light of the ongoing habitat loss in South America's open biomes. (C) 2011 Elsevier Inc. All rights reserved.
Resumo:
This paper presents a survey of evolutionary algorithms that are designed for decision-tree induction. In this context, most of the paper focuses on approaches that evolve decision trees as an alternate heuristics to the traditional top-down divide-and-conquer approach. Additionally, we present some alternative methods that make use of evolutionary algorithms to improve particular components of decision-tree classifiers. The paper's original contributions are the following. First, it provides an up-to-date overview that is fully focused on evolutionary algorithms and decision trees and does not concentrate on any specific evolutionary approach. Second, it provides a taxonomy, which addresses works that evolve decision trees and works that design decision-tree components by the use of evolutionary algorithms. Finally, a number of references are provided that describe applications of evolutionary algorithms for decision-tree induction in different domains. At the end of this paper, we address some important issues and open questions that can be the subject of future research.
Resumo:
Two new species of Gastrotheca are described from northeastern Minas Gerais and southern Bahia, in the Atlantic Forest of Brazil. Data on morphology, calls, mitochondrial, and nuclear DNA are provided. Allied to G. fissipes and G. megacephala, the new taxa provide evidence for a higher diversity of species of Gastrotheca than previously thought at the Atlantic Forest. The data also suggest that G. pulchra, another Atlantic Forest taxon, is more closely related to non-Atlantic Forest species than to the remaining analyzed Brazilian Gastrotheca species. This implies that the Gastrotheca at the Brazilian coastal forests have at least two independent origins.
Resumo:
Background: This paper addresses the prediction of the free energy of binding of a drug candidate with enzyme InhA associated with Mycobacterium tuberculosis. This problem is found within rational drug design, where interactions between drug candidates and target proteins are verified through molecular docking simulations. In this application, it is important not only to correctly predict the free energy of binding, but also to provide a comprehensible model that could be validated by a domain specialist. Decision-tree induction algorithms have been successfully used in drug-design related applications, specially considering that decision trees are simple to understand, interpret, and validate. There are several decision-tree induction algorithms available for general-use, but each one has a bias that makes it more suitable for a particular data distribution. In this article, we propose and investigate the automatic design of decision-tree induction algorithms tailored to particular drug-enzyme binding data sets. We investigate the performance of our new method for evaluating binding conformations of different drug candidates to InhA, and we analyze our findings with respect to decision tree accuracy, comprehensibility, and biological relevance. Results: The empirical analysis indicates that our method is capable of automatically generating decision-tree induction algorithms that significantly outperform the traditional C4.5 algorithm with respect to both accuracy and comprehensibility. In addition, we provide the biological interpretation of the rules generated by our approach, reinforcing the importance of comprehensible predictive models in this particular bioinformatics application. Conclusions: We conclude that automatically designing a decision-tree algorithm tailored to molecular docking data is a promising alternative for the prediction of the free energy from the binding of a drug candidate with a flexible-receptor.
Resumo:
Fungi are disease-causing agents in plants and affect crops of economic importance. One control method is to induce resistance in the host by using biological control with hypovirulent phytopathogenic fungi. Here, we report the detection of a mycovirus in a strain of Colletotrichum gloeosporioides causing anthracnose of cashew tree. The strain C. gloeosporioides URM 4903 was isolated from a cashew tree (Anacardium occidentale) in Igarassu, PE, Brazil. After nucleic acid extraction and electrophoresis, the band corresponding to a possible double-stranded RNA (dsRNA) was purified by cellulose column chromatography. Nine extrachromosomal bands were obtained. Enzymatic digestion with DNAse I and Nuclease S1 had no effect on these bands, indicating their dsRNA nature. Transmission electron microscopic examination of extracts from this strain showed the presence of isometric particles (30-35 nm in diameter). These data strongly suggest the infection of this C. gloeosporioides strain by a dsRNA mycovirus. Once the hypovirulence of this strain is confirmed, the strain may be used for the biological control of cashew anthracnose.
Resumo:
For many tree species, mating system analyses have indicated potential variations in the selfing rate and paternity correlation among fruits within individuals, among individuals within populations, among populations, and from one flowering event to another. In this study, we used eight microsatellite markers to investigate mating systems at two hierarchical levels (fruits within individuals and individuals within populations) for the insect pollinated Neotropical tree Tabebuia roseo-alba. We found that T. roseo-alba has a mixed mating system with predominantly outcrossed mating. The outcrossing rates at the population level were similar across two T. roseo-alba populations; however, the rates varied considerably among individuals within populations. The correlated paternity results at different hierarchical levels showed that there is a high probability of shared paternal parentage when comparing seeds within fruits and among fruits within plants and full-sibs occur in much higher proportion within fruits than among fruits. Significant levels of fixation index were found in both populations and biparental inbreeding is believed to be the main cause of the observed inbreeding. The number of pollen donors contributing to mating was low. Furthermore, open-pollinated seeds varied according to relatedness, including half-sibs, full-sibs, self-sibs and self- half-sibs. In both populations, the effective population size within a family (seed-tree and its offspring) was lower than expected for panmictic populations. Thus, seeds for ex situ conservation genetics, progeny tests and reforestation must be collected from a large number of seed-trees to guarantee an adequate effective population in the sample.
Resumo:
A recent review of the homology concept in cladistics is critiqued in light of the historical literature. Homology as a notion relevant to the recognition of clades remains equivalent to synapomorphy. Some symplesiomorphies are homologies inasmuch as they represent synapomorphies of more inclusive taxa; others are complementary character states that do not imply any shared evolutionary history among the taxa that exhibit the state. Undirected character-state change (as characters optimized on an unrooted tree) is a necessary but not sufficient test of homology, because the addition of a root may alter parsimonious reconstructions. Primary and secondary homology are defended as realistic representations of discovery procedures in comparative biology, recognizable even in Direct Optimization. The epistemological relationship between homology as evidence and common ancestry as explanation is again emphasized. An alternative definition of homology is proposed. (c) The Willi Hennig Society 2012.
Resumo:
Background: Tuberculosis (TB) remains a public health issue worldwide. The lack of specific clinical symptoms to diagnose TB makes the correct decision to admit patients to respiratory isolation a difficult task for the clinician. Isolation of patients without the disease is common and increases health costs. Decision models for the diagnosis of TB in patients attending hospitals can increase the quality of care and decrease costs, without the risk of hospital transmission. We present a predictive model for predicting pulmonary TB in hospitalized patients in a high prevalence area in order to contribute to a more rational use of isolation rooms without increasing the risk of transmission. Methods: Cross sectional study of patients admitted to CFFH from March 2003 to December 2004. A classification and regression tree (CART) model was generated and validated. The area under the ROC curve (AUC), sensitivity, specificity, positive and negative predictive values were used to evaluate the performance of model. Validation of the model was performed with a different sample of patients admitted to the same hospital from January to December 2005. Results: We studied 290 patients admitted with clinical suspicion of TB. Diagnosis was confirmed in 26.5% of them. Pulmonary TB was present in 83.7% of the patients with TB (62.3% with positive sputum smear) and HIV/AIDS was present in 56.9% of patients. The validated CART model showed sensitivity, specificity, positive predictive value and negative predictive value of 60.00%, 76.16%, 33.33%, and 90.55%, respectively. The AUC was 79.70%. Conclusions: The CART model developed for these hospitalized patients with clinical suspicion of TB had fair to good predictive performance for pulmonary TB. The most important variable for prediction of TB diagnosis was chest radiograph results. Prospective validation is still necessary, but our model offer an alternative for decision making in whether to isolate patients with clinical suspicion of TB in tertiary health facilities in countries with limited resources.
Resumo:
We tested the early performance of 16 native early-, mid-, and late-successional tree species in response to four intensities of grass removal in an abandoned cattle pasture dominated by the introduced, invasive African grass, Cynodon plectostachyus, within the Lacandon rainforest region, southeast Mexico. The increase in grass removals significantly improved the performance of many species, especially of early-and mid-successional species, while performance of late-successional species was relatively poor and did not differ significantly among treatments. Good site preparation and at least one additional grass removal four months after seedling transplant were found to be essential; additional grass removals led to improved significantly performance of saplings in most cases. In order to evaluate the potential of transplanting tree seedlings successfully in abandoned tropical pastures, we developed a "planting risk index", combining field performance measurements and plantation cost estimations. Our results showed a great potential for establishing restoration plantings with many early-and mid-successional species. Although planting risk of late-successional species was considered high, certain species showed some possibilities of acclimation after 18 months and should be considered in future plantation arrangements in view of their long-term contributions to biodiversity maintenance and also to human welfare through delivery of ecosystem services. Conducting a planting risk analysis can help avoid failure of restoration strategies involving simultaneous planting of early-, mid-, and late-successional tree species. This in turn will improve cost-effectiveness of initial interventions in large-scale, long-term restoration programs.
Resumo:
Invasive species are known to affect native species in a variety of ways, but the effect of acoustic invaders has not been examined previously. We simulated an invasion of the acoustic niche by exposing calling native male white-banded tree frogs (Hypsiboas albomarginatus) to recorded invasive American bullfrog (Lithobates catesbeianus) calls. In response, tree frogs immediately shifted calls to significantly higher frequencies. In the post-stimulus period, they continued to use higher frequencies while also decreasing signal duration. Acoustic signals are the primary basis of mate selection in many anurans, suggesting that such changes could negatively affect the reproductive success of native species. The effects of bullfrog vocalizations on acoustic communities are expected to be especially severe due to their broad frequency band, which masks the calls of multiple species simultaneously.
Resumo:
This study extends the current knowledge regarding the use of plants for the passive accumulation of anthropogenic PAHs that are present in the atmospheric total suspended particles (TSP) in the tropics and sub-tropics. It is of major relevance because the anthropic emissions of TSP containing PAHs are significant in these regions, but their monitoring is still scarce. We compared the biomonitor efficiency of Lolium multiflorum 'Lema' and tropical tree species (Tibouchina pukka and Psidium guajava 'Paluma') that were growing in an intensely TSP-polluted site in Cubatao (SE Brazil), and established the species with the highest potential for alternative monitoring of PAHs. PAHs present in the TSP indicated that the region is impacted by various emission sources. L. multiflorum showed a greater efficiency for the accumulation of PAH compounds on their leaves than the tropical trees. The linear regression between the logBCF and logKoa revealed that L. multiflorum is an efficient biomonitor of the profile of light and heavy PAHs present in the particulate phase of the atmosphere during dry weather and mild temperatures. The grass should be used only for indicating the PAHs with higher molecular weight in warmer and wetter periods. (C) 2012 Elsevier Inc. All rights reserved.
Resumo:
XML similarity evaluation has become a central issue in the database and information communities, its applications ranging over document clustering, version control, data integration and ranked retrieval. Various algorithms for comparing hierarchically structured data, XML documents in particular, have been proposed in the literature. Most of them make use of techniques for finding the edit distance between tree structures, XML documents being commonly modeled as Ordered Labeled Trees. Yet, a thorough investigation of current approaches led us to identify several similarity aspects, i.e., sub-tree related structural and semantic similarities, which are not sufficiently addressed while comparing XML documents. In this paper, we provide an integrated and fine-grained comparison framework to deal with both structural and semantic similarities in XML documents (detecting the occurrences and repetitions of structurally and semantically similar sub-trees), and to allow the end-user to adjust the comparison process according to her requirements. Our framework consists of four main modules for (i) discovering the structural commonalities between sub-trees, (ii) identifying sub-tree semantic resemblances, (iii) computing tree-based edit operations costs, and (iv) computing tree edit distance. Experimental results demonstrate higher comparison accuracy with respect to alternative methods, while timing experiments reflect the impact of semantic similarity on overall system performance.
Resumo:
PURPOSE: To compare the role of transitory latex and sylastic (R) implants in tympanoplasty on the closure of tympanic perforations. METHODS: A randomized double-blind prospective study was conducted on 107 patients with chronic otitis media submitted to underlay tympanoplasty and divided at random into three groups: control with no transitory implant, latex membrane group, and sylastic (R) membrane group. RESULTS: Greater graft vascularization occurred in the latex membrane group (p<0.05). Good biocompatibility was obtained with the use of the latex and silicone implants, with no effect on the occurrence of infection, otorrhea or otorragy. CONCLUSION: The use of a transitory latex implant induced greater graft vascularization, with a biocompatible interaction with the tissue of the human tympanic membrane.