75 resultados para Support vector machine
Resumo:
Several popular Machine Learning techniques are originally designed for the solution of two-class problems. However, several classification problems have more than two classes. One approach to deal with multiclass problems using binary classifiers is to decompose the multiclass problem into multiple binary sub-problems disposed in a binary tree. This approach requires a binary partition of the classes for each node of the tree, which defines the tree structure. This paper presents two algorithms to determine the tree structure taking into account information collected from the used dataset. This approach allows the tree structure to be determined automatically for any multiclass dataset.
Resumo:
Robotic mapping is the process of automatically constructing an environment representation using mobile robots. We address the problem of semantic mapping, which consists of using mobile robots to create maps that represent not only metric occupancy but also other properties of the environment. Specifically, we develop techniques to build maps that represent activity and navigability of the environment. Our approach to semantic mapping is to combine machine learning techniques with standard mapping algorithms. Supervised learning methods are used to automatically associate properties of space to the desired classification patterns. We present two methods, the first based on hidden Markov models and the second on support vector machines. Both approaches have been tested and experimentally validated in two problem domains: terrain mapping and activity-based mapping.
Resumo:
Recently, we have built a classification model that is capable of assigning a given sesquiterpene lactone (STL) into exactly one tribe of the plant family Asteraceae from which the STL has been isolated. Although many plant species are able to biosynthesize a set of peculiar compounds, the occurrence of the same secondary metabolites in more than one tribe of Asteraceae is frequent. Building on our previous work, in this paper, we explore the possibility of assigning an STL to more than one tribe (class) simultaneously. When an object may belong to more than one class simultaneously, it is called multilabeled. In this work, we present a general overview of the techniques available to examine multilabeled data. The problem of evaluating the performance of a multilabeled classifier is discussed. Two particular multilabeled classification methods-cross-training with support vector machines (ct-SVM) and multilabeled k-nearest neighbors (M-L-kNN)were applied to the classification of the STLs into seven tribes from the plant family Asteraceae. The results are compared to a single-label classification and are analyzed from a chemotaxonomic point of view. The multilabeled approach allowed us to (1) model the reality as closely as possible, (2) improve our understanding of the relationship between the secondary metabolite profiles of different Asteraceae tribes, and (3) significantly decrease the number of plant sources to be considered for finding a certain STL. The presented classification models are useful for the targeted collection of plants with the objective of finding plant sources of natural compounds that are biologically active or possess other specific properties of interest.
Resumo:
This paper proposes a filter-based algorithm for feature selection. The filter is based on the partitioning of the set of features into clusters. The number of clusters, and consequently the cardinality of the subset of selected features, is automatically estimated from data. The computational complexity of the proposed algorithm is also investigated. A variant of this filter that considers feature-class correlations is also proposed for classification problems. Empirical results involving ten datasets illustrate the performance of the developed algorithm, which in general has obtained competitive results in terms of classification accuracy when compared to state of the art algorithms that find clusters of features. We show that, if computational efficiency is an important issue, then the proposed filter May be preferred over their counterparts, thus becoming eligible to join a pool of feature selection algorithms to be used in practice. As an additional contribution of this work, a theoretical framework is used to formally analyze some properties of feature selection methods that rely on finding clusters of features. (C) 2011 Elsevier Inc. All rights reserved.
Dynamic Changes in the Mental Rotation Network Revealed by Pattern Recognition Analysis of fMRI Data
Resumo:
We investigated the temporal dynamics and changes in connectivity in the mental rotation network through the application of spatio-temporal support vector machines (SVMs). The spatio-temporal SVM [Mourao-Miranda, J., Friston, K. J., et al. (2007). Dynamic discrimination analysis: A spatial-temporal SVM. Neuroimage, 36, 88-99] is a pattern recognition approach that is suitable for investigating dynamic changes in the brain network during a complex mental task. It does not require a model describing each component of the task and the precise shape of the BOLD impulse response. By defining a time window including a cognitive event, one can use spatio-temporal fMRI observations from two cognitive states to train the SVM. During the training, the SVM finds the discriminating pattern between the two states and produces a discriminating weight vector encompassing both voxels and time (i.e., spatio-temporal maps). We showed that by applying spatio-temporal SVM to an event-related mental rotation experiment, it is possible to discriminate between different degrees of angular disparity (0 degrees vs. 20 degrees, 0 degrees vs. 60 degrees, and 0 degrees vs. 100 degrees), and the discrimination accuracy is correlated with the difference in angular disparity between the conditions. For the comparison with highest accuracy (08 vs. 1008), we evaluated how the most discriminating areas (visual regions, parietal regions, supplementary, and premotor areas) change their behavior over time. The frontal premotor regions became highly discriminating earlier than the superior parietal cortex. There seems to be a parcellation of the parietal regions with an earlier discrimination of the inferior parietal lobe in the mental rotation in relation to the superior parietal. The SVM also identified a network of regions that had a decrease in BOLD responses during the 100 degrees condition in relation to the 0 degrees condition (posterior cingulate, frontal, and superior temporal gyrus). This network was also highly discriminating between the two conditions. In addition, we investigated changes in functional connectivity between the most discriminating areas identified by the spatio-temporal SVM. We observed an increase in functional connectivity between almost all areas activated during the 100 degrees condition (bilateral inferior and superior parietal lobe, bilateral premotor area, and SMA) but not between the areas that showed a decrease in BOLD response during the 100 degrees condition.
Resumo:
Background: Cryptic species complexes are common among anophelines. Previous phylogenetic analysis based on the complete mtDNA COI gene sequences detected paraphyly in the Neotropical malaria vector Anopheles marajoara. The ""Folmer region"" detects a single taxon using a 3% divergence threshold. Methods: To test the paraphyletic hypothesis and examine the utility of the Folmer region, genealogical trees based on a concatenated (white + 3' COI sequences) dataset and pairwise differentiation of COI fragments were examined. The population structure and demographic history were based on partial COI sequences for 294 individuals from 14 localities in Amazonian Brazil. 109 individuals from 12 localities were sequenced for the nDNA white gene, and 57 individuals from 11 localities were sequenced for the ribosomal DNA (rDNA) internal transcribed spacer 2 (ITS2). Results: Distinct A. marajoara lineages were detected by combined genealogical analysis and were also supported among COI haplotypes using a median joining network and AMOVA, with time since divergence during the Pleistocene (< 100,000 ya). COI sequences at the 3' end were more variable, demonstrating significant pairwise differentiation (3.82%) compared to the more moderate 2.92% detected by the Folmer region. Lineage 1 was present in all localities, whereas lineage 2 was restricted mainly to the west. Mismatch distributions for both lineages were bimodal, likely due to multiple colonization events and spatial expansion (similar to 798 - 81,045 ya). There appears to be gene flow within, not between lineages, and a partial barrier was detected near Rio Jari in Amapa state, separating western and eastern populations. In contrast, both nDNA data sets (white gene sequences with or without the retention of the 4th intron, and ITS2 sequences and length) detected a single A. marajoara lineage. Conclusions: Strong support for combined data with significant differentiation detected in the COI and absent in the nDNA suggest that the divergence is recent, and detectable only by the faster evolving mtDNA. A within subgenus threshold of >2% may be more appropriate among sister taxa in cryptic anopheline complexes than the standard 3%. Differences in demographic history and climatic changes may have contributed to mtDNA lineage divergence in A. marajoara.
Resumo:
Species` potential distribution modelling consists of building a representation of the fundamental ecological requirements of a species from biotic and abiotic conditions where the species is known to occur. Such models can be valuable tools to understand the biogeography of species and to support the prediction of its presence/absence considering a particular environment scenario. This paper investigates the use of different supervised machine learning techniques to model the potential distribution of 35 plant species from Latin America. Each technique was able to extract a different representation of the relations between the environmental conditions and the distribution profile of the species. The experimental results highlight the good performance of random trees classifiers, indicating this particular technique as a promising candidate for modelling species` potential distribution. (C) 2010 Elsevier Ltd. All rights reserved.
Resumo:
The aim of this study was to assess the Knoop hardness of three high viscous glass ionomer cements: G1 - Ketac Molar; G2 - Ketac Molar Easymix (3M ESPE) and G3 - Magic Glass ART (Vigodent). As a parallel goal, three different methods for insertion of Ketac Molar Easymix were tested: G4 - conventional spatula; G5 - commercial syringe (Centrix) and G6 - low-cost syringe. Ten specimens of each group were prepared and the Knoop hardness was determined 5 times on each specimen with a HM-124 hardness machine (25 g/30 s dwell time) after 24 h, 1 and 2 weeks. During the entire test period, the specimens were stored in liquid paraffin at 37ºC. Significant differences were found between G3 and G1/G2 (two-way ANOVA and Tukey's post hoc test; p<0.01). There was no significant difference in the results among the multiple ways of insertion. The glass ionomer cement Magic Glass ART showed the lowest hardness, while the insertion technique had no significant influence on hardness.
Resumo:
Phylogenetic relationships among species of the Myzorhynchella Section of Anopheles (Nyssorhynchus) were investigated using the nuclear ribosomal DNA second internal transcribed spacer (ITS2), the nuclear whitegene and mitochondrial cytochrome oxidase subunit I (COI) regions. The recently described Anopheles pristinus and resurrected Anopheles guarani were also included in the study. Bayesian phylogenetic analyses found Anopheles parvus to be the most distantly related species within the Section, a finding that is consistent with morphology. An. pristinus and An. guarani were clearly resolved from Anopheles antunesi and Anopheles lutzii, respectively. An. lutzii collected in the same mountain range as the type locality were found within a strongly supported clade, whereas individuals from the southern state of Rio Grande do Sul, tentatively identified as An. lutzii based on adult female external morphology, were distinct from An. lutzii, An. antunesi and from each other, and may therefore represent two new sympatric species. A more detailed examination of An. lutzii sensu latoalong its known geographic range is recommended to resolve these anomalous relationships.
Resumo:
Every year, autochthonous cases of Plasmodium vivax malaria occur in low-endemicity areas of Vale do Ribeira in the south-eastern part of the Atlantic Forest, state of São Paulo, where Anopheles cruzii and Anopheles bellator are considered the primary vectors. However, other species in the subgenus Nyssorhynchus of Anopheles (e.g., Anopheles marajoara) are abundant and may participate in the dynamics of malarial transmission in that region. The objectives of the present study were to assess the spatial distribution of An. cruzii, An. bellator and An. marajoara and to associate the presence of these species with malaria cases in the municipalities of the Vale do Ribeira. Potential habitat suitability modelling was applied to determine both the spatial distribution of An. cruzii, An. bellator and An. marajoara and to establish the density of each species. Poisson regression was utilized to associate malaria cases with estimated vector densities. As a result, An. cruzii was correlated with the forested slopes of the Serra do Mar, An. bellator with the coastal plain and An. marajoara with the deforested areas. Moreover, both An. marajoara and An. cruzii were positively associated with malaria cases. Considering that An. marajoara was demonstrated to be a primary vector of human Plasmodium in the rural areas of the state of Amapá, more attention should be given to the species in the deforested areas of the Atlantic Forest, where it might be a secondary vector.
Resumo:
This work proposes a new approach using a committee machine of artificial neural networks to classify masses found in mammograms as benign or malignant. Three shape factors, three edge-sharpness measures, and 14 texture measures are used for the classification of 20 regions of interest (ROIs) related to malignant tumors and 37 ROIs related to benign masses. A group of multilayer perceptrons (MLPs) is employed as a committee machine of neural network classifiers. The classification results are reached by combining the responses of the individual classifiers. Experiments involving changes in the learning algorithm of the committee machine are conducted. The classification accuracy is evaluated using the area A. under the receiver operating characteristics (ROC) curve. The A, result for the committee machine is compared with the A, results obtained using MLPs and single-layer perceptrons (SLPs), as well as a linear discriminant analysis (LDA) classifier Tests are carried out using the student's t-distribution. The committee machine classifier outperforms the MLP SLP, and LDA classifiers in the following cases: with the shape measure of spiculation index, the A, values of the four methods are, in order 0.93, 0.84, 0.75, and 0.76; and with the edge-sharpness measure of acutance, the values are 0.79, 0.70, 0.69, and 0.74. Although the features with which improvement is obtained with the committee machines are not the same as those that provided the maximal value of A(z) (A(z) = 0.99 with some shape features, with or without the committee machine), they correspond to features that are not critically dependent on the accuracy of the boundaries of the masses, which is an important result. (c) 2008 SPIE and IS&T.
Resumo:
Due to both the widespread and multipurpose use of document images and the current availability of a high number of document images repositories, robust information retrieval mechanisms and systems have been increasingly demanded. This paper presents an approach to support the automatic generation of relationships among document images by exploiting Latent Semantic Indexing (LSI) and Optical Character Recognition (OCR). We developed the LinkDI (Linking of Document Images) service, which extracts and indexes document images content, computes its latent semantics, and defines relationships among images as hyperlinks. LinkDI was experimented with document images repositories, and its performance was evaluated by comparing the quality of the relationships created among textual documents as well as among their respective document images. Considering those same document images, we ran further experiments in order to compare the performance of LinkDI when it exploits or not the LSI technique. Experimental results showed that LSI can mitigate the effects of usual OCR misrecognition, which reinforces the feasibility of LinkDI relating OCR output with high degradation.
Resumo:
We present the first spin alignment measurements for the K*(0)(892) and phi(1020) vector mesons produced at midrapidity with transverse momenta up to 5 GeV/c at root s(NN) = 200 GeV at RHIC. The diagonal spin-density matrix elements with respect to the reaction plane in Au+Au collisions are rho(00) = 0.32 +/- 0.04 (stat) +/- 0.09 (syst) for the K*(0) (0.8 < p(T) < 5.0 GeV/c) and rho(00) = 0.34 +/- 0.02 (stat) +/- 0.03 (syst) for the phi (0.4 < p(T) < 5.0 GeV/c) and are constant with transverse momentum and collision centrality. The data are consistent with the unpolarized expectation of 1/3 and thus no evidence is found for the transfer of the orbital angular momentum of the colliding system to the vector-meson spins. Spin alignments for K(*0) and phi in Au+Au collisions were also measured with respect to the particle's production plane. The phi result, rho(00) = 0.41 +/- 0.02 (stat) +/- 0.04 (syst), is consistent with that in p+p collisions, rho(00) = 0.39 +/- 0.03 (stat) +/- 0.06 (syst), also measured in this work. The measurements thus constrain the possible size of polarization phenomena in the production dynamics of vector mesons.
Resumo:
Background: In a number of malaria endemic regions, tourists and travellers face a declining risk of travel associated malaria, in part due to successful malaria control. Many millions of visitors to these regions are recommended, via national and international policy, to use chemoprophylaxis which has a well recognized morbidity profile. To evaluate whether current malaria chemo-prophylactic policy for travellers is cost effective when adjusted for endemic transmission risk and duration of exposure. a framework, based on partial cost-benefit analysis was used Methods: Using a three component model combining a probability component, a cost component and a malaria risk component, the study estimated health costs avoided through use of chemoprophylaxis and costs of disease prevention (including adverse events and pre-travel advice for visits to five popular high and low malaria endemic regions) and malaria transmission risk using imported malaria cases and numbers of travellers to malarious countries. By calculating the minimal threshold malaria risk below which the economic costs of chemoprophylaxis are greater than the avoided health costs we were able to identify the point at which chemoprophylaxis would be economically rational. Results: The threshold incidence at which malaria chemoprophylaxis policy becomes cost effective for UK travellers is an accumulated risk of 1.13% assuming a given set of cost parameters. The period a travellers need to remain exposed to achieve this accumulated risk varied from 30 to more than 365 days, depending on the regions intensity of malaria transmission. Conclusions: The cost-benefit analysis identified that chemoprophylaxis use was not a cost-effective policy for travellers to Thailand or the Amazon region of Brazil, but was cost-effective for travel to West Africa and for those staying longer than 45 days in India and Indonesia.
Resumo:
Yellow fever virus (YFV) was isolated from Haemagogus leucocelaenus mosquitoes during an epizootic in 2001 in the Rio Grande do Sul State in southern Brazil In October 2008 a yellow fever outbreak was reported there with nonhuman primate deaths and human cases This latter outbreak led to intensification of surveillance measures for early detection of YFV and support for vaccination programs We report entomologic surveillance in 2 municipalities that recorded nonhuman primate deaths Mosquitoes were collected at ground level identified and processed for virus isolation and molecular analyses Eight YFV strains were isolated (7 from pools of Hg leucocelaenus mosquitoes and another from Aedes serratus mosquitoes) 6 were sequenced and they grouped in the YFV South American genotype I The results confirmed the role of Hg leucocelaenus mosquitoes as the main YFV vector in southern Brazil and suggest that Ae serratus mosquitoes may have a potential role as a secondary vector