41 resultados para Support Vector Machines and Naive Bayes Classifier


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Several real problems involve the classification of data into categories or classes. Given a data set containing data whose classes are known, Machine Learning algorithms can be employed for the induction of a classifier able to predict the class of new data from the same domain, performing the desired discrimination. Some learning techniques are originally conceived for the solution of problems with only two classes, also named binary classification problems. However, many problems require the discrimination of examples into more than two categories or classes. This paper presents a survey on the main strategies for the generalization of binary classifiers to problems with more than two classes, known as multiclass classification problems. The focus is on strategies that decompose the original multiclass problem into multiple binary subtasks, whose outputs are combined to obtain the final prediction.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Support vector machines (SVMs) were originally formulated for the solution of binary classification problems. In multiclass problems, a decomposition approach is often employed, in which the multiclass problem is divided into multiple binary subproblems, whose results are combined. Generally, the performance of SVM classifiers is affected by the selection of values for their parameters. This paper investigates the use of genetic algorithms (GAs) to tune the parameters of the binary SVMs in common multiclass decompositions. The developed GA may search for a set of parameter values common to all binary classifiers or for differentiated values for each binary classifier. (C) 2008 Elsevier B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Robotic mapping is the process of automatically constructing an environment representation using mobile robots. We address the problem of semantic mapping, which consists of using mobile robots to create maps that represent not only metric occupancy but also other properties of the environment. Specifically, we develop techniques to build maps that represent activity and navigability of the environment. Our approach to semantic mapping is to combine machine learning techniques with standard mapping algorithms. Supervised learning methods are used to automatically associate properties of space to the desired classification patterns. We present two methods, the first based on hidden Markov models and the second on support vector machines. Both approaches have been tested and experimentally validated in two problem domains: terrain mapping and activity-based mapping.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We investigated the temporal dynamics and changes in connectivity in the mental rotation network through the application of spatio-temporal support vector machines (SVMs). The spatio-temporal SVM [Mourao-Miranda, J., Friston, K. J., et al. (2007). Dynamic discrimination analysis: A spatial-temporal SVM. Neuroimage, 36, 88-99] is a pattern recognition approach that is suitable for investigating dynamic changes in the brain network during a complex mental task. It does not require a model describing each component of the task and the precise shape of the BOLD impulse response. By defining a time window including a cognitive event, one can use spatio-temporal fMRI observations from two cognitive states to train the SVM. During the training, the SVM finds the discriminating pattern between the two states and produces a discriminating weight vector encompassing both voxels and time (i.e., spatio-temporal maps). We showed that by applying spatio-temporal SVM to an event-related mental rotation experiment, it is possible to discriminate between different degrees of angular disparity (0 degrees vs. 20 degrees, 0 degrees vs. 60 degrees, and 0 degrees vs. 100 degrees), and the discrimination accuracy is correlated with the difference in angular disparity between the conditions. For the comparison with highest accuracy (08 vs. 1008), we evaluated how the most discriminating areas (visual regions, parietal regions, supplementary, and premotor areas) change their behavior over time. The frontal premotor regions became highly discriminating earlier than the superior parietal cortex. There seems to be a parcellation of the parietal regions with an earlier discrimination of the inferior parietal lobe in the mental rotation in relation to the superior parietal. The SVM also identified a network of regions that had a decrease in BOLD responses during the 100 degrees condition in relation to the 0 degrees condition (posterior cingulate, frontal, and superior temporal gyrus). This network was also highly discriminating between the two conditions. In addition, we investigated changes in functional connectivity between the most discriminating areas identified by the spatio-temporal SVM. We observed an increase in functional connectivity between almost all areas activated during the 100 degrees condition (bilateral inferior and superior parietal lobe, bilateral premotor area, and SMA) but not between the areas that showed a decrease in BOLD response during the 100 degrees condition.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Several popular Machine Learning techniques are originally designed for the solution of two-class problems. However, several classification problems have more than two classes. One approach to deal with multiclass problems using binary classifiers is to decompose the multiclass problem into multiple binary sub-problems disposed in a binary tree. This approach requires a binary partition of the classes for each node of the tree, which defines the tree structure. This paper presents two algorithms to determine the tree structure taking into account information collected from the used dataset. This approach allows the tree structure to be determined automatically for any multiclass dataset.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Introduction: Internet users are increasingly using the worldwide web to search for information relating to their health. This situation makes it necessary to create specialized tools capable of supporting users in their searches. Objective: To apply and compare strategies that were developed to investigate the use of the Portuguese version of Medical Subject Headings (MeSH) for constructing an automated classifier for Brazilian Portuguese-language web-based content within or outside of the field of healthcare, focusing on the lay public. Methods: 3658 Brazilian web pages were used to train the classifier and 606 Brazilian web pages were used to validate it. The strategies proposed were constructed using content-based vector methods for text classification, such that Naive Bayes was used for the task of classifying vector patterns with characteristics obtained through the proposed strategies. Results: A strategy named InDeCS was developed specifically to adapt MeSH for the problem that was put forward. This approach achieved better accuracy for this pattern classification task (0.94 sensitivity, specificity and area under the ROC curve). Conclusions: Because of the significant results achieved by InDeCS, this tool has been successfully applied to the Brazilian healthcare search portal known as Busca Saude. Furthermore, it could be shown that MeSH presents important results when used for the task of classifying web-based content focusing on the lay public. It was also possible to show from this study that MeSH was able to map out mutable non-deterministic characteristics of the web. (c) 2010 Elsevier Inc. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Reactivation of p53 by either gene transfer or pharmacologic approaches may compensate for loss of p19Arf or excess mdm2 expression, common events in melanoma and glioma. In our previous work, we constructed the pCLPG retroviral vector where transgene expression is controlled by p53 through a p53-responsive promoter. The use of this vector to introduce p19Arf into tumor cells that harbor p53wt should yield viral expression of p19Arf which, in turn, would activate the endogenous p53 and result in enhanced vector expression and tumor suppression. Since nutlin-3 can activate p53 by blocking its interaction with mdm2, we explored the possibility that the combination of p19Arf gene transfer and nutlin-3 drug treatment may provide an additive benefit in stimulating p53 function. Methods: B16 (mouse melanoma) and C6 (rat glioma) cell lines, which harbor p53wt, were transduced with pCLPGp19 and these were additionally treated with nutlin-3 or the DNA damaging agent, doxorubicin. Viral expression was confirmed by Western, Northern and immunofluorescence assays. p53 function was assessed by reporter gene activity provided by a p53-responsive construct. Alterations in proliferation and viability were measured by colony formation, growth curve, cell cycle and MTT assays. In an animal model, B16 cells were treated with the pCLPGp19 virus and/or drugs before subcutaneous injection in C57BL/6 mice, observation of tumor progression and histopathologic analyses. Results: Here we show that the functional activation of endogenous p53wt in B16 was particularly challenging, but accomplished when combined gene transfer and drug treatments were applied, resulting in increased transactivation by p53, marked cell cycle alteration and reduced viability in culture. In an animal model, B16 cells treated with both p19Arf and nutlin-3 yielded increased necrosis and decreased BrdU marking. In comparison, C6 cells were quite susceptible to either treatment, yet p53 was further activated by the combination of p19Arf and nutlin-3. Conclusions: To the best of our knowledge, this is the first study to apply both p19Arf and nutlin-3 for the stimulation of p53 activity. These results support the notion that a p53 responsive vector may prove to be an interesting gene transfer tool, especially when combined with p53- activating agents, for the treatment of tumors that retain wild-type p53.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This study evaluated two different support materials (polystyrene and expanded clay) for biohydrogen production in an anaerobic fluidized bed reactor (AFBR) treating synthetic wastewater containing glucose (4000 mg L(-1)). The AFBRs contained either polystyrene (R1) or expanded clay (R2) as support materials were inoculated with thermally pre-treated anaerobic sludge and operated at a temperature of 30 degrees C and a pH of approximately 5.5. The AFBRs were operated with a range of hydraulic retention times (HRTs) between 1 and 8 h. For R1 with an HRT of 2 h, the maximum hydrogen yield (HY) was 1.90 mol H(2) mol(-1) glucose, with 0.805 mg of biomass (as total volatile solids, or TVS) attached to each g of polystyrene. For R2 operated at an HRT of 2 h, the maximum HY was 2.59 mol H(2) moll glucose, with 1.100 mg of attached biomass (as TVS) g(-1) expanded clay. The highest hydrogen production rates (HPR) were 0.95 and 1.21 L h(-1) L(-1) for R1 and R2, respectively, using an HRT of 1 h. The H(2) content increased from 16-47% for R1 and from 22-51% for R2. No methane was detected in the biogas produced throughout the period of AFBR operation. These results show that the values of HY, HPR, H(2) content, and g of attached biomass g(-1) support material were all higher for AFBRs containing expanded clay than for reactors containing polystyrene. (C) 2010 Professor T. Nejat Veziroglu. Published by Elsevier Ltd. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Here, we examine morphological changes in cortical thickness of patients with Alzheimer`s disease (AD) using image analysis algorithms for brain structure segmentation and study automatic classification of AD patients using cortical and volumetric data. Cortical thickness of AD patients (n = 14) was measured using MRI cortical surface-based analysis and compared with healthy subjects (n = 20). Data was analyzed using an automated algorithm for tissue segmentation and classification. A Support Vector Machine (SVM) was applied over the volumetric measurements of subcortical and cortical structures to separate AD patients from controls. The group analysis showed cortical thickness reduction in the superior temporal lobe, parahippocampal gyrus, and enthorhinal cortex in both hemispheres. We also found cortical thinning in the isthmus of cingulate gyrus and middle temporal gyrus at the right hemisphere, as well as a reduction of the cortical mantle in areas previously shown to be associated with AD. We also confirmed that automatic classification algorithms (SVM) could be helpful to distinguish AD patients from healthy controls. Moreover, the same areas implicated in the pathogenesis of AD were the main parameters driving the classification algorithm. While the patient sample used in this study was relatively small, we expect that using a database of regional volumes derived from MRI scans of a large number of subjects will increase the SVM power of AD patient identification.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Pattern recognition methods have been successfully applied in several functional neuroimaging studies. These methods can be used to infer cognitive states, so-called brain decoding. Using such approaches, it is possible to predict the mental state of a subject or a stimulus class by analyzing the spatial distribution of neural responses. In addition it is possible to identify the regions of the brain containing the information that underlies the classification. The Support Vector Machine (SVM) is one of the most popular methods used to carry out this type of analysis. The aim of the current study is the evaluation of SVM and Maximum uncertainty Linear Discrimination Analysis (MLDA) in extracting the voxels containing discriminative information for the prediction of mental states. The comparison has been carried out using fMRI data from 41 healthy control subjects who participated in two experiments, one involving visual-auditory stimulation and the other based on bimanual fingertapping sequences. The results suggest that MLDA uses significantly more voxels containing discriminative information (related to different experimental conditions) to classify the data. On the other hand, SVM is more parsimonious and uses less voxels to achieve similar classification accuracies. In conclusion, MLDA is mostly focused on extracting all discriminative information available, while SVM extracts the information which is sufficient for classification. (C) 2009 Elsevier Inc. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Objective: To develop a model to predict the bleeding source and identify the cohort amongst patients with acute gastrointestinal bleeding (GIB) who require urgent intervention, including endoscopy. Patients with acute GIB, an unpredictable event, are most commonly evaluated and managed by non-gastroenterologists. Rapid and consistently reliable risk stratification of patients with acute GIB for urgent endoscopy may potentially improve outcomes amongst such patients by targeting scarce health-care resources to those who need it the most. Design and methods: Using ICD-9 codes for acute GIB, 189 patients with acute GIB and all. available data variables required to develop and test models were identified from a hospital medical records database. Data on 122 patients was utilized for development of the model and on 67 patients utilized to perform comparative analysis of the models. Clinical data such as presenting signs and symptoms, demographic data, presence of co-morbidities, laboratory data and corresponding endoscopic diagnosis and outcomes were collected. Clinical data and endoscopic diagnosis collected for each patient was utilized to retrospectively ascertain optimal management for each patient. Clinical presentations and corresponding treatment was utilized as training examples. Eight mathematical models including artificial neural network (ANN), support vector machine (SVM), k-nearest neighbor, linear discriminant analysis (LDA), shrunken centroid (SC), random forest (RF), logistic regression, and boosting were trained and tested. The performance of these models was compared using standard statistical analysis and ROC curves. Results: Overall the random forest model best predicted the source, need for resuscitation, and disposition with accuracies of approximately 80% or higher (accuracy for endoscopy was greater than 75%). The area under ROC curve for RF was greater than 0.85, indicating excellent performance by the random forest model Conclusion: While most mathematical models are effective as a decision support system for evaluation and management of patients with acute GIB, in our testing, the RF model consistently demonstrated the best performance. Amongst patients presenting with acute GIB, mathematical models may facilitate the identification of the source of GIB, need for intervention and allow optimization of care and healthcare resource allocation; these however require further validation. (c) 2007 Elsevier B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This work proposes and discusses an approach for inducing Bayesian classifiers aimed at balancing the tradeoff between the precise probability estimates produced by time consuming unrestricted Bayesian networks and the computational efficiency of Naive Bayes (NB) classifiers. The proposed approach is based on the fundamental principles of the Heuristic Search Bayesian network learning. The Markov Blanket concept, as well as a proposed ""approximate Markov Blanket"" are used to reduce the number of nodes that form the Bayesian network to be induced from data. Consequently, the usually high computational cost of the heuristic search learning algorithms can be lessened, while Bayesian network structures better than NB can be achieved. The resulting algorithms, called DMBC (Dynamic Markov Blanket Classifier) and A-DMBC (Approximate DMBC), are empirically assessed in twelve domains that illustrate scenarios of particular interest. The obtained results are compared with NB and Tree Augmented Network (TAN) classifiers, and confinn that both proposed algorithms can provide good classification accuracies and better probability estimates than NB and TAN, while being more computationally efficient than the widely used K2 Algorithm.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this work, different reactions in vitro between an environmental bacterial isolate and fungal species were related. The Gram-positive bacteria had terminal and subterminal endospores, presented metabolic characteristics of mesophilic and acidophilic growth, halotolerance, positive to nitrate reduction and enzyme production, as caseinase and catalase. The analysis of partial sequences containing 400 to 700 bases of the 16S ribosomal RNA gene showed identity with the genus Bacillus. However, its identity as B. subtilis was confirmed after analyses of the rpoB, gyrA, and 16S rRNA near-full-length sequences. Strong inhibitory activity of environmental microorganisms, such as Penicillium sp, Aspergillus flavus, A. niger, and phytopathogens, such as Colletotrichum sp, Alternaria alternata, Fusarium solani and F. oxysporum f.sp vasinfectum, was shown on co-cultures with B. subtilis strain, particularly on Sabouraud dextrose agar (SDA) and DNase media. Red and red-ochre color pigments, probably phaeomelanins, were secreted by A. alternata and A. niger respectively after seven days of co-culture.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this study, 222 genome survey sequences were generated for Trypanosoma rangeli strain P07 isolated from an opossum (Didelphis albiventris) in Minas Gerais State, Brazil. T. rangeli sequences were compared by BLASTX (Basic Local Alignment Search Tool X) analysis with the assembled contigs of Leishmania braziliensis, Leishmania infantum, Leishmania major, Trypanosoma brucei, and Trypanosoma cruzi. Results revealed that 82% (182/222) of the sequences were associated with predicted proteins described, whereas 18% (40/222) of the sequences did not show significant identity with sequences deposited in databases, suggesting that they may represent T. rangeli-specific sequences. Among the 182 predicted sequences, 179 (80.6%) had the highest similarity with T. cruzi, 2 (0.9%) with T. brucei, and 1 (0.5%) with L. braziliensis. Computer analysis permitted the identification of members of various gene families described for trypanosomatids in the genome of T. rangeli, such as trans-sialidases, mucin-associated surface proteins, and major surface proteases (MSP or gp63). This is the first report identifying sequences of the MSP family in T. rangeli. Multiple sequence alignments showed that the predicted MSP of T. rangeli presented the typical characteristics of metalloproteases, such as the presence of the HEXXH motif, which corresponds to a region previously associated with the catalytic site of the enzyme, and various cysteine and proline residues, which are conserved among MSPs of different trypanosomatid species. Reverse transcriptase-polymerase chain reaction analysis revealed the presence of MSP transcripts in epimastigote forms of T. rangeli.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

During 2010, 15 adult ticks, identified as Amblyomma cajennense, were collected from horses in Cahuita and Turrialba districts, whereas 7 fleas, identified as Ctenocephalides felis, were collected from a dog in San Jose city, Costa Rica. In the laboratory, three A. cajennense specimens, two from Cahuita and one from Turrialba, were individually processed for rickettsial isolation in cell culture, as was a pool of seven fleas. Rickettsiae were successfully isolated and established in Vero cell culture from the three ticks and from a pool of seven fleas in C6/36 cell culture. The three tick isolates were genotypically identified as Rickettsia amblyommii, and the flea isolate was identified as Rickettsia felis through DNA sequencing of portions of the rickettsial genes gltA, ompA, and ompB of each isolate. In addition, other seven ticks were shown to contain rickettsial DNA. Polymerase chain reaction products of at least two of these ticks were sequenced and also showed to correspond to R. amblyommii. Overall, 66.7% (10/15) of the A. cajennense adult ticks were found to be infected with rickettsiae. This is the first report of a successful isolation in cell culture of R. amblyommii and R. felis from Central America.