20 resultados para Associative Classifiers
Resumo:
Fraud is a global problem that has required more attention due to an accentuated expansion of modern technology and communication. When statistical techniques are used to detect fraud, whether a fraud detection model is accurate enough in order to provide correct classification of the case as a fraudulent or legitimate is a critical factor. In this context, the concept of bootstrap aggregating (bagging) arises. The basic idea is to generate multiple classifiers by obtaining the predicted values from the adjusted models to several replicated datasets and then combining them into a single predictive classification in order to improve the classification accuracy. In this paper, for the first time, we aim to present a pioneer study of the performance of the discrete and continuous k-dependence probabilistic networks within the context of bagging predictors classification. Via a large simulation study and various real datasets, we discovered that the probabilistic networks are a strong modeling option with high predictive capacity and with a high increment using the bagging procedure when compared to traditional techniques. (C) 2012 Elsevier Ltd. All rights reserved.
Resumo:
In multi-label classification, examples can be associated with multiple labels simultaneously. The task of learning from multi-label data can be addressed by methods that transform the multi-label classification problem into several single-label classification problems. The binary relevance approach is one of these methods, where the multi-label learning task is decomposed into several independent binary classification problems, one for each label in the set of labels, and the final labels for each example are determined by aggregating the predictions from all binary classifiers. However, this approach fails to consider any dependency among the labels. Aiming to accurately predict label combinations, in this paper we propose a simple approach that enables the binary classifiers to discover existing label dependency by themselves. An experimental study using decision trees, a kernel method as well as Naive Bayes as base-learning techniques shows the potential of the proposed approach to improve the multi-label classification performance.
Resumo:
Abstract Background One goal of gene expression profiling is to identify signature genes that robustly distinguish different types or grades of tumors. Several tumor classifiers based on expression profiling have been proposed using microarray technique. Due to important differences in the probabilistic models of microarray and SAGE technologies, it is important to develop suitable techniques to select specific genes from SAGE measurements. Results A new framework to select specific genes that distinguish different biological states based on the analysis of SAGE data is proposed. The new framework applies the bolstered error for the identification of strong genes that separate the biological states in a feature space defined by the gene expression of a training set. Credibility intervals defined from a probabilistic model of SAGE measurements are used to identify the genes that distinguish the different states with more reliability among all gene groups selected by the strong genes method. A score taking into account the credibility and the bolstered error values in order to rank the groups of considered genes is proposed. Results obtained using SAGE data from gliomas are presented, thus corroborating the introduced methodology. Conclusion The model representing counting data, such as SAGE, provides additional statistical information that allows a more robust analysis. The additional statistical information provided by the probabilistic model is incorporated in the methodology described in the paper. The introduced method is suitable to identify signature genes that lead to a good separation of the biological states using SAGE and may be adapted for other counting methods such as Massive Parallel Signature Sequencing (MPSS) or the recent Sequencing-By-Synthesis (SBS) technique. Some of such genes identified by the proposed method may be useful to generate classifiers.
Resumo:
The striatum, the largest component of the basal ganglia, is usually subdivided into associative, motor and limbic components. However, the electrophysiological interactions between these three subsystems during behavior remain largely unknown. We hypothesized that the striatum might be particularly active during exploratory behavior, which is presumably associated with increased attention. We investigated the modulation of local field potentials (LFPs) in the striatum during attentive wakefulness in freely moving rats. To this end, we implanted microelectrodes into different parts of the striatum of Wistar rats, as well as into the motor, associative and limbic cortices. We then used electromyograms to identify motor activity and analyzed the instantaneous frequency, power spectra and partial directed coherence during exploratory behavior. We observed fine modulation in the theta frequency range of striatal LFPs in 92.5 ± 2.5% of all epochs of exploratory behavior. Concomitantly, the theta power spectrum increased in all striatal channels (P < 0.001), and coherence analysis revealed strong connectivity (coefficients >0.7) between the primary motor cortex and the rostral part of the caudatoputamen nucleus, as well as among all striatal channels (P < 0.001). Conclusively, we observed a pattern of strong theta band activation in the entire striatum during attentive wakefulness, as well as a strong coherence between the motor cortex and the entire striatum. We suggest that this activation reflects the integration of motor, cognitive and limbic systems during attentive wakefulness.
Resumo:
Este trabalho tem por objetivo investigar e delimitar formas de interação e disputas sociais presentes no movimento paralímpico brasileiro, relativos aos processos de classificação de atletas, com base em conceitos de Pierre Bourdieu. A metodologia utilizada fundamentou-se em entrevistas semiestruturadas com quatro atletas (com deficiência física ou visual, praticantes de diversas modalidades: natação, goalball, rugby e basquete em cadeira de rodas) e quatro dirigentes (2 atuantes em funções técnicas e 2 em funções administrativas do Comitê Paralímpico Brasileiro). A análise de dados apoiou-se no método Discurso do Sujeito Coletivo e suas ferramentas metodológicas (expressões-chave; ideias centrais; ancoragens; instrumentos de análise de discurso). Destacam-se como resultados: os protocolos de classificação, assim como a atuação e formação de novos classificadores, são motivo de tensões sociais neste espaço; Os classificadores exercem importante poder simbólico no subcampo; Demais agentes, como treinadores e atletas, têm suas possibilidades de ascensão diminuídas por condições sociais desfavoráveis..