10 resultados para Supervised classifiers

em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Semi-supervised learning is one of the important topics in machine learning, concerning with pattern classification where only a small subset of data is labeled. In this paper, a new network-based (or graph-based) semi-supervised classification model is proposed. It employs a combined random-greedy walk of particles, with competition and cooperation mechanisms, to propagate class labels to the whole network. Due to the competition mechanism, the proposed model has a local label spreading fashion, i.e., each particle only visits a portion of nodes potentially belonging to it, while it is not allowed to visit those nodes definitely occupied by particles of other classes. In this way, a "divide-and-conquer" effect is naturally embedded in the model. As a result, the proposed model can achieve a good classification rate while exhibiting low computational complexity order in comparison to other network-based semi-supervised algorithms. Computer simulations carried out for synthetic and real-world data sets provide a numeric quantification of the performance of the method.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Complex networks have been employed to model many real systems and as a modeling tool in a myriad of applications. In this paper, we use the framework of complex networks to the problem of supervised classification in the word disambiguation task, which consists in deriving a function from the supervised (or labeled) training data of ambiguous words. Traditional supervised data classification takes into account only topological or physical features of the input data. On the other hand, the human (animal) brain performs both low- and high-level orders of learning and it has facility to identify patterns according to the semantic meaning of the input data. In this paper, we apply a hybrid technique which encompasses both types of learning in the field of word sense disambiguation and show that the high-level order of learning can really improve the accuracy rate of the model. This evidence serves to demonstrate that the internal structures formed by the words do present patterns that, generally, cannot be correctly unveiled by only traditional techniques. Finally, we exhibit the behavior of the model for different weights of the low- and high-level classifiers by plotting decision boundaries. This study helps one to better understand the effectiveness of the model. Copyright (C) EPLA, 2012

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Semi-supervised learning techniques have gained increasing attention in the machine learning community, as a result of two main factors: (1) the available data is exponentially increasing; (2) the task of data labeling is cumbersome and expensive, involving human experts in the process. In this paper, we propose a network-based semi-supervised learning method inspired by the modularity greedy algorithm, which was originally applied for unsupervised learning. Changes have been made in the process of modularity maximization in a way to adapt the model to propagate labels throughout the network. Furthermore, a network reduction technique is introduced, as well as an extensive analysis of its impact on the network. Computer simulations are performed for artificial and real-world databases, providing a numerical quantitative basis for the performance of the proposed method.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The cost-effectiveness of a modified supervised toothbrushing program was compared to a conventional program. A total of 284 five-year-old children presenting at least one permanent molar with emerged/sound occlusal surface participated. In the control group, oral health education and dental plaque dying followed by toothbrushing with fluoride dentifrice was carried outfour times per year. With the test group, children also underwent professional cross-brushing on surfaces of first permanent molar rendered by a dental assistant five times per year. Enamel/dentin caries were recorded on buccal, occlusal and lingual surfaces of permanent molars for a period of 18 months. The incidence density (ID) ratio was estimated using Poisson's regression model. The ID was 50% lower among boys in the test group (p = 0.016). The cost of the modified program was US$ 1.79 per capita. The marginal cost-effectiveness ratio among boys was US$ 6.30 per avoided carie. The modified supervised toothbrushing program was shown to be cost-effective in the case of boys.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Surveillance Levels (SLs) are categories for medical patients (used in Brazil) that represent different types of medical recommendations. SLs are defined according to risk factors and the medical and developmental history of patients. Each SL is associated with specific educational and clinical measures. The objective of the present paper was to verify computer-aided, automatic assignment of SLs. The present paper proposes a computer-aided approach for automatic recommendation of SLs. The approach is based on the classification of information from patient electronic records. For this purpose, a software architecture composed of three layers was developed. The architecture is formed by a classification layer that includes a linguistic module and machine learning classification modules. The classification layer allows for the use of different classification methods, including the use of preprocessed, normalized language data drawn from the linguistic module. We report the verification and validation of the software architecture in a Brazilian pediatric healthcare institution. The results indicate that selection of attributes can have a great effect on the performance of the system. Nonetheless, our automatic recommendation of surveillance level can still benefit from improvements in processing procedures when the linguistic module is applied prior to classification. Results from our efforts can be applied to different types of medical systems. The results of systems supported by the framework presented in this paper may be used by healthcare and governmental institutions to improve healthcare services in terms of establishing preventive measures and alerting authorities about the possibility of an epidemic.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This study aimed to estimate the owned dog and cat populations of Sao Paulo city using a complex sample with random selection in two stages. In each administrative district, six census sectors and 20 households in each sampled sector were visited from September 2006 to September 2009, totalizing 11,272 interviews. The human: dog ratio was 4.34 and the human: cat ratio was 19.33. The dog population was 2,507,401 and the cat population, 562,965. The dog population was composted of 52.7% males, while among the cat population it was 45.1%. The proportion of sterilized females (23.4% among dogs and 46.1% among cats) was higher than males (11.4% among dogs and 31.5% among cats). The mean age of dogs was 4.99 years, and for cats, 3.53 years. The proportion of restricted (without access to the street) dogs, 64.4%, was higher than restricted cats, 42.5%. The average number of animals/household was 1.60 for dogs and 1.69 for cats. The animal ownership is associated with cultural factors, therefore the characterization of canine and feline population is essential to implement an adequate animal population management program and zoonosis control.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Dimensionality reduction is employed for visual data analysis as a way to obtaining reduced spaces for high dimensional data or to mapping data directly into 2D or 3D spaces. Although techniques have evolved to improve data segregation on reduced or visual spaces, they have limited capabilities for adjusting the results according to user's knowledge. In this paper, we propose a novel approach to handling both dimensionality reduction and visualization of high dimensional data, taking into account user's input. It employs Partial Least Squares (PLS), a statistical tool to perform retrieval of latent spaces focusing on the discriminability of the data. The method employs a training set for building a highly precise model that can then be applied to a much larger data set very effectively. The reduced data set can be exhibited using various existing visualization techniques. The training data is important to code user's knowledge into the loop. However, this work also devises a strategy for calculating PLS reduced spaces when no training data is available. The approach produces increasingly precise visual mappings as the user feeds back his or her knowledge and is capable of working with small and unbalanced training sets.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Abstract Background Low back pain is a relevant public health problem, being an important cause of work absenteeism worldwide, as well as affecting the quality of life of sufferers and their individual functional performances. Supervised active physical routines and of cognitive-behavioral therapies are recommended for the treatment of chronic Low back pain, although evidence to support the effectiveness of different techniques is missing. Accordingly, the aim of this study is to contrast the effectiveness of two types of exercises, graded activity or supervised, in decreasing symptoms of chronic low back pain. Methods/design Sample will consist of 66 patients, blindly allocated into one of two groups: 1) Graded activity which, based on an operant approach, will use time-contingent methods aiming to increase participants’ activity levels; 2) Supervised exercise, where participants will be trained for strengthening, stretching, and motor control targeting different muscle groups. Interventions will last one hour, and will happen twice a week for 6 weeks. Outcomes (pain, disability, quality of life, global perceived effect, return to work, physical activity, physical capacity, and kinesiophobia) will be assessed at baseline, at treatment end, and three and six months after treatment end. Data collection will be conducted by an investigator blinded to treatment allocation. Discussion This project describes the randomisation method that will be used to compare the effectiveness of two different treatments for chronic low back pain: graded activity and supervised exercises. Since optimal approach for patients with chronic back pain have yet not been defined based on evidence, good quality studies on the subject are necessary. Trial registration NCT01719276

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Semi-supervised learning is a classification paradigm in which just a few labeled instances are available for the training process. To overcome this small amount of initial label information, the information provided by the unlabeled instances is also considered. In this paper, we propose a nature-inspired semi-supervised learning technique based on attraction forces. Instances are represented as points in a k-dimensional space, and the movement of data points is modeled as a dynamical system. As the system runs, data items with the same label cooperate with each other, and data items with different labels compete among them to attract unlabeled points by applying a specific force function. In this way, all unlabeled data items can be classified when the system reaches its stable state. Stability analysis for the proposed dynamical system is performed and some heuristics are proposed for parameter setting. Simulation results show that the proposed technique achieves good classification results on artificial data sets and is comparable to well-known semi-supervised techniques using benchmark data sets.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The strength and durability of materials produced from aggregates (e.g., concrete bricks, concrete, and ballast) are critically affected by the weathering of the particles, which is closely related to their mineral composition. It is possible to infer the degree of weathering from visual features derived from the surface of the aggregates. By using sound pattern recognition methods, this study shows that the characterization of the visual texture of particles, performed by using texture-related features of gray scale images, allows the effective differentiation between weathered and nonweathered aggregates. The selection of the most discriminative features is also performed by taking into account a feature ranking method. The evaluation of the methodology in the presence of noise suggests that it can be used in stone quarries for automatic detection of weathered materials.