3 resultados para Semi-supervised classification
em Universidade Federal do Rio Grande do Norte(UFRN)
Resumo:
The techniques of Machine Learning are applied in classification tasks to acquire knowledge through a set of data or information. Some learning methods proposed in literature are methods based on semissupervised learning; this is represented by small percentage of labeled data (supervised learning) combined with a quantity of label and non-labeled examples (unsupervised learning) during the training phase, which reduces, therefore, the need for a large quantity of labeled instances when only small dataset of labeled instances is available for training. A commom problem in semi-supervised learning is as random selection of instances, since most of paper use a random selection technique which can cause a negative impact. Much of machine learning methods treat single-label problems, in other words, problems where a given set of data are associated with a single class; however, through the requirement existent to classify data in a lot of domain, or more than one class, this classification as called multi-label classification. This work presents an experimental analysis of the results obtained using semissupervised learning in troubles of multi-label classification using reliability parameter as an aid in the classification data. Thus, the use of techniques of semissupervised learning and besides methods of multi-label classification, were essential to show the results
Resumo:
Data classification is a task with high applicability in a lot of areas. Most methods for treating classification problems found in the literature dealing with single-label or traditional problems. In recent years has been identified a series of classification tasks in which the samples can be labeled at more than one class simultaneously (multi-label classification). Additionally, these classes can be hierarchically organized (hierarchical classification and hierarchical multi-label classification). On the other hand, we have also studied a new category of learning, called semi-supervised learning, combining labeled data (supervised learning) and non-labeled data (unsupervised learning) during the training phase, thus reducing the need for a large amount of labeled data when only a small set of labeled samples is available. Thus, since both the techniques of multi-label and hierarchical multi-label classification as semi-supervised learning has shown favorable results with its use, this work is proposed and used to apply semi-supervised learning in hierarchical multi-label classication tasks, so eciently take advantage of the main advantages of the two areas. An experimental analysis of the proposed methods found that the use of semi-supervised learning in hierarchical multi-label methods presented satisfactory results, since the two approaches were statistically similar results
Resumo:
The municipality of Areia Branca is within the mesoregion of West Potiguar and within the microregion of Mossoró, covering an area of 357,58 km2. Covering an area of weakness in terms of environmental, housing, together with the municipality of Grossos-RN, the estuary of River Apodi-Mossoró. The municipality of Areia Branca has historically suffered from a lack of planning regarding the use and occupation of land as some economic activities, attracted by the extremely favorable natural conditions, have exploited their natural resources improperly. The aim of this study is to quantify and analyze the environmental degradation in the municipality. Thus initially was performed a characterization of land use using remote sensing, geoprocessing and geographic information system GIS in order to generate data and information on the municipal scale, which may serve as input to the environmental planning and land use planning in the region. From this perspective, were used a Landsat 5 image TM sensor for the year 2010. In the processing of this image was used SPRING 5.2 and applied a supervised classification using the classifier regions, which was employed Bhattacharya Distance method with a threshold at 30%. Thus was obtained the land use map that was analyzed the spatial distribution of different types of the use that is occurring in the city, identifying areas that are being used incorrectly and the main types of environmental degradation. And further, were applied the methodology proposed by Beltrame (1994), Physical Diagnosis Conservationist under some adaptations for quantifying the level of degradation or conservation study area. As results, the indexes were obtained for the parameters in the proposed methodology, allowing quantitatively analyze the degradation potential of each sector. From this perspective, considering a scale of 0 to 100, sector A and sector B had value 31.20 units of risk of physical deterioration. And the C sector, has shown its value - 34.64 units degradation risk and should be considered a priority in relation to the achievement of conservation actions