131 resultados para Typological Classification


Relevância:

20.00% 20.00%

Publicador:

Resumo:

The reliability of an induced classifier can be affected by several factors including the data oriented factors and the algorithm oriented factors. In some cases, the reliability could also be affected by knowledge oriented factors. In this paper, we analyze three special cases to examine the reliability of the discovered knowledge. Our case study results show that (1) in the cases of mining from low quality data, rough classification approach is more reliable than exact approach which in general tolerate to low quality data; (2) Without sufficient large size of the data, the reliability of the discovered knowledge will be decreased accordingly; (3) The reliability of point learning approach could easily be misled by noisy data. It will in most cases generate an unreliable interval and thus affect the reliability of the discovered knowledge. It is also reveals that the inexact field is a good learning strategy that could model the potentials and to improve the discovery reliability.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper introduces a new technique in the investigation of object classification and illustrates the potential use of this technique for the analysis of a range of biological data, using avian morphometric data as an example. The nascent variable precision rough sets (VPRS) model is introduced and compared with the decision tree method ID3 (through a ‘leave n out’ approach), using the same dataset of morphometric measures of European barn swallows (Hirundo rustica) and assessing the accuracy of gender classification based on these measures. The results demonstrate that the VPRS model, allied with the use of a modern method of discretization of data, is comparable with the more traditional non-parametric ID3 decision tree method. We show that, particularly in small samples, the VPRS model can improve classification and to a lesser extent prediction aspects over ID3. Furthermore, through the ‘leave n out’ approach, some indication can be produced of the relative importance of the different morphometric measures used in this problem. In this case we suggest that VPRS has advantages over ID3, as it intelligently uses more of the morphometric data available for the data classification, whilst placing less emphasis on variables with low reliability. In biological terms, the results suggest that the gender of swallows can be determined with reasonable accuracy from morphometric data and highlight the most important variables in this process. We suggest that both analysis techniques are potentially useful for the analysis of a range of different types of biological datasets, and that VPRS in particular has potential for application to a range of biological circumstances.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This article investigates the potential of a novel technique for object classification, called Classification and Ranking Belief Simplex (CaRBS), which is based on the Dempster-Shafer theory of evidence. As such, the classification of objects and the evidence from their characteristics have a level of ignorance associated with them. Its potential is exposited in the application of the classification of European barn swallows according to their gender. The classification of biological data in the presence of ignorance about such data sets is a common problem in biology. Comparisons of the results from CaRBS with those from multivariate discriminant analysis and neural networks are made. Also shown throughout the investigation is the interpretability of the results with the utilisation of the simplex plot method of representing data

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Regardless of the technical procedure used in signalling corporate collapse, the bottom line rests on the predictive power of the corresponding statistical model. In that regard, it is imperative to empirically test the model using a data sample of both collapsed and non-collapsed companies. A superior model is one that successfully classifies collapsed and non-collapsed companies in their respective categories with a high degree of accuracy. Empirical studies of this nature have thus far done one of two things. (1) Some have classified companies based on a specific statistical modelling process. (2) Some have classified companies based on two (sometimes – but rarely – more than two) independent statistical modelling processes for the purposes of comparing one with the other. In the latter case, the mindset of the researchers has been – invariably – to pitch one procedure against the other. This paper raises the question, why pitch one statistical process against another; why not make the two procedures work together? As such, this paper puts forward an innovative dual-classification scheme for signalling corporate collapse: dual in the sense that it relies on two statistical procedures concurrently. Using a data sample of Australian publicly listed companies, the proposed scheme is tested against the traditional approach taken thus far in the pertinent literature. The results demonstrate that the proposed dual-classification scheme signals collapse with a higher degree of accuracy.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

As published in the final reports by the College of Wooster and Sydney University, the dating and analysis of the excavated remains of the three churches and their associated finds at Pella, Jordan, display some puzzling aspects. This paper argues that, in these reports, two of the churches appear to have been dated up to a century later than they should have been, while the suggested date for the third now appears too early. This article examines the problem in the context of a new architectural and theological typology of churches in the East Mediterranean from the 4th to 6th centuries AD.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Automated classification of lung nodules is challenging because of the variation in shape and size of lung nodules, as well as their associated differences in their images. Ensemble based learners have demonstrated the potentialof good performance. Random forests are employed for pulmonary nodule classification where each tree in the forest produces a classification decision, and an integrated output is calculated. A classification aided by clustering approach is proposed to improve the lung nodule classification performance. Three experiments are performed using the LIDC lung image database of 32 cases. The classification performance and execution times are presented and discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper presents an innovative fusion based multi-classifier email classification on a ubiquitous multi-core architecture. Many approaches use text-based single classifiers or multiple weakly trained classifiers to identify spam messages from a large email corpus. We build upon our previous work on multi-core by apply our ubiquitous multi-core framework to run our fusion based multi-classifier architecture. By running each classifier process in parallel within their dedicated core, we greatly improve the performance of our proposed multi-classifier based filtering system. Our proposed architecture also provides a safeguard of user mailbox from different malicious attacks. Our experimental results show that we achieved an average of 30% speedup at the average cost of 1.4 ms. We also reduced the instance of false positive, which is one of the key challenges in spam filtering system, and increases email classification accuracy substantially compared with single classification techniques.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper we have proposed a spam filtering technique using (2+1)-tier classification approach. The main focus of this paper is to reduce the false positive (FP) rate which is considered as an important research issue in spam filtering. In our approach, firstly the email message will classify using first two tier classifiers and the outputs will appear to the analyzer. The analyzer will check the labeling of the output emails and send to the corresponding mailboxes based on labeling, for the case of identical prediction. If there are any misclassifications occurred by first two tier classifiers then tier-3 classifier will invoked by the analyzer and the tier-3 will take final decision. This technique reduced the analyzing complexity of our previous work. It has also been shown that the proposed technique gives better performance in terms of reducing false positive as well as better accuracy.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

It has been an important and challenging task to classify and evaluate the contents in wool blends. Quantitative characterisation of animal fibre scale patterns has attracted considerable attention, since it is the major evidence for identification and subsequent classification purpose. Although techniques such as imaging processing and linear demarcation functions have been used to identify unknown fibre type with some success, a more comprehensive approach is required to perform this task. In this paper, a new approach is presented, which employs non-linear demarcation functions by using an artificial neural network (ANN). Based on scale pattern features extracted by using image processing techniques the artificial neural network (ANN) model is to classify mohair and merino fibres. It is observed that the techniques developed in this work are very effective and have the potential to be applied to other animal fibres.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This considers the challenging task of cancer prediction based on microarray data for the medical community. The research was conducted on mostly common cancers (breast, colon, long, prostate and leukemia) microarray data analysis, and suggests the use of modern machine learning techniques to predict cancer.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

To develop an objective and repeatable method of identification and classification of animal fibres, two different integrated systems were developed to mimic the human brain's ability to undertake feature extraction and discrimination of animal fibres. Both integrated systems are basically composed of an image processing system and an artificial neural network system.