152 resultados para Data Mining, Rough Sets, Multi-Dimension, Association Rules, Constraint


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Education is a complex systematic engineering, which is the guarantee of training high-quality talent, helping society make full use of educational outcomes and promote the healthy development of education. In the education, the students' score is a very important quantitative evaluation indicator, which can objectively reflect the effects of educational system and is an important basis to make lots of scientific decisions. This paper uses clustering algorithm and decision tree to comprehensively analyze the students' score, and obtains useful results. It can be observed that the results are valuable for the teaching and management.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This thesis examined the application of data mining techniques to the issue of predicting pilling propensity of wool knitwear. Using real industrial data, a pilling propensity prediction tool with embedded trained support vector machines is developed to provide high accuracy prediction to wool knitwear even before the yarn is spun!

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper reports on the preparation and management processes of inconsistent data on damage on residential houses in Victoria, Australia. There are no existing specific and fully relevant databases readily available except for the incomplete paper-based and electronic-based reports. Therefore, the extracting of information from the reports is complicated and time consuming in order to extract and include all the necessary information needed for analysis of damage on residential houses founded on expansive soils. Data mining is adopted to develop a database. Statistical methods and Artificial Intelligence methods are used to quantify the quality of data. The paper concludes that the development of such database could enable BHC to evaluate the usefulness of the reports prepared on the reported damage properties for further analysis.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The development and application of computational data mining techniques in financial fraud detection and business failure prediction has become a popular cross-disciplinary research area in recent times involving financial economists, forensic accountants and computational modellers. Some of the computational techniques popularly used in the context of - financial fraud detection and business failure prediction can also be effectively applied in the detection of fraudulent insurance claims and therefore, can be of immense practical value to the insurance industry. We provide a comparative analysis of prediction performance of a battery of data mining techniques using real-life automotive insurance fraud data. While the data we have used in our paper is US-based, the computational techniques we have tested can be adapted and generally applied to detect similar insurance frauds in other countries as well where an organized automotive insurance industry exists.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Musical preference has long been a research interest in the field of music education, and studies consistently confirm the importance of musical preference in one’s musical learning experiences. However, only a limited number of studies have been focussed on the field of early childhood education (e.g., Hargreaves, North, & Tarrant, 2006; Roulston, 2006). Further, among these limited early childhood studies, few of them discuss children’s musical preference in both the East and the West. There is very limited literature (e.g., Faulkner et al., 2010; Szymanska, 2012) which explores the data by using a data mining approach. This study aims to bridge the research gaps by examining children’s musical preference in Hong Kong and in South Australia by applying a data mining technique – Self Organising Maps (SOM), which is a clustering method that groups similar data objects together. The application of SOM is new in the field of early childhood education and also in the study of children’s musical preference. This paper specifically aims to expand a previous study (Yim & Ebbeck, 2009) by conducting deeper investigations into the existing datasets, for the purpose of uncovering insights that have not been identified through data mining approach.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Price promotions (also called discount promotions), i.e. short-term temporary price reductions for selected items (Hermann 1989), are frequently used in sales promotions. The main objective of price promotions is to boost sales and increase profits. Quantitative evaluation of the effects of price promotions (QEEPP) is essential and important for sales managers to analyse historical price promotions and informative for devising more effective promotional strategies in the future. However, most previous studies only provide insights into the effects of discount promotions from some specific prospectives, and no approaches have been proposed for comprehensive evaluation of the effects of discount promotions. For example, Hinkle [1965] discovered that price promotions in the off-season are more favourable, and the effects of price promotions are stronger for new products. Peckham [1973] found that price promotions have no impact on long-term trend. Blattberg et al. [1978] identified that different segments respond to price promotions in different ways. Rockney [1991] discovered three basic types of effects: effects on discounted items, effects on substitutes and effects on complementary items.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper presents a novel data mining framework for the exploration and extraction of actionable knowledge from data generated by electricity meters. Although a rich source of information for energy consumption analysis, electricity meters produce a voluminous, fast-paced, transient stream of data that conventional approaches are unable to address entirely. In order to overcome these issues, it is important for a data mining framework to incorporate functionality for interim summarization and incremental analysis using intelligent techniques. The proposed Incremental Summarization and Pattern Characterization (ISPC) framework demonstrates this capability. Stream data is structured in a data warehouse based on key dimensions enabling rapid interim summarization. Independently, the IPCL algorithm incrementally characterizes patterns in stream data and correlates these across time. Eventually, characterized patterns are consolidated with interim summarization to facilitate an overall analysis and prediction of energy consumption trends. Results of experiments conducted using the actual data from electricity meters confirm applicability of the ISPC framework.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper we demonstrate our signature based detector for self-propagating worms. We use a set of worm and benign traffic traces of several endpoints to build benign and worm profiles. These profiles were arranged into separate n-ary trees. We also demonstrate our anomaly detector that was used to deal with tied matches between worm and benign trees. We analyzed the performance of each detector and also with their integration. Results show that our signature based detector can detect very high true positive. Meanwhile, the anomaly detector did not achieve high true positive. Both detectors, when used independently, suffer high false positive. However, when both detectors were integrated they maintained a high detection rate of true positive and minimized the false positive