145 resultados para trajectory mining


Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper reports on the preparation and management processes of inconsistent data on damage on residential houses in Victoria, Australia. There are no existing specific and fully relevant databases readily available except for the incomplete paper-based and electronic-based reports. Therefore, the extracting of information from the reports is complicated and time consuming in order to extract and include all the necessary information needed for analysis of damage on residential houses founded on expansive soils. Data mining is adopted to develop a database. Statistical methods and Artificial Intelligence methods are used to quantify the quality of data. The paper concludes that the development of such database could enable BHC to evaluate the usefulness of the reports prepared on the reported damage properties for further analysis.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Purpose – The purpose of this study is to examine the exposures of Australian gold mining firms in the highly volatile period from 1995 to 2000. This period has been characterized by significant changes in gold price due to bulk sale of gold by collective central banks. Specifically, the paper aims to investigate several firm-specific factors that are hypothesized to carry substantial influence on gold beta.

Design/methodology/approach – To estimate gold beta, we use the following multifactor model: Rg,t = a+ßgGPRt + ßxFXRt + ßmRm,t + Et , where Rg,t is the return on the gold stock Index at time t, GPRt is the gold price return denominated in US dollar at time t, FXRt is the foreign exchange return of Australian dollar in terms of US dollar at time t, Rm,t is the market return at time t, and Et is the random error term at time t.

Findings – The paper finds that the values of gold beta are consistently greater than one, implying the sensitive nature of firms’ stock returns to gold price changes. This also suggests that investors holding gold mining stock would receive higher percentage increases in stock returns from a percentage increase in gold price returns, as opposed to investors holding gold bullion. Furthermore, these values have changed substantially over time with significant changes in gold price volatility. The most important and consistent relationship that we find is the impact of firms’ hedging behavior on their respective gold betas. This is consistent with Tufano’s study. It implies that firms, which hedge a greater proportion of their gold reserves, are less sensitive to movements in gold prices. The finding therefore supports the risk management theory that hedging increases shareholder’s wealth. However, cash operating costs, cash reserves and the level of gold production seem to influence very little on the firms’ exposure to gold price changes.

Originality/value – This study is of interest and important to the stock mining companies and investors because the extent of the effect of gold price movements on the stock returns of gold mining companies has significant impacts on returns for both firms and investors especially in their risk management and investment decisions, respectively.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Researchers have been endeavoring to discover concise sets of episode rules instead of complete sets in sequences. Existing approaches, however, are not able to process complex sequences and can not guarantee the accuracy of resulting sets due to the violation of anti-monotonicity of the frequency metric. In some real applications, episode rules need to be extracted from complex sequences in which multiple items may appear in a time slot. This paper investigates the discovery of concise episode rules in complex sequences. We define a concise representation called non-derivable episode rules and formularize the mining problem. Adopting a novel anti-monotonic frequency metric, we then develop a fast approach to discover non-derivable episode rules in complex sequences. Experimental results demonstrate that the utility of the proposed approach substantially reduces the number of rules and achieves fast processing.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Class imbalance in textual data is one important factor that affects the reliability of text mining. For imbalanced textual data, conventional classifiers tend to have a strong performance bias, which results in high accuracy rate on the majority class but very low rate on the minorities. An extreme strategy for unbalanced learning is to discard the majority instances and apply one-class classification to the minority class. However, this could easily cause another type of bias, which increases the accuracy rate on minorities by sacrificing the majorities. This chapter aims to investigate approaches that reduce these two types of performance bias and improve the reliability of discovered classification rules. Experimental results show that the inexact field learning method and parameter optimized one class classifiers achieve more balanced performance than the standard approaches.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The problem of extracting infrequent patterns from streams and building associations between these patterns is becoming increasingly relevant today as many events of interest such as attacks in network data or unusual stories in news data occur rarely. The complexity of the problem is compounded when a system is required to deal with data from multiple streams. To address these problems, we present a framework that combines the time based association mining with a pyramidal structure that allows a rolling analysis of the stream and maintains a synopsis of the data without requiring increasing memory resources. We apply the algorithms and show the usefulness of the techniques. © 2007 Crown Copyright.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Many previous approaches to frequent episode discovery only accept simple sequences. Although a recent approach has been able to nd frequent episodes from complex sequences, the discovered sets are neither condensed nor accurate. This paper investigates the discovery of condensed sets of frequent episodes from complex sequences. We adopt a novel anti-monotonic frequency measure based on non-redundant occurrences, and dene a condensed set, nDaCF (the set of non-derivable approximately closed frequent episodes) within a given maximal error bound of support. We then introduce a series of effective pruning strategies, and develop a method, nDaCF-Miner, for discovering nDaCF sets. Experimental results show that, when the error bound is somewhat high, the discovered nDaCF sets are two orders of magnitude smaller than complete sets, and nDaCF-miner is more efficient than previous mining approaches. In addition, the nDaCF sets are more accurate than the sets found by previous approaches.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The development and application of computational data mining techniques in financial fraud detection and business failure prediction has become a popular cross-disciplinary research area in recent times involving financial economists, forensic accountants and computational modellers. Some of the computational techniques popularly used in the context of - financial fraud detection and business failure prediction can also be effectively applied in the detection of fraudulent insurance claims and therefore, can be of immense practical value to the insurance industry. We provide a comparative analysis of prediction performance of a battery of data mining techniques using real-life automotive insurance fraud data. While the data we have used in our paper is US-based, the computational techniques we have tested can be adapted and generally applied to detect similar insurance frauds in other countries as well where an organized automotive insurance industry exists.