A study on the accuracy of frequency measures and its impact on knowledge discovery in single sequences


Autoria(s): Gan, Min; Dai, Honghua
Contribuinte(s)

Fan, Wei

Hsu, Wynne

Webb, Geoffrey I.

Liu, Bing

Zhang, Chengqi

Gunopulos, Dimitrios

Wu, Xindong

Data(s)

01/01/2010

Resumo

In knowledge discovery in single sequences, different results could be discovered from the same sequence when different frequency measures are adopted. It is natural to raise such questions as (1) do these frequency measures reflect actual frequencies accurately? (2) what impacts do frequency measures have on discovered knowledge? (3) are discovered results accurate and reliable? and (4) which measures are appropriate for reflecting frequencies accurately? In this paper, taking three major factors (anti-monotonicity, maximum-frequency and window-width restriction) into account, we identify inaccuracies inherent in seven existing frequency measures, and investigate their impacts on the soundness and completeness of two kinds of knowledge, frequent episodes and episode rules, discovered from single sequences. In order to obtain more accurate frequencies and knowledge, we provide three recommendations for defining appropriate frequency measures. Following the recommendations, we introduce a more appropriate frequency measure. Empirical evaluation reveals the inaccuracies and verifies our findings. 

Identificador

http://hdl.handle.net/10536/DRO/DU:30035409

Idioma(s)

eng

Publicador

IEEE Computer Society

Relação

http://dro.deakin.edu.au/eserv/DU:30035409/dai-astudy-2010.pdf

http://dro.deakin.edu.au/eserv/DU:30035409/dai-astudyreview-2010.pdf

http://dro.deakin.edu.au/eserv/DU:30035409/dai-icdmwconference-2010.pdf

http://ieeexplore.ieee.org/xpls/abs_all.jsp?tp=

Direitos

2010, IEEE

Palavras-Chave #frequency measures #episodes #single sequences #knowlede discovery
Tipo

Conference Paper