Subsequence frequency measurement and its impact on knowledge discovery in single sequences


Autoria(s): Gan, Min; Dai, Honghua
Contribuinte(s)

Dai, Honghua

Liu, James N. K.

Smirnov, Evgueni

Data(s)

01/01/2012

Resumo

Subsequence frequency measurement is a basic and essential problem in knowledge discovery in single sequences. Frequency based knowledge discovery in single sequences tends to be unreliable since different resulting sets may be obtained from a same sequence when different frequency metrics are adopted. In this chapter, we investigate subsequence frequency measurement and its impact on the reliability of knowledge discovery in single sequences. We analyse seven previous frequency metrics, identify their inherent inaccuracies, and explore their impacts on two kinds of knowledge discovered from single sequences, frequent episodes and episode rules. We further give three suggestions for frequency metrics and introduce a new frequency metric in order to improve the reliability. Empirical evaluation reveals the inaccuracies and verifies our findings.

Identificador

http://hdl.handle.net/10536/DRO/DU:30043152

Idioma(s)

eng

Publicador

Springer

Relação

http://dro.deakin.edu.au/eserv/DU:30043152/gan-subsequencefrequency-2012.pdf

http://dro.deakin.edu.au/eserv/DU:30043152/gan-subsequencefrequency-evidence-2012.pdf

http://dx.doi.org/10.1007/978-1-4614-1903-7_14

Direitos

2012, Springer Science+Business Media, LLC

Palavras-Chave #subsequence frequency measurement #frequency metrics #frequency based knowledge discovery
Tipo

Book Chapter