A fast algorithm for finding frequent episodes in event streams


Autoria(s): Laxman, Srivatsan; Sastry, PS; Unnikrishnan, KP
Data(s)

01/08/2007

Resumo

Frequent episode discovery is a popular framework for mining data available as a long sequence of events. An episode is essentially a short ordered sequence of event types and the frequency of an episode is some suitable measure of how often the episode occurs in the data sequence. Recently,we proposed a new frequency measure for episodes based on the notion of non-overlapped occurrences of episodes in the event sequence, and showed that, such a definition, in addition to yielding computationally efficient algorithms, has some important theoretical properties in connecting frequent episode discovery with HMM learning. This paper presents some new algorithms for frequent episode discovery under this non-overlapped occurrences-based frequency definition. The algorithms presented here are better (by a factor of N, where N denotes the size of episodes being discovered) in terms of both time and space complexities when compared to existing methods for frequent episode discovery. We show through some simulation experiments, that our algorithms are very efficient. The new algorithms presented here have arguably the least possible orders of spaceand time complexities for the task of frequent episode discovery.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/41513/1/A_Fast_Algorithm.pdf

Laxman, Srivatsan and Sastry, PS and Unnikrishnan, KP (2007) A fast algorithm for finding frequent episodes in event streams. In: KDD '07 Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, Aug. 2007, New York, NY.

Publicador

ACM Press

Relação

http://dl.acm.org/citation.cfm?id=1281238

http://eprints.iisc.ernet.in/41513/

Palavras-Chave #Electrical Engineering
Tipo

Conference Paper

PeerReviewed