A Two-stage information filtering based on rough decision rule and pattern mining


Autoria(s): Zhou, Xujuan; Li, Yuefeng; Bruza, Peter; Xu, Yue; Lau, Raymond
Data(s)

01/11/2010

Resumo

Information Overload and Mismatch are two fundamental problems affecting the effectiveness of information filtering systems. Even though both term-based and patternbased approaches have been proposed to address the problems of overload and mismatch, neither of these approaches alone can provide a satisfactory solution to address these problems. This paper presents a novel two-stage information filtering model which combines the merits of term-based and pattern-based approaches to effectively filter sheer volume of information. In particular, the first filtering stage is supported by a novel rough analysis model which efficiently removes a large number of irrelevant documents, thereby addressing the overload problem. The second filtering stage is empowered by a semantically rich pattern taxonomy mining model which effectively fetches incoming documents according to the specific information needs of a user, thereby addressing the mismatch problem. The experimental results based on the RCV1 corpus show that the proposed twostage filtering model significantly outperforms the both termbased and pattern-based information filtering models.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/39353/

Publicador

Academy Publisher

Relação

http://eprints.qut.edu.au/39353/1/3637-9089-1-PB.pdf

DOI:10.4304/jetwi.2.4.326-332

Zhou, Xujuan, Li, Yuefeng, Bruza, Peter, Xu, Yue, & Lau, Raymond (2010) A Two-stage information filtering based on rough decision rule and pattern mining. Journal of Emerging Technologies in Web Intelligence, 2(4), pp. 326-332.

Direitos

Copyright 2010 Academy Publisher

Fonte

Computer Science; Faculty of Science and Technology; Information Systems

Palavras-Chave #Information Filtering #User Profiles #Rough Set Theory #Pattern Mining
Tipo

Journal Article