Rough sets based reasoning and pattern mining for a two-stage information filtering system


Autoria(s): Zhou, Xujuan; Li, Yuefeng; Bruza, Peter D.; Xu, Yue; Lau, Raymond Y.K.
Data(s)

2010

Resumo

This paper presents a novel two-stage information filtering model which combines the merits of term-based and pattern- based approaches to effectively filter sheer volume of information. In particular, the first filtering stage is supported by a novel rough analysis model which efficiently removes a large number of irrelevant documents, thereby addressing the overload problem. The second filtering stage is empowered by a semantically rich pattern taxonomy mining model which effectively fetches incoming documents according to the specific information needs of a user, thereby addressing the mismatch problem. The experiments have been conducted to compare the proposed two-stage filtering (T-SM) model with other possible "term-based + pattern-based" or "term-based + term-based" IF models. The results based on the RCV1 corpus show that the T-SM model significantly outperforms other types of "two-stage" IF models.

Identificador

http://eprints.qut.edu.au/42071/

Publicador

ACM

Relação

DOI:10.1145/1871437.1871639

Zhou, Xujuan, Li, Yuefeng, Bruza, Peter D., Xu, Yue, & Lau, Raymond Y.K. (2010) Rough sets based reasoning and pattern mining for a two-stage information filtering system. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management, ACM, Fairmont Royal York, Toronto, pp. 1429-1432.

Direitos

Copyright 2010 ACM

Fonte

Faculty of Science and Technology

Palavras-Chave #080600 INFORMATION SYSTEMS #filtering model #term-based, pattern- based #rough analysis
Tipo

Conference Paper