Mining Negative Relevance Feedback for Information Filtering


Autoria(s): Li, Yuefeng; Algarni, Abdulmohsen; Wu, Sheng-Tang; Xu, Yue
Data(s)

15/09/2009

Resumo

It is a big challenge to clearly identify the boundary between positive and negative streams. Several attempts have used negative feedback to solve this challenge; however, there are two issues for using negative relevance feedback to improve the effectiveness of information filtering. The first one is how to select constructive negative samples in order to reduce the space of negative documents. The second issue is how to decide noisy extracted features that should be updated based on the selected negative samples. This paper proposes a pattern mining based approach to select some offenders from the negative documents, where an offender can be used to reduce the side effects of noisy features. It also classifies extracted features (i.e., terms) into three categories: positive specific terms, general terms, and negative specific terms. In this way, multiple revising strategies can be used to update extracted features. An iterative learning algorithm is also proposed to implement this approach on RCV1, and substantial experiments show that the proposed approach achieves encouraging performance.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/29276/

Publicador

IEEE Computer Society Conference Publications

Relação

http://eprints.qut.edu.au/29276/1/c29276.pdf

DOI:10.1109/WI-IAT.2009.103

Li, Yuefeng, Algarni, Abdulmohsen, Wu, Sheng-Tang, & Xu, Yue (2009) Mining Negative Relevance Feedback for Information Filtering. In Proceedings of the 2009 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies (WI-IAT 2009), IEEE Computer Society Conference Publications, University of Milano, Milan, pp. 606-613.

http://purl.org/au-research/grants/ARC/DP0988007

Direitos

Copyright 2009 IEEE

Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Fonte

Faculty of Science and Technology

Palavras-Chave #080109 Pattern Recognition and Data Mining #080704 Information Retrieval and Web Search #Information Filtering #Text mining #Algorithm #Negative feedback #Pattern mining
Tipo

Conference Paper