Semantic labelling for document feature patterns using ontological subjects


Autoria(s): Tao, Xiaohui; Li, Yuefeng; Liu, Bin; Shen, Yan
Contribuinte(s)

Zhong, Ning

Gong, Zhiguo

Data(s)

04/12/2012

Resumo

Finding and labelling semantic features patterns of documents in a large, spatial corpus is a challenging problem. Text documents have characteristics that make semantic labelling difficult; the rapidly increasing volume of online documents makes a bottleneck in finding meaningful textual patterns. Aiming to deal with these issues, we propose an unsupervised documnent labelling approach based on semantic content and feature patterns. A world ontology with extensive topic coverage is exploited to supply controlled, structured subjects for labelling. An algorithm is also introduced to reduce dimensionality based on the study of ontological structure. The proposed approach was promisingly evaluated by compared with typical machine learning methods including SVMs, Rocchio, and kNN.

Identificador

http://eprints.qut.edu.au/58294/

Publicador

IEEE Computer Society Conference Publishing Services (CPS)

Relação

DOI:10.1109/WI-IAT.2012.47

Tao, Xiaohui, Li, Yuefeng, Liu, Bin, & Shen, Yan (2012) Semantic labelling for document feature patterns using ontological subjects. In Zhong, Ning & Gong, Zhiguo (Eds.) 2012 IEEE/WIC/ACM International Conference on Web Intelligence, IEEE Computer Society Conference Publishing Services (CPS), Macau, China, pp. 530-534.

Direitos

Copyright 2012 IEEE

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #080109 Pattern Recognition and Data Mining #Text classification #Ontology Learning #Feature selection
Tipo

Conference Paper