Text mining with semantic annotation : using enriched text representation for entity-oriented retrieval, semantic relation identification and text clustering


Autoria(s): Hou, Jun
Data(s)

2014

Resumo

This project is a step forward in the study of text mining where enhanced text representation with semantic information plays a significant role. It develops effective methods of entity-oriented retrieval, semantic relation identification and text clustering utilizing semantically annotated data. These methods are based on enriched text representation generated by introducing semantic information extracted from Wikipedia into the input text data. The proposed methods are evaluated against several start-of-art benchmarking methods on real-life data-sets. In particular, this thesis improves the performance of entity-oriented retrieval, identifies different lexical forms for an entity relation and handles clustering documents with multiple feature spaces.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/79206/

Publicador

Queensland University of Technology

Relação

http://eprints.qut.edu.au/79206/1/Jun_Hou_Thesis.pdf

Hou, Jun (2014) Text mining with semantic annotation : using enriched text representation for entity-oriented retrieval, semantic relation identification and text clustering. PhD thesis, Queensland University of Technology.

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #Text Mining #Semantic Annotation #Entity-oriented Retrieval #Semantic Relation Identification #Clustering #Cluster Ensemble Learning #High-Order Co-Clustering #Multiple Subspace Learning #Concept-based Retrieval #Open Information Extraction
Tipo

Thesis