Automatically Annotating Structured Web Data Using a SVM-Based Multiclass Classifier


Autoria(s): Weng, Daiyue; Hong, Jun; Bell, David A.
Data(s)

2014

Resumo

In this paper, we propose a new learning approach to Web data annotation, where a support vector machine-based multiclass classifier is trained to assign labels to data items. For data record extraction, a data section re-segmentation algorithm based on visual and content features is introduced to improve the performance of Web data record extraction. We have implemented the proposed approach and tested it with a large set of Web query result pages in different domains. Our experimental results show that our proposed approach is highly effective and efficient.

Identificador

http://pure.qub.ac.uk/portal/en/publications/automatically-annotating-structured-web-data-using-a-svmbased-multiclass-classifier(45cf32fb-f74b-4547-accf-9efec2da260f).html

http://dx.doi.org/10.1007/978-3-319-11749-2_9

Idioma(s)

eng

Publicador

Springer

Direitos

info:eu-repo/semantics/restrictedAccess

Fonte

Weng , D , Hong , J & Bell , D A 2014 , Automatically Annotating Structured Web Data Using a SVM-Based Multiclass Classifier . in Web Information Systems Engineering - WISE 2014: 15th International Conference, Thessaloniki, Greece, October 12-14, 2014, Proceedings, Part I . Lecture Notes in Computer Science , vol. 8786 , Springer , pp. 115 - 124 . DOI: 10.1007/978-3-319-11749-2_9

Tipo

contributionToPeriodical