Automatically Annotating Structured Web Data Using a SVM-Based Multiclass Classifier
Data(s) |
2014
|
---|---|
Resumo |
In this paper, we propose a new learning approach to Web data annotation, where a support vector machine-based multiclass classifier is trained to assign labels to data items. For data record extraction, a data section re-segmentation algorithm based on visual and content features is introduced to improve the performance of Web data record extraction. We have implemented the proposed approach and tested it with a large set of Web query result pages in different domains. Our experimental results show that our proposed approach is highly effective and efficient. |
Identificador | |
Idioma(s) |
eng |
Publicador |
Springer |
Direitos |
info:eu-repo/semantics/restrictedAccess |
Fonte |
Weng , D , Hong , J & Bell , D A 2014 , Automatically Annotating Structured Web Data Using a SVM-Based Multiclass Classifier . in Web Information Systems Engineering - WISE 2014: 15th International Conference, Thessaloniki, Greece, October 12-14, 2014, Proceedings, Part I . Lecture Notes in Computer Science , vol. 8786 , Springer , pp. 115 - 124 . DOI: 10.1007/978-3-319-11749-2_9 |
Tipo |
contributionToPeriodical |