Information extraction from web services : a comparison of Tokenisation algorithms


Autoria(s): Metke-Jimenez, Alejandro; Raymond, Kerry; MacColl, Ian
Data(s)

09/08/2011

Resumo

Most web service discovery systems use keyword-based search algorithms and, although partially successful, sometimes fail to satisfy some users information needs. This has given rise to several semantics-based approaches that look to go beyond simple attribute matching and try to capture the semantics of services. However, the results reported in the literature vary and in many cases are worse than the results obtained by keyword-based systems. We believe the accuracy of the mechanisms used to extract tokens from the non-natural language sections of WSDL files directly affects the performance of these techniques, because some of them can be more sensitive to noise. In this paper three existing tokenization algorithms are evaluated and a new algorithm that outperforms all the algorithms found in the literature is introduced.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/43885/

Relação

http://eprints.qut.edu.au/43885/1/MetkeTok2011.pdf

DOI:10.5220/0003698000120023

Metke-Jimenez, Alejandro, Raymond, Kerry, & MacColl, Ian (2011) Information extraction from web services : a comparison of Tokenisation algorithms. In SKY2011 Workshop : Discovery and Representation of Runnable Knowledge, 26 October 2011, Paris.

Direitos

Copyright 2011 [Please consult the authors]

Fonte

Computer Science; Faculty of Science and Technology; Smart Services CRC

Palavras-Chave #080300 COMPUTER SOFTWARE
Tipo

Conference Paper