Hierarchical rule generalisation for speaker identification in fiction books


Autoria(s): Glass, Kevin; Bangay, Shaun
Contribuinte(s)

Bishop, Judith

Kourie, Derrick

Data(s)

01/01/2006

Resumo

This paper presents a hierarchical pattern matching and generalisation technique which is applied to the problem of locating the correct speaker of quoted speech found in fiction books. Patterns from a training set are generalised to create a small number of rules, which can be used to identify items of interest within the text. The pattern matching technique is applied to finding the Speech-Verb, Actor and Speaker of quotes found in ction books. The technique performs well over the training data, resulting in rule-sets many times smaller than the training set, but providing very high accuracy. While the rule-set generalised from one book is less effective when applied to different books than an approach based on hand coded heuristics, performance is comparable when testing on data closely related to the training set.<br />

Identificador

http://hdl.handle.net/10536/DRO/DU:30039201

Idioma(s)

eng

Publicador

South African Institute for Computer Scientists and Information Technologists

Relação

http://dro.deakin.edu.au/eserv/DU:30039201/bangay-hierachicalrule-2006.pdf

http://www.cs.ru.ac.za/research/g05g1909/papers/GLASS_rule_generalisatio.pdf

Direitos

2006, SAICSIT

Palavras-Chave #pattern matching #machine Learning #generalisation
Tipo

Conference Paper