Finite-State Spell-Checking with Weighted Language and Error Models


Autoria(s): Pirinen, Tommi; Linden, Krister
Contribuinte(s)

University of Helsinki, Käyttäytymistieteellisen tiedekunnan kanslia

University of Helsinki, Department of Modern Languages

Data(s)

01/05/2010

Resumo

In this paper we present simple methods for construction and evaluation of finite-state spell-checking tools using an existing finite-state lexical automaton, freely available finite-state tools and Internet corpora acquired from projects such as Wikipedia. As an example, we use a freely available open-source implementation of Finnish morphology, made with traditional finite-state morphology tools, and demonstrate rapid building of Northern Sámi and English spell checkers from tools and resources available from the Internet.

Identificador

http://hdl.handle.net/10138/29358

Idioma(s)

eng

Relação

Proceedings of LREC 2010 Workshop on Creation and use of basic lexical resources for less-resourced languages

Fonte

Pirinen , T & Linden , K 2010 , ' Finite-State Spell-Checking with Weighted Language and Error Models : Building and Evaluating Spell-Checkers with Wikipedia as Corpus ' in Proceedings of LREC 2010 : Workshop on Creation and use of basic lexical resources for less-resourced languages .

Palavras-Chave #612 Languages and Literature
Tipo

A4 Article in conference publication (refereed)

info:eu-repo/semantics/conferencePaper

http://purl.org/eprint/status/NonPeerReviewed