Part-of-Speech Tagging using Parallel Weighted Finite-State Transducers


Autoria(s): Silfverberg, Miikka; Linden, Krister
Contribuinte(s)

University of Helsinki, Department of Modern Languages

University of Helsinki, Department of Modern Languages

Data(s)

01/08/2010

Resumo

We use parallel weighted finite-state transducers to implement a part-of-speech tagger, which obtains state-of-the-art accuracy when used to tag the Europarl corpora for Finnish, Swedish and English. Our system consists of a weighted lexicon and a guesser combined with a bigram model factored into two weighted transducers. We use both lemmas and tag sequences in the bigram model, which guarantees reliable bigram estimates.

Formato

12

Identificador

http://hdl.handle.net/10138/29357

Idioma(s)

eng

Relação

Proceedings of IceTAL 2010 7th International Conference on Natural Language Processing

Fonte

Silfverberg , M & Linden , K 2010 , ' Part-of-Speech Tagging using Parallel Weighted Finite-State Transducers ' in Proceedings of IceTAL 2010 : 7th International Conference on Natural Language Processing .

Palavras-Chave #612 Languages and Literature
Tipo

A4 Article in conference publication (refereed)

info:eu-repo/semantics/conferencePaper

http://purl.org/eprint/status/NonPeerReviewed