Part-of-Speech Tagging using Parallel Weighted Finite-State Transducers
Contribuinte(s) |
University of Helsinki, Department of Modern Languages University of Helsinki, Department of Modern Languages |
---|---|
Data(s) |
01/08/2010
|
Resumo |
We use parallel weighted finite-state transducers to implement a part-of-speech tagger, which obtains state-of-the-art accuracy when used to tag the Europarl corpora for Finnish, Swedish and English. Our system consists of a weighted lexicon and a guesser combined with a bigram model factored into two weighted transducers. We use both lemmas and tag sequences in the bigram model, which guarantees reliable bigram estimates. |
Formato |
12 |
Identificador | |
Idioma(s) |
eng |
Relação |
Proceedings of IceTAL 2010 7th International Conference on Natural Language Processing |
Fonte |
Silfverberg , M & Linden , K 2010 , ' Part-of-Speech Tagging using Parallel Weighted Finite-State Transducers ' in Proceedings of IceTAL 2010 : 7th International Conference on Natural Language Processing . |
Palavras-Chave | #612 Languages and Literature |
Tipo |
A4 Article in conference publication (refereed) info:eu-repo/semantics/conferencePaper http://purl.org/eprint/status/NonPeerReviewed |