Using syntactic and semantic features for classifying mo- dal values in the portuguese language
Data(s) |
06/02/2017
06/02/2017
01/04/2016
|
---|---|
Resumo |
This paper presents a study made in a field poorly explored in the Portuguese language – modality and its automatic tagging. Our main goal was to find a set of attributes for the creation of automatic tag- gers with improved performance over the bag-of-words (bow) approach. The performance was measured using precision, recall and F1. Because it is a relatively unexplored field, the study covers the creation of the corpus (composed by eleven verbs), the use of a parser to extract syntac- tic and semantic information from the sentences and a machine learning approach to identify modality values. Based on three different sets of attributes – from trigger itself and the trigger’s path (from the parse tree) and context – the system creates a tagger for each verb achiev- ing (in almost every verb) an improvement in F1 when compared to the traditional bow approach. |
Identificador |
ao Sequeira, Teresa Gon ̧calves, Paulo Quaresma, Am ́alia Mendes, and Iris Hendrickx. Using syntactic and semantic features for classifying mo- dal values in the portuguese language. In CICLing-16, 17th international Conference on Intelligent Text Processing and Computational Linguistics, Lecture Notes in Computer Science. Springer, 2016. http://hdl.handle.net/10174/20650 nd tcg@uevora.pt pq@uevora.pt nd nd 498 |
Idioma(s) |
por |
Publicador |
Springer |
Direitos |
restrictedAccess |
Tipo |
article |