Alignment-based extraction of multiword expressions
Contribuinte(s) |
UNIVERSIDADE DE SÃO PAULO |
---|---|
Data(s) |
20/10/2012
20/10/2012
2010
|
Resumo |
Due to idiosyncrasies in their syntax, semantics or frequency, Multiword Expressions (MWEs) have received special attention from the NLP community, as the methods and techniques developed for the treatment of simplex words are not necessarily suitable for them. This is certainly the case for the automatic acquisition of MWEs from corpora. A lot of effort has been directed to the task of automatically identifying them, with considerable success. In this paper, we propose an approach for the identification of MWEs in a multilingual context, as a by-product of a word alignment process, that not only deals with the identification of possible MWE candidates, but also associates some multiword expressions with semantics. The results obtained indicate the feasibility and low costs in terms of tools and resources demanded by this approach, which could, for example, facilitate and speed up lexicographic work. Brazilian agencies FAPESP[02/13207-8] Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) CNPq[550388/2005-2] SEBRAE Financiadora de Estudos e Projetos (FINEP) SEBRAE/FINEP[1194/07] Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) CAPES[CAPES/COFECUB 548/07] |
Identificador |
LANGUAGE RESOURCES AND EVALUATION, v.44, n.1/Fev, p.59-77, 2010 1574-020X http://producao.usp.br/handle/BDPI/28760 10.1007/s10579-009-9097-9 |
Idioma(s) |
eng |
Publicador |
SPRINGER |
Relação |
Language Resources and Evaluation |
Direitos |
restrictedAccess Copyright SPRINGER |
Palavras-Chave | #Automatic identification #Word alignment #Machine translation #Terminology #Multiword expressions #Lexical acquisition #Statistical methods #Computer Science, Interdisciplinary Applications |
Tipo |
article original article publishedVersion |