Alignment-based extraction of multiword expressions


Autoria(s): CASELI, Helena de Medeiros; RAMISCH, Carlos; NUNES, Maria das Gracas Volpe; VILLAVICENCIO, Aline
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

20/10/2012

20/10/2012

2010

Resumo

Due to idiosyncrasies in their syntax, semantics or frequency, Multiword Expressions (MWEs) have received special attention from the NLP community, as the methods and techniques developed for the treatment of simplex words are not necessarily suitable for them. This is certainly the case for the automatic acquisition of MWEs from corpora. A lot of effort has been directed to the task of automatically identifying them, with considerable success. In this paper, we propose an approach for the identification of MWEs in a multilingual context, as a by-product of a word alignment process, that not only deals with the identification of possible MWE candidates, but also associates some multiword expressions with semantics. The results obtained indicate the feasibility and low costs in terms of tools and resources demanded by this approach, which could, for example, facilitate and speed up lexicographic work.

Brazilian agencies FAPESP[02/13207-8]

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)

CNPq[550388/2005-2]

SEBRAE

Financiadora de Estudos e Projetos (FINEP)

SEBRAE/FINEP[1194/07]

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

CAPES[CAPES/COFECUB 548/07]

Identificador

LANGUAGE RESOURCES AND EVALUATION, v.44, n.1/Fev, p.59-77, 2010

1574-020X

http://producao.usp.br/handle/BDPI/28760

10.1007/s10579-009-9097-9

http://dx.doi.org/10.1007/s10579-009-9097-9

Idioma(s)

eng

Publicador

SPRINGER

Relação

Language Resources and Evaluation

Direitos

restrictedAccess

Copyright SPRINGER

Palavras-Chave #Automatic identification #Word alignment #Machine translation #Terminology #Multiword expressions #Lexical acquisition #Statistical methods #Computer Science, Interdisciplinary Applications
Tipo

article

original article

publishedVersion