Using metrics from complex networks to evaluate machine translation


Autoria(s): AMANCIO, D. R.; NUNES, M. G. V.; OLIVEIRA JUNIOR, Osvaldo Novais de; PARDO, T. A. S.; ANTIQUEIRA, L.; COSTA, Luciano da Fontoura
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

20/10/2012

20/10/2012

2011

Resumo

Establishing metrics to assess machine translation (MT) systems automatically is now crucial owing to the widespread use of MT over the web. In this study we show that such evaluation can be done by modeling text as complex networks. Specifically, we extend our previous work by employing additional metrics of complex networks, whose results were used as input for machine learning methods and allowed MT texts of distinct qualities to be distinguished. Also shown is that the node-to-node mapping between source and target texts (English-Portuguese and Spanish-Portuguese pairs) can be improved by adding further hierarchical levels for the metrics out-degree, in-degree, hierarchical common degree, cluster coefficient, inter-ring degree, intra-ring degree and convergence ratio. The results presented here amount to a proof-of-principle that the possible capturing of a wider context with the hierarchical levels may be combined with machine learning methods to yield an approach for assessing the quality of MT systems. (C) 2010 Elsevier B.V. All rights reserved.

CNPq

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)

FAPESP[2010/00927-9]

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Identificador

PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, v.390, n.1, p.131-142, 2011

0378-4371

http://producao.usp.br/handle/BDPI/29835

10.1016/j.physa.2010.08.052

http://dx.doi.org/10.1016/j.physa.2010.08.052

Idioma(s)

eng

Publicador

ELSEVIER SCIENCE BV

Relação

Physica A-statistical Mechanics and Its Applications

Direitos

restrictedAccess

Copyright ELSEVIER SCIENCE BV

Palavras-Chave #Machine translation #Evaluation #Complex networks #Machine learning #Physics, Multidisciplinary
Tipo

article

original article

publishedVersion