Using metrics from complex networks to evaluate machine translation
Contribuinte(s) |
UNIVERSIDADE DE SÃO PAULO |
---|---|
Data(s) |
20/10/2012
20/10/2012
2011
|
Resumo |
Establishing metrics to assess machine translation (MT) systems automatically is now crucial owing to the widespread use of MT over the web. In this study we show that such evaluation can be done by modeling text as complex networks. Specifically, we extend our previous work by employing additional metrics of complex networks, whose results were used as input for machine learning methods and allowed MT texts of distinct qualities to be distinguished. Also shown is that the node-to-node mapping between source and target texts (English-Portuguese and Spanish-Portuguese pairs) can be improved by adding further hierarchical levels for the metrics out-degree, in-degree, hierarchical common degree, cluster coefficient, inter-ring degree, intra-ring degree and convergence ratio. The results presented here amount to a proof-of-principle that the possible capturing of a wider context with the hierarchical levels may be combined with machine learning methods to yield an approach for assessing the quality of MT systems. (C) 2010 Elsevier B.V. All rights reserved. CNPq Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) FAPESP[2010/00927-9] Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) |
Identificador |
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, v.390, n.1, p.131-142, 2011 0378-4371 http://producao.usp.br/handle/BDPI/29835 10.1016/j.physa.2010.08.052 |
Idioma(s) |
eng |
Publicador |
ELSEVIER SCIENCE BV |
Relação |
Physica A-statistical Mechanics and Its Applications |
Direitos |
restrictedAccess Copyright ELSEVIER SCIENCE BV |
Palavras-Chave | #Machine translation #Evaluation #Complex networks #Machine learning #Physics, Multidisciplinary |
Tipo |
article original article publishedVersion |