NLG-Metricverse: An End-to-End Library for Evaluating Natural Language Generation


Autoria(s): Zammarchi, Andrea
Contribuinte(s)

Carbonaro, Antonella

Frisoni, Giacomo

Data(s)

01/12/2022

Resumo

Driven by recent deep learning breakthroughs, natural language generation (NLG) models have been at the center of steady progress in the last few years. However, since our ability to generate human-indistinguishable artificial text lags behind our capacity to assess it, it is paramount to develop and apply even better automatic evaluation metrics. To facilitate researchers to judge the effectiveness of their models broadly, we suggest NLG-Metricverse—an end-to-end open-source library for NLG evaluation based on Python. This framework provides a living collection of NLG metrics in a unified and easy- to-use environment, supplying tools to efficiently apply, analyze, compare, and visualize them. This includes (i) the extensive support of heterogeneous automatic metrics with n-arity management, (ii) the meta-evaluation upon individual performance, metric-metric and metric-human correlations, (iii) graphical interpretations for helping humans better gain score intuitions, (iv) formal categorization and convenient documentation to accelerate metrics understanding. NLG-Metricverse aims to increase the comparability and replicability of NLG research, hopefully stimulating new contributions in the area.

Formato

application/pdf

Identificador

http://amslaurea.unibo.it/27407/1/Tesi_Andrea_Zammarchi.pdf

Zammarchi, Andrea (2022) NLG-Metricverse: An End-to-End Library for Evaluating Natural Language Generation. [Laurea], Università di Bologna, Corso di Studio in Ingegneria e scienze informatiche [L-DM270] - Cesena <http://amslaurea.unibo.it/view/cds/CDS8615/>

Idioma(s)

en

Publicador

Alma Mater Studiorum - Università di Bologna

Relação

http://amslaurea.unibo.it/27407/

Direitos

cc_by_nc_nd4

Palavras-Chave #Natural Language Processing,Natural Language Generation,Artificial Text Evaluation,Language Models,Evaluation Metrics #Ingegneria e scienze informatiche [L-DM270] - Cesena
Tipo

PeerReviewed

info:eu-repo/semantics/bachelorThesis