Embarrassingly easy document-level MT metrics: How to convert any pretrained metric into a document-level metric

Giorgos Vernikos; Brian Thompson; Prashant Mathur; Marcello Federico

Publication

Embarrassingly easy document-level MT metrics: How to convert any pretrained metric into a document-level metric

By Giorgos Vernikos, Brian Thompson, Prashant Mathur, Marcello Federico

2022

Download Copy BibTeX GitHub

Share

Download

Copy BibTeX

GitHub

Share

We present a very simple method for extending pretrained machine translation metrics to incorporate document-level context. We apply our method to four popular metrics: BERTScore, Prism, COMET, and the reference-free metric COMET-QE. We evaluate our document-level metrics on the MQM annotations from the WMT 2021 metrics shared task and find that the document-level metrics outperform their sentence-level counterparts in about 85% of the tested conditions, when excluding results on low-quality human references. Additionally, we show that our document-level extension of COMET-QE dramatically improves accuracy on discourse phenomena tasks, supporting our hypothesis that our document-level metrics are resolving ambiguities in the reference sentence by using additional context.

Embarrassingly easy document-level MT metrics: How to convert any pretrained metric into a document-level metric

Latest news

Work with us