MetricX-24: The Google Submission to the WMT 2024 Metrics Shared Task

Juraj Juraska, Daniel Deutsch, Mara Finkelstein, Markus Freitag

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we present the MetricX-24 submissions to the WMT24 Metrics Shared Task and provide details on the improvements we made over the previous version of MetricX. Our primary submission is a hybrid reference-based/-free metric, which can score a translation irrespective of whether it is given the source segment, the reference, or both. The metric is trained on previous WMT data in a two-stage fashion, first on the DA ratings only, then on a mixture of MQM and DA ratings. The training set in both stages is augmented with synthetic examples that we created to make the metric more robust to several common failure modes, such as fluent but unrelated translation, or undertranslation. We demonstrate the benefits of the individual modifications via an ablation study, and show a significant performance increase over MetricX-23 on the WMT23 MQM ratings, as well as our new synthetic challenge set.1

Original languageEnglish
Title of host publicationWMT 2024 - 9th Conference on Machine Translation, Proceedings of the Conference
EditorsBarry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
PublisherAssociation for Computational Linguistics
Pages492-504
Number of pages13
ISBN (Electronic)9798891761797
StatePublished - 2024
Externally publishedYes
Event9th Conference on Machine Translation, WMT 2024 - Miami, United States
Duration: 15 Nov 202416 Nov 2024

Publication series

NameConference on Machine Translation - Proceedings
Volume2024-November
ISSN (Electronic)2768-0983

Conference

Conference9th Conference on Machine Translation, WMT 2024
Country/TerritoryUnited States
CityMiami
Period15/11/2416/11/24

Bibliographical note

Publisher Copyright:
©2024 Association for Computational Linguistics.

ASJC Scopus subject areas

  • Language and Linguistics
  • Human-Computer Interaction
  • Software

Fingerprint

Dive into the research topics of 'MetricX-24: The Google Submission to the WMT 2024 Metrics Shared Task'. Together they form a unique fingerprint.

Cite this