MetricX-23: The Google Submission to the WMT 2023 Metrics Shared Task

Juraj Juraska, Mara Finkelstein, Daniel Deutsch, Aditya Siddhant, Mehdi Mirzazadeh, Markus Freitag

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This report details the MetricX-23 submission to the WMT23 Metrics Shared Task and provides an overview of the experiments that informed which metrics were submitted. Our 3 submissions-each with a quality estimation (or reference-free) version-are all learned regression-based metrics that vary in the data used for training and which pretrained language model was used for initialization. We report results related to understanding (1) which supervised training data to use, (2) the impact of how the training labels are normalized, (3) the amount of synthetic training data to use, (4) how metric performance is related to model size, and (5) the effect of initializing the metrics with different pretrained language models. The most successful training recipe for MetricX employs two-stage fine-tuning on DA and MQM ratings, and includes synthetic training data. Finally, one important takeaway from our extensive experiments is that optimizing for both segment- and system-level performance at the same time is a challenging task.

Original languageEnglish
Title of host publicationProceedings of the 8th Conference on Machine Translation, WMT 2023
PublisherAssociation for Computational Linguistics
Pages754-765
Number of pages12
ISBN (Electronic)9798891760417
StatePublished - 2023
Externally publishedYes
Event8th Conference on Machine Translation, WMT 2023 - Singapore, Singapore
Duration: 6 Dec 20237 Dec 2023

Publication series

NameConference on Machine Translation - Proceedings
ISSN (Electronic)2768-0983

Conference

Conference8th Conference on Machine Translation, WMT 2023
Country/TerritorySingapore
CitySingapore
Period6/12/237/12/23

Bibliographical note

Publisher Copyright:
© 2023 Association for Computational Linguistics.

ASJC Scopus subject areas

  • Language and Linguistics
  • Human-Computer Interaction
  • Software

Fingerprint

Dive into the research topics of 'MetricX-23: The Google Submission to the WMT 2023 Metrics Shared Task'. Together they form a unique fingerprint.

Cite this