123

Preliminary Ranking of WMT25 General Machine Translation Systems

Main:35 Pages
Bibliography:2 Pages
1 Tables
Appendix:1 Pages
Abstract

We present the preliminary ranking of the WMT25 General Machine Translation Shared Task, in which MT systems have been evaluated using automatic metrics. As this ranking is based on automatic evaluations, it may be biased in favor of systems that employ re-ranking techniques, such as Quality Estimation re-ranking or Minimum Bayes Risk decoding. The official WMT25 ranking will be based on human evaluation, which is more reliable and will supersede the automatic ranking.

View on arXiv
Comments on this paper