ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.03148
171
3
v1v2 (latest)

Expanding the WMT24++ Benchmark with Rumantsch Grischun, Sursilvan, Sutsilvan, Surmiran, Puter, and Vallader

3 September 2025
Jannis Vamvas
Ignacio Pérez Prat
Not Battesta Soliva
Sandra Baltermia-Guetg
Andrina Beeli
Simona Beeli
Madlaina Capeder
Laura Decurtins
Gian Peder Gregori
Flavia Hobi
Gabriela Holderegger
Arina Lazzarini
Viviana Lazzarini
Walter Rosselli
Bettina Vital
Anna Rutkiewicz
Rico Sennrich
ArXiv (abs)PDFHTMLGithub
Main:6 Pages
3 Figures
Bibliography:3 Pages
18 Tables
Appendix:11 Pages
Abstract

The Romansh language, spoken in Switzerland, has limited resources for machine translation evaluation. In this paper, we present a benchmark for six varieties of Romansh: Rumantsch Grischun, a supra-regional variety, and five regional varieties: Sursilvan, Sutsilvan, Surmiran, Puter, and Vallader. Our reference translations were created by human translators based on the WMT24++ benchmark, which ensures parallelism with more than 55 other languages. An automatic evaluation of existing MT systems and LLMs shows that translation out of Romansh into German is handled relatively well for all the varieties, but translation into Romansh is still challenging.

View on arXiv
Comments on this paper