ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.03193
  4. Cited By
The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual
  Machine Translation

The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation

Transactions of the Association for Computational Linguistics (TACL), 2021
6 June 2021
Naman Goyal
Cynthia Gao
Vishrav Chaudhary
Peng-Jen Chen
Guillaume Wenzek
Da Ju
Sanjan Krishnan
MarcÁurelio Ranzato
Francisco Guzman
Angela Fan
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation"

50 / 244 papers shown
ELR-1000: A Community-Generated Dataset for Endangered Indic Indigenous Languages
Neha Joshi
Pamir Gogoi
Aasim Mirza
Aayush Jansari
Aditya Yadavalli
Ayushi Pandey
Arunima Shukla
Deepthi Sudharsan
Kalika Bali
Vivek Seshadri
67
0
0
30 Nov 2025
IndicParam: Benchmark to evaluate LLMs on low-resource Indic Languages
IndicParam: Benchmark to evaluate LLMs on low-resource Indic Languages
Ayush Maheshwari
Kaushal Sharma
Vivek Patel
Aditya Maheshwari
ELM
117
0
0
29 Nov 2025
Dealing with the Hard Facts of Low-Resource African NLP
Dealing with the Hard Facts of Low-Resource African NLP
Yacouba Diarra
Nouhoum Souleymane Coulibaly
Panga Azazia Kamaté
Madani Amadou Tall
Emmanuel Élisé Koné
Aymane Dembélé
Michael Leventhal
88
1
0
23 Nov 2025
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance
Shalini Maiti
Amar Budhiraja
Bhavul Gauri
Gaurav Chaurasia
Anton Protopopov
...
Michael Slater
Despoina Magka
Tatiana Shavrina
Roberta Raileanu
Yoram Bachrach
MoMe
159
0
0
17 Nov 2025
Ibom NLP: A Step Toward Inclusive Natural Language Processing for Nigeria's Minority Languages
Ibom NLP: A Step Toward Inclusive Natural Language Processing for Nigeria's Minority Languages
Oluwadara Kalejaiye
Luel Hagos Beyene
David Ifeoluwa Adelani
Mmekut-Mfon Gabriel Edet
A. D. Akpan
E. Urua
Anietie U Andy
84
0
0
09 Nov 2025
Segmentation Beyond Defaults: Asymmetrical Byte Pair Encoding for Optimal Machine Translation Performance
Segmentation Beyond Defaults: Asymmetrical Byte Pair Encoding for Optimal Machine Translation Performance
Saumitra Yadav
Manish Shrivastava
161
0
0
05 Nov 2025
Charting the European LLM Benchmarking Landscape: A New Taxonomy and a Set of Best Practices
Charting the European LLM Benchmarking Landscape: A New Taxonomy and a Set of Best Practices
Špela Vintar
Taja Kuzman Pungeršek
Mojca Brglez
Nikola Ljubešić
183
0
0
28 Oct 2025
Are the LLMs Capable of Maintaining at Least the Language Genus?
Are the LLMs Capable of Maintaining at Least the Language Genus?
Sandra Mitrović
David Kletz
Ljiljana Dolamic
Fabio Rinaldi
88
1
0
24 Oct 2025
Model-Aware Tokenizer Transfer
Model-Aware Tokenizer Transfer
Mykola Haltiuk
Aleksander Smywiński-Pohl
113
0
0
24 Oct 2025
ARC-Encoder: learning compressed text representations for large language models
ARC-Encoder: learning compressed text representations for large language models
Hippolyte Pilchen
Edouard Grave
P. Pérez
LLMAGRALMAI4CE
168
1
0
23 Oct 2025
Zero-Shot Performance Prediction for Probabilistic Scaling Laws
Zero-Shot Performance Prediction for Probabilistic Scaling Laws
Viktoria Schram
Markus Hiller
Daniel Beck
Trevor Cohn
132
0
0
19 Oct 2025
Active Model Selection for Large Language Models
Active Model Selection for Large Language Models
Yavuz Durmazkeser
Patrik Okanovic
Andreas Kirsch
Torsten Hoefler
Nezihe Merve Gürel
127
0
0
10 Oct 2025
Multilingual Routing in Mixture-of-Experts
Multilingual Routing in Mixture-of-Experts
Lucas Bandarkar
Chenyuan Yang
Mohsen Fayyaz
Junlin Hu
Nanyun Peng
MoE
152
0
0
06 Oct 2025
Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models
Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models
Tolúl\d{o}pé Ògúnrèmí
Christopher D. Manning
Dan Jurafsky
Karen Livescu
AuLLM
207
0
0
02 Oct 2025
Self-Speculative Biased Decoding for Faster Re-Translation
Self-Speculative Biased Decoding for Faster Re-Translation
Linxiao Zeng
Haoyun Deng
Kangyuan Shu
Shizhen Wang
96
0
0
26 Sep 2025
SiniticMTError: A Machine Translation Dataset with Error Annotations for Sinitic Languages
SiniticMTError: A Machine Translation Dataset with Error Annotations for Sinitic Languages
Hannah Liu
Junghyun Min
Ethan Yue Heng Cheung
Shou-Yi Hung
Syed Mekael Wasti
...
Elsie Chan
Ka Ieng Charlotte Lo
Wing Yu Yip
Richard Tzong-Han Tsai
En-Shiun Annie Lee
95
1
0
24 Sep 2025
CS-FLEURS: A Massively Multilingual and Code-Switched Speech Dataset
CS-FLEURS: A Massively Multilingual and Code-Switched Speech Dataset
Brian Yan
Injy Hamed
Shuichiro Shimizu
Vasista Lodagala
William Chen
...
Samuele Cornell
Eunjung Yeo
Kwanghee Choi
Carlos Carvalho
Karen Rosero
144
4
0
17 Sep 2025
Bhaasha, Bhasa, Zaban: A Survey for Low-Resourced Languages in South Asia - Current Stage and Challenges
Bhaasha, Bhasa, Zaban: A Survey for Low-Resourced Languages in South Asia - Current Stage and Challenges
Sampoorna Poria
Xiaolei Huang
200
0
0
15 Sep 2025
MoVoC: Morphology-Aware Subword Construction for Geez Script Languages
MoVoC: Morphology-Aware Subword Construction for Geez Script Languages
Hailay Teklehaymanot
Dren Fazlija
Wolfgang Nejdl
113
0
0
10 Sep 2025
What if I ask in \textit{alia lingua}? Measuring Functional Similarity Across Languages
What if I ask in \textit{alia lingua}? Measuring Functional Similarity Across Languages
Debangan Mishra
Arihant Rastogi
Agyeya Negi
Shashwat Goel
Ponnurangam Kumaraguru
122
0
0
04 Sep 2025
Expanding the WMT24++ Benchmark with Rumantsch Grischun, Sursilvan, Sutsilvan, Surmiran, Puter, and Vallader
Expanding the WMT24++ Benchmark with Rumantsch Grischun, Sursilvan, Sutsilvan, Surmiran, Puter, and Vallader
Jannis Vamvas
Ignacio Pérez Prat
Not Battesta Soliva
Sandra Baltermia-Guetg
Andrina Beeli
...
Viviana Lazzarini
Walter Rosselli
Bettina Vital
Anna Rutkiewicz
Rico Sennrich
108
1
0
03 Sep 2025
Languages Still Left Behind: Toward a Better Multilingual Machine Translation Benchmark
Languages Still Left Behind: Toward a Better Multilingual Machine Translation Benchmark
Chihiro Taguchi
Seng Mai
Keita Kurabe
Yusuke Sakai
Georgina Agyei
Soudabeh Eslami
David Chiang
ELM
52
0
0
28 Aug 2025
The Mediomatix Corpus: Parallel Data for Romansh Idioms via Comparable Schoolbooks
The Mediomatix Corpus: Parallel Data for Romansh Idioms via Comparable Schoolbooks
Zachary Hopton
Jannis Vamvas
Andrin Büchler
Anna Rutkiewicz
Rico Cathomas
Rico Sennrich
120
0
0
22 Aug 2025
ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students' Cognitive Abilities
ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students' Cognitive Abilities
Wenhan Dong
Zhen Sun
Yuemeng Zhao
Zifan Peng
Jun Wu
...
Xinlei He
Yu Wang
Ruiming Wang
Xinyi Huang
Lei Mo
160
0
0
20 Aug 2025
LoraxBench: A Multitask, Multilingual Benchmark Suite for 20 Indonesian Languages
LoraxBench: A Multitask, Multilingual Benchmark Suite for 20 Indonesian Languages
Alham Fikri Aji
Trevor Cohn
114
0
0
17 Aug 2025
SEA-BED: Southeast Asia Embedding Benchmark
SEA-BED: Southeast Asia Embedding Benchmark
Wuttikorn Ponwitayarat
Raymond Ng
Jann Railey Montalan
Thura Aung
Jian Gang Ngui
...
Panuthep Tasawong
Erik Cambria
Ekapol Chuangsuwanich
Sarana Nutanong
Peerat Limkonchotiwat
162
1
0
17 Aug 2025
Utilizing Multilingual Encoders to Improve Large Language Models for Low-Resource Languages
Utilizing Multilingual Encoders to Improve Large Language Models for Low-Resource LanguagesMoratuwa Engineering Research Conference (MERCon), 2025
Imalsha Puranegedara
Themira Chathumina
Nisal Ranathunga
Nisansa de Silva
Surangika Ranathunga
Mokanarangan Thayaparan
219
0
0
12 Aug 2025
TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation
TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation
A. Zebaze
Benoît Sagot
Rachel Bawden
104
1
0
12 Aug 2025
The TUB Sign Language Corpus Collection
The TUB Sign Language Corpus CollectionInternational Conference on Intelligent Virtual Agents (IVA), 2025
Eleftherios Avramidis
Vera Czehmann
Fabian Deckert
Lorenz Hufe
Aljoscha Lipski
...
Tae Kwon Rhee
Mengqian Shi
Lennart Stölting
Fabrizio Nunnari
Sebastian Möller
SLR
190
0
0
07 Aug 2025
Uncertainty-driven Embedding Convolution
Uncertainty-driven Embedding Convolution
Sungjun Lim
Kangjun Noh
Youngjun Choi
Heeyoung Lee
Kyungwoo Song
BDL
279
0
0
28 Jul 2025
Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters
Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters
Shanbo Cheng
Yu Bao
Qian Cao
Daigang Xu
Liyan Kang
...
Liehao Zou
Hang Li
Lu Lu
Yuping Wang
Yonghui Wu
LRM
308
9
0
18 Jul 2025
Translationese-index: Using Likelihood Ratios for Graded and Generalizable Measurement of Translationese
Translationese-index: Using Likelihood Ratios for Graded and Generalizable Measurement of Translationese
Yikang Liu
Wanyang Zhang
Yiming Wang
Jialong Tang
Pei Zhang
Baosong Yang
Fei Huang
Rui Wang
Hai Hu
134
0
0
16 Jul 2025
RedOne: Revealing Domain-specific LLM Post-Training in Social Networking Services
RedOne: Revealing Domain-specific LLM Post-Training in Social Networking Services
Fei Zhao
Chonggang Lu
Yue Wang
Zheyong Xie
Ziyan Liu
...
Jun Fan
Xiaolong Jiang
Weiting Liu
Boyang Wang
Shaosheng Cao
ALM
210
0
0
13 Jul 2025
Eka-Eval: An Evaluation Framework for Low-Resource Multilingual Large Language Models
Eka-Eval: An Evaluation Framework for Low-Resource Multilingual Large Language Models
Samridhi Raj Sinha
Rajvee Sheth
Abhishek Upperwal
Mayank Singh
ELM
173
0
0
02 Jul 2025
mSTEB: Massively Multilingual Evaluation of LLMs on Speech and Text Tasks
Luel Hagos Beyene
Vivek Verma
Min Ma
Jesujoba Oluwadara Alabi
Fabian David Schmidt
Joyce Nakatumba-Nabende
David Ifeoluwa Adelani
332
2
0
10 Jun 2025
Exploring the Impact of Temperature on Large Language Models:Hot or Cold?
Exploring the Impact of Temperature on Large Language Models:Hot or Cold?Procedia Computer Science (PCS), 2025
Lujun Li
Lama Sleem
Niccolo Gentile
Geoffrey Nichil
Radu State
165
14
0
08 Jun 2025
Exploring In-context Example Generation for Machine Translation
Exploring In-context Example Generation for Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Dohyun Lee
Seungil Lee
Chanwoo Yang
Yujin Baek
Jaegul Choo
168
0
0
31 May 2025
Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreement
Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreement
Gabriele Sarti
Vilém Zouhar
Malvina Nissim
Arianna Bisazza
280
0
0
29 May 2025
Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
Jesujoba Oluwadara Alabi
Michael A. Hedderich
David Ifeoluwa Adelani
Dietrich Klakow
477
6
0
27 May 2025
Multilingual Pretraining for Pixel Language Models
Multilingual Pretraining for Pixel Language Models
Ilker Kesen
Jonas F. Lotz
Ingo Ziegler
Phillip Rust
Desmond Elliott
MLLMVLM
339
1
0
27 May 2025
MOLE: Metadata Extraction and Validation in Scientific Papers Using LLMs
MOLE: Metadata Extraction and Validation in Scientific Papers Using LLMs
Zaid Alyafeai
Maged S. Al-Shaibani
Bernard Ghanem
289
4
0
26 May 2025
Building a Functional Machine Translation Corpus for Kpelle
Building a Functional Machine Translation Corpus for Kpelle
Kweku Andoh Yamoah
Jackson Weako
Emmanuel J. Dorley
185
0
0
24 May 2025
NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities
NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities
Abdellah El Mekki
Houdaifa Atou
Omer Nacar
Shady Shehata
Muhammad Abdul-Mageed
286
7
0
23 May 2025
MAPS: A Multilingual Benchmark for Global Agent Performance and Security
MAPS: A Multilingual Benchmark for Global Agent Performance and Security
Omer Hofman
Jonathan Brokman
Oren Rachmil
Shamik Bose
Vikas Pahuja
Toshiya Shimizu
Trisha Starostina
Kelly Marchisio
Seraphina Goldfarb-Tarrant
Roman Vainshtein
228
1
0
21 May 2025
Scaling Low-Resource MT via Synthetic Data Generation with LLMs
Scaling Low-Resource MT via Synthetic Data Generation with LLMs
Ona de Gibert
Joseph Attieh
Teemu Vahtola
Mikko Aulamo
Zihao Li
Ananda Sreenidhi
Tiancheng Hu
Jörg Tiedemann
SyDa
344
4
0
20 May 2025
HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing
HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing
Shamsuddeen Hassan Muhammad
Ibrahim Said Ahmad
Idris Abdulmumin
Falalu Ibrahim Lawan
Babangida Sani
...
Sani Abdullahi Sani
Ali Usman Umar
T. Gwadabe
Kenneth Church
Vukosi Marivate
AI4TS
403
1
0
20 May 2025
MUG-Eval: A Proxy Evaluation Framework for Multilingual Generation Capabilities in Any Language
MUG-Eval: A Proxy Evaluation Framework for Multilingual Generation Capabilities in Any Language
Seyoung Song
Seogyeong Jeong
Eunsu Kim
Jiho Jin
Dongkwan Kim
Jay Shin
Alice Oh
433
0
0
20 May 2025
From Unaligned to Aligned: Scaling Multilingual LLMs with Multi-Way Parallel Corpora
From Unaligned to Aligned: Scaling Multilingual LLMs with Multi-Way Parallel Corpora
Yingli Shen
Wen Lai
Kaiyan Zhang
Kangyang Luo
Kangyang Luo
Maosong Sun
Maosong Sun
371
2
0
20 May 2025
Granary: Speech Recognition and Translation Dataset in 25 European Languages
Granary: Speech Recognition and Translation Dataset in 25 European Languages
Nithin Rao Koluguri
Monica Sekoyan
George Zelenfroynd
Sasha Meister
Shuoyang Ding
...
Yifan Peng
Sara Papi
Marco Gaido
Alessio Brutti
Boris Ginsburg
244
6
0
19 May 2025
Unveiling Language-Specific Features in Large Language Models via Sparse Autoencoders
Unveiling Language-Specific Features in Large Language Models via Sparse AutoencodersAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Boyi Deng
Boyi Deng
Yidan Zhang
Baosong Yang
Fuli Feng
361
3
0
08 May 2025
12345
Next