The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation

Transactions of the Association for Computational Linguistics (TACL), 2021

6 June 2021

Francisco Guzman

Angela Fan

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation"

50 / 244 papers shown

Bemba Speech Translation: Exploring a Low-Resource African LanguageInternational Workshop on Spoken Language Translation (IWSLT), 2025

Muhammad Hazim Al Farouq

Aman Kassahun Wassie

Yasmin Moslem

535

05 May 2025

FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation

481

24 Apr 2025

MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages

Dieuwke Hupkes

Nikolay Bogoychev

966

14 Apr 2025

Can the capability of Large Language Models be described by human ability? A Meta Study

254

13 Apr 2025

Redefining Machine Translation on Social Network Services with Large Language Models

...

232

10 Apr 2025

GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models

...

246

05 Apr 2025

Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources

792

05 Apr 2025

Overcoming Vocabulary Constraints with Pixel-level Fallback

Jonas F. Lotz

Hendra Setiawan

Stephan Peitz

Yova Kementchedjhieva

310

02 Apr 2025

Large Language Models in Numberland: A Quick Test of Their Numerical Reasoning Abilities

Roussel Rahman

ReLM ELM LRM

249

31 Mar 2025

Is Small Language Model the Silver Bullet to Low-Resource Languages Machine Translation?

Tegawende F. Bissyande

Jacques Klein

358

31 Mar 2025

Whispering in Amharic: Fine-tuning Whisper for Low-resource Language

...

Shamsuddeen Hassan Muhammad

Henning Schreiber

Seid Muhie Yimam

320

24 Mar 2025

The Amazon Nova Family of Models: Technical Report and Model Card

...

273

17 Mar 2025

Arabizi vs LLMs: Can the Genie Understand the Language of Aladdin?

Perla Al Almaoui

Pierrette Bouillon

Simon Hengchen

328

28 Feb 2025

R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning

...

506

27 Feb 2025

NaijaNLP: A Survey of Nigerian Low-Resource Languages

Isa Inuwa-Dutse

355

27 Feb 2025

Science Across Languages: Assessing LLM Multilingual Translation of Scientific Papers

Hannah Calzi Kleidermacher

James Zou

669

25 Feb 2025

How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal RepresentationsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Hyunji Lee

Danni Liu

Supriti Sinhamahapatra

Jan Niehues

425

21 Feb 2025

D.Va: Validate Your Demonstration First Before You Use ItAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

389

20 Feb 2025

Batayan: A Filipino NLP benchmark for evaluating Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Jann Railey Montalan

Jimson Paulo Layacan

David Demitri Africa

Richell Isaiah Flores

Michael T. Lopez II

Theresa Denise Magsajo

Anjanette Cayabyab

William-Chandra Tjhi

238

19 Feb 2025

DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data Cleaning as Anomaly Detection

537

17 Feb 2025

DiSCo: Device-Server Collaborative LLM-Based Text Streaming ServicesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Ting Sun

Penghan Wang

Fan Lai

309

17 Feb 2025

Blessing of Multilinguality: A Systematic Analysis of Multilingual In-Context LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Yilei Tu

Andrew Xue

Freda Shi

394

17 Feb 2025

LayAlign: Enhancing Multilingual Reasoning in Large Language Models via Layer-Wise Adaptive Fusion and Alignment StrategyNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

293

17 Feb 2025

Beyond Literal Token Overlap: Token Alignability for MultilingualityNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

194

10 Feb 2025

BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation

Omnilingual MT Team

Pierre Yves Andrews

Mikel Artetxe

Mariano Coria Meglioli

...

Albert Ventayol-Boada

Shireen Yates

469

06 Feb 2025

Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical StudyNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

452

04 Feb 2025

Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation

Muhammed Yusuf Kocyigit

229

30 Jan 2025

Faster Machine Translation Ensembling with Reinforcement Learning and Competitive CorrectionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

222

28 Jan 2025

Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History

877

17 Jan 2025

AFRIDOC-MT: Document-level MT Corpus for African Languages

Jesujoba Oluwadara Alabi

...

Shamsuddeen Hassan Muhammad

374

10 Jan 2025

Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive InvestigationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Vera Neplenbroek

Arianna Bisazza

Raquel Fernández

604

18 Dec 2024

Task-Oriented Dialog Systems for the Senegalese Wolof LanguageInternational Conference on Computational Linguistics (COLING), 2024

Derguene Mbaye

Moussa Diallo

261

15 Dec 2024

DRPruning: Efficient Large Language Model Pruning through Distributionally Robust OptimizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

565

21 Nov 2024

Towards Building Large Scale Datasets and State-of-the-Art Automatic Speech Translation Systems for 14 Indian Languages

Mohammed Safi Ur Rahman Khan

Anoop Kunchukuttan

Mitesh M. Khapra

Mary Dabre

467

07 Nov 2024

MoCE: Adaptive Mixture of Contextualization Experts for Byte-based Neural Machine TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Langlin Huang

Mengyu Bu

Yang Feng

246

03 Nov 2024

GrammaMT: Improving Machine Translation with Grammar-Informed In-Context LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Rita Ramos

Everlyn Asiko Chimoto

Maartje ter Hoeve

Natalie Schluter

281

24 Oct 2024

Effective Self-Mining of In-Context Examples for Unsupervised Machine Translation with LLMsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Abdellah El Mekki

Muhammad Abdul-Mageed

LRM

255

14 Oct 2024

Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?International Conference on Learning Representations (ICLR), 2024

HyoJung Han

235

12 Oct 2024

Beyond Correlation: Interpretable Evaluation of Machine Translation MetricsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Stefano Perrella

Lorenzo Proietti

Pere-Lluís Huguet Cabot

Edoardo Barba

Roberto Navigli

275

07 Oct 2024

CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text

Jun Hirako

Ryohei Sasano

Koichi Takeda

322

06 Oct 2024

AfriHuBERT: A self-supervised speech representation model for African languages

Jesujoba Oluwadara Alabi

431

30 Sep 2024

Characterizing and Efficiently Accelerating Multimodal Generation Model Inference

...

462

30 Sep 2024

BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer TrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

232

06 Sep 2024

Correcting FLORES Evaluation Dataset for Four African LanguagesConference on Machine Translation (WMT), 2024

Idris Abdulmumin

Sthembiso Mkhwanazi

Mahlatse S. Mbooi

Shamsuddeen Hassan Muhammad

311

01 Sep 2024

Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions

492

16 Aug 2024

Misfitting With AI: How Blind People Verify and Contest AI ErrorsInternational ACM SIGACCESS Conference on Computers and Accessibility (ASSETS), 2024

213

13 Aug 2024

Evaluating the Translation Performance of Large Language Models Based on Euas-20

Yan Huang

Wei Liu

ELM

239

06 Aug 2024

Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Carlos Mullov

Ngoc-Quan Pham

Alexander Waibel

217

05 Aug 2024

In-Context Example Selection via Similarity Search Improves Low-Resource Machine Translation

Joel Witzke

Benoît Sagot

Rachel Bawden

308

01 Aug 2024

Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment

319

20 Jul 2024

All Papers

The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation

Papers citing "The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation"