Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Home
Papers
2106.03193
Cited By
The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Transactions of the Association for Computational Linguistics (TACL), 2021
6 June 2021
Naman Goyal
Cynthia Gao
Vishrav Chaudhary
Peng-Jen Chen
Guillaume Wenzek
Da Ju
Sanjan Krishnan
MarcÁurelio Ranzato
Francisco Guzman
Angela Fan
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation"
50 / 244 papers shown
Bemba Speech Translation: Exploring a Low-Resource African Language
International Workshop on Spoken Language Translation (IWSLT), 2025
Muhammad Hazim Al Farouq
Aman Kassahun Wassie
Yasmin Moslem
535
0
0
05 May 2025
FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation
Yulia Otmakhova
Hung Thinh Truong
Rahmad Mahendra
Zenan Zhai
Rongxin Zhu
Daniel Beck
Jey Han Lau
ELM
481
0
0
24 Apr 2025
MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages
Dieuwke Hupkes
Nikolay Bogoychev
966
12
0
14 Apr 2025
Can the capability of Large Language Models be described by human ability? A Meta Study
Mingrui Zan
Yunquan Zhang
Boyang Zhang
Fangming Liu
Daning Cheng
ELM
LM&MA
254
1
0
13 Apr 2025
Redefining Machine Translation on Social Network Services with Large Language Models
Hongcheng Guo
Fei Zhao
Shaosheng Cao
Xinze Lyu
Ziqiang Liu
...
Boyang Wang
Hui Yuan
Chonggang Lu
Zhe Xu
Yao Hu
232
3
0
10 Apr 2025
GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models
Hengyu Luo
Zihao Li
Joseph Attieh
Sawal Devkota
Ona de Gibert
...
Mengjie Wang
Mengjie Wang
Samea Yusofi
Fei Yuan
Jörg Tiedemann
ELM
246
1
0
05 Apr 2025
Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources
Zihao Li
Shaoxiong Ji
Hengyu Luo
Jörg Tiedemann
CLL
792
4
0
05 Apr 2025
Overcoming Vocabulary Constraints with Pixel-level Fallback
Jonas F. Lotz
Hendra Setiawan
Stephan Peitz
Yova Kementchedjhieva
310
4
0
02 Apr 2025
Large Language Models in Numberland: A Quick Test of Their Numerical Reasoning Abilities
Roussel Rahman
ReLM
ELM
LRM
249
3
0
31 Mar 2025
Is Small Language Model the Silver Bullet to Low-Resource Languages Machine Translation?
Yewei Song
Lujun Li
Cedric Lothritz
Saad Ezzini
Lama Sleem
Niccolo Gentile
Radu State
Tegawende F. Bissyande
Jacques Klein
358
4
0
31 Mar 2025
Whispering in Amharic: Fine-tuning Whisper for Low-resource Language
Dawit Ketema Gete
Bedru Yimam Ahamed
Tadesse Destaw Belay
Yohannes Ayana Ejigu
Sukairaj Hafiz Imam
...
Umma Aliyu Musa
Martin Semmann
Shamsuddeen Hassan Muhammad
Henning Schreiber
Seid Muhie Yimam
320
3
0
24 Mar 2025
The Amazon Nova Family of Models: Technical Report and Model Card
Amazon AGI
Aaron Langford
A. Shah
Abhanshu Gupta
Abhimanyu Bhatter
...
Benjamin Biggs
Benjamin Ott
Bhanu Vinzamuri
Bharath Venkatesh
Bhavana Ganesh
273
47
0
17 Mar 2025
Arabizi vs LLMs: Can the Genie Understand the Language of Aladdin?
Perla Al Almaoui
Pierrette Bouillon
Simon Hengchen
328
0
0
28 Feb 2025
R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning
Minggui He
Yilun Liu
Shimin Tao
Yuanchang Luo
Hongyong Zeng
...
Daimeng Wei
Weibin Meng
Hao Yang
Boxing Chen
Osamu Yoshie
LRM
506
14
0
27 Feb 2025
NaijaNLP: A Survey of Nigerian Low-Resource Languages
Isa Inuwa-Dutse
355
2
0
27 Feb 2025
Science Across Languages: Assessing LLM Multilingual Translation of Scientific Papers
Hannah Calzi Kleidermacher
James Zou
669
5
0
25 Feb 2025
How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal Representations
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Hyunji Lee
Danni Liu
Supriti Sinhamahapatra
Jan Niehues
425
5
0
21 Feb 2025
D.Va: Validate Your Demonstration First Before You Use It
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Qi Zhang
Zhiqing Xiao
Ruixuan Xiao
Lirong Gao
Junbo Zhao
389
0
0
20 Feb 2025
Batayan: A Filipino NLP benchmark for evaluating Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jann Railey Montalan
Jimson Paulo Layacan
David Demitri Africa
Richell Isaiah Flores
Michael T. Lopez II
Theresa Denise Magsajo
Anjanette Cayabyab
William-Chandra Tjhi
238
3
0
19 Feb 2025
DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data Cleaning as Anomaly Detection
Yingli Shen
Wen Lai
Kaiyan Zhang
Xueren Zhang
Kangyang Luo
Kangyang Luo
Maosong Sun
537
2
0
17 Feb 2025
DiSCo: Device-Server Collaborative LLM-Based Text Streaming Services
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Ting Sun
Penghan Wang
Fan Lai
309
3
0
17 Feb 2025
Blessing of Multilinguality: A Systematic Analysis of Multilingual In-Context Learning
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yilei Tu
Andrew Xue
Freda Shi
394
1
0
17 Feb 2025
LayAlign: Enhancing Multilingual Reasoning in Large Language Models via Layer-Wise Adaptive Fusion and Alignment Strategy
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Zhiwen Ruan
Yixia Li
He Zhu
Longyue Wang
Weihua Luo
Kaifu Zhang
Yuxiao Chen
Guanhua Chen
293
5
0
17 Feb 2025
Beyond Literal Token Overlap: Token Alignability for Multilinguality
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Katharina Hämmerl
Tomasz Limisiewicz
Jindrich Libovický
Kangyang Luo
194
3
0
10 Feb 2025
BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
Omnilingual MT Team
Pierre Yves Andrews
Mikel Artetxe
Mariano Coria Meglioli
Marta R. Costa-jussá
...
Eduardo Sánchez
Ioannis Tsiamas
Arina Turkatenko
Albert Ventayol-Boada
Shireen Yates
469
1
0
06 Feb 2025
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Menglong Cui
Pengzhi Gao
Wei Liu
Jian Luan
Sijin Yu
LRM
452
30
0
04 Feb 2025
Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation
Muhammed Yusuf Kocyigit
Eleftheria Briakou
Daniel Deutsch
Jiaming Luo
Colin Cherry
Markus Freitag
229
5
0
30 Jan 2025
Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Kritarth Prasad
Mohammadi Zaki
Pratik Rakesh Singh
Pankaj Wasnik
222
3
0
28 Jan 2025
Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History
Yevhen Kostiuk
O. Vitman
Łukasz Gagała
Artur Kiulian
ELM
877
1
0
17 Jan 2025
AFRIDOC-MT: Document-level MT Corpus for African Languages
Jesujoba Oluwadara Alabi
Israel Abebe Azime
Miaoran Zhang
C. España-Bonet
Rachel Bawden
...
Shamsuddeen Hassan Muhammad
Neo Putini
David O. Ademuyiwa
Andrew Caines
Dietrich Klakow
374
2
0
10 Jan 2025
Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive Investigation
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Vera Neplenbroek
Arianna Bisazza
Raquel Fernández
604
3
0
18 Dec 2024
Task-Oriented Dialog Systems for the Senegalese Wolof Language
International Conference on Computational Linguistics (COLING), 2024
Derguene Mbaye
Moussa Diallo
261
1
0
15 Dec 2024
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Hexuan Deng
Wenxiang Jiao
Xuebo Liu
Min Zhang
Zhaopeng Tu
Zhaopeng Tu
VLM
565
1
0
21 Nov 2024
Towards Building Large Scale Datasets and State-of-the-Art Automatic Speech Translation Systems for 14 Indian Languages
Sparsh Jain
Ashwin Sankar
Devilal Choudhary
Dhairya Suman
Nikhil Narasimhan
Mohammed Safi Ur Rahman Khan
Anoop Kunchukuttan
Mitesh M. Khapra
Mary Dabre
467
2
0
07 Nov 2024
MoCE: Adaptive Mixture of Contextualization Experts for Byte-based Neural Machine Translation
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Langlin Huang
Mengyu Bu
Yang Feng
246
0
0
03 Nov 2024
GrammaMT: Improving Machine Translation with Grammar-Informed In-Context Learning
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Rita Ramos
Everlyn Asiko Chimoto
Maartje ter Hoeve
Natalie Schluter
281
5
0
24 Oct 2024
Effective Self-Mining of In-Context Examples for Unsupervised Machine Translation with LLMs
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Abdellah El Mekki
Muhammad Abdul-Mageed
LRM
255
1
0
14 Oct 2024
Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?
International Conference on Learning Representations (ICLR), 2024
HyoJung Han
Akiko Eriguchi
Haoran Xu
Hieu T. Hoang
Marine Carpuat
Huda Khayrallah
VLM
235
8
0
12 Oct 2024
Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Stefano Perrella
Lorenzo Proietti
Pere-Lluís Huguet Cabot
Edoardo Barba
Roberto Navigli
275
8
0
07 Oct 2024
CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text
Jun Hirako
Ryohei Sasano
Koichi Takeda
322
4
0
06 Oct 2024
AfriHuBERT: A self-supervised speech representation model for African languages
Jesujoba Oluwadara Alabi
Xuechen Liu
Dietrich Klakow
Junichi Yamagishi
VLM
431
11
0
30 Sep 2024
Characterizing and Efficiently Accelerating Multimodal Generation Model Inference
Yejin Lee
Anna Y. Sun
Basil Hosmer
Bilge Acun
Can Balioglu
...
Ram Pasunuru
Scott Yih
Sravya Popuri
Xing Liu
Carole-Jean Wu
462
4
0
30 Sep 2024
BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Pavel Chizhov
Catherine Arnett
Elizaveta Korotkova
Ivan P. Yamshchikov
232
14
0
06 Sep 2024
Correcting FLORES Evaluation Dataset for Four African Languages
Conference on Machine Translation (WMT), 2024
Idris Abdulmumin
Sthembiso Mkhwanazi
Mahlatse S. Mbooi
Shamsuddeen Hassan Muhammad
Ibrahim Said Ahmad
Neo Putini
Miehleketo Mathebula
Matimba Shingange
T. Gwadabe
Vukosi Marivate
311
12
0
01 Sep 2024
Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions
Chenming Tang
Zhixiang Wang
Hao Sun
Yunfang Wu
LRM
492
1
0
16 Aug 2024
Misfitting With AI: How Blind People Verify and Contest AI Errors
International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS), 2024
Rahaf Alharbi
P. Lor
Jaylin Herskovitz
S. Schoenebeck
Robin Brewer
213
24
0
13 Aug 2024
Evaluating the Translation Performance of Large Language Models Based on Euas-20
Yan Huang
Wei Liu
ELM
239
3
0
06 Aug 2024
Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen Languages
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Carlos Mullov
Ngoc-Quan Pham
Alexander Waibel
217
2
0
05 Aug 2024
In-Context Example Selection via Similarity Search Improves Low-Resource Machine Translation
Joel Witzke
Benoît Sagot
Rachel Bawden
308
21
0
01 Aug 2024
Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment
Yongxin Huang
Kexin Wang
Goran Glavaš
Iryna Gurevych
319
2
0
20 Jul 2024
Previous
1
2
3
4
5
Next