ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.04261
  4. Cited By
Aya Expanse: Combining Research Breakthroughs for a New Multilingual
  Frontier

Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier

5 December 2024
John Dang
Shivalika Singh
Daniel D'souza
Arash Ahmadian
Alejandro Salamanca
Madeline Smith
Aidan Peppin
Sungjin Hong
Manoj Govindassamy
Terrence Zhao
Sandra Kublik
Meor Amer
Viraat Aryabumi
Jon Ander Campos
Yi Chern Tan
Tom Kocmi
Florian Strub
Nathan Grinsztajn
Yannis Flet-Berliac
Acyr Locatelli
Hangyu Lin
Dwarak Talupuru
Bharat Venkitesh
David Cairuz
Bowen Yang
Tim Chung
Wei-Yin Ko
Sylvie Shang Shi
Amir Shukayev
Sammie Bae
Aleksandra Piktus
Roman Castagné
Felipe Cruz-Salinas
Eddie Kim
Lucas Crawhall-Stein
Adrien Morisot
Sudip Roy
Phil Blunsom
Ivan Zhang
Aidan Gomez
Nick Frosst
Marzieh Fadaee
Beyza Ermis
Ahmet Üstün
Sara Hooker
    ELMOSLMMoE
ArXiv (abs)PDFHTMLHuggingFace (4 upvotes)

Papers citing "Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier"

48 / 48 papers shown
Title
Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation
Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation
Marii Ojastu
Hele-Andra Kuulmets
Aleksei Dorkin
Marika Borovikova
Dage Särg
Kairit Sirts
50
0
0
21 Nov 2025
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures
T. Chang
Catherine Arnett
Abdelrahman Eldesokey
Abdelrahman Sadallah
Abeer Kashar
...
Francesco Orabona
Francesco Periti
Gbenga Kayode Solomon
Gia Nghia Ngo
Gloria Udhehdhe-oze
LRMELM
88
0
0
28 Oct 2025
Quality-Aware Translation Tagging in Multilingual RAG system
Quality-Aware Translation Tagging in Multilingual RAG system
Hoyeon Moon
Byeolhee Kim
Nikhil Verma
VLM
148
0
0
27 Oct 2025
Model-Aware Tokenizer Transfer
Model-Aware Tokenizer Transfer
Mykola Haltiuk
Aleksander Smywiński-Pohl
72
0
0
24 Oct 2025
Enhancing Reasoning Skills in Small Persian Medical Language Models Can Outperform Large-Scale Data Training
Enhancing Reasoning Skills in Small Persian Medical Language Models Can Outperform Large-Scale Data Training
Mehrdad Ghassabi
Sadra Hakim
Hamidreza Baradaran Kashani
Pedram Rostami
ReLMLRM
256
0
0
22 Oct 2025
DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning
DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning
Yongxin He
Shan Zhang
Yixuan Cao
Lei Ma
Ping Luo
DeLMO
116
0
0
20 Oct 2025
Finetuning LLMs for EvaCun 2025 token prediction shared task
Finetuning LLMs for EvaCun 2025 token prediction shared task
Josef Jon
Ondrej Bojar
32
0
0
17 Oct 2025
From tests to effect sizes: Quantifying uncertainty and statistical variability in multilingual and multitask NLP evaluation benchmarks
From tests to effect sizes: Quantifying uncertainty and statistical variability in multilingual and multitask NLP evaluation benchmarks
Jonne Sälevä
Duygu Ataman
Constantine Lignos
60
0
0
26 Sep 2025
Introducing OmniGEC: A Silver Multilingual Dataset for Grammatical Error Correction
Roman Kovalchuk
Mariana Romanyshyn
Petro Ivaniuk
SyDa
72
0
0
18 Sep 2025
Debiasing Multilingual LLMs in Cross-lingual Latent Space
Debiasing Multilingual LLMs in Cross-lingual Latent Space
Qiwei Peng
Guimin Hu
Yekun Chai
Anders Søgaard
60
1
0
25 Aug 2025
Understanding Subword Compositionality of Large Language Models
Understanding Subword Compositionality of Large Language Models
Qiwei Peng
Yekun Chai
Anders Søgaard
60
1
0
25 Aug 2025
Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish
Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish
Yakup Abrek Er
.Ilker Kesen
Gözde Gül Şahin
Aykut Erdem
ELMVLM
96
0
0
22 Aug 2025
mSCoRe: a $M$ultilingual and Scalable Benchmark for $S$kill-based $Co$mmonsense $Re$asoning
mSCoRe: a MMMultilingual and Scalable Benchmark for SSSkill-based CoCoCommonsense ReReReasoning
Nghia Trung Ngo
Franck Dernoncourt
T. Nguyen
LRM
98
0
0
13 Aug 2025
Sacred or Synthetic? Evaluating LLM Reliability and Abstention for Religious Questions
Sacred or Synthetic? Evaluating LLM Reliability and Abstention for Religious Questions
Farah Atif
Nursultan Askarbekuly
Kareem Darwish
Monojit Choudhury
40
0
0
04 Aug 2025
BALSAM: A Platform for Benchmarking Arabic Large Language Models
BALSAM: A Platform for Benchmarking Arabic Large Language Models
Rawan N. Al-Matham
Kareem Darwish
Raghad Al-Rasheed
Waad Alshammari
Muneera Alhoshan
...
Sultana Alghurabi
Atikah Alzeghayer
Afrah Altamimi
Abdullah Alfaifi
Abdulrahman AlOsaimy
ELM
142
2
0
30 Jul 2025
Where to show Demos in Your Prompt: A Positional Bias of In-Context Learning
Where to show Demos in Your Prompt: A Positional Bias of In-Context Learning
Kwesi Cobbina
Tianyi Zhou
86
2
0
30 Jul 2025
A Benchmark Dataset and Evaluation Framework for Vietnamese Large Language Models in Customer Support
A Benchmark Dataset and Evaluation Framework for Vietnamese Large Language Models in Customer Support
Long Nguyen
Truong P. Hua
T. Nguyen
T. Pham
Nam K. Ngo
An X. Nguyen
Nghi D. M. Pham
Nghia H. Nguyen
Tho Quan
84
0
0
30 Jul 2025
Multilingual LLMs Are Not Multilingual Thinkers: Evidence from Hindi Analogy Evaluation
Multilingual LLMs Are Not Multilingual Thinkers: Evidence from Hindi Analogy Evaluation
Ashray Gupta
Rohan Joseph
Sunny Rai
ELMLRM
93
1
0
17 Jul 2025
Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers
Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers
Daniel D'souza
Julia Kreutzer
Adrien Morisot
Ahmet Üstün
Sara Hooker
158
0
0
17 Jun 2025
They want to pretend not to understand: The Limits of Current LLMs in Interpreting Implicit Content of Political Discourse
They want to pretend not to understand: The Limits of Current LLMs in Interpreting Implicit Content of Political DiscourseAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Walter Paci
Alessandro Panunzi
Sandro Pezzelle
78
0
0
07 Jun 2025
Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data
Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data
Shaoxiong Ji
Zihao Li
Jaakko Paavola
Indraneil Paul
Hengyu Luo
Jörg Tiedemann
CLL
168
3
0
31 May 2025
M-Wanda: Improving One-Shot Pruning for Multilingual LLMs
M-Wanda: Improving One-Shot Pruning for Multilingual LLMs
Rochelle Choenni
Ivan Titov
155
1
0
27 May 2025
NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities
NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities
Abdellah El Mekki
Houdaifa Atou
Omer Nacar
Shady Shehata
Muhammad Abdul-Mageed
182
6
0
23 May 2025
EXECUTE: A Multilingual Benchmark for LLM Token Understanding
EXECUTE: A Multilingual Benchmark for LLM Token UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Lukas Edman
Helmut Schmid
Kangyang Luo
117
0
0
23 May 2025
Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs
Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs
Wafa Alghallabi
Ritesh Thawkar
Sara Ghaboura
Ketan More
Omkar Thawakar
Hisham Cholakkal
Salman Khan
Rao Muhammad Anwer
342
6
0
23 May 2025
MAPS: A Multilingual Benchmark for Global Agent Performance and Security
MAPS: A Multilingual Benchmark for Global Agent Performance and Security
Omer Hofman
Jonathan Brokman
Oren Rachmil
Shamik Bose
Vikas Pahuja
Toshiya Shimizu
Trisha Starostina
Kelly Marchisio
Seraphina Goldfarb-Tarrant
Roman Vainshtein
164
1
0
21 May 2025
Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model
Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model
Mehrdad Ghassabi
Pedram Rostami
Hamidreza Baradaran Kashani
Amirhossein Poursina
Zahra Kazemi
Milad Tavakoli
LM&MA
437
2
0
21 May 2025
Language Specific Knowledge: Do Models Know Better in X than in English?
Language Specific Knowledge: Do Models Know Better in X than in English?
Ishika Agarwal
Nimet Beyza Bozdag
Dilek Hakkani-Tur
168
1
0
21 May 2025
Krikri: Advancing Open Large Language Models for Greek
Krikri: Advancing Open Large Language Models for Greek
Dimitris Roussis
Leon Voukoutis
Georgios Paraskevopoulos
Sokratis Sofianopoulos
Prokopis Prokopidis
Vassilis Papavasileiou
Athanasios Katsamanis
Stelios Piperidis
Vassilis Katsouros
ALM
278
5
0
19 May 2025
Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance
Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance
Ram Mohan Rao Kadiyala
Siddartha Pullakhandam
Siddhant Gupta
Drishti Sharma
Jebish Purbey
Kanwal Mehreen
Muhammad Arham
Suman Debnath
Hamza Farooq
324
1
0
13 Apr 2025
MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs
MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs
Jaap Jumelet
Leonie Weissweiler
Joakim Nivre
Arianna Bisazza
243
21
0
03 Apr 2025
Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models
Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models
José P. Pombal
Nuno M. Guerreiro
Ricardo Rei
André F. T. Martins
ALM
426
7
0
01 Apr 2025
MKA: Leveraging Cross-Lingual Consensus for Model Abstention
MKA: Leveraging Cross-Lingual Consensus for Model Abstention
Sharad Duwal
228
0
0
31 Mar 2025
UNITYAI-GUARD: Pioneering Toxicity Detection Across Low-Resource Indian Languages
UNITYAI-GUARD: Pioneering Toxicity Detection Across Low-Resource Indian Languages
Himanshu Beniwal
Reddybathuni Venkat
Rohit Kumar
Birudugadda Srivibhav
Daksh Jain
Pavan Doddi
Eshwar Dhande
Adithya Ananth
Kuldeep
Heer Kubadia
146
1
0
29 Mar 2025
Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM
Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM
Yazeed Alnumay
Alexandre Barbet
Anna Bialas
William Darling
Shaan Desai
...
Stephanie Howe
Olivia Lasche
Justin Lee
Anirudh Shrinivason
Jennifer Tracey
263
3
0
18 Mar 2025
Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation
Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation
Songjun Tu
Jiahao Lin
Xiangyu Tian
Qichao Zhang
Linjing Li
...
Nan Xu
Wei He
Xiangyuan Lan
Shihong Deng
Dongbin Zhao
LRM
381
13
0
17 Mar 2025
High-Dimensional Interlingual Representations of Large Language Models
High-Dimensional Interlingual Representations of Large Language Models
Bryan Wilie
Samuel Cahyawijaya
Junxian He
Pascale Fung
391
0
0
14 Mar 2025
CULEMO: Cultural Lenses on Emotion -- Benchmarking LLMs for Cross-Cultural Emotion Understanding
CULEMO: Cultural Lenses on Emotion -- Benchmarking LLMs for Cross-Cultural Emotion UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Tadesse Destaw Belay
Ahmed Haj Ahmed
Alvin Grissom II
Iqra Ameer
Grigori Sidorov
Olga Kolesnikova
Seid Muhie Yimam
343
3
0
12 Mar 2025
MiLiC-Eval: Benchmarking Multilingual LLMs for China's Minority Languages
MiLiC-Eval: Benchmarking Multilingual LLMs for China's Minority LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Chen Zhang
Mingxu Tao
Zhiyuan Liao
Yansong Feng
263
2
0
03 Mar 2025
Batayan: A Filipino NLP benchmark for evaluating Large Language Models
Batayan: A Filipino NLP benchmark for evaluating Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Jann Railey Montalan
Jimson Paulo Layacan
David Demitri Africa
Richell Isaiah Flores
Michael T. Lopez II
Theresa Denise Magsajo
Anjanette Cayabyab
William-Chandra Tjhi
158
2
0
19 Feb 2025
DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data Cleaning as Anomaly Detection
DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data Cleaning as Anomaly Detection
Yingli Shen
Wen Lai
Kaiyan Zhang
Xueren Zhang
Kangyang Luo
Kangyang Luo
Maosong Sun
349
2
0
17 Feb 2025
Blessing of Multilinguality: A Systematic Analysis of Multilingual In-Context Learning
Blessing of Multilinguality: A Systematic Analysis of Multilingual In-Context LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yilei Tu
Andrew Xue
Freda Shi
246
1
0
17 Feb 2025
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models
Xu Huang
Wenhao Zhu
Hanxu Hu
Bin Wang
Lei Li
Shujian Huang
Fei Yuan
ELM
399
8
0
11 Feb 2025
BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
Omnilingual MT Team
Pierre Yves Andrews
Mikel Artetxe
Mariano Coria Meglioli
Marta R. Costa-jussá
...
Eduardo Sánchez
Ioannis Tsiamas
Arina Turkatenko
Albert Ventayol-Boada
Shireen Yates
357
1
0
06 Feb 2025
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical StudyNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Menglong Cui
Pengzhi Gao
Wei Liu
Jian Luan
Bin Wang
LRM
344
23
0
04 Feb 2025
The Reality of AI and Biorisk
The Reality of AI and BioriskConference on Fairness, Accountability and Transparency (FAccT), 2024
Aidan Peppin
Anka Reuel
Stephen Casper
Elliot Jones
A. Strait
...
Sanmi Koyejo
Marie Pellat
Rishi Bommasani
Nick Frosst
Sara Hooker
236
6
0
03 Jan 2025
M-RewardBench: Evaluating Reward Models in Multilingual Settings
M-RewardBench: Evaluating Reward Models in Multilingual SettingsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Srishti Gureja
Lester James V. Miranda
Shayekh Bin Islam
Rishabh Maheshwary
Drishti Sharma
Gusti Winata
Nathan Lambert
Sebastian Ruder
Sara Hooker
Marzieh Fadaee
LRM
338
40
0
20 Oct 2024
CaLMQA: Exploring culturally specific long-form question answering across 23 languages
CaLMQA: Exploring culturally specific long-form question answering across 23 languages
Shane Arora
Marzena Karpinska
Hung-Ting Chen
Ipsita Bhattacharjee
Mohit Iyyer
Eunsol Choi
HILM
349
22
0
25 Jun 2024
1