ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.05002
  4. Cited By
TyDi QA: A Benchmark for Information-Seeking Question Answering in
  Typologically Diverse Languages

TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages

Transactions of the Association for Computational Linguistics (TACL), 2020
10 March 2020
J. Clark
Eunsol Choi
Michael Collins
Dan Garrette
Tom Kwiatkowski
Vitaly Nikolaev
J. Palomaki
ArXiv (abs)PDFHTML

Papers citing "TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages"

50 / 491 papers shown
M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG
M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG
David Anugraha
Patrick Amadeus Irawan
A. Singh
En-Shiun Annie Lee
Genta Indra Winata
VLM
196
1
0
05 Dec 2025
mmJEE-Eval: A Bilingual Multimodal Benchmark for Evaluating Scientific Reasoning in Vision-Language Models
mmJEE-Eval: A Bilingual Multimodal Benchmark for Evaluating Scientific Reasoning in Vision-Language Models
Arka Mukherjee
Shreya Ghosh
LRM
200
0
0
12 Nov 2025
Rethinking what Matters: Effective and Robust Multilingual Realignment for Low-Resource Languages
Rethinking what Matters: Effective and Robust Multilingual Realignment for Low-Resource Languages
Quang Phuoc Nguyen
David Anugraha
Felix Gaschi
Jun Bin Cheng
En-Shiun Annie Lee
220
0
0
09 Nov 2025
Mixtures of SubExperts for Large Language Continual Learning
Mixtures of SubExperts for Large Language Continual Learning
Haeyong Kang
CLLKELMMoE
269
0
0
09 Nov 2025
Iterative Layer-wise Distillation for Efficient Compression of Large Language Models
Iterative Layer-wise Distillation for Efficient Compression of Large Language Models
Grigory Kovalev
M. Tikhomirov
151
0
0
07 Nov 2025
Do You Know About My Nation? Investigating Multilingual Language Models' Cultural Literacy Through Factual Knowledge
Do You Know About My Nation? Investigating Multilingual Language Models' Cultural Literacy Through Factual Knowledge
Eshaan Tanwar
Anwoy Chatterjee
Michael Stephen Saxon
Alon Albalak
William Wang
Tanmoy Chakraborty
169
3
0
01 Nov 2025
DialectalArabicMMLU: Benchmarking Dialectal Capabilities in Arabic and Multilingual Language Models
DialectalArabicMMLU: Benchmarking Dialectal Capabilities in Arabic and Multilingual Language Models
Malik H. Altakrori
Nizar Habash
Abdelhakim Freihat
Younes Samih
Kirill Chirkunov
Muhammed AbuOdeh
Radu Florian
Teresa Lynn
Preslav Nakov
Alham Fikri Aji
ELM
279
4
0
31 Oct 2025
Rethinking Cross-lingual Alignment: Balancing Transfer and Cultural Erasure in Multilingual LLMs
Rethinking Cross-lingual Alignment: Balancing Transfer and Cultural Erasure in Multilingual LLMs
HyoJung Han
Sweta Agrawal
Eleftheria Briakou
140
2
0
29 Oct 2025
Can LLMs Write Faithfully? An Agent-Based Evaluation of LLM-generated Islamic Content
Can LLMs Write Faithfully? An Agent-Based Evaluation of LLM-generated Islamic Content
Abdullah Mushtaq
Rafay Naeem
Ezieddin Elmahjub
Ibrahim Ghaznavi
Shawqi Al-Maliki
M. Abdallah
Ala I. Al-Fuqaha
Junaid Qadir
ELM
207
1
0
28 Oct 2025
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures
T. Chang
Catherine Arnett
Abdelrahman Eldesokey
Abdelrahman Sadallah
Abeer Kashar
...
Francesco Orabona
Francesco Periti
Gbenga Kayode Solomon
Gia Nghia Ngo
Gloria Udhehdhe-oze
LRMELM
231
4
0
28 Oct 2025
Quality-Aware Translation Tagging in Multilingual RAG system
Quality-Aware Translation Tagging in Multilingual RAG system
Hoyeon Moon
Byeolhee Kim
Nikhil Verma
VLM
249
2
0
27 Oct 2025
LM-mixup: Text Data Augmentation via Language Model based Mixup
LM-mixup: Text Data Augmentation via Language Model based Mixup
Zhijie Deng
Zhouan Shen
Ling Li
Yao Zhou
Zhaowei Zhu
Yanji He
Wei Wang
Jiaheng Wei
145
0
0
23 Oct 2025
ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning
ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning
Xiaohan Qin
Xiaoxing Wang
Ning Liao
Cancheng Zhang
Xiangdong Zhang
Mingquan Feng
Jingzhi Wang
Junchi Yan
178
2
0
21 Oct 2025
ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models
ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models
Emily Chang
Niyati Bafna
ELM
196
0
0
19 Oct 2025
Evaluating Arabic Large Language Models: A Survey of Benchmarks, Methods, and Gaps
Evaluating Arabic Large Language Models: A Survey of Benchmarks, Methods, and Gaps
Ahmed Alzubaidi
Shaikha Alsuwaidi
Basma El Amel Boussaha
Leen AlQadi
Omar Alkaabi
Mohammed Alyafeai
Hamza Alobeidli
Hakim Hacid
ELM
216
4
0
15 Oct 2025
Bolster Hallucination Detection via Prompt-Guided Data Augmentation
Bolster Hallucination Detection via Prompt-Guided Data Augmentation
Wenyun Li
Zheng Zhang
Dongmei Jiang
Xiangyuan Lan
HILM
226
0
0
13 Oct 2025
Type and Complexity Signals in Multilingual Question Representations
Type and Complexity Signals in Multilingual Question Representations
Robin Kokot
Wessel Poelman
137
0
0
07 Oct 2025
CDT: A Comprehensive Capability Framework for Large Language Models Across Cognition, Domain, and Task
CDT: A Comprehensive Capability Framework for Large Language Models Across Cognition, Domain, and Task
Haosi Mo
Xinyu Ma
Xuebo Liu
Yang Li
Yu Li
Jie Liu
Min Zhang
ELM
165
0
0
29 Sep 2025
Anecdoctoring: Automated Red-Teaming Across Language and Place
Anecdoctoring: Automated Red-Teaming Across Language and Place
Alejandro Cuevas
Saloni Dash
Bharat Kumar Nayak
Dan Vann
Madeleine I. G. Daepp
160
2
0
23 Sep 2025
DRISHTIKON: A Multimodal Multilingual Benchmark for Testing Language Models' Understanding on Indian Culture
DRISHTIKON: A Multimodal Multilingual Benchmark for Testing Language Models' Understanding on Indian Culture
Arijit Maji
Raghvendra Kumar
Akash Ghosh
Anushka
Nemil Shah
Abhilekh Borah
Vanshika Shah
Nishant Mishra
Sriparna Saha
VLM
258
7
0
23 Sep 2025
Uncertainty in Semantic Language Modeling with PIXELS
Uncertainty in Semantic Language Modeling with PIXELS
Stefania Radu
Marco Zullich
Matias Valdenegro-Toro
201
0
0
23 Sep 2025
Breaking Token Into Concepts: Exploring Extreme Compression in Token Representation Via Compositional Shared Semantics
Breaking Token Into Concepts: Exploring Extreme Compression in Token Representation Via Compositional Shared Semantics
Kavin R V
Pawan Goyal
111
0
0
22 Sep 2025
Probabilistic Token Alignment for Large Language Model Fusion
Probabilistic Token Alignment for Large Language Model Fusion
Runjia Zeng
James Liang
Cheng Han
Zhiwen Cao
Jiahao Liu
...
Yingjie Victor Chen
Lifu Huang
Tong Geng
Qifan Wang
Dongfang Liu
209
3
0
21 Sep 2025
HARP: Hallucination Detection via Reasoning Subspace Projection
HARP: Hallucination Detection via Reasoning Subspace Projection
Junjie Hu
Gang Tu
ShengYu Cheng
Jinxin Li
Jinting Wang
Rui Chen
Zhilong Zhou
Dongbo Shan
265
0
0
15 Sep 2025
MultiWikiQA: A Reading Comprehension Benchmark in 300+ Languages
MultiWikiQA: A Reading Comprehension Benchmark in 300+ Languages
Dan Saattrup Smart
RALM
442
5
0
04 Sep 2025
AraHalluEval: A Fine-grained Hallucination Evaluation Framework for Arabic LLMs
AraHalluEval: A Fine-grained Hallucination Evaluation Framework for Arabic LLMs
Aisha Alansari
Hamzah Luqman
HILMLRM
273
6
0
04 Sep 2025
Implicit Reasoning in Large Language Models: A Comprehensive Survey
Implicit Reasoning in Large Language Models: A Comprehensive Survey
Jindong Li
Yali Fu
Li Fan
Jiahong Liu
Yao Shu
Chengwei Qin
Menglin Yang
Irwin King
Rex Ying
OffRLLRMAI4CE
285
25
0
02 Sep 2025
The Gold Medals in an Empty Room: Diagnosing Metalinguistic Reasoning in LLMs with Camlang
The Gold Medals in an Empty Room: Diagnosing Metalinguistic Reasoning in LLMs with Camlang
Fenghua Liu
Yulong Chen
Yixuan Liu
Zhujun Jin
Solomon Tsai
Ming Zhong
ReLMLRM
249
1
0
30 Aug 2025
CultranAI at PalmX 2025: Data Augmentation for Cultural Knowledge Representation
CultranAI at PalmX 2025: Data Augmentation for Cultural Knowledge Representation
Hunzalah Hassan Bhatti
Youssef Ahmed
Md. Arid Hasan
Firoj Alam
270
2
0
24 Aug 2025
Quantifying Language Disparities in Multilingual Large Language Models
Quantifying Language Disparities in Multilingual Large Language Models
Songbo Hu
Ivan Vulić
Anna Korhonen
155
4
0
23 Aug 2025
M3TQA: Massively Multilingual Multitask Table Question Answering
M3TQA: Massively Multilingual Multitask Table Question Answering
Daixin Shu
Jian Yang
Zhenhe Wu
Xianjie Wu
Xianfu Cheng
...
Hualei Zhu
Wei Zhang
G. Zhang
Jiaheng Liu
Zhoujun Li
LMTD
235
2
0
22 Aug 2025
XLQA: A Benchmark for Locale-Aware Multilingual Open-Domain Question Answering
XLQA: A Benchmark for Locale-Aware Multilingual Open-Domain Question Answering
Keon-Woo Roh
Yeong-Joon Ju
Seong-Whan Lee
ELM
224
2
0
22 Aug 2025
SEA-BED: How Do Embedding Models Represent Southeast Asian Languages?
SEA-BED: How Do Embedding Models Represent Southeast Asian Languages?
Wuttikorn Ponwitayarat
Raymond Ng
Jann Railey Montalan
Thura Aung
Jian Gang Ngui
...
Panuthep Tasawong
Erik Cambria
Ekapol Chuangsuwanich
Sarana Nutanong
Peerat Limkonchotiwat
FedML
229
2
0
17 Aug 2025
LoraxBench: A Multitask, Multilingual Benchmark Suite for 20 Indonesian Languages
LoraxBench: A Multitask, Multilingual Benchmark Suite for 20 Indonesian Languages
Alham Fikri Aji
Trevor Cohn
159
2
0
17 Aug 2025
Two-Stage Quranic QA via Ensemble Retrieval and Instruction-Tuned Answer Extraction
Two-Stage Quranic QA via Ensemble Retrieval and Instruction-Tuned Answer Extraction
Mohamed Basem
Islam Oshallah
Ali Hamdi
Khaled Shaban
Hozaifa Kassab
RALM
274
0
0
09 Aug 2025
TASE: Token Awareness and Structured Evaluation for Multilingual Language Models
TASE: Token Awareness and Structured Evaluation for Multilingual Language Models
Chenzhuo Zhao
Xinda Wang
Yue Huang
Junting Lu
Ziqian Liu
LRM
144
1
0
07 Aug 2025
Forgetting: A New Mechanism Towards Better Large Language Model Fine-tuning
Forgetting: A New Mechanism Towards Better Large Language Model Fine-tuning
Ali Taheri Ghahrizjani
Alireza Taban
Qizhou Wang
Shanshan Ye
Tongliang Liu
Tongliang Liu
Bo Han
CLLKELMMULRM
396
3
0
06 Aug 2025
MegaWika 2: A More Comprehensive Multilingual Collection of Articles and their Sources
MegaWika 2: A More Comprehensive Multilingual Collection of Articles and their Sources
Samuel Barham
Chandler May
Benjamin Van Durme
SyDa
334
3
0
05 Aug 2025
HeQ: a Large and Diverse Hebrew Reading Comprehension Benchmark
HeQ: a Large and Diverse Hebrew Reading Comprehension BenchmarkConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Amir D. N. Cohen
Hilla Merhav
Yoav Goldberg
Reut Tsarfaty
167
11
0
03 Aug 2025
Enhanced Arabic Text Retrieval with Attentive Relevance Scoring
Enhanced Arabic Text Retrieval with Attentive Relevance Scoring
Salah Eddine Bekhouche
Azeddine Benlamoudi
Yazid Bounab
Fadi Dornaika
Abdenour Hadid
260
2
0
31 Jul 2025
CUS-QA: Local-Knowledge-Oriented Open-Ended Question Answering Dataset
CUS-QA: Local-Knowledge-Oriented Open-Ended Question Answering Dataset
Jindrich Libovický
Jindřich Helcl
Andrei-Alexandru Manea
Gianluca Vico
296
3
0
30 Jul 2025
HW-MLVQA: Elucidating Multilingual Handwritten Document Understanding with a Comprehensive VQA Benchmark
HW-MLVQA: Elucidating Multilingual Handwritten Document Understanding with a Comprehensive VQA Benchmark
Aniket Pal
Ajoy Mondal
Minesh Mathew
C. V. Jawahar
VLM
135
1
0
21 Jul 2025
BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity
BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity
Chenyang Song
Weilin Zhao
Xu Han
Chaojun Xiao
Yingfa Chen
Yuxuan Li
Zhiyuan Liu
Maosong Sun
MoE
340
2
0
11 Jul 2025
SANSKRITI: A Comprehensive Benchmark for Evaluating Language Models' Knowledge of Indian Culture
SANSKRITI: A Comprehensive Benchmark for Evaluating Language Models' Knowledge of Indian CultureAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Arijit Maji
Raghvendra Kumar
Akash Ghosh
Anushka
Sriparna Saha
ELM
381
12
0
18 Jun 2025
ClusterUCB: Efficient Gradient-Based Data Selection for Targeted Fine-Tuning of LLMs
ClusterUCB: Efficient Gradient-Based Data Selection for Targeted Fine-Tuning of LLMs
Zige Wang
Qi Zhu
Fei Mi
Minghui Xu
Ruochun Jin
Wenjing Yang
291
2
0
12 Jun 2025
Shaking to Reveal: Perturbation-Based Detection of LLM Hallucinations
Shaking to Reveal: Perturbation-Based Detection of LLM Hallucinations
Jinyuan Luo
Zhen Fang
Shouqing Yang
Seongheon Park
Ling Chen
AAMLHILM
299
1
0
03 Jun 2025
Data Pruning by Information Maximization
Data Pruning by Information MaximizationInternational Conference on Learning Representations (ICLR), 2025
Haoru Tan
Sitong Wu
Wei Huang
Shizhen Zhao
Xiaojuan Qi
374
13
0
02 Jun 2025
Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
Jesujoba Oluwadara Alabi
Michael A. Hedderich
David Ifeoluwa Adelani
Dietrich Klakow
563
12
0
27 May 2025
Efficient Data Selection at Scale via Influence Distillation
Efficient Data Selection at Scale via Influence Distillation
Mahdi Nikdan
Vincent Cohen-Addad
Dan Alistarh
Vahab Mirrokni
TDI
447
9
0
25 May 2025
ProDS: Preference-oriented Data Selection for Instruction Tuning
ProDS: Preference-oriented Data Selection for Instruction Tuning
Wenya Guo
Zhengkun Zhang
Xumeng Liu
Ying Zhang
Ziyu Lu
Haoze Zhu
Xubo Liu
Ruxue Yan
330
1
0
19 May 2025
1234...8910
Next
Page 1 of 10