ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.05002
  4. Cited By
TyDi QA: A Benchmark for Information-Seeking Question Answering in
  Typologically Diverse Languages

TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages

10 March 2020
J. Clark
Eunsol Choi
Michael Collins
Dan Garrette
Tom Kwiatkowski
Vitaly Nikolaev
J. Palomaki
ArXivPDFHTML

Papers citing "TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages"

50 / 400 papers shown
Title
LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning
LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning
Xiaotian Lin
Yanlin Qi
Yizhang Zhu
Themis Palpanas
Chengliang Chai
Nan Tang
Yuyu Luo
21
0
0
12 May 2025
IndicSQuAD: A Comprehensive Multilingual Question Answering Dataset for Indic Languages
IndicSQuAD: A Comprehensive Multilingual Question Answering Dataset for Indic Languages
Sharvi Endait
Ruturaj Ghatage
Aditya Kulkarni
Rajlaxmi Patil
Raviraj Joshi
32
0
0
06 May 2025
Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training
Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training
Linjuan Wu
H. Wei
Huan Lin
Tianhao Li
Baosong Yang
Weiming Lu
26
0
0
29 Apr 2025
Building Russian Benchmark for Evaluation of Information Retrieval Models
Building Russian Benchmark for Evaluation of Information Retrieval Models
Grigory Kovalev
M. Tikhomirov
Evgeny Kozhevnikov
Max Kornilov
Natalia V. Loukachevitch
26
0
0
17 Apr 2025
MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages
MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages
Dieuwke Hupkes
Nikolay Bogoychev
97
0
0
14 Apr 2025
Can the capability of Large Language Models be described by human ability? A Meta Study
Can the capability of Large Language Models be described by human ability? A Meta Study
Mingrui Zan
Yunquan Zhang
Boyang Zhang
Fangming Liu
Daning Cheng
ELM
LM&MA
55
0
0
13 Apr 2025
HalluShift: Measuring Distribution Shifts towards Hallucination Detection in LLMs
HalluShift: Measuring Distribution Shifts towards Hallucination Detection in LLMs
Sharanya Dasgupta
Sujoy Nath
Arkaprabha Basu
Pourya Shamsolmoali
Swagatam Das
HILM
60
0
0
13 Apr 2025
Improving Multilingual Retrieval-Augmented Language Models through Dialectic Reasoning Argumentations
Improving Multilingual Retrieval-Augmented Language Models through Dialectic Reasoning Argumentations
Leonardo Ranaldi
Federico Ranaldi
Fabio Massimo Zanzotto
Barry Haddow
Alexandra Birch
RALM
LRM
38
0
0
07 Apr 2025
Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task
Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task
Leonardo Ranaldi
Barry Haddow
Alexandra Birch
RALM
63
1
0
04 Apr 2025
Neuronal Activation States as Sample Embeddings for Data Selection in Task-Specific Instruction Tuning
Neuronal Activation States as Sample Embeddings for Data Selection in Task-Specific Instruction Tuning
Da Ma
Gonghu Shang
Zhi Chen
L. Qin
Yijie Luo
Lei Pan
Shuai Fan
L. Chen
Kai Yu
36
0
0
19 Mar 2025
Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM
Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM
Yazeed Alnumay
Alexandre Barbet
Anna Bialas
William Darling
Shaan Desai
...
Stephanie Howe
Olivia Lasche
Justin Lee
Anirudh Shrinivason
Jennifer Tracey
86
0
0
18 Mar 2025
Fragile Mastery: Are Domain-Specific Trade-Offs Undermining On-Device Language Models?
Fragile Mastery: Are Domain-Specific Trade-Offs Undermining On-Device Language Models?
Basab Jha
Firoj Paudel
37
0
0
16 Mar 2025
Domain Adaptation for Japanese Sentence Embeddings with Contrastive Learning based on Synthetic Sentence Generation
Zihao Chen
H. Handa
Miho Ohsaki
Kimiaki Shirahama
57
0
0
12 Mar 2025
Large-Scale Data Selection for Instruction Tuning
Hamish Ivison
Muru Zhang
Faeze Brahman
Pang Wei Koh
Pradeep Dasigi
ALM
71
1
0
03 Mar 2025
Granite Embedding Models
Granite Embedding Models
Parul Awasthy
Aashka Trivedi
Yulong Li
Mihaela A. Bornea
David D. Cox
...
Sukriti Sharma
Avirup Sil
Kate Soule
Arafat Sultan
Radu Florian
RALM
56
1
0
27 Feb 2025
Few-Shot Multilingual Open-Domain QA from 5 Examples
Few-Shot Multilingual Open-Domain QA from 5 Examples
Fan Jiang
Tom Drummond
Trevor Cohn
48
0
0
27 Feb 2025
Where Are We? Evaluating LLM Performance on African Languages
Where Are We? Evaluating LLM Performance on African Languages
Ife Adebara
Hawau Olamide Toyin
Nahom Tesfu Ghebremichael
AbdelRahim Elmadany
Muhammad Abdul-Mageed
52
0
0
26 Feb 2025
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts
Muhammad Farid Adilazuarda
M. Wijanarko
Lucky Susanto
Khumaisa Nuráini
Derry Wijaya
Alham Fikri Aji
52
0
0
25 Feb 2025
Language Models' Factuality Depends on the Language of Inquiry
Language Models' Factuality Depends on the Language of Inquiry
Tushar Aggarwal
Kumar Tanmay
Ayush Agrawal
Kumar Ayush
Hamid Palangi
Paul Pu Liang
HILM
KELM
71
0
0
25 Feb 2025
Reasoning with Latent Thoughts: On the Power of Looped Transformers
Reasoning with Latent Thoughts: On the Power of Looped Transformers
Nikunj Saunshi
Nishanth Dikkala
Zhiyuan Li
Sanjiv Kumar
Sashank J. Reddi
OffRL
LRM
AI4CE
56
10
0
24 Feb 2025
Pay Attention to Real World Perturbations! Natural Robustness Evaluation in Machine Reading Comprehension
Pay Attention to Real World Perturbations! Natural Robustness Evaluation in Machine Reading Comprehension
Yulong Wu
Viktor Schlegel
R. Batista-Navarro
AAML
36
0
0
23 Feb 2025
Multilingual Non-Factoid Question Answering with Answer Paragraph Selection
Multilingual Non-Factoid Question Answering with Answer Paragraph Selection
Ritwik Mishra
Sreeram Vennam
R. Shah
Ponnurangam Kumaraguru
93
0
0
20 Feb 2025
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Longxu Dou
Qian Liu
Fan Zhou
Changyu Chen
Zili Wang
...
Tianyu Pang
Chao Du
Xinyi Wan
Wei Lu
Min Lin
89
1
0
18 Feb 2025
Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models
Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models
Masahiro Kaneko
Alham Fikri Aji
Timothy Baldwin
67
0
0
17 Feb 2025
QLESS: A Quantized Approach for Data Valuation and Selection in Large Language Model Fine-Tuning
QLESS: A Quantized Approach for Data Valuation and Selection in Large Language Model Fine-Tuning
Moses Ananta
Muhammad Farid Adilazuarda
Zayd Muhammad Kawakibi Zuhri
Ayu Purwarianti
Alham Fikri Aji
MQ
57
0
0
03 Feb 2025
ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation
ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation
Peter Devine
33
0
0
21 Jan 2025
Algorithm for Semantic Network Generation from Texts of Low Resource Languages Such as Kiswahili
Algorithm for Semantic Network Generation from Texts of Low Resource Languages Such as Kiswahili
B. Wanjawa
Lawrence Muchemi
Evans Miriti
46
0
0
17 Jan 2025
Language Fusion for Parameter-Efficient Cross-lingual Transfer
Language Fusion for Parameter-Efficient Cross-lingual Transfer
Philipp Borchert
Ivan Vulić
Marie-Francine Moens
Jochen De Weerdt
36
0
0
12 Jan 2025
Optimized Quran Passage Retrieval Using an Expanded QA Dataset and
  Fine-Tuned Language Models
Optimized Quran Passage Retrieval Using an Expanded QA Dataset and Fine-Tuned Language Models
Mohamed Basem
Islam Oshallah
Baraa Hikal
Ali Hamdi
Ammar Mohamed
RALM
71
1
0
16 Dec 2024
Codenames as a Benchmark for Large Language Models
Codenames as a Benchmark for Large Language Models
Matthew Stephenson
Matthew Sidji
Benoît Ronval
LLMAG
LRM
ELM
103
1
0
16 Dec 2024
SailCompass: Towards Reproducible and Robust Evaluation for Southeast
  Asian Languages
SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
Jia Guo
Longxu Dou
Guangtao Zeng
Stanley Kok
Wei Lu
Qian Liu
ELM
LRM
70
1
0
02 Dec 2024
SEEKR: Selective Attention-Guided Knowledge Retention for Continual
  Learning of Large Language Models
SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models
Jinghan He
Haiyun Guo
Kuan Zhu
Zihan Zhao
Ming Tang
J. T. Wang
KELM
28
1
0
09 Nov 2024
Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
Yuqi Luo
Chenyang Song
Xu Han
Y. Chen
Chaojun Xiao
Zhiyuan Liu
Maosong Sun
47
3
0
04 Nov 2024
SandboxAQ's submission to MRL 2024 Shared Task on Multi-lingual
  Multi-task Information Retrieval
SandboxAQ's submission to MRL 2024 Shared Task on Multi-lingual Multi-task Information Retrieval
Isidora Chara Tourni
Sayontan Ghosh
Brenda Miao
Constantijn van der Poel
LRM
28
0
0
28 Oct 2024
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
Julie Kallini
Shikhar Murty
Christopher D. Manning
Christopher Potts
Róbert Csordás
30
2
0
28 Oct 2024
A Little Help Goes a Long Way: Efficient LLM Training by Leveraging
  Small LMs
A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
A. S. Rawat
Veeranjaneyulu Sadhanala
Afshin Rostamizadeh
Ayan Chakrabarti
Wittawat Jitkrittum
...
Rakesh Shivanna
Sashank J. Reddi
A. Menon
Rohan Anil
Sanjiv Kumar
28
2
0
24 Oct 2024
VoiceBench: Benchmarking LLM-Based Voice Assistants
VoiceBench: Benchmarking LLM-Based Voice Assistants
Yiming Chen
Xianghu Yue
Chen Zhang
Xiaoxue Gao
R. Tan
H. Li
ELM
AuLLM
34
18
0
22 Oct 2024
Influential Language Data Selection via Gradient Trajectory Pursuit
Influential Language Data Selection via Gradient Trajectory Pursuit
Zhiwei Deng
Tao Li
Yang Li
24
1
0
22 Oct 2024
Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between
  Ghana and the U.S
Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the U.S
Christabel Acquaye
Haozhe An
Rachel Rudinger
27
4
0
21 Oct 2024
SwaQuAD-24: QA Benchmark Dataset in Swahili
SwaQuAD-24: QA Benchmark Dataset in Swahili
Alfred Malengo Kondoro
22
0
0
18 Oct 2024
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems
Nandan Thakur
Suleman Kazi
Ge Luo
Jimmy J. Lin
Amin Ahmad
VLM
RALM
26
7
0
17 Oct 2024
TSDS: Data Selection for Task-Specific Model Finetuning
TSDS: Data Selection for Task-Specific Model Finetuning
Zifan Liu
Amin Karbasi
Theodoros Rekatsinas
29
3
0
15 Oct 2024
BanglaQuAD: A Bengali Open-domain Question Answering Dataset
BanglaQuAD: A Bengali Open-domain Question Answering Dataset
Md. Rony
Sudipto Kumar Shaha
Rakib Al Hasan
Sumon Kanti Dey
Amzad Hossain Rafi
Amzad Hossain Rafi
Ashraf Hasan Sirajee
Jens Lehmann
32
1
0
14 Oct 2024
State of NLP in Kenya: A Survey
State of NLP in Kenya: A Survey
Cynthia Jayne Amol
Everlyn Asiko Chimoto
Rose Delilah Gesicho
Antony M. Gitau
Naome A. Etori
...
Catherine Gitau
Antony Ndolo
Lilian D. A. Wanzare
Albert Njoroge Kahira
Ronald Tombe
21
1
0
13 Oct 2024
MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual
  Alignment
MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment
Amir Hossein Kargaran
Ali Modarressi
Nafiseh Nikeghbal
Jana Diesner
François Yvon
Hinrich Schütze
ELM
44
3
0
08 Oct 2024
Cross-lingual Transfer for Automatic Question Generation by Learning
  Interrogative Structures in Target Languages
Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages
Seonjeong Hwang
Yunsu Kim
Gary Geunbae Lee
19
0
0
04 Oct 2024
Residual Policy Learning for Perceptive Quadruped Control Using
  Differentiable Simulation
Residual Policy Learning for Perceptive Quadruped Control Using Differentiable Simulation
Jing Yuan Luo
Yunlong Song
Victor Klemm
Fan Shi
Davide Scaramuzza
Marco Hutter
31
1
0
04 Oct 2024
Distilling an End-to-End Voice Assistant Without Instruction Training
  Data
Distilling an End-to-End Voice Assistant Without Instruction Training Data
William B. Held
Ella Li
Michael Joseph Ryan
Weiyan Shi
Yanzhe Zhang
Diyi Yang
AuLLM
36
8
0
03 Oct 2024
MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture
  of Shards
MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards
Sheng Wang
Liheng Chen
Pengan Chen
Jingwei Dong
Boyang Xue
Jiyue Jiang
Lingpeng Kong
Chuan Wu
MoE
29
7
0
01 Oct 2024
Towards Robust Extractive Question Answering Models: Rethinking the
  Training Methodology
Towards Robust Extractive Question Answering Models: Rethinking the Training Methodology
Son Quoc Tran
Matt Kretchmar
OOD
19
0
0
29 Sep 2024
12345678
Next