ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.05002
  4. Cited By
TyDi QA: A Benchmark for Information-Seeking Question Answering in
  Typologically Diverse Languages

TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages

10 March 2020
J. Clark
Eunsol Choi
Michael Collins
Dan Garrette
Tom Kwiatkowski
Vitaly Nikolaev
J. Palomaki
ArXivPDFHTML

Papers citing "TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages"

50 / 400 papers shown
Title
Realistic Evaluation of Model Merging for Compositional Generalization
Realistic Evaluation of Model Merging for Compositional Generalization
Derek Tam
Yash Kant
Brian Lester
Igor Gilitschenski
Colin Raffel
MoMe
23
5
0
26 Sep 2024
HaloScope: Harnessing Unlabeled LLM Generations for Hallucination
  Detection
HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection
Xuefeng Du
Chaowei Xiao
Yixuan Li
HILM
29
16
0
26 Sep 2024
Unlocking Markets: A Multilingual Benchmark to Cross-Market Question
  Answering
Unlocking Markets: A Multilingual Benchmark to Cross-Market Question Answering
Yifei Yuan
Yang Deng
Anders Søgaard
Mohammad Aliannejadi
23
0
0
24 Sep 2024
AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs
AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs
Basel Mousi
Nadir Durrani
Fatema Ahmad
Md. Arid Hasan
Maram Hasanain
Tameem Kabbani
Fahim Dalvi
Shammur A. Chowdhury
Firoj Alam
43
8
0
17 Sep 2024
L3Cube-IndicQuest: A Benchmark Question Answering Dataset for Evaluating
  Knowledge of LLMs in Indic Context
L3Cube-IndicQuest: A Benchmark Question Answering Dataset for Evaluating Knowledge of LLMs in Indic Context
Pritika Rohera
Chaitrali Ginimav
Akanksha Salunke
Gayatri Sawant
Raviraj Joshi
35
3
0
13 Sep 2024
ContextCite: Attributing Model Generation to Context
ContextCite: Attributing Model Generation to Context
Benjamin Cohen-Wang
Harshay Shah
Kristian Georgiev
Aleksander Madry
LRM
30
18
0
01 Sep 2024
Into the Unknown Unknowns: Engaged Human Learning through Participation
  in Language Model Agent Conversations
Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations
Yucheng Jiang
Yijia Shao
Dekun Ma
Sina J. Semnani
Monica S. Lam
LLMAG
32
14
0
27 Aug 2024
Predictability and Causality in Spanish and English Natural Language
  Generation
Predictability and Causality in Spanish and English Natural Language Generation
Andrea Busto-Castiñeira
Francisco J. González Castaño
Silvia García-Méndez
Francisco de Arriba-Pérez
CML
46
1
0
26 Aug 2024
Extend Model Merging from Fine-Tuned to Pre-Trained Large Language
  Models via Weight Disentanglement
Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement
Le Yu
Bowen Yu
Haiyang Yu
Fei Huang
Yongbin Li
MoMe
27
5
0
06 Aug 2024
SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models
  for Southeast Asian Languages
SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages
Wenxuan Zhang
Hou Pong Chan
Yiran Zhao
Mahani Aljunied
Jianyu Wang
...
Zhiqiang Hu
Weiwen Xu
Yew Ken Chia
Xin Li
Li Bing
LRM
52
7
0
29 Jul 2024
mGTE: Generalized Long-Context Text Representation and Reranking Models
  for Multilingual Text Retrieval
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
Xin Zhang
Yanzhao Zhang
Dingkun Long
Wen Xie
Ziqi Dai
...
Pengjun Xie
Fei Huang
Meishan Zhang
Wenjie Li
Min Zhang
35
73
0
29 Jul 2024
Revolutionizing Bridge Operation and maintenance with LLM-based Agents:
  An Overview of Applications and Insights
Revolutionizing Bridge Operation and maintenance with LLM-based Agents: An Overview of Applications and Insights
Xinyu-Chen
Lianzhen-Zhang
LLMAG
AI4CE
37
1
0
14 Jul 2024
Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Huanqian Wang
Yang Yue
Rui Lu
Jingxin Shi
Andrew Zhao
Shenzhi Wang
Shiji Song
Gao Huang
LM&Ro
KELM
44
6
0
11 Jul 2024
Data, Data Everywhere: A Guide for Pretraining Dataset Construction
Data, Data Everywhere: A Guide for Pretraining Dataset Construction
Jupinder Parmar
Shrimai Prabhumoye
Joseph Jennings
Bo Liu
Aastha Jhunjhunwala
Zhilin Wang
M. Patwary
M. Shoeybi
Bryan Catanzaro
39
5
0
08 Jul 2024
LLMBox: A Comprehensive Library for Large Language Models
LLMBox: A Comprehensive Library for Large Language Models
Tianyi Tang
Yiwen Hu
Bingqian Li
Wenyang Luo
Zijing Qin
...
Chunxuan Xia
Junyi Li
Kun Zhou
Wayne Xin Zhao
Ji-Rong Wen
31
1
0
08 Jul 2024
A Principled Framework for Evaluating on Typologically Diverse Languages
A Principled Framework for Evaluating on Typologically Diverse Languages
Esther Ploeger
Wessel Poelman
Andreas Holck Høeg-Petersen
Anders Schlichtkrull
Miryam de Lhoneux
Johannes Bjerva
36
1
0
06 Jul 2024
A Systematic Survey and Critical Review on Evaluating Large Language
  Models: Challenges, Limitations, and Recommendations
A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations
Md Tahmid Rahman Laskar
Sawsan Alqahtani
M Saiful Bari
Mizanur Rahman
Mohammad Abdullah Matin Khan
...
Chee Wei Tan
Md. Rizwan Parvez
Enamul Hoque
Shafiq R. Joty
Jimmy Huang
ELM
ALM
27
27
0
04 Jul 2024
The Art of Saying No: Contextual Noncompliance in Language Models
The Art of Saying No: Contextual Noncompliance in Language Models
Faeze Brahman
Sachin Kumar
Vidhisha Balachandran
Pradeep Dasigi
Valentina Pyatkin
...
Jack Hessel
Yulia Tsvetkov
Noah A. Smith
Yejin Choi
Hannaneh Hajishirzi
65
20
0
02 Jul 2024
Survey on Knowledge Distillation for Large Language Models: Methods,
  Evaluation, and Application
Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application
Chuanpeng Yang
Wang Lu
Yao Zhu
Yidong Wang
Qian Chen
Chenlong Gao
Bingjie Yan
Yiqiang Chen
ALM
KELM
44
22
0
02 Jul 2024
Retrieval-augmented generation in multilingual settings
Retrieval-augmented generation in multilingual settings
Nadezhda Chirkova
David Rau
Hervé Déjean
Thibault Formal
S. Clinchant
Vassilina Nikoulina
RALM
30
15
0
01 Jul 2024
BERGEN: A Benchmarking Library for Retrieval-Augmented Generation
BERGEN: A Benchmarking Library for Retrieval-Augmented Generation
David Rau
Hervé Déjean
Nadezhda Chirkova
Thibault Formal
Shuai Wang
Vassilina Nikoulina
S. Clinchant
45
10
0
01 Jul 2024
CaLMQA: Exploring culturally specific long-form question answering
  across 23 languages
CaLMQA: Exploring culturally specific long-form question answering across 23 languages
Shane Arora
Marzena Karpinska
Hung-Ting Chen
Ipsita Bhattacharjee
Mohit Iyyer
Eunsol Choi
HILM
43
11
0
25 Jun 2024
M2Lingual: Enhancing Multilingual, Multi-Turn Instruction Alignment in Large Language Models
M2Lingual: Enhancing Multilingual, Multi-Turn Instruction Alignment in Large Language Models
Rishabh Maheshwary
Vikas Yadav
Hoang Nguyen
Khyati Mahajan
Sathwik Tejaswi Madhusudhan
40
3
0
24 Jun 2024
Teaching LLMs to Abstain across Languages via Multilingual Feedback
Teaching LLMs to Abstain across Languages via Multilingual Feedback
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Orevaoghene Ahia
Shuyue Stella Li
Vidhisha Balachandran
Sunayana Sitaram
Yulia Tsvetkov
65
4
0
22 Jun 2024
Data-Centric AI in the Age of Large Language Models
Data-Centric AI in the Age of Large Language Models
Xinyi Xu
Zhaoxuan Wu
Rui Qiao
Arun Verma
Yao Shu
...
Xiaoqiang Lin
Wenyang Hu
Zhongxiang Dai
Pang Wei Koh
Bryan Kian Hsiang Low
ALM
40
2
0
20 Jun 2024
On the Evaluation Practices in Multilingual NLP: Can Machine Translation
  Offer an Alternative to Human Translations?
On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations?
Rochelle Choenni
Sara Rajaee
Christof Monz
Ekaterina Shutova
31
1
0
20 Jun 2024
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and
  Metrics for Open Domain Question Answering in the Era of Large Language
  Models
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models
Akchay Srivastava
Atif Memon
ELM
40
1
0
19 Jun 2024
Is It Good Data for Multilingual Instruction Tuning or Just Bad
  Multilingual Evaluation for Large Language Models?
Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?
Pinzhen Chen
Simon Yu
Zhicheng Guo
Barry Haddow
ELM
46
1
0
18 Jun 2024
Datasets for Multilingual Answer Sentence Selection
Datasets for Multilingual Answer Sentence Selection
Matteo Gabburo
S. Campese
Federico Agostini
Alessandro Moschitti
36
0
0
14 Jun 2024
mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation
  Strategy by Language Models and Humans
mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans
Yusuke Sakai
Hidetaka Kamigaito
Taro Watanabe
LRM
38
2
0
06 Jun 2024
eXmY: A Data Type and Technique for Arbitrary Bit Precision Quantization
eXmY: A Data Type and Technique for Arbitrary Bit Precision Quantization
Aditya Agrawal
Matthew Hedlund
Blake A. Hechtman
MQ
23
4
0
22 May 2024
Long Context Alignment with Short Instructions and Synthesized Positions
Long Context Alignment with Short Instructions and Synthesized Positions
Wenhao Wu
Yizhong Wang
Yao Fu
Xiang Yue
Dawei Zhu
Sujian Li
SyDa
35
18
0
07 May 2024
UQA: Corpus for Urdu Question Answering
UQA: Corpus for Urdu Question Answering
Samee Arif
Sualeha Farid
Awais Athar
Agha Ali Raza
32
4
0
02 May 2024
HFT: Half Fine-Tuning for Large Language Models
HFT: Half Fine-Tuning for Large Language Models
Tingfeng Hui
Zhenyu Zhang
Shuohuan Wang
Weiran Xu
Yu Sun
Hua-Hong Wu
CLL
37
4
0
29 Apr 2024
TIGQA:An Expert Annotated Question Answering Dataset in Tigrinya
TIGQA:An Expert Annotated Question Answering Dataset in Tigrinya
Hailay Teklehaymanot
Dren Fazlija
Niloy Ganguly
Gourab K. Patro
Wolfgang Nejdl
27
0
0
26 Apr 2024
IndicGenBench: A Multilingual Benchmark to Evaluate Generation
  Capabilities of LLMs on Indic Languages
IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages
Harman Singh
Nitish Gupta
Shikhar Bharadwaj
Dinesh Tewari
Partha P. Talukdar
ELM
32
22
0
25 Apr 2024
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language
  Models
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models
Aitor Ormazabal
Che Zheng
Cyprien de Masson dÁutume
Dani Yogatama
Deyu Fu
...
Yazheng Yang
Yi Tay
Yuqi Wang
Zhongkai Zhu
Zhihui Xie
LRM
VLM
ReLM
28
47
0
18 Apr 2024
Language Model Cascades: Token-level uncertainty and beyond
Language Model Cascades: Token-level uncertainty and beyond
Neha Gupta
Harikrishna Narasimhan
Wittawat Jitkrittum
A. S. Rawat
A. Menon
Sanjiv Kumar
UQLM
41
42
0
15 Apr 2024
Rho-1: Not All Tokens Are What You Need
Rho-1: Not All Tokens Are What You Need
Zheng-Wen Lin
Zhibin Gou
Yeyun Gong
Xiao Liu
Yelong Shen
...
Chen Lin
Yujiu Yang
Jian Jiao
Nan Duan
Weizhu Chen
CLL
48
55
0
11 Apr 2024
CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging
  LLMs' (Lack of) Multicultural Knowledge
CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
Yu Ying Chiu
Amirhossein Ajalloeian
Maria Antoniak
Chan Young Park
Shuyue Stella Li
Mehar Bhatia
Sahithya Ravi
Yulia Tsvetkov
Vered Shwartz
Yejin Choi
36
20
0
10 Apr 2024
Multilingual Large Language Model: A Survey of Resources, Taxonomy and
  Frontiers
Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers
Libo Qin
Qiguang Chen
Yuhang Zhou
Zhi Chen
Yinghui Li
Lizi Liao
Min Li
Wanxiang Che
Philip S. Yu
LRM
47
36
0
07 Apr 2024
KazQAD: Kazakh Open-Domain Question Answering Dataset
KazQAD: Kazakh Open-Domain Question Answering Dataset
Rustem Yeshpanov
Pavel Efimov
Leonid Boytsov
Ardak Shalkarbayuli
Pavel Braslavski
RALM
28
4
0
06 Apr 2024
Sailor: Open Language Models for South-East Asia
Sailor: Open Language Models for South-East Asia
Longxu Dou
Qian Liu
Guangtao Zeng
Jia Guo
Jiahui Zhou
Wei Lu
Min-Bin Lin
LRM
32
7
0
04 Apr 2024
An Efficient Approach for Studying Cross-Lingual Transfer in
  Multilingual Language Models
An Efficient Approach for Studying Cross-Lingual Transfer in Multilingual Language Models
Fahim Faisal
Antonios Anastasopoulos
27
0
0
29 Mar 2024
COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning
COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning
Yuelin Bai
Xinrun Du
Yiming Liang
Yonggang Jin
Ziqiang Liu
...
Chenghua Lin
Jie Fu
Min Yang
Shiwen Ni
Ge Zhang
ALM
40
32
0
26 Mar 2024
DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and
  Closely-Related Languages
DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Fahim Faisal
Orevaoghene Ahia
Aarohi Srivastava
Kabir Ahuja
David Chiang
Yulia Tsvetkov
Antonios Anastasopoulos
56
27
0
16 Mar 2024
Komodo: A Linguistic Expedition into Indonesia's Regional Languages
Komodo: A Linguistic Expedition into Indonesia's Regional Languages
Louis Owen
Vishesh Tripathi
Abhay Kumar
Biddwan Ahmed
ELM
27
7
0
14 Mar 2024
Cost-Performance Optimization for Processing Low-Resource Language Tasks
  Using Commercial LLMs
Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs
Arijit Nag
Animesh Mukherjee
Niloy Ganguly
Soumen Chakrabarti
33
2
0
08 Mar 2024
A Measure for Transparent Comparison of Linguistic Diversity in
  Multilingual NLP Data Sets
A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data Sets
Tanja Samardzic
Ximena Gutierrez-Vasques
Christian Bentz
Steven Moran
Olga Pelloni
29
4
0
06 Mar 2024
Evaluating the Elementary Multilingual Capabilities of Large Language
  Models with MultiQ
Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ
Carolin Holtermann
Paul Röttger
Timm Dill
Anne Lauscher
ELM
LRM
32
22
0
06 Mar 2024
Previous
12345678
Next