Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.05002
Cited By
TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages
10 March 2020
J. Clark
Eunsol Choi
Michael Collins
Dan Garrette
Tom Kwiatkowski
Vitaly Nikolaev
J. Palomaki
Re-assign community
ArXiv
PDF
HTML
Papers citing
"TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages"
50 / 400 papers shown
Title
Cross-Lingual QA as a Stepping Stone for Monolingual Open QA in Icelandic
Vésteinn Snæbjarnarson
H. Einarsson
22
5
0
05 Jul 2022
MIA 2022 Shared Task: Evaluating Cross-lingual Open-Retrieval Question Answering for 16 Diverse Languages
Akari Asai
Shayne Longpre
Jungo Kasai
Chia-Hsuan Lee
Rui Zhang
Junjie Hu
Ikuya Yamada
J. Clark
Eunsol Choi
LRM
17
14
0
02 Jul 2022
Questions Are All You Need to Train a Dense Passage Retriever
Devendra Singh Sachan
M. Lewis
Dani Yogatama
Luke Zettlemoyer
J. Pineau
Manzil Zaheer
RALM
19
53
0
21 Jun 2022
Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold
Sebastian Ruder
Ivan Vulić
Anders Søgaard
33
25
0
20 Jun 2022
GAAMA 2.0: An Integrated System that Answers Boolean and Extractive Questions
Scott McCarley
Mihaela A. Bornea
Sara Rosenthal
Anthony Ferritto
Md Arafat Sultan
Avirup Sil
Radu Florian
12
1
0
16 Jun 2022
Evaluating the Diversity, Equity and Inclusion of NLP Technology: A Case Study for Indian Languages
Simran Khanuja
Sebastian Ruder
Partha P. Talukdar
32
16
0
25 May 2022
Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation
Tu Vu
Aditya Barua
Brian Lester
Daniel Matthew Cer
Mohit Iyyer
Noah Constant
CLL
16
64
0
25 May 2022
Investigating Information Inconsistency in Multilingual Open-Domain Question Answering
Shramay Palta
Haozhe An
Yifan Yang
Shuaiyi Huang
Maharshi Gor
34
1
0
25 May 2022
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in Bangla
Abhik Bhattacharjee
Tahmid Hasan
Wasi Uddin Ahmad
Rifat Shahriyar
AIMat
LM&MA
18
28
0
23 May 2022
Towards Debiasing Translation Artifacts
Koel Dutta Chowdhury
Rricha Jalota
C. España-Bonet
Josef van Genabith
23
6
0
16 May 2022
Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages
Kabir Ahuja
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhury
LRM
39
16
0
12 May 2022
On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data
Kabir Ahuja
Monojit Choudhury
Sandipan Dandapat
19
3
0
12 May 2022
Multi Task Learning For Zero Shot Performance Prediction of Multilingual Models
Kabir Ahuja
Shanu Kumar
Sandipan Dandapat
Monojit Choudhury
9
25
0
12 May 2022
Enhancing Cross-lingual Transfer by Manifold Mixup
Huiyun Yang
Huadong Chen
Hao Zhou
Lei Li
AAML
10
44
0
09 May 2022
KenSwQuAD -- A Question Answering Dataset for Swahili Low Resource Language
B. Wanjawa
Lilian D. A. Wanzare
F. Indede
Owen McOnyango
Lawrence Muchemi
Edward Ombui
21
19
0
04 May 2022
Don't Blame the Annotator: Bias Already Starts in the Annotation Instructions
Mihir Parmar
Swaroop Mishra
Mor Geva
Chitta Baral
28
55
0
01 May 2022
Polyglot Prompt: Multilingual Multitask PrompTraining
Jinlan Fu
See-Kiong Ng
Pengfei Liu
17
7
0
29 Apr 2022
Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient Optimization in Few-Shot Cross-Lingual Transfer
Haoran Xu
Kenton W. Murray
16
12
0
29 Apr 2022
On the Representation Collapse of Sparse Mixture of Experts
Zewen Chi
Li Dong
Shaohan Huang
Damai Dai
Shuming Ma
...
Payal Bajaj
Xia Song
Xian-Ling Mao
Heyan Huang
Furu Wei
MoMe
MoE
37
96
0
20 Apr 2022
WikiOmnia: generative QA corpus on the whole Russian Wikipedia
D. Pisarevskaya
Tatiana Shavrina
16
2
0
17 Apr 2022
Multilingual Event Linking to Wikidata
Adithya Pratapa
Rishubh Gupta
Teruko Mitamura
19
7
0
13 Apr 2022
The Impact of Cross-Lingual Adjustment of Contextual Word Representations on Zero-Shot Transfer
Pavel Efimov
Leonid Boytsov
E. Arslanova
Pavel Braslavski
22
7
0
13 Apr 2022
MuCoT: Multilingual Contrastive Training for Question-Answering in Low-resource Languages
Gokul Karthik Kumar
Abhishek Singh Gehlot
Sahal Shaji Mullappilly
Karthik Nandakumar
26
13
0
12 Apr 2022
Breaking Character: Are Subwords Good Enough for MRLs After All?
Omri Keren
Tal Avinari
Reut Tsarfaty
Omer Levy
25
15
0
10 Apr 2022
Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency
Yanyang Li
Fuli Luo
Runxin Xu
Songfang Huang
Fei Huang
Liwei Wang
25
3
0
06 Apr 2022
Towards Best Practices for Training Multilingual Dense Retrieval Models
Xinyu Crystina Zhang
Kelechi Ogueji
Xueguang Ma
Jimmy J. Lin
RALM
22
34
0
05 Apr 2022
PaLM: Scaling Language Modeling with Pathways
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
80
5,983
0
05 Apr 2022
Learning Disentangled Semantic Representations for Zero-Shot Cross-Lingual Transfer in Multilingual Machine Reading Comprehension
injuan Wu
Shaojuan Wu
Xiaowang Zhang
Deyi Xiong
Shizhan Chen
Zhiqiang Zhuang
Zhiyong Feng
14
13
0
03 Apr 2022
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
Alham Fikri Aji
Genta Indra Winata
Fajri Koto
Samuel Cahyawijaya
Ade Romadhony
...
David Moeljadi
Radityo Eko Prasojo
Timothy Baldwin
Jey Han Lau
Sebastian Ruder
38
98
0
24 Mar 2022
XTREME-S: Evaluating Cross-lingual Speech Representations
Alexis Conneau
Ankur Bapna
Yu Zhang
Min Ma
Patrick von Platen
...
Orhan Firat
Michael Auli
Sebastian Ruder
Jason Riesa
Melvin Johnson
VLM
AILaw
ELM
48
22
0
21 Mar 2022
Meta-X
N
L
G
_{NLG}
N
L
G
: A Meta-Learning Approach Based on Language Clustering for Zero-Shot Cross-Lingual Transfer and Generation
Kaushal Kumar Maurya
M. Desarkar
9
8
0
19 Mar 2022
Combining Static and Contextualised Multilingual Embeddings
Katharina Hämmerl
Jindrich Libovický
Alexander M. Fraser
23
10
0
17 Mar 2022
MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages
Zhiruo Wang
Grace Cuenca
Shuyan Zhou
Frank F. Xu
Graham Neubig
19
49
0
16 Mar 2022
Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval
Luyu Gao
Xueguang Ma
Jimmy J. Lin
Jamie Callan
22
75
0
11 Mar 2022
IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic Languages
Aman Kumar
Himani Shrotriya
P. Sahu
Raj Dabre
Ratish Puduppully
Anoop Kunchukuttan
Amogh Mishra
Mitesh M. Khapra
Pratyush Kumar
38
38
0
10 Mar 2022
What are the best systems? New perspectives on NLP Benchmarking
Pierre Colombo
Nathan Noiry
Ekhine Irurozki
Stéphan Clémençon
27
28
0
08 Feb 2022
Pirá: A Bilingual Portuguese-English Dataset for Question-Answering about the Ocean
André F. A. Paschoal
Paulo Pirozelli
Valdinei Freire
K. V. Delgado
S. M. Peres
...
Flávio Nakasato
A. Oliveira
A. Brandão
A. H. R. Costa
Fabio Gagliardi Cozman
RALM
19
11
0
04 Feb 2022
Cross-Lingual Dialogue Dataset Creation via Outline-Based Generation
Olga Majewska
E. Razumovskaia
E. Ponti
Ivan Vulić
Anna Korhonen
30
28
0
31 Jan 2022
Leaf: Multiple-Choice Question Generation
Kristiyan Vachev
Momchil Hardalov
Georgi Karadzhov
Georgi Georgiev
Ivan Koychev
Preslav Nakov
AI4Ed
25
21
0
22 Jan 2022
A Survey on non-English Question Answering Dataset
Andrea Chandra
Affandy Fahrizain
Ibrahim
Simon Willyanto Laufried
16
10
0
27 Dec 2021
Unsupervised Dense Information Retrieval with Contrastive Learning
Gautier Izacard
Mathilde Caron
Lucas Hosseini
Sebastian Riedel
Piotr Bojanowski
Armand Joulin
Edouard Grave
RALM
24
807
0
16 Dec 2021
Learning to Transpile AMR into SPARQL
Mihaela A. Bornea
Ramón Fernández Astudillo
Tahira Naseem
Nandana Mihindukulasooriya
Ibrahim Abdelaziz
Pavan Kapanipathi
Radu Florian
Salim Roukos
34
6
0
15 Dec 2021
Do Answers to Boolean Questions Need Explanations? Yes
Sara Rosenthal
Mihaela A. Bornea
Avirup Sil
Radu Florian
Scott McCarley
17
4
0
14 Dec 2021
Extending the WILDS Benchmark for Unsupervised Adaptation
Shiori Sagawa
Pang Wei Koh
Tony Lee
Irena Gao
Sang Michael Xie
...
Kate Saenko
Tatsunori Hashimoto
Sergey Levine
Chelsea Finn
Percy Liang
OOD
11
97
0
09 Dec 2021
Semantic Search as Extractive Paraphrase Span Detection
Jenna Kanerva
Hanna Kitti
Li-Hsin Chang
Teemu Vahtola
Mathias Creutz
Filip Ginter
19
2
0
09 Dec 2021
Question Answering Survey: Directions, Challenges, Datasets, Evaluation Matrices
Hariom A. Pandya
Brijesh S. Bhatt
36
27
0
07 Dec 2021
Dataset Geography: Mapping Language Data to Language Users
Fahim Faisal
Yinkai Wang
Antonios Anastasopoulos
54
23
0
07 Dec 2021
Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-sentence Dependency Graph
Liyan Xu
Xuchao Zhang
Bo Zong
Yanchi Liu
Wei Cheng
Jingchao Ni
Haifeng Chen
Liang Zhao
Jinho D. Choi
32
4
0
01 Dec 2021
Enhancing Multilingual Language Model with Massive Multilingual Knowledge Triples
Linlin Liu
Xin Li
Ruidan He
Lidong Bing
Shafiq R. Joty
Luo Si
KELM
35
18
0
22 Nov 2021
Cross-lingual Adaption Model-Agnostic Meta-Learning for Natural Language Understanding
Qianying Liu
Fei Cheng
Sadao Kurohashi
9
1
0
10 Nov 2021
Previous
1
2
3
4
5
6
7
8
Next