Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.09462
Cited By
Selective Question Answering under Domain Shift
16 June 2020
Amita Kamath
Robin Jia
Percy Liang
OOD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Selective Question Answering under Domain Shift"
38 / 38 papers shown
Title
Don't lie to your friends: Learning what you know from collaborative self-play
Jacob Eisenstein
Reza Aghajani
Adam Fisch
Dheeru Dua
Fantine Huot
Mirella Lapata
Vicky Zayats
Jonathan Berant
72
0
0
18 Mar 2025
Teaching LLMs to Abstain across Languages via Multilingual Feedback
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Orevaoghene Ahia
Shuyue Stella Li
Vidhisha Balachandran
Sunayana Sitaram
Yulia Tsvetkov
69
4
0
22 Jun 2024
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
Seungbin Yang
chaeHun Park
Taehee Kim
Jaegul Choo
44
2
0
18 Jun 2024
Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless Predictions
Junzhang Liu
Zhecan Wang
Hammad A. Ayyubi
Haoxuan You
Chris Thomas
Rui Sun
Shih-Fu Chang
Kai-Wei Chang
39
0
0
18 May 2024
Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees
Yu Gui
Ying Jin
Zhimei Ren
MedIm
36
18
0
16 May 2024
Distinguishing the Knowable from the Unknowable with Language Models
Gustaf Ahdritz
Tian Qin
Nikhil Vyas
Boaz Barak
Benjamin L. Edelman
26
18
0
05 Feb 2024
Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering with Multi-Granularity Answers
G. Yona
Roee Aharoni
Mor Geva
ELM
38
11
0
09 Jan 2024
Latent Feature-based Data Splits to Improve Generalisation Evaluation: A Hate Speech Detection Case Study
Maike Zufle
Verna Dankers
Ivan Titov
37
0
0
16 Nov 2023
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation
Vaishnavi Shrivastava
Percy Liang
Ananya Kumar
18
28
0
15 Nov 2023
Using Weak Supervision and Data Augmentation in Question Answering
Chumki Basu
Binyuan Hui
Allen McIntosh
Wei Wang
J. Wullert
OOD
46
0
0
28 Sep 2023
Estimating Large Language Model Capabilities without Labeled Test Data
Harvey Yiyun Fu
Qinyuan Ye
Albert Xu
Xiang Ren
Robin Jia
21
8
0
24 May 2023
Distinguish Sense from Nonsense: Out-of-Scope Detection for Virtual Assistants
Cheng Qian
Haode Qi
Gengyu Wang
L. Kunc
Saloni Potdar
25
4
0
16 Jan 2023
A Call to Reflect on Evaluation Practices for Failure Detection in Image Classification
Paul F. Jaeger
Carsten T. Lüth
Lukas Klein
Till J. Bungert
UQCV
26
35
0
28 Nov 2022
Can Open-Domain QA Reader Utilize External Knowledge Efficiently like Humans?
Neeraj Varshney
Man Luo
Chitta Baral
RALM
19
11
0
23 Nov 2022
Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating Models to Reflect Conflicting Evidence
Hung-Ting Chen
Michael J.Q. Zhang
Eunsol Choi
RALM
HILM
38
92
0
25 Oct 2022
Knowledge Transfer from Answer Ranking to Answer Generation
Matteo Gabburo
Rik Koncel-Kedziorski
Siddhant Garg
Luca Soldaini
Alessandro Moschitti
25
7
0
23 Oct 2022
Pseudo-OOD training for robust language models
Dhanasekar Sundararaman
Nikhil Mehta
Lawrence Carin
20
0
0
17 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review
Dieuwke Hupkes
Mario Giulianelli
Verna Dankers
Mikel Artetxe
Yanai Elazar
...
Leila Khalatbari
Maria Ryskina
Rita Frieske
Ryan Cotterell
Zhijing Jin
114
93
0
06 Oct 2022
Honest Students from Untrusted Teachers: Learning an Interpretable Question-Answering Pipeline from a Pretrained Language Model
Jacob Eisenstein
D. Andor
Bernd Bohnet
Michael Collins
David M. Mimno
LRM
189
24
0
05 Oct 2022
Understanding Prior Bias and Choice Paralysis in Transformer-based Language Representation Models through Four Experimental Probes
Ke Shen
M. Kejriwal
27
4
0
03 Oct 2022
Improving alignment of dialogue agents via targeted human judgements
Amelia Glaese
Nat McAleese
Maja Trkebacz
John Aslanides
Vlad Firoiu
...
John F. J. Mellor
Demis Hassabis
Koray Kavukcuoglu
Lisa Anne Hendricks
G. Irving
ALM
AAML
227
500
0
28 Sep 2022
Using contradictions improves question answering systems
Étienne Fortier-Dubois
Domenic Rosati
13
0
0
28 Sep 2022
On the Relation between Sensitivity and Accuracy in In-context Learning
Yanda Chen
Chen Zhao
Zhou Yu
Kathleen McKeown
He He
182
77
0
16 Sep 2022
Calibrated Selective Classification
Adam Fisch
Tommi Jaakkola
Regina Barzilay
23
16
0
25 Aug 2022
Augmenting Softmax Information for Selective Classification with Out-of-Distribution Data
Guoxuan Xia
C. Bouganis
OODD
16
27
0
15 Jul 2022
Re-Examining Calibration: The Case of Question Answering
Chenglei Si
Chen Zhao
Sewon Min
Jordan L. Boyd-Graber
61
30
0
25 May 2022
On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations
Roy Schwartz
Gabriel Stanovsky
29
24
0
27 Apr 2022
Uncertainty Estimation for Language Reward Models
Adam Gleave
G. Irving
UQLM
29
31
0
14 Mar 2022
Active Learning Over Multiple Domains in Natural Language Tasks
Shayne Longpre
Julia Reisler
E. G. Huang
Yi Lu
Andrew J. Frank
Nikhil Ramesh
Chris DuBois
OOD
19
13
0
01 Feb 2022
Attacking Open-domain Question Answering by Injecting Misinformation
Liangming Pan
Wenhu Chen
Min-Yen Kan
W. Wang
HILM
AAML
193
22
0
15 Oct 2021
Can Explanations Be Useful for Calibrating Black Box Models?
Xi Ye
Greg Durrett
FAtt
24
25
0
14 Oct 2021
GOLD: Improving Out-of-Scope Detection in Dialogues using Data Augmentation
Derek Chen
Zhou Yu
19
31
0
07 Sep 2021
FaVIQ: FAct Verification from Information-seeking Questions
Jungsoo Park
Sewon Min
Jaewoo Kang
Luke Zettlemoyer
Hannaneh Hajishirzi
HILM
32
37
0
05 Jul 2021
How Robust are Model Rankings: A Leaderboard Customization Approach for Equitable Evaluation
Swaroop Mishra
Anjana Arunkumar
26
24
0
10 Jun 2021
Can NLI Models Verify QA Systems' Predictions?
Jifan Chen
Eunsol Choi
Greg Durrett
23
54
0
18 Apr 2021
Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval
Akari Asai
Eunsol Choi
RALM
37
51
0
22 Oct 2020
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan
Alexander Pritzel
Charles Blundell
UQCV
BDL
276
5,660
0
05 Dec 2016
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal
Zoubin Ghahramani
UQCV
BDL
285
9,136
0
06 Jun 2015
1