Selective Question Answering under Domain Shift

16 June 2020

Robin Jia

Papers citing "Selective Question Answering under Domain Shift"

38 / 38 papers shown

Title
Don't lie to your friends: Learning what you know from collaborative self-play Jacob Eisenstein Reza Aghajani Adam Fisch Dheeru Dua Fantine Huot Mirella Lapata Vicky Zayats Jonathan Berant 72 0 0 18 Mar 2025
Teaching LLMs to Abstain across Languages via Multilingual Feedback Shangbin Feng Weijia Shi Yike Wang Wenxuan Ding Orevaoghene Ahia Shuyue Stella Li Vidhisha Balachandran Sunayana Sitaram Yulia Tsvetkov 69 4 0 22 Jun 2024
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions? Seungbin Yang chaeHun Park Taehee Kim Jaegul Choo 44 2 0 18 Jun 2024
Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless Predictions Junzhang Liu Zhecan Wang Hammad A. Ayyubi Haoxuan You Chris Thomas Rui Sun Shih-Fu Chang Kai-Wei Chang 39 0 0 18 May 2024
Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees Yu Gui Ying Jin Zhimei Ren MedIm 36 18 0 16 May 2024
Distinguishing the Knowable from the Unknowable with Language Models Gustaf Ahdritz Tian Qin Nikhil Vyas Boaz Barak Benjamin L. Edelman 26 18 0 05 Feb 2024
Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering with Multi-Granularity Answers G. Yona Roee Aharoni Mor Geva ELM 38 11 0 09 Jan 2024
Latent Feature-based Data Splits to Improve Generalisation Evaluation: A Hate Speech Detection Case Study Maike Zufle Verna Dankers Ivan Titov 37 0 0 16 Nov 2023
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation Vaishnavi Shrivastava Percy Liang Ananya Kumar 18 28 0 15 Nov 2023
Using Weak Supervision and Data Augmentation in Question Answering Chumki Basu Binyuan Hui Allen McIntosh Wei Wang J. Wullert OOD 46 0 0 28 Sep 2023
Estimating Large Language Model Capabilities without Labeled Test Data Harvey Yiyun Fu Qinyuan Ye Albert Xu Xiang Ren Robin Jia 21 8 0 24 May 2023
Distinguish Sense from Nonsense: Out-of-Scope Detection for Virtual Assistants Cheng Qian Haode Qi Gengyu Wang L. Kunc Saloni Potdar 25 4 0 16 Jan 2023
A Call to Reflect on Evaluation Practices for Failure Detection in Image Classification Paul F. Jaeger Carsten T. Lüth Lukas Klein Till J. Bungert UQCV 26 35 0 28 Nov 2022
Can Open-Domain QA Reader Utilize External Knowledge Efficiently like Humans? Neeraj Varshney Man Luo Chitta Baral RALM 19 11 0 23 Nov 2022
Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating Models to Reflect Conflicting Evidence Hung-Ting Chen Michael J.Q. Zhang Eunsol Choi RALM HILM 38 92 0 25 Oct 2022
Knowledge Transfer from Answer Ranking to Answer Generation Matteo Gabburo Rik Koncel-Kedziorski Siddhant Garg Luca Soldaini Alessandro Moschitti 25 7 0 23 Oct 2022
Pseudo-OOD training for robust language models Dhanasekar Sundararaman Nikhil Mehta Lawrence Carin 20 0 0 17 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review Dieuwke Hupkes Mario Giulianelli Verna Dankers Mikel Artetxe Yanai Elazar ... Leila Khalatbari Maria Ryskina Rita Frieske Ryan Cotterell Zhijing Jin 114 93 0 06 Oct 2022
Honest Students from Untrusted Teachers: Learning an Interpretable Question-Answering Pipeline from a Pretrained Language Model Jacob Eisenstein D. Andor Bernd Bohnet Michael Collins David M. Mimno LRM 189 24 0 05 Oct 2022
Understanding Prior Bias and Choice Paralysis in Transformer-based Language Representation Models through Four Experimental Probes Ke Shen M. Kejriwal 27 4 0 03 Oct 2022
Improving alignment of dialogue agents via targeted human judgements Amelia Glaese Nat McAleese Maja Trkebacz John Aslanides Vlad Firoiu ... John F. J. Mellor Demis Hassabis Koray Kavukcuoglu Lisa Anne Hendricks G. Irving ALM AAML 227 500 0 28 Sep 2022
Using contradictions improves question answering systems Étienne Fortier-Dubois Domenic Rosati 13 0 0 28 Sep 2022
On the Relation between Sensitivity and Accuracy in In-context Learning Yanda Chen Chen Zhao Zhou Yu Kathleen McKeown He He 182 77 0 16 Sep 2022
Calibrated Selective Classification Adam Fisch Tommi Jaakkola Regina Barzilay 23 16 0 25 Aug 2022
Augmenting Softmax Information for Selective Classification with Out-of-Distribution Data Guoxuan Xia C. Bouganis OODD 16 27 0 15 Jul 2022
Re-Examining Calibration: The Case of Question Answering Chenglei Si Chen Zhao Sewon Min Jordan L. Boyd-Graber 61 30 0 25 May 2022
On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations Roy Schwartz Gabriel Stanovsky 29 24 0 27 Apr 2022
Uncertainty Estimation for Language Reward Models Adam Gleave G. Irving UQLM 29 31 0 14 Mar 2022
Active Learning Over Multiple Domains in Natural Language Tasks Shayne Longpre Julia Reisler E. G. Huang Yi Lu Andrew J. Frank Nikhil Ramesh Chris DuBois OOD 19 13 0 01 Feb 2022
Attacking Open-domain Question Answering by Injecting Misinformation Liangming Pan Wenhu Chen Min-Yen Kan W. Wang HILM AAML 193 22 0 15 Oct 2021
Can Explanations Be Useful for Calibrating Black Box Models? Xi Ye Greg Durrett FAtt 24 25 0 14 Oct 2021
GOLD: Improving Out-of-Scope Detection in Dialogues using Data Augmentation Derek Chen Zhou Yu 19 31 0 07 Sep 2021
FaVIQ: FAct Verification from Information-seeking Questions Jungsoo Park Sewon Min Jaewoo Kang Luke Zettlemoyer Hannaneh Hajishirzi HILM 32 37 0 05 Jul 2021
How Robust are Model Rankings: A Leaderboard Customization Approach for Equitable Evaluation Swaroop Mishra Anjana Arunkumar 26 24 0 10 Jun 2021
Can NLI Models Verify QA Systems' Predictions? Jifan Chen Eunsol Choi Greg Durrett 23 54 0 18 Apr 2021
Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval Akari Asai Eunsol Choi RALM 37 51 0 22 Oct 2020
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles Balaji Lakshminarayanan Alexander Pritzel Charles Blundell UQCV BDL 276 5,660 0 05 Dec 2016
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning Y. Gal Zoubin Ghahramani UQCV BDL 285 9,136 0 06 Jun 2015