What Can We Learn from Collective Human Opinions on Natural Language Inference Data?

7 October 2020

Papers citing "What Can We Learn from Collective Human Opinions on Natural Language Inference Data?"

27 / 27 papers shown

Title
Always Tell Me The Odds: Fine-grained Conditional Probability Estimation Liaoyaqi Wang Zhengping Jiang Anqi Liu Benjamin Van Durme 61 0 0 02 May 2025
Validating LLM-as-a-Judge Systems in the Absence of Gold Labels Luke M. Guerdan Solon Barocas Kenneth Holstein Hanna M. Wallach Zhiwei Steven Wu Alexandra Chouldechova ALM ELM 203 0 0 13 Mar 2025
Fine-grained Fallacy Detection with Human Label Variation Alan Ramponi Agnese Daffara Sara Tonelli 54 1 0 20 Feb 2025
Training and Evaluating with Human Label Variation: An Empirical Study K. K. Meladel Mistica Timothy Baldwin Jey Han Lau 65 0 0 03 Feb 2025
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs Tongshuang Wu Haiyi Zhu Maya Albayrak Alexis Axon Amanda Bertsch ... Ying-Jui Tseng Patricia Vaidos Zhijin Wu Wei Yu Wu Chenyang Yang 83 30 0 10 Jan 2025
Conformalized Credal Regions for Classification with Ambiguous Ground Truth Michele Caprio David Stutz Shuo Li Arnaud Doucet UQCV 64 4 0 07 Nov 2024
Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions Michael J.Q. Zhang W. Bradley Knox Eunsol Choi 48 3 0 17 Oct 2024
SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes Timothee Mickus Elaine Zosa Raúl Vázquez Teemu Vahtola Jörg Tiedemann Vincent Segonne Alessandro Raganato Marianna Apidianaki HILM LRM 35 20 0 12 Mar 2024
Interpretation modeling: Social grounding of sentences by reasoning over their implicit moral judgments Liesbeth Allein Maria Mihaela Trucscva Marie-Francine Moens 22 1 0 27 Nov 2023
Collective Human Opinions in Semantic Textual Similarity Yuxia Wang Shimin Tao Ning Xie Hao-Yu Yang Timothy Baldwin Karin Verspoor 26 4 0 08 Aug 2023
No Strong Feelings One Way or Another: Re-operationalizing Neutrality in Natural Language Inference Animesh Nighojkar Antonio Laverghetta John Licato 28 4 0 16 Jun 2023
Deep Model Compression Also Helps Models Capture Ambiguity Hancheol Park Jong C. Park 27 1 0 12 Jun 2023
Understanding and Predicting Human Label Variation in Natural Language Inference through Explanation Nan-Jiang Jiang Chenhao Tan M. Marneffe 27 2 0 24 Apr 2023
Uncertainty-Aware Natural Language Inference with Stochastic Weight Averaging Aarne Talman H. Çelikkanat Sami Virpioja Markus Heinonen Jörg Tiedemann BDL UQCV 26 7 0 10 Apr 2023
Design Choices for Crowdsourcing Implicit Discourse Relations: Revealing the Biases Introduced by Task Design Valentina Pyatkin Frances Yung Merel C. J. Scholman Reut Tsarfaty Ido Dagan Vera Demberg 19 12 0 03 Apr 2023
Investigating Multi-source Active Learning for Natural Language Inference Ard Snijders Douwe Kiela Katerina Margatina 24 7 0 14 Feb 2023
Multi-Scales Data Augmentation Approach In Natural Language Inference For Artifacts Mitigation And Pre-Trained Model Optimization Zhenyu Lu 13 1 0 16 Dec 2022
The 'Problem' of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation Barbara Plank 30 97 0 04 Nov 2022
Stop Measuring Calibration When Humans Disagree Joris Baan Wilker Aziz Barbara Plank Raquel Fernández 24 53 0 28 Oct 2022
Investigating Reasons for Disagreement in Natural Language Inference Nan-Jiang Jiang M. Marneffe 19 26 0 07 Sep 2022
ALLSH: Active Learning Guided by Local Sensitivity and Hardness Shujian Zhang Chengyue Gong Xingchao Liu Pengcheng He Weizhu Chen Mingyuan Zhou 25 26 0 10 May 2022
Reducing Model Jitter: Stable Re-training of Semantic Parsers in Production Environments Christopher Hidey Fei Liu Rahul Goel 21 4 0 10 Apr 2022
What Makes Reading Comprehension Questions Difficult? Saku Sugawara Nikita Nangia Alex Warstadt Sam Bowman ELM RALM 20 13 0 12 Mar 2022
Natural Language Deduction through Search over Statement Compositions Kaj Bostrom Zayne Sprague Swarat Chaudhuri Greg Durrett ReLM LRM 24 46 0 16 Jan 2022
Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair Jason Phang Angelica Chen William Huang Samuel R. Bowman AAML 28 13 0 16 Nov 2021
Can Transformer Language Models Predict Psychometric Properties? Antonio Laverghetta Animesh Nighojkar Jamshidbek Mirzakhalov John Licato LM&MA 30 14 0 12 Jun 2021
ANLIzing the Adversarial Natural Language Inference Dataset Adina Williams Tristan Thrush Douwe Kiela AAML 166 45 0 24 Oct 2020