Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.03532
Cited By
What Can We Learn from Collective Human Opinions on Natural Language Inference Data?
7 October 2020
Yixin Nie
Xiang Zhou
Mohit Bansal
Re-assign community
ArXiv
PDF
HTML
Papers citing
"What Can We Learn from Collective Human Opinions on Natural Language Inference Data?"
27 / 27 papers shown
Title
Always Tell Me The Odds: Fine-grained Conditional Probability Estimation
Liaoyaqi Wang
Zhengping Jiang
Anqi Liu
Benjamin Van Durme
61
0
0
02 May 2025
Validating LLM-as-a-Judge Systems in the Absence of Gold Labels
Luke M. Guerdan
Solon Barocas
Kenneth Holstein
Hanna M. Wallach
Zhiwei Steven Wu
Alexandra Chouldechova
ALM
ELM
203
0
0
13 Mar 2025
Fine-grained Fallacy Detection with Human Label Variation
Alan Ramponi
Agnese Daffara
Sara Tonelli
54
1
0
20 Feb 2025
Training and Evaluating with Human Label Variation: An Empirical Study
K. K.
Meladel Mistica
Timothy Baldwin
Jey Han Lau
65
0
0
03 Feb 2025
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
Tongshuang Wu
Haiyi Zhu
Maya Albayrak
Alexis Axon
Amanda Bertsch
...
Ying-Jui Tseng
Patricia Vaidos
Zhijin Wu
Wei Yu Wu
Chenyang Yang
83
30
0
10 Jan 2025
Conformalized Credal Regions for Classification with Ambiguous Ground Truth
Michele Caprio
David Stutz
Shuo Li
Arnaud Doucet
UQCV
64
4
0
07 Nov 2024
Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions
Michael J.Q. Zhang
W. Bradley Knox
Eunsol Choi
48
3
0
17 Oct 2024
SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes
Timothee Mickus
Elaine Zosa
Raúl Vázquez
Teemu Vahtola
Jörg Tiedemann
Vincent Segonne
Alessandro Raganato
Marianna Apidianaki
HILM
LRM
35
20
0
12 Mar 2024
Interpretation modeling: Social grounding of sentences by reasoning over their implicit moral judgments
Liesbeth Allein
Maria Mihaela Trucscva
Marie-Francine Moens
22
1
0
27 Nov 2023
Collective Human Opinions in Semantic Textual Similarity
Yuxia Wang
Shimin Tao
Ning Xie
Hao-Yu Yang
Timothy Baldwin
Karin Verspoor
26
4
0
08 Aug 2023
No Strong Feelings One Way or Another: Re-operationalizing Neutrality in Natural Language Inference
Animesh Nighojkar
Antonio Laverghetta
John Licato
28
4
0
16 Jun 2023
Deep Model Compression Also Helps Models Capture Ambiguity
Hancheol Park
Jong C. Park
27
1
0
12 Jun 2023
Understanding and Predicting Human Label Variation in Natural Language Inference through Explanation
Nan-Jiang Jiang
Chenhao Tan
M. Marneffe
27
2
0
24 Apr 2023
Uncertainty-Aware Natural Language Inference with Stochastic Weight Averaging
Aarne Talman
H. Çelikkanat
Sami Virpioja
Markus Heinonen
Jörg Tiedemann
BDL
UQCV
26
7
0
10 Apr 2023
Design Choices for Crowdsourcing Implicit Discourse Relations: Revealing the Biases Introduced by Task Design
Valentina Pyatkin
Frances Yung
Merel C. J. Scholman
Reut Tsarfaty
Ido Dagan
Vera Demberg
19
12
0
03 Apr 2023
Investigating Multi-source Active Learning for Natural Language Inference
Ard Snijders
Douwe Kiela
Katerina Margatina
24
7
0
14 Feb 2023
Multi-Scales Data Augmentation Approach In Natural Language Inference For Artifacts Mitigation And Pre-Trained Model Optimization
Zhenyu Lu
13
1
0
16 Dec 2022
The 'Problem' of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation
Barbara Plank
30
97
0
04 Nov 2022
Stop Measuring Calibration When Humans Disagree
Joris Baan
Wilker Aziz
Barbara Plank
Raquel Fernández
24
53
0
28 Oct 2022
Investigating Reasons for Disagreement in Natural Language Inference
Nan-Jiang Jiang
M. Marneffe
19
26
0
07 Sep 2022
ALLSH: Active Learning Guided by Local Sensitivity and Hardness
Shujian Zhang
Chengyue Gong
Xingchao Liu
Pengcheng He
Weizhu Chen
Mingyuan Zhou
25
26
0
10 May 2022
Reducing Model Jitter: Stable Re-training of Semantic Parsers in Production Environments
Christopher Hidey
Fei Liu
Rahul Goel
21
4
0
10 Apr 2022
What Makes Reading Comprehension Questions Difficult?
Saku Sugawara
Nikita Nangia
Alex Warstadt
Sam Bowman
ELM
RALM
20
13
0
12 Mar 2022
Natural Language Deduction through Search over Statement Compositions
Kaj Bostrom
Zayne Sprague
Swarat Chaudhuri
Greg Durrett
ReLM
LRM
24
46
0
16 Jan 2022
Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair
Jason Phang
Angelica Chen
William Huang
Samuel R. Bowman
AAML
28
13
0
16 Nov 2021
Can Transformer Language Models Predict Psychometric Properties?
Antonio Laverghetta
Animesh Nighojkar
Jamshidbek Mirzakhalov
John Licato
LM&MA
30
14
0
12 Jun 2021
ANLIzing the Adversarial Natural Language Inference Dataset
Adina Williams
Tristan Thrush
Douwe Kiela
AAML
166
45
0
24 Oct 2020
1