ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.01348
  4. Cited By
The Sensitivity of Language Models and Humans to Winograd Schema
  Perturbations
v1v2 (latest)

The Sensitivity of Language Models and Humans to Winograd Schema Perturbations

Annual Meeting of the Association for Computational Linguistics (ACL), 2020
4 May 2020
Mostafa Abdou
Vinit Ravishankar
Maria Barrett
Yonatan Belinkov
Desmond Elliott
Anders Søgaard
    ReLMLRM
ArXiv (abs)PDFHTML

Papers citing "The Sensitivity of Language Models and Humans to Winograd Schema Perturbations"

21 / 21 papers shown
Not quite Sherlock Holmes: Language model predictions do not reliably differentiate impossible from improbable events
Not quite Sherlock Holmes: Language model predictions do not reliably differentiate impossible from improbable eventsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
J. Michaelov
Reeka Estacio
Zhien Zhang
Benjamin Bergen
ReLMLRM
250
2
0
07 Jun 2025
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
I. Gevers
Victor De Marez
Luna De Bruyne
Walter Daelemans
390
2
0
31 Mar 2025
WinoPron: Revisiting English Winogender Schemas for Consistency,
  Coverage, and Grammatical Case
WinoPron: Revisiting English Winogender Schemas for Consistency, Coverage, and Grammatical Case
Vagrant Gautam
Julius Steuer
Eileen Bingert
Ray Johns
Anne Lauscher
Dietrich Klakow
422
10
0
09 Sep 2024
Robust Pronoun Fidelity with English LLMs: Are they Reasoning,
  Repeating, or Just Biased?
Robust Pronoun Fidelity with English LLMs: Are they Reasoning, Repeating, or Just Biased?Transactions of the Association for Computational Linguistics (TACL), 2024
Vagrant Gautam
Eileen Bingert
D. Zhu
Anne Lauscher
Dietrich Klakow
410
18
0
04 Apr 2024
EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human
  Adversaries
EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries
Jing Han Sun
Ali Emami
377
6
0
20 Feb 2024
CASE: Commonsense-Augmented Score with an Expanded Answer Space
CASE: Commonsense-Augmented Score with an Expanded Answer SpaceConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Wenkai Chen
Sahithya Ravi
Vered Shwartz
244
0
0
03 Nov 2023
BRAINTEASER: Lateral Thinking Puzzles for Large Language Models
BRAINTEASER: Lateral Thinking Puzzles for Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yifan Jiang
Filip Ilievski
Kaixin Ma
Zhivar Sourati
LRMReLM
395
17
0
08 Oct 2023
Causal interventions expose implicit situation models for commonsense
  language understanding
Causal interventions expose implicit situation models for commonsense language understandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Takateru Yamakoshi
James L. McClelland
A. Goldberg
Robert D. Hawkins
350
10
0
06 Jun 2023
Few-Shot Dialogue Summarization via Skeleton-Assisted Prompt Transfer in
  Prompt Tuning
Few-Shot Dialogue Summarization via Skeleton-Assisted Prompt Transfer in Prompt TuningConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Kaige Xie
Tong Yu
Haoliang Wang
Junda Wu
Handong Zhao
Ruiyi Zhang
K. Mahadik
A. Nenkova
Mark O. Riedl
387
4
0
20 May 2023
Event knowledge in large language models: the gap between the impossible
  and the unlikely
Event knowledge in large language models: the gap between the impossible and the unlikelyCognitive Sciences (CS), 2022
Carina Kauf
Anna A. Ivanova
Giulia Rambelli
Emmanuele Chersoni
Jingyuan Selena She
Zawad Chowdhury
Evelina Fedorenko
Alessandro Lenci
579
92
0
02 Dec 2022
An Empirical Investigation of Commonsense Self-Supervision with
  Knowledge Graphs
An Empirical Investigation of Commonsense Self-Supervision with Knowledge Graphs
Jiarui Zhang
Filip Ilievski
Kaixin Ma
Jonathan M Francis
A. Oltramari
SSL
158
5
0
21 May 2022
Generalized Quantifiers as a Source of Error in Multilingual NLU
  Benchmarks
Generalized Quantifiers as a Source of Error in Multilingual NLU Benchmarks
Ruixiang Cui
Daniel Hershcovich
Anders Søgaard
304
17
0
22 Apr 2022
Testing the limits of natural language models for predicting human
  language judgments
Testing the limits of natural language models for predicting human language judgmentsNature Machine Intelligence (Nat. Mach. Intell.), 2022
Tal Golan
Matthew Siegelman
N. Kriegeskorte
Christopher A. Baldassano
346
21
0
07 Apr 2022
Hierarchical Interpretation of Neural Text Classification
Hierarchical Interpretation of Neural Text ClassificationComputational Linguistics (CL), 2022
Hanqi Yan
Lin Gui
Yulan He
397
17
0
20 Feb 2022
An Application of Pseudo-Log-Likelihoods to Natural Language Scoring
An Application of Pseudo-Log-Likelihoods to Natural Language Scoring
Darren Abramson
Ali Emami
274
3
0
23 Jan 2022
Towards Zero-shot Commonsense Reasoning with Self-supervised Refinement
  of Language Models
Towards Zero-shot Commonsense Reasoning with Self-supervised Refinement of Language Models
T. Klein
Moin Nabi
ReLMLRM
183
9
0
10 Sep 2021
Transformers in the loop: Polarity in neural models of language
Transformers in the loop: Polarity in neural models of languageAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Lisa Bylinina
Alexey Tikhonov
157
0
0
08 Sep 2021
John praised Mary because he? Implicit Causality Bias and Its
  Interaction with Explicit Cues in LMs
John praised Mary because he? Implicit Causality Bias and Its Interaction with Explicit Cues in LMsFindings (Findings), 2021
Yova Kementchedjhieva
Mark Anderson
Anders Søgaard
176
14
0
02 Jun 2021
A Semantic-based Method for Unsupervised Commonsense Question Answering
A Semantic-based Method for Unsupervised Commonsense Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Yilin Niu
Fei Huang
Jiaming Liang
Wenkai Chen
Xiaoyan Zhu
Shiyu Huang
LRM
202
15
0
31 May 2021
Back to Square One: Artifact Detection, Training and Commonsense
  Disentanglement in the Winograd Schema
Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd SchemaConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Yanai Elazar
Hongming Zhang
Yoav Goldberg
Dan Roth
ReLMLRM
393
46
0
16 Apr 2021
An Analysis of Dataset Overlap on Winograd-Style Tasks
An Analysis of Dataset Overlap on Winograd-Style Tasks
Ali Emami
Adam Trischler
Kaheer Suleman
Jackie C.K. Cheung
259
24
0
09 Nov 2020
1
Page 1 of 1