v1v2 (latest)

The Sensitivity of Language Models and Humans to Winograd Schema Perturbations

Annual Meeting of the Association for Computational Linguistics (ACL), 2020

4 May 2020

Papers citing "The Sensitivity of Language Models and Humans to Winograd Schema Perturbations"

21 / 21 papers shown

Not quite Sherlock Holmes: Language model predictions do not reliably differentiate impossible from improbable eventsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

250

07 Jun 2025

WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization

390

31 Mar 2025

WinoPron: Revisiting English Winogender Schemas for Consistency, Coverage, and Grammatical Case

422

09 Sep 2024

Robust Pronoun Fidelity with English LLMs: Are they Reasoning, Repeating, or Just Biased?Transactions of the Association for Computational Linguistics (TACL), 2024

410

04 Apr 2024

EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries

Jing Han Sun

Ali Emami

377

20 Feb 2024

CASE: Commonsense-Augmented Score with an Expanded Answer SpaceConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Wenkai Chen

Sahithya Ravi

Vered Shwartz

244

03 Nov 2023

BRAINTEASER: Lateral Thinking Puzzles for Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Kaixin Ma

395

08 Oct 2023

Causal interventions expose implicit situation models for commonsense language understandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

350

06 Jun 2023

Few-Shot Dialogue Summarization via Skeleton-Assisted Prompt Transfer in Prompt TuningConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

387

20 May 2023

Event knowledge in large language models: the gap between the impossible and the unlikelyCognitive Sciences (CS), 2022

579

02 Dec 2022

An Empirical Investigation of Commonsense Self-Supervision with Knowledge Graphs

Kaixin Ma

158

21 May 2022

Generalized Quantifiers as a Source of Error in Multilingual NLU Benchmarks

Ruixiang Cui

Daniel Hershcovich

Anders Søgaard

304

22 Apr 2022

Testing the limits of natural language models for predicting human language judgmentsNature Machine Intelligence (Nat. Mach. Intell.), 2022

Tal Golan

Matthew Siegelman

N. Kriegeskorte

Christopher A. Baldassano

346

07 Apr 2022

Hierarchical Interpretation of Neural Text ClassificationComputational Linguistics (CL), 2022

Hanqi Yan

Lin Gui

Yulan He

397

20 Feb 2022

An Application of Pseudo-Log-Likelihoods to Natural Language Scoring

Darren Abramson

Ali Emami

274

23 Jan 2022

Towards Zero-shot Commonsense Reasoning with Self-supervised Refinement of Language Models

T. Klein

Moin Nabi

ReLM LRM

183

10 Sep 2021

Transformers in the loop: Polarity in neural models of languageAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Lisa Bylinina

Alexey Tikhonov

157

08 Sep 2021

John praised Mary because he? Implicit Causality Bias and Its Interaction with Explicit Cues in LMsFindings (Findings), 2021

Yova Kementchedjhieva

Mark Anderson

Anders Søgaard

176

02 Jun 2021

A Semantic-based Method for Unsupervised Commonsense Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Fei Huang

202

31 May 2021

Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd SchemaConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

393

16 Apr 2021

An Analysis of Dataset Overlap on Winograd-Style Tasks

259

09 Nov 2020