ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.08161
  4. Cited By
Back to Square One: Artifact Detection, Training and Commonsense
  Disentanglement in the Winograd Schema
v1v2 (latest)

Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
16 April 2021
Yanai Elazar
Hongming Zhang
Yoav Goldberg
Dan Roth
    ReLMLRM
ArXiv (abs)PDFHTML

Papers citing "Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema"

31 / 31 papers shown
Title
Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation
Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation
Marii Ojastu
Hele-Andra Kuulmets
Aleksei Dorkin
Marika Borovikova
Dage Särg
Kairit Sirts
151
0
0
21 Nov 2025
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
I. Gevers
Victor De Marez
Luna De Bruyne
Walter Daelemans
232
1
0
31 Mar 2025
MASS: Overcoming Language Bias in Image-Text Matching
MASS: Overcoming Language Bias in Image-Text MatchingAAAI Conference on Artificial Intelligence (AAAI), 2025
Jiwan Chung
Seungwon Lim
Sangkyu Lee
Youngjae Yu
VLM
197
0
0
20 Jan 2025
WinoPron: Revisiting English Winogender Schemas for Consistency,
  Coverage, and Grammatical Case
WinoPron: Revisiting English Winogender Schemas for Consistency, Coverage, and Grammatical Case
Vagrant Gautam
Julius Steuer
Eileen Bingert
Ray Johns
Anne Lauscher
Dietrich Klakow
332
7
0
09 Sep 2024
Thai Winograd Schemas: A Benchmark for Thai Commonsense Reasoning
Thai Winograd Schemas: A Benchmark for Thai Commonsense Reasoning
Phakphum Artkaew
LRM
147
0
0
28 May 2024
Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge
Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge
Brendan Park
Madeline Janecek
Naser Ezzati-Jivan
Yifeng Li
Ali Emami
210
2
0
25 May 2024
Robust Pronoun Fidelity with English LLMs: Are they Reasoning,
  Repeating, or Just Biased?
Robust Pronoun Fidelity with English LLMs: Are they Reasoning, Repeating, or Just Biased?Transactions of the Association for Computational Linguistics (TACL), 2024
Vagrant Gautam
Eileen Bingert
D. Zhu
Anne Lauscher
Dietrich Klakow
272
13
0
04 Apr 2024
EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human
  Adversaries
EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries
Jing Han Sun
Ali Emami
236
6
0
20 Feb 2024
Experimental Contexts Can Facilitate Robust Semantic Property Inference
  in Language Models, but Inconsistently
Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but InconsistentlyConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Kanishka Misra
Allyson Ettinger
Kyle Mahowald
250
5
0
12 Jan 2024
CLOMO: Counterfactual Logical Modification with Large Language Models
CLOMO: Counterfactual Logical Modification with Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yinya Huang
Ruixin Hong
Hongming Zhang
Wei Shao
Zhicheng YANG
Dong Yu
Changshui Zhang
Xiaodan Liang
Linqi Song
LRM
187
11
0
29 Nov 2023
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language
  Models
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language ModelsInternational Conference on Learning Representations (ICLR), 2023
Sreyan Ghosh
Ashish Seth
Sonal Kumar
Utkarsh Tyagi
Chandra Kiran Reddy Evuru
S. Ramaneswaran
S. Sakshi
Oriol Nieto
R. Duraiswami
Dinesh Manocha
AuLLMVLMCoGe
417
44
0
12 Oct 2023
BRAINTEASER: Lateral Thinking Puzzles for Large Language Models
BRAINTEASER: Lateral Thinking Puzzles for Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yifan Jiang
Filip Ilievski
Kaixin Ma
Zhivar Sourati
LRMReLM
317
14
0
08 Oct 2023
PronounFlow: A Hybrid Approach for Calibrating Pronouns in Sentences
PronounFlow: A Hybrid Approach for Calibrating Pronouns in Sentences
Nicos Isaak
111
1
0
29 Aug 2023
Causal interventions expose implicit situation models for commonsense
  language understanding
Causal interventions expose implicit situation models for commonsense language understandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Takateru Yamakoshi
James L. McClelland
A. Goldberg
Robert D. Hawkins
295
7
0
06 Jun 2023
Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and
  Evaluation
Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Marius Mosbach
Tiago Pimentel
Haiqin Yang
Dietrich Klakow
Yanai Elazar
293
171
0
26 May 2023
Abductive Commonsense Reasoning Exploiting Mutually Exclusive
  Explanations
Abductive Commonsense Reasoning Exploiting Mutually Exclusive ExplanationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Wenting Zhao
Justin T. Chiu
Claire Cardie
Alexander M. Rush
LRM
240
25
0
24 May 2023
Event knowledge in large language models: the gap between the impossible
  and the unlikely
Event knowledge in large language models: the gap between the impossible and the unlikelyCognitive Sciences (CS), 2022
Carina Kauf
Anna A. Ivanova
Giulia Rambelli
Emmanuele Chersoni
Jingyuan Selena She
Zawad Chowdhury
Evelina Fedorenko
Alessandro Lenci
461
86
0
02 Dec 2022
Measuring Reliability of Large Language Models through Semantic
  Consistency
Measuring Reliability of Large Language Models through Semantic Consistency
Harsh Raj
Domenic Rosati
Subhabrata Majumdar
HILM
218
37
0
10 Nov 2022
Validity Assessment of Legal Will Statements as Natural Language
  Inference
Validity Assessment of Legal Will Statements as Natural Language InferenceConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
A. Kwak
Jacob O. Israelsen
Clayton T. Morrison
Derek E. Bambauer
Mihai Surdeanu
AILaw
123
4
0
30 Oct 2022
CIKQA: Learning Commonsense Inference with a Unified
  Knowledge-in-the-loop QA Paradigm
CIKQA: Learning Commonsense Inference with a Unified Knowledge-in-the-loop QA ParadigmFindings (Findings), 2022
Hongming Zhang
Yintong Huo
Yanai Elazar
Yangqiu Song
Yoav Goldberg
Dan Roth
LRM
201
3
0
12 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review
State-of-the-art generalisation research in NLP: A taxonomy and reviewNature Machine Intelligence (Nat. Mach. Intell.), 2022
Dieuwke Hupkes
Mario Giulianelli
Verna Dankers
Mikel Artetxe
Yanai Elazar
...
Leila Khalatbari
Maria Ryskina
Rita Frieske
Robert Bamler
Zhijing Jin
548
130
0
06 Oct 2022
Measuring Causal Effects of Data Statistics on Language Model's
  `Factual' Predictions
Measuring Causal Effects of Data Statistics on Language Model's `Factual' Predictions
Yanai Elazar
Nora Kassner
Haiqin Yang
Amir Feder
Abhilasha Ravichander
Marius Mosbach
Yonatan Belinkov
Hinrich Schütze
Yoav Goldberg
CMLSyDaMILM
212
61
0
28 Jul 2022
longhorns at DADC 2022: How many linguists does it take to fool a
  Question Answering model? A systematic approach to adversarial attacks
longhorns at DADC 2022: How many linguists does it take to fool a Question Answering model? A systematic approach to adversarial attacks
Venelin Kovatchev
Trina Chatterjee
Venkata S Govindarajan
Jifan Chen
Eunsol Choi
...
K. Erk
Matthew Lease
Junyi Jessy Li
Yating Wu
Kyle Mahowald
AAMLELM
180
11
0
29 Jun 2022
On the Paradox of Learning to Reason from Data
On the Paradox of Learning to Reason from DataInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Honghua Zhang
Liunian Harold Li
Tao Meng
Kai-Wei Chang
Karen Ullrich
NAIReLMOODLRM
320
132
0
23 May 2022
On the Limitations of Dataset Balancing: The Lost Battle Against
  Spurious Correlations
On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations
Roy Schwartz
Gabriel Stanovsky
168
31
0
27 Apr 2022
Testing the Ability of Language Models to Interpret Figurative Language
Testing the Ability of Language Models to Interpret Figurative LanguageNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Emmy Liu
Chenxuan Cui
Kenneth Zheng
Graham Neubig
ELMLRM
228
94
0
26 Apr 2022
Explanation Graph Generation via Pre-trained Language Models: An
  Empirical Study with Contrastive Learning
Explanation Graph Generation via Pre-trained Language Models: An Empirical Study with Contrastive LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Swarnadeep Saha
Prateek Yadav
Joey Tianyi Zhou
163
10
0
11 Apr 2022
Winoground: Probing Vision and Language Models for Visio-Linguistic
  Compositionality
Winoground: Probing Vision and Language Models for Visio-Linguistic CompositionalityComputer Vision and Pattern Recognition (CVPR), 2022
Tristan Thrush
Ryan Jiang
Max Bartolo
Amanpreet Singh
Adina Williams
Douwe Kiela
Candace Ross
CoGe
354
510
0
07 Apr 2022
A Theoretically Grounded Benchmark for Evaluating Machine Commonsense
A Theoretically Grounded Benchmark for Evaluating Machine Commonsense
Henrique M. Dinis Santos
Ke Shen
Alice M. Mulvehill
Yasaman Razeghi
D. McGuinness
Mayank Kejriwal
ELMLRM
201
6
0
23 Mar 2022
The Defeat of the Winograd Schema Challenge
The Defeat of the Winograd Schema ChallengeArtificial Intelligence (AIJ), 2022
Vid Kocijan
E. Davis
Thomas Lukasiewicz
G. Marcus
L. Morgenstern
260
47
0
07 Jan 2022
ASER: Towards Large-scale Commonsense Knowledge Acquisition via
  Higher-order Selectional Preference over Eventualities
ASER: Towards Large-scale Commonsense Knowledge Acquisition via Higher-order Selectional Preference over EventualitiesArtificial Intelligence (AI), 2021
Hongming Zhang
Xin Liu
Haojie Pan
Hao Ke
Jiefu Ou
Tianqing Fang
Yangqiu Song
179
50
0
05 Apr 2021
1