v1v2 (latest)

Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021

16 April 2021

Papers citing "Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema"

31 / 31 papers shown

Title
Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation Marii Ojastu Hele-Andra Kuulmets Aleksei Dorkin Marika Borovikova Dage Särg Kairit Sirts 151 0 0 21 Nov 2025
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization I. Gevers Victor De Marez Luna De Bruyne Walter Daelemans 232 1 0 31 Mar 2025
MASS: Overcoming Language Bias in Image-Text MatchingAAAI Conference on Artificial Intelligence (AAAI), 2025 Jiwan Chung Seungwon Lim Sangkyu Lee Youngjae Yu VLM 197 0 0 20 Jan 2025
WinoPron: Revisiting English Winogender Schemas for Consistency, Coverage, and Grammatical Case Vagrant Gautam Julius Steuer Eileen Bingert Ray Johns Anne Lauscher Dietrich Klakow 332 7 0 09 Sep 2024
Thai Winograd Schemas: A Benchmark for Thai Commonsense Reasoning Phakphum Artkaew LRM 147 0 0 28 May 2024
Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge Brendan Park Madeline Janecek Naser Ezzati-Jivan Yifeng Li Ali Emami 210 2 0 25 May 2024
Robust Pronoun Fidelity with English LLMs: Are they Reasoning, Repeating, or Just Biased?Transactions of the Association for Computational Linguistics (TACL), 2024 Vagrant Gautam Eileen Bingert D. Zhu Anne Lauscher Dietrich Klakow 272 13 0 04 Apr 2024
EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries Jing Han Sun Ali Emami 236 6 0 20 Feb 2024
Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but InconsistentlyConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 Kanishka Misra Allyson Ettinger Kyle Mahowald 250 5 0 12 Jan 2024
CLOMO: Counterfactual Logical Modification with Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Yinya Huang Ruixin Hong Hongming Zhang Wei Shao Zhicheng YANG Dong Yu Changshui Zhang Xiaodan Liang Linqi Song LRM 187 11 0 29 Nov 2023
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language ModelsInternational Conference on Learning Representations (ICLR), 2023 Sreyan Ghosh Ashish Seth Sonal Kumar Utkarsh Tyagi Chandra Kiran Reddy Evuru S. Ramaneswaran S. Sakshi Oriol Nieto R. Duraiswami Dinesh Manocha AuLLM VLM CoGe 417 44 0 12 Oct 2023
BRAINTEASER: Lateral Thinking Puzzles for Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Yifan Jiang Filip Ilievski Kaixin Ma Zhivar Sourati LRM ReLM 317 14 0 08 Oct 2023
PronounFlow: A Hybrid Approach for Calibrating Pronouns in Sentences Nicos Isaak 111 1 0 29 Aug 2023
Causal interventions expose implicit situation models for commonsense language understandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Takateru Yamakoshi James L. McClelland A. Goldberg Robert D. Hawkins 295 7 0 06 Jun 2023
Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Marius Mosbach Tiago Pimentel Haiqin Yang Dietrich Klakow Yanai Elazar 293 171 0 26 May 2023
Abductive Commonsense Reasoning Exploiting Mutually Exclusive ExplanationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Wenting Zhao Justin T. Chiu Claire Cardie Alexander M. Rush LRM 240 25 0 24 May 2023
Event knowledge in large language models: the gap between the impossible and the unlikelyCognitive Sciences (CS), 2022 Carina Kauf Anna A. Ivanova Giulia Rambelli Emmanuele Chersoni Jingyuan Selena She Zawad Chowdhury Evelina Fedorenko Alessandro Lenci 461 86 0 02 Dec 2022
Measuring Reliability of Large Language Models through Semantic Consistency Harsh Raj Domenic Rosati Subhabrata Majumdar HILM 218 37 0 10 Nov 2022
Validity Assessment of Legal Will Statements as Natural Language InferenceConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 A. Kwak Jacob O. Israelsen Clayton T. Morrison Derek E. Bambauer Mihai Surdeanu AILaw 123 4 0 30 Oct 2022
CIKQA: Learning Commonsense Inference with a Unified Knowledge-in-the-loop QA ParadigmFindings (Findings), 2022 Hongming Zhang Yintong Huo Yanai Elazar Yangqiu Song Yoav Goldberg Dan Roth LRM 201 3 0 12 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and reviewNature Machine Intelligence (Nat. Mach. Intell.), 2022 Dieuwke Hupkes Mario Giulianelli Verna Dankers Mikel Artetxe Yanai Elazar ... Leila Khalatbari Maria Ryskina Rita Frieske Robert Bamler Zhijing Jin 548 130 0 06 Oct 2022
Measuring Causal Effects of Data Statistics on Language Model's `Factual' Predictions Yanai Elazar Nora Kassner Haiqin Yang Amir Feder Abhilasha Ravichander Marius Mosbach Yonatan Belinkov Hinrich Schütze Yoav Goldberg CML SyDa MILM 212 61 0 28 Jul 2022
longhorns at DADC 2022: How many linguists does it take to fool a Question Answering model? A systematic approach to adversarial attacks Venelin Kovatchev Trina Chatterjee Venkata S Govindarajan Jifan Chen Eunsol Choi ... K. Erk Matthew Lease Junyi Jessy Li Yating Wu Kyle Mahowald AAML ELM 180 11 0 29 Jun 2022
On the Paradox of Learning to Reason from DataInternational Joint Conference on Artificial Intelligence (IJCAI), 2022 Honghua Zhang Liunian Harold Li Tao Meng Kai-Wei Chang Karen Ullrich NAI ReLM OOD LRM 320 132 0 23 May 2022
On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations Roy Schwartz Gabriel Stanovsky 168 31 0 27 Apr 2022
Testing the Ability of Language Models to Interpret Figurative LanguageNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022 Emmy Liu Chenxuan Cui Kenneth Zheng Graham Neubig ELM LRM 228 94 0 26 Apr 2022
Explanation Graph Generation via Pre-trained Language Models: An Empirical Study with Contrastive LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 Swarnadeep Saha Prateek Yadav Joey Tianyi Zhou 163 10 0 11 Apr 2022
Winoground: Probing Vision and Language Models for Visio-Linguistic CompositionalityComputer Vision and Pattern Recognition (CVPR), 2022 Tristan Thrush Ryan Jiang Max Bartolo Amanpreet Singh Adina Williams Douwe Kiela Candace Ross CoGe 354 510 0 07 Apr 2022
A Theoretically Grounded Benchmark for Evaluating Machine Commonsense Henrique M. Dinis Santos Ke Shen Alice M. Mulvehill Yasaman Razeghi D. McGuinness Mayank Kejriwal ELM LRM 201 6 0 23 Mar 2022
The Defeat of the Winograd Schema ChallengeArtificial Intelligence (AIJ), 2022 Vid Kocijan E. Davis Thomas Lukasiewicz G. Marcus L. Morgenstern 260 47 0 07 Jan 2022
ASER: Towards Large-scale Commonsense Knowledge Acquisition via Higher-order Selectional Preference over EventualitiesArtificial Intelligence (AI), 2021 Hongming Zhang Xin Liu Haojie Pan Hao Ke Jiefu Ou Tianqing Fang Yangqiu Song 179 50 0 05 Apr 2021