SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference

16 August 2018

Yejin Choi

Papers citing "SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference"

26 / 476 papers shown

Title
Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE BenchmarkAnnual Meeting of the Association for Computational Linguistics (ACL), 2019 Nikita Nangia Samuel R. Bowman ELM ALM 278 76 0 24 May 2019
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No QuestionsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2019 Christopher Clark Kenton Lee Ming-Wei Chang Tom Kwiatkowski Michael Collins Kristina Toutanova 619 1,991 0 24 May 2019
MCScript2.0: A Machine Comprehension Corpus Focused on Script Events and ParticipantsInternational Workshop on Semantic Evaluation (SemEval), 2019 Simon Ostermann Michael Roth Manfred Pinkal 130 47 0 23 May 2019
Performance Analysis of Deep Learning Workloads on Leading-edge SystemsInternational Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS), 2019 Zhongjing Jiang Shinjae Yoo A. Hoisie ELM 96 23 0 21 May 2019
HellaSwag: Can a Machine Really Finish Your Sentence?Annual Meeting of the Association for Computational Linguistics (ACL), 2019 Rowan Zellers Ari Holtzman Yonatan Bisk Ali Farhadi Yejin Choi 549 3,363 0 19 May 2019
Story Ending Prediction by Transferable BERTInternational Joint Conference on Artificial Intelligence (IJCAI), 2019 Zhongyang Li Xiao Ding Ting Liu 219 54 0 17 May 2019
ERNIE: Enhanced Language Representation with Informative EntitiesAnnual Meeting of the Association for Computational Linguistics (ACL), 2019 Zhengyan Zhang Xu Han Zhiyuan Liu Xin Jiang Maosong Sun Qun Liu 360 1,507 0 17 May 2019
Misleading Failures of Partial-input Baselines Shi Feng Eric Wallace Jordan L. Boyd-Graber 199 0 0 14 May 2019
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding SystemsNeural Information Processing Systems (NeurIPS), 2019 Alex Jinpeng Wang Yada Pruksachatkun Nikita Nangia Amanpreet Singh Julian Michael Felix Hill Omer Levy Samuel R. Bowman ELM 648 2,576 0 02 May 2019
Enabling Robots to Understand Incomplete Natural Language Instructions Using Commonsense ReasoningIEEE International Conference on Robotics and Automation (ICRA), 2019 Haonan Chen Hao Tan Alan Kuntz Joey Tianyi Zhou Ron Alterovitz LM&Ro LRM 221 48 0 29 Apr 2019
Exploring Unsupervised Pretraining and Sentence Structure Modelling for Winograd Schema Challenge Yu-Ping Ruan Xiao-Dan Zhu Zhenhua Ling Zhan Shi Quan Liu Si Wei 99 16 0 22 Apr 2019
CODAH: An Adversarially Authored Question-Answer Dataset for Common Sense Michael Chen Mike DÁrcy Alisa Liu Jared Fernandez Doug Downey ELM CML 166 2 0 08 Apr 2019
Unsupervised Deep Structured Semantic Models for Commonsense Reasoning Shuohang Wang Sheng Zhang Yelong Shen Xiaodong Liu Jingjing Liu Jianfeng Gao Jing Jiang LRM 198 15 0 03 Apr 2019
Recent Advances in Natural Language Inference: A Survey of Benchmarks, Resources, and Approaches Shane Storks Qiaozi Gao J. Chai 384 141 0 02 Apr 2019
Adversarial attacks against Fact Extraction and VERification James Thorne Andreas Vlachos FedML AAML 203 26 0 13 Mar 2019
DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over ParagraphsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2019 Dheeru Dua Yizhong Wang Pradeep Dasigi Gabriel Stanovsky Sameer Singh Matt Gardner AIMat 309 1,131 0 01 Mar 2019
Still a Pain in the Neck: Evaluating Text Representations on Lexical CompositionTransactions of the Association for Computational Linguistics (TACL), 2019 Vered Shwartz Ido Dagan CoGe 179 86 0 27 Feb 2019
Contextual Word Representations: A Contextual Introduction Noah A. Smith 215 35 0 15 Feb 2019
From Recognition to Cognition: Visual Commonsense Reasoning Rowan Zellers Yonatan Bisk Ali Farhadi Yejin Choi LRM BDL OCL ReLM 584 982 0 27 Nov 2018
How Reasonable are Common-Sense Reasoning Tasks: A Case-Study on the Winograd Schema Challenge and SWAG P. Trichelair Ali Emami Adam Trischler Kaheer Suleman Jackie C.K. Cheung LRM 184 44 0 05 Nov 2018
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge Alon Talmor Jonathan Herzig Nicholas Lourie Jonathan Berant RALM 332 2,109 0 02 Nov 2018
Shifting the Baseline: Single Modality Performance on Visual Navigation & QA Jesse Thomason Daniel Gordon Yonatan Bisk 250 80 0 01 Nov 2018
ReCoRD: Bridging the Gap between Human and Machine Commonsense Reading Comprehension Sheng Zhang Xiaodong Liu Jingjing Liu Jianfeng Gao Kevin Duh Benjamin Van Durme 209 333 0 30 Oct 2018
Machine Common Sense Concept Paper David Gunning VLM LRM 206 45 0 17 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Jacob Devlin Ming-Wei Chang Kenton Lee Kristina Toutanova VLM SSL SSeg 2.8K 106,996 0 11 Oct 2018
Improving Question Answering by Commonsense-Based Pre-Training Wanjun Zhong Duyu Tang N. Duan Ming Zhou Jiahai Wang Jian Yin NAI LRM ReLM 206 490 0 05 Sep 2018