HellaSwag: Can a Machine Really Finish Your Sentence?

Annual Meeting of the Association for Computational Linguistics (ACL), 2019

19 May 2019

Yejin Choi

Papers citing "HellaSwag: Can a Machine Really Finish Your Sentence?"

50 / 2,254 papers shown

The Stability-Efficiency Dilemma: Investigating Sequence Length Warmup for Training GPT ModelsNeural Information Processing Systems (NeurIPS), 2021

Conglong Li

Minjia Zhang

Yuxiong He

326

13 Aug 2021

Goal-Oriented Script ConstructionInternational Conference on Natural Language Generation (INLG), 2021

Qing Lyu

Li Zhang

Chris Callison-Burch

209

28 Jul 2021

QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading ComprehensionACM Computing Surveys (CSUR), 2021

Anna Rogers

Matt Gardner

Isabelle Augenstein

377

191

27 Jul 2021

HTLM: Hyper-Text Pre-Training and Prompting of Language ModelsInternational Conference on Learning Representations (ICLR), 2021

Luke Zettlemoyer

VLM VPVLM AI4TS AI4CE

214

14 Jul 2021

All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated TextAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

565

490

30 Jun 2021

Learning Stable Classifiers by Transferring Unstable FeaturesInternational Conference on Machine Learning (ICML), 2021

340

15 Jun 2021

Improving Paraphrase Detection with the Adversarial Paraphrasing TaskAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Animesh Nighojkar

John Licato

159

14 Jun 2021

ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language GenerationFindings (Findings), 2021

147

10 Jun 2021

Bayesian Attention Belief NetworksInternational Conference on Machine Learning (ICML), 2021

Shujian Zhang

Xinjie Fan

Bo Chen

Mingyuan Zhou

BDL

261

09 Jun 2021

PROST: Physical Reasoning of Objects through Space and TimeFindings (Findings), 2021

Stéphane Aroca-Ouellette

172

07 Jun 2021

MedNLI Is Not Immune: Natural Language Inference Artifacts in the Clinical DomainAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Christine Herlihy

Rachel Rudinger

137

02 Jun 2021

COM2SENSE: A Commonsense Reasoning Benchmark with Complementary SentencesFindings (Findings), 2021

Shikhar Singh

Nuan Wen

Yu Hou

Pegah Alipoormolabashi

252

02 Jun 2021

On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized StudyAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Douwe Kiela

166

02 Jun 2021

Comparing Test Sets with Item Response TheoryAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

178

01 Jun 2021

What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?Annual Meeting of the Association for Computational Linguistics (ACL), 2021

282

01 Jun 2021

PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D WorldAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Yejin Choi

316

01 Jun 2021

Predict then Interpolate: A Simple Algorithm to Learn Stable ClassifiersInternational Conference on Machine Learning (ICML), 2021

Yujia Bao

Shiyu Chang

Regina Barzilay

178

26 May 2021

Measuring Coding Challenge Competence With APPS

...

1.2K

924

20 May 2021

Go Beyond Plain Fine-tuning: Improving Pretrained Models for Social CommonsenseSpoken Language Technology Workshop (SLT), 2021

Ting-Yun Chang

Yang Liu

Karthik Gopalakrishnan

131

12 May 2021

Incorporating Commonsense Knowledge Graph in Pretrained Models for Social Commonsense TasksWorkshop on Knowledge Extraction and Integration for Deep Learning Architectures; Deep Learning Inside Out (DEELIO), 2020

Ting-Yun Chang

Yang Liu

Karthik Gopalakrishnan

177

12 May 2021

Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks

149

03 May 2021

RoFormer: Enhanced Transformer with Rotary Position Embedding

877

4,081

20 Apr 2021

CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in NLPConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Qinyuan Ye

Bill Yuchen Lin

Xiang Ren

645

195

18 Apr 2021

Surface Form Competition: Why the Highest Probability Answer Isn't Always RightConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Ari Holtzman

Peter West

Vered Schwartz

Yejin Choi

Luke Zettlemoyer

LRM

721

268

16 Apr 2021

What to Pre-Train on? Efficient Intermediate Task SelectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

266

107

16 Apr 2021

ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense ReasoningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

340

15 Apr 2021

AR-LSAT: Investigating Analytical Reasoning of Text

377

14 Apr 2021

UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask BenchmarkAAAI Conference on Artificial Intelligence (AAAI), 2021

Yejin Choi

284

147

24 Mar 2021

Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQANorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Gabriel Stanovsky

198

17 Mar 2021

IIE-NLP-Eyas at SemEval-2021 Task 4: Enhancing PLM for ReCAM with Special Tokens, Re-Ranking, Siamese Encoders and Back TranslationInternational Workshop on Semantic Evaluation (SemEval), 2021

136

25 Feb 2021

Muppet: Massive Multi-task Representations with Pre-FinetuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Luke Zettlemoyer

197

290

26 Jan 2021

English Machine Reading Comprehension Datasets: A SurveyConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

271

25 Jan 2021

Benchmarking Knowledge-Enhanced Commonsense Question Answering via Knowledge-to-Text TransformationAAAI Conference on Artificial Intelligence (AAAI), 2021

Xianpei Han

186

04 Jan 2021

DynaSent: A Dynamic Benchmark for Sentiment AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Christopher Potts

Zhengxuan Wu

Atticus Geiger

Douwe Kiela

474

30 Dec 2020

Exploring and Analyzing Machine Commonsense Benchmarks

Henrique M. Dinis Santos

130

21 Dec 2020

Learning from others' mistakes: Avoiding dataset biases without modeling themInternational Conference on Learning Representations (ICLR), 2020

304

123

02 Dec 2020

An Enhanced Knowledge Injection Model for Commonsense GenerationInternational Conference on Computational Linguistics (COLING), 2020

Xuanjing Huang

259

01 Dec 2020

A Data-Driven Study of Commonsense Knowledge using the ConceptNet Knowledge Base

Ke Shen

Mayank Kejriwal

183

28 Nov 2020

Do Fine-tuned Commonsense Language Models Really Generalize?

Mayank Kejriwal

Ke Shen

ELM LRM

139

18 Nov 2020

An Analysis of Dataset Overlap on Winograd-Style Tasks

195

09 Nov 2020

Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering

Kaixin Ma

173

07 Nov 2020

Underspecification Presents Challenges for Credibility in Modern Machine Learning

...

448

766

06 Nov 2020

"where is this relationship going?": Understanding Relationship Trajectories in Narrative Text

Keen You

Dan Goldwasser

240

29 Oct 2020

Analogous Process Structure Induction for Sub-event Sequence PredictionConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

154

16 Oct 2020

What is More Likely to Happen Next? Video-and-Language Future Event Prediction

208

15 Oct 2020

Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs

Yejin Choi

235

15 Oct 2020

Asking Crowdworkers to Write Entailment Examples: The Best of Bad Options

Clara Vania

Ruijie Chen

Samuel R. Bowman

265

13 Oct 2020

COMET-ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs

Keisuke Sakaguchi

Yejin Choi

294

443

12 Oct 2020

Social Commonsense Reasoning with Multi-Head Knowledge Attention

Debjit Paul

Anette Frank

LRM

139

12 Oct 2020

Intrinsic Probing through Dimension Selection

Lucas Torroba Hennigen

Adina Williams

Robert Bamler

212

06 Oct 2020