ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.07830
  4. Cited By
HellaSwag: Can a Machine Really Finish Your Sentence?

HellaSwag: Can a Machine Really Finish Your Sentence?

Annual Meeting of the Association for Computational Linguistics (ACL), 2019
19 May 2019
Rowan Zellers
Ari Holtzman
Yonatan Bisk
Ali Farhadi
Yejin Choi
ArXiv (abs)PDFHTML

Papers citing "HellaSwag: Can a Machine Really Finish Your Sentence?"

50 / 2,254 papers shown
The Stability-Efficiency Dilemma: Investigating Sequence Length Warmup
  for Training GPT Models
The Stability-Efficiency Dilemma: Investigating Sequence Length Warmup for Training GPT ModelsNeural Information Processing Systems (NeurIPS), 2021
Conglong Li
Minjia Zhang
Yuxiong He
326
51
0
13 Aug 2021
Goal-Oriented Script Construction
Goal-Oriented Script ConstructionInternational Conference on Natural Language Generation (INLG), 2021
Qing Lyu
Li Zhang
Chris Callison-Burch
209
36
0
28 Jul 2021
QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering
  and Reading Comprehension
QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading ComprehensionACM Computing Surveys (CSUR), 2021
Anna Rogers
Matt Gardner
Isabelle Augenstein
377
191
0
27 Jul 2021
HTLM: Hyper-Text Pre-Training and Prompting of Language Models
HTLM: Hyper-Text Pre-Training and Prompting of Language ModelsInternational Conference on Learning Representations (ICLR), 2021
Armen Aghajanyan
Dmytro Okhonko
M. Lewis
Mandar Joshi
Hu Xu
Gargi Ghosh
Luke Zettlemoyer
VLMVPVLMAI4TSAI4CE
214
80
0
14 Jul 2021
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated
  Text
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated TextAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Elizabeth Clark
Tal August
Sofia Serrano
Nikita Haduong
Suchin Gururangan
Noah A. Smith
DeLMO
565
490
0
30 Jun 2021
Learning Stable Classifiers by Transferring Unstable Features
Learning Stable Classifiers by Transferring Unstable FeaturesInternational Conference on Machine Learning (ICML), 2021
Yujia Bao
Shiyu Chang
Regina Barzilay
OOD
340
8
0
15 Jun 2021
Improving Paraphrase Detection with the Adversarial Paraphrasing Task
Improving Paraphrase Detection with the Adversarial Paraphrasing TaskAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Animesh Nighojkar
John Licato
159
40
0
14 Jun 2021
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural
  Language Generation
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language GenerationFindings (Findings), 2021
Wanrong Zhu
Xinze Wang
An Yan
Miguel P. Eckstein
Wenjie Wang
147
7
0
10 Jun 2021
Bayesian Attention Belief Networks
Bayesian Attention Belief NetworksInternational Conference on Machine Learning (ICML), 2021
Shujian Zhang
Xinjie Fan
Bo Chen
Mingyuan Zhou
BDL
261
36
0
09 Jun 2021
PROST: Physical Reasoning of Objects through Space and Time
PROST: Physical Reasoning of Objects through Space and TimeFindings (Findings), 2021
Stéphane Aroca-Ouellette
Cory Paik
Alessandro Roncone
Katharina Kann
LRM
172
53
0
07 Jun 2021
MedNLI Is Not Immune: Natural Language Inference Artifacts in the
  Clinical Domain
MedNLI Is Not Immune: Natural Language Inference Artifacts in the Clinical DomainAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Christine Herlihy
Rachel Rudinger
137
28
0
02 Jun 2021
COM2SENSE: A Commonsense Reasoning Benchmark with Complementary
  Sentences
COM2SENSE: A Commonsense Reasoning Benchmark with Complementary SentencesFindings (Findings), 2021
Shikhar Singh
Nuan Wen
Yu Hou
Pegah Alipoormolabashi
Te-Lin Wu
Xuezhe Ma
Nanyun Peng
LRM
252
64
0
02 Jun 2021
On the Efficacy of Adversarial Data Collection for Question Answering:
  Results from a Large-Scale Randomized Study
On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized StudyAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Divyansh Kaushik
Douwe Kiela
Zachary Chase Lipton
Anuj Kumar
AAML
166
38
0
02 Jun 2021
Comparing Test Sets with Item Response Theory
Comparing Test Sets with Item Response TheoryAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Clara Vania
Phu Mon Htut
William Huang
Dhara Mungra
Richard Yuanzhe Pang
Jason Phang
Haokun Liu
Kyunghyun Cho
Sam Bowman
178
51
0
01 Jun 2021
What Ingredients Make for an Effective Crowdsourcing Protocol for
  Difficult NLU Data Collection Tasks?
What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Nikita Nangia
Saku Sugawara
H. Trivedi
Alex Warstadt
Clara Vania
Sam Bowman
282
36
0
01 Jun 2021
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D
  World
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D WorldAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Rowan Zellers
Ari Holtzman
Matthew E. Peters
Roozbeh Mottaghi
Aniruddha Kembhavi
Ali Farhadi
Yejin Choi
316
77
0
01 Jun 2021
Predict then Interpolate: A Simple Algorithm to Learn Stable Classifiers
Predict then Interpolate: A Simple Algorithm to Learn Stable ClassifiersInternational Conference on Machine Learning (ICML), 2021
Yujia Bao
Shiyu Chang
Regina Barzilay
178
22
0
26 May 2021
Measuring Coding Challenge Competence With APPS
Measuring Coding Challenge Competence With APPS
Dan Hendrycks
Steven Basart
Saurav Kadavath
Mantas Mazeika
Akul Arora
...
Collin Burns
Samir Puranik
Horace He
Basel Alomair
Jacob Steinhardt
ELMAIMatALM
1.2K
924
0
20 May 2021
Go Beyond Plain Fine-tuning: Improving Pretrained Models for Social
  Commonsense
Go Beyond Plain Fine-tuning: Improving Pretrained Models for Social CommonsenseSpoken Language Technology Workshop (SLT), 2021
Ting-Yun Chang
Yang Liu
Karthik Gopalakrishnan
Behnam Hedayatnia
Pei Zhou
Dilek Z. Hakkani-Tür
ReLMVLMAI4MHLRM
131
1
0
12 May 2021
Incorporating Commonsense Knowledge Graph in Pretrained Models for
  Social Commonsense Tasks
Incorporating Commonsense Knowledge Graph in Pretrained Models for Social Commonsense TasksWorkshop on Knowledge Extraction and Integration for Deep Learning Architectures; Deep Learning Inside Out (DEELIO), 2020
Ting-Yun Chang
Yang Liu
Karthik Gopalakrishnan
Behnam Hedayatnia
Pei Zhou
Dilek Z. Hakkani-Tür
ReLMAI4MHLRM
177
38
0
12 May 2021
Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian
  SuperGLUE Tasks
Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks
Tatiana Iazykova
Denis Kapelyushnik
Olga Bystrova
Andrey Kutuzov
ELM
149
1
0
03 May 2021
RoFormer: Enhanced Transformer with Rotary Position Embedding
RoFormer: Enhanced Transformer with Rotary Position Embedding
Jianlin Su
Yu Lu
Shengfeng Pan
Ahmed Murtadha
Bo Wen
Yunfeng Liu
877
4,081
0
20 Apr 2021
CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in
  NLP
CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in NLPConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Qinyuan Ye
Bill Yuchen Lin
Xiang Ren
645
195
0
18 Apr 2021
Surface Form Competition: Why the Highest Probability Answer Isn't
  Always Right
Surface Form Competition: Why the Highest Probability Answer Isn't Always RightConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Ari Holtzman
Peter West
Vered Schwartz
Yejin Choi
Luke Zettlemoyer
LRM
721
268
0
16 Apr 2021
What to Pre-Train on? Efficient Intermediate Task Selection
What to Pre-Train on? Efficient Intermediate Task SelectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Clifton A. Poth
Jonas Pfeiffer
Andreas Rucklé
Iryna Gurevych
266
107
0
16 Apr 2021
ExplaGraphs: An Explanation Graph Generation Task for Structured
  Commonsense Reasoning
ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense ReasoningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Swarnadeep Saha
Prateek Yadav
Lisa Bauer
Joey Tianyi Zhou
LRM
340
69
0
15 Apr 2021
AR-LSAT: Investigating Analytical Reasoning of Text
AR-LSAT: Investigating Analytical Reasoning of Text
Wanjun Zhong
Siyuan Wang
Duyu Tang
Zenan Xu
Daya Guo
Jiahai Wang
Jian Yin
Ming Zhou
Nan Duan
ELM
377
55
0
14 Apr 2021
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New
  Multitask Benchmark
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask BenchmarkAAAI Conference on Artificial Intelligence (AAAI), 2021
Nicholas Lourie
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
LRM
284
147
0
24 Mar 2021
Automatic Generation of Contrast Sets from Scene Graphs: Probing the
  Compositional Consistency of GQA
Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQANorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Yonatan Bitton
Gabriel Stanovsky
Roy Schwartz
Michael Elhadad
CoGe
198
33
0
17 Mar 2021
IIE-NLP-Eyas at SemEval-2021 Task 4: Enhancing PLM for ReCAM with
  Special Tokens, Re-Ranking, Siamese Encoders and Back Translation
IIE-NLP-Eyas at SemEval-2021 Task 4: Enhancing PLM for ReCAM with Special Tokens, Re-Ranking, Siamese Encoders and Back TranslationInternational Workshop on Semantic Evaluation (SemEval), 2021
Lei Shen
Luxi Xing
Wei Peng
Yue Hu
136
4
0
25 Feb 2021
Muppet: Massive Multi-task Representations with Pre-Finetuning
Muppet: Massive Multi-task Representations with Pre-FinetuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Armen Aghajanyan
Anchit Gupta
Akshat Shrivastava
Xilun Chen
Luke Zettlemoyer
Sonal Gupta
197
290
0
26 Jan 2021
English Machine Reading Comprehension Datasets: A Survey
English Machine Reading Comprehension Datasets: A SurveyConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Daria Dzendzik
Carl Vogel
Jennifer Foster
RALMAIMat
271
51
0
25 Jan 2021
Benchmarking Knowledge-Enhanced Commonsense Question Answering via
  Knowledge-to-Text Transformation
Benchmarking Knowledge-Enhanced Commonsense Question Answering via Knowledge-to-Text TransformationAAAI Conference on Artificial Intelligence (AAAI), 2021
Ning Bian
Xianpei Han
Bo Chen
Le Sun
ELM
186
48
0
04 Jan 2021
DynaSent: A Dynamic Benchmark for Sentiment Analysis
DynaSent: A Dynamic Benchmark for Sentiment AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Christopher Potts
Zhengxuan Wu
Atticus Geiger
Douwe Kiela
474
85
0
30 Dec 2020
Exploring and Analyzing Machine Commonsense Benchmarks
Exploring and Analyzing Machine Commonsense Benchmarks
Henrique M. Dinis Santos
M. Gordon
Zhicheng Liang
Gretchen Forbush
D. McGuinness
130
4
0
21 Dec 2020
Learning from others' mistakes: Avoiding dataset biases without modeling
  them
Learning from others' mistakes: Avoiding dataset biases without modeling themInternational Conference on Learning Representations (ICLR), 2020
Victor Sanh
Thomas Wolf
Yonatan Belinkov
Alexander M. Rush
304
123
0
02 Dec 2020
An Enhanced Knowledge Injection Model for Commonsense Generation
An Enhanced Knowledge Injection Model for Commonsense GenerationInternational Conference on Computational Linguistics (COLING), 2020
Zhihao Fan
Yeyun Gong
Zhongyu Wei
Siyuan Wang
Ya-Chieh Huang
Jian Jiao
Xuanjing Huang
Nan Duan
Ruofei Zhang
259
30
0
01 Dec 2020
A Data-Driven Study of Commonsense Knowledge using the ConceptNet
  Knowledge Base
A Data-Driven Study of Commonsense Knowledge using the ConceptNet Knowledge Base
Ke Shen
Mayank Kejriwal
183
3
0
28 Nov 2020
Do Fine-tuned Commonsense Language Models Really Generalize?
Do Fine-tuned Commonsense Language Models Really Generalize?
Mayank Kejriwal
Ke Shen
ELMLRM
139
10
0
18 Nov 2020
An Analysis of Dataset Overlap on Winograd-Style Tasks
An Analysis of Dataset Overlap on Winograd-Style Tasks
Ali Emami
Adam Trischler
Kaheer Suleman
Jackie C.K. Cheung
195
23
0
09 Nov 2020
Knowledge-driven Data Construction for Zero-shot Evaluation in
  Commonsense Question Answering
Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering
Kaixin Ma
Filip Ilievski
Jonathan M Francis
Yonatan Bisk
Eric Nyberg
A. Oltramari
173
6
0
07 Nov 2020
Underspecification Presents Challenges for Credibility in Modern Machine
  Learning
Underspecification Presents Challenges for Credibility in Modern Machine Learning
Alexander DÁmour
Katherine A. Heller
D. Moldovan
Ben Adlam
B. Alipanahi
...
Kellie Webster
Steve Yadlowsky
T. Yun
Xiaohua Zhai
D. Sculley
OffRL
448
766
0
06 Nov 2020
"where is this relationship going?": Understanding Relationship
  Trajectories in Narrative Text
"where is this relationship going?": Understanding Relationship Trajectories in Narrative Text
Keen You
Dan Goldwasser
240
5
0
29 Oct 2020
Analogous Process Structure Induction for Sub-event Sequence Prediction
Analogous Process Structure Induction for Sub-event Sequence PredictionConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Hongming Zhang
Muhao Chen
Haoyu Wang
Yangqiu Song
Dan Roth
AI4TS
154
49
0
16 Oct 2020
What is More Likely to Happen Next? Video-and-Language Future Event
  Prediction
What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
208
80
0
15 Oct 2020
Natural Language Rationales with Full-Stack Visual Reasoning: From
  Pixels to Semantic Frames to Commonsense Graphs
Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs
Ana Marasović
Chandra Bhagavatula
J. S. Park
Ronan Le Bras
Noah A. Smith
Yejin Choi
ReLMLRM
235
64
0
15 Oct 2020
Asking Crowdworkers to Write Entailment Examples: The Best of Bad
  Options
Asking Crowdworkers to Write Entailment Examples: The Best of Bad Options
Clara Vania
Ruijie Chen
Samuel R. Bowman
265
12
0
13 Oct 2020
COMET-ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs
COMET-ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs
Jena D. Hwang
Chandra Bhagavatula
Ronan Le Bras
Jeff Da
Keisuke Sakaguchi
Antoine Bosselut
Yejin Choi
294
443
0
12 Oct 2020
Social Commonsense Reasoning with Multi-Head Knowledge Attention
Social Commonsense Reasoning with Multi-Head Knowledge Attention
Debjit Paul
Anette Frank
LRM
139
19
0
12 Oct 2020
Intrinsic Probing through Dimension Selection
Intrinsic Probing through Dimension Selection
Lucas Torroba Hennigen
Adina Williams
Robert Bamler
212
61
0
06 Oct 2020
Previous
123...43444546
Next
Page 44 of 46
Pageof 46