HellaSwag: Can a Machine Really Finish Your Sentence?

Annual Meeting of the Association for Computational Linguistics (ACL), 2019

19 May 2019

Yejin Choi

Papers citing "HellaSwag: Can a Machine Really Finish Your Sentence?"

50 / 2,252 papers shown

Clues Before Answers: Generation-Enhanced Multiple-Choice QANorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

148

30 Apr 2022

Prompt Consistency for Zero-Shot Task GeneralizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Chunting Zhou

Junxian He

Xuezhe Ma

Taylor Berg-Kirkpatrick

Graham Neubig

VLM

358

29 Apr 2022

Learning to Split for Automatic Bias Detection

Yujia Bao

Regina Barzilay

215

28 Apr 2022

On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations

Roy Schwartz

Gabriel Stanovsky

188

27 Apr 2022

GPT-NeoX-20B: An Open-Source Autoregressive Language Model

...

368

949

14 Apr 2022

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

...

901

3,458

12 Apr 2022

What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization?International Conference on Machine Learning (ICML), 2022

282

215

12 Apr 2022

FoundationLayerNorm: Scaling BERT and GPT to 1,000 Layers

Dezhou Shen

AI4CE

09 Apr 2022

Checking HateCheck: a cross-functional analysis of behaviour-aware learning for hate speech detection

Pedro Henrique Luz de Araujo

Benjamin Roth

136

08 Apr 2022

PaLM: Scaling Language Modeling with PathwaysJournal of machine learning research (JMLR), 2022

Sharan Narang

...

Kathy Meier-Hellstern

1.2K

7,418

05 Apr 2022

Training Compute-Optimal Large Language Models

...

792

2,613

29 Mar 2022

REx: Data-Free Residual Quantization Error ExpansionNeural Information Processing Systems (NeurIPS), 2022

338

28 Mar 2022

When Chosen Wisely, More Data Is What You Need: A Universal Sample-Efficient Strategy For Data AugmentationFindings (Findings), 2022

Ehsan Kamalloo

Mehdi Rezagholizadeh

A. Ghodsi

200

17 Mar 2022

Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web DataAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Graham Neubig

166

14 Mar 2022

Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

...

Jianfei Chen

Yang Liu

Jie Tang

Juan Li

Maosong Sun

350

225

14 Mar 2022

Efficient Language Modeling with Sparse all-MLP

Xian Li

182

14 Mar 2022

CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition AlignmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

131

11 Mar 2022

Training language models to follow instructions with human feedbackNeural Information Processing Systems (NeurIPS), 2022

Carroll L. Wainwright

...

2.1K

17,490

04 Mar 2022

A Survey of Knowledge-Intensive NLP with Pre-Trained Language Models

Xiaodong Liu

198

17 Feb 2022

Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language ModelsNeural Information Processing Systems (NeurIPS), 2022

297

08 Feb 2022

Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A SurveyAAAI Conference on Artificial Intelligence (AAAI), 2022

Prajjwal Bhargava

Vincent Ng

ReLM LRM

327

28 Jan 2022

Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model

...

Yuxiong He

427

810

28 Jan 2022

WANLI: Worker and AI Collaboration for Natural Language Inference Dataset CreationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Alisa Liu

Swabha Swayamdipta

Noah A. Smith

Yejin Choi

634

250

16 Jan 2022

CommonsenseQA 2.0: Exposing the Limits of AI through Gamification

Alon Talmor

Ori Yoran

Ronan Le Bras

Chandrasekhar Bhagavatula

Yoav Goldberg

Yejin Choi

Jonathan Berant

ELM

293

167

14 Jan 2022

Efficient Large Scale Language Modeling with Mixtures of ExpertsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

...

Luke Zettlemoyer

463

223

20 Dec 2021

Few-shot Learning with Multilingual Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

...

Luke Zettlemoyer

Xian Li

353

354

20 Dec 2021

KGR^4: Retrieval, Retrospect, Refine and Rethink for Commonsense Generation

Dayiheng Liu

15 Dec 2021

GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

...

680

1,045

13 Dec 2021

Human Parity on CommonsenseQA: Augmenting Self-Attention with External AttentionInternational Joint Conference on Artificial Intelligence (IJCAI), 2021

Xiaodong Liu

464

06 Dec 2021

MetaQA: Combining Expert Agents for Multi-Skill Question Answering

454

03 Dec 2021

A General Language Assistant as a Laboratory for Alignment

Deep Ganguli

...

443

966

01 Dec 2021

ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning

...

297

230

22 Nov 2021

Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair

166

16 Nov 2021

Uncertainty Calibration for Ensemble-Based Debiasing Methods

Liang Pang

147

07 Nov 2021

A Systematic Investigation of Commonsense Knowledge in Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Xiang Lorraine Li

A. Kuncoro

Jordan Hoffmann

Cyprien de Masson dÁutume

Phil Blunsom

Aida Nematzadeh

LRM

266

31 Oct 2021

MetaICL: Learning to Learn In ContextNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Luke Zettlemoyer

672

573

29 Oct 2021

NormFormer: Improved Transformer Pretraining with Extra Normalization

Sam Shleifer

Jason Weston

Myle Ott

AI4CE

272

18 Oct 2021

Coherence boosting: When your pretrained language model is not paying enough attention

Nikolay Malkin

Zhen Wang

Nebojsa Jojic

RALM

209

15 Oct 2021

Jurassic is (almost) All You Need: Few-Shot Meaning-to-Text Generation for Open-Domain Dialogue

221

15 Oct 2021

SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer

458

314

15 Oct 2021

Can Machines Learn Morality? The Delphi Experiment

...

Yejin Choi

333

152

14 Oct 2021

Does Vision-and-Language Pretraining Improve Lexical Grounding?

228

21 Sep 2021

Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers

Jason Phang

Haokun Liu

Samuel R. Bowman

242

17 Sep 2021

Avoiding Inference Heuristics in Few-shot Prompt-based FinetuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

217

09 Sep 2021

CREAK: A Dataset for Commonsense Reasoning over Entity Knowledge

252

03 Sep 2021

Finetuned Language Models Are Zero-Shot Learners

1.6K

4,587

03 Sep 2021

An Empirical Exploration in Quality Filtering of Text Data

Leo Gao

126

02 Sep 2021

Rethinking Why Intermediate-Task Fine-Tuning WorksConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Ting-Yun Chang

Chi-Jen Lu

LRM

201

26 Aug 2021

The Stability-Efficiency Dilemma: Investigating Sequence Length Warmup for Training GPT ModelsNeural Information Processing Systems (NeurIPS), 2021

Conglong Li

Minjia Zhang

Yuxiong He

326

13 Aug 2021

Goal-Oriented Script ConstructionInternational Conference on Natural Language Generation (INLG), 2021

Qing Lyu

Li Zhang

Chris Callison-Burch

193

28 Jul 2021