Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1905.07830
Cited By
HellaSwag: Can a Machine Really Finish Your Sentence?
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
19 May 2019
Rowan Zellers
Ari Holtzman
Yonatan Bisk
Ali Farhadi
Yejin Choi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"HellaSwag: Can a Machine Really Finish Your Sentence?"
50 / 2,243 papers shown
Title
PaLM: Scaling Language Modeling with Pathways
Journal of machine learning research (JMLR), 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
1.2K
7,351
0
05 Apr 2022
Training Compute-Optimal Large Language Models
Jordan Hoffmann
Sebastian Borgeaud
A. Mensch
Elena Buchatskaya
Trevor Cai
...
Karen Simonyan
Erich Elsen
Jack W. Rae
Oriol Vinyals
Laurent Sifre
AI4TS
756
2,585
0
29 Mar 2022
REx: Data-Free Residual Quantization Error Expansion
Neural Information Processing Systems (NeurIPS), 2022
Edouard Yvinec
Arnaud Dapgony
Matthieu Cord
Kévin Bailly
MQ
310
9
0
28 Mar 2022
When Chosen Wisely, More Data Is What You Need: A Universal Sample-Efficient Strategy For Data Augmentation
Findings (Findings), 2022
Ehsan Kamalloo
Mehdi Rezagholizadeh
A. Ghodsi
188
11
0
17 Mar 2022
Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Shuyan Zhou
Li Zhang
Yue Yang
Qing Lyu
Pengcheng Yin
Chris Callison-Burch
Graham Neubig
166
32
0
14 Mar 2022
Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models
Ning Ding
Yujia Qin
Guang Yang
Fu Wei
Zonghan Yang
...
Jianfei Chen
Yang Liu
Jie Tang
Juan Li
Maosong Sun
322
225
0
14 Mar 2022
Efficient Language Modeling with Sparse all-MLP
Ping Yu
Mikel Artetxe
Myle Ott
Sam Shleifer
Hongyu Gong
Ves Stoyanov
Xian Li
MoE
174
15
0
14 Mar 2022
CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition Alignment
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Lutfi Kerem Senel
Timo Schick
Hinrich Schütze
ELM
ALM
125
6
0
11 Mar 2022
Training language models to follow instructions with human feedback
Neural Information Processing Systems (NeurIPS), 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
2.0K
17,203
0
04 Mar 2022
A Survey of Knowledge-Intensive NLP with Pre-Trained Language Models
Da Yin
Li Dong
Hao Cheng
Xiaodong Liu
Kai-Wei Chang
Furu Wei
Jianfeng Gao
KELM
182
36
0
17 Feb 2022
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models
Neural Information Processing Systems (NeurIPS), 2022
Wei Ping
Ming-Yu Liu
Chaowei Xiao
Peng Xu
M. Patwary
Mohammad Shoeybi
Yue Liu
Anima Anandkumar
Bryan Catanzaro
277
79
0
08 Feb 2022
Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
AAAI Conference on Artificial Intelligence (AAAI), 2022
Prajjwal Bhargava
Vincent Ng
ReLM
LRM
323
73
0
28 Jan 2022
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model
Shaden Smith
M. Patwary
Brandon Norick
P. LeGresley
Samyam Rajbhandari
...
Mohammad Shoeybi
Yuxiong He
Michael Houston
Saurabh Tiwary
Bryan Catanzaro
MoE
391
807
0
28 Jan 2022
WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Alisa Liu
Swabha Swayamdipta
Noah A. Smith
Yejin Choi
573
250
0
16 Jan 2022
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
Alon Talmor
Ori Yoran
Ronan Le Bras
Chandrasekhar Bhagavatula
Yoav Goldberg
Yejin Choi
Jonathan Berant
ELM
284
166
0
14 Jan 2022
Efficient Large Scale Language Modeling with Mixtures of Experts
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Mikel Artetxe
Shruti Bhosale
Naman Goyal
Todor Mihaylov
Myle Ott
...
Jeff Wang
Luke Zettlemoyer
Mona T. Diab
Zornitsa Kozareva
Ves Stoyanov
MoE
450
220
0
20 Dec 2021
Few-shot Learning with Multilingual Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Xi Lin
Todor Mihaylov
Mikel Artetxe
Tianlu Wang
Shuohui Chen
...
Luke Zettlemoyer
Zornitsa Kozareva
Mona T. Diab
Ves Stoyanov
Xian Li
BDL
ELM
LRM
301
351
0
20 Dec 2021
KGR^4: Retrieval, Retrospect, Refine and Rethink for Commonsense Generation
Xin Liu
Dayiheng Liu
Baosong Yang
Haibo Zhang
Junwei Ding
Wenqing Yao
Weihua Luo
Haiying Zhang
Jinsong Su
LRM
95
8
0
15 Dec 2021
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Nan Du
Yanping Huang
Andrew M. Dai
Simon Tong
Dmitry Lepikhin
...
Kun Zhang
Quoc V. Le
Yonghui Wu
Zhiwen Chen
Claire Cui
ALM
MoE
637
1,035
0
13 Dec 2021
Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention
International Joint Conference on Artificial Intelligence (IJCAI), 2021
Yichong Xu
Chenguang Zhu
Shuohang Wang
Siqi Sun
Hao Cheng
Xiaodong Liu
Jianfeng Gao
Pengcheng He
Michael Zeng
Xuedong Huang
LRM
455
62
0
06 Dec 2021
MetaQA: Combining Expert Agents for Multi-Skill Question Answering
Haritz Puerto
Gözde Gül Sahin
Iryna Gurevych
LLMAG
394
27
0
03 Dec 2021
A General Language Assistant as a Laboratory for Alignment
Amanda Askell
Yuntao Bai
Anna Chen
Dawn Drain
Deep Ganguli
...
Tom B. Brown
Jack Clark
Sam McCandlish
C. Olah
Jared Kaplan
ALM
396
960
0
01 Dec 2021
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
V. Aribandi
Yi Tay
Tal Schuster
J. Rao
H. Zheng
...
Jianmo Ni
Jai Gupta
Kai Hui
Sebastian Ruder
Donald Metzler
MoE
267
227
0
22 Nov 2021
Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair
Jason Phang
Angelica Chen
William Huang
Samuel R. Bowman
AAML
158
14
0
16 Nov 2021
Uncertainty Calibration for Ensemble-Based Debiasing Methods
Ruibin Xiong
Yimeng Chen
Liang Pang
Xueqi Chen
Yanyan Lan
140
23
0
07 Nov 2021
A Systematic Investigation of Commonsense Knowledge in Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Xiang Lorraine Li
A. Kuncoro
Jordan Hoffmann
Cyprien de Masson dÁutume
Phil Blunsom
Aida Nematzadeh
LRM
245
72
0
31 Oct 2021
MetaICL: Learning to Learn In Context
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Sewon Min
M. Lewis
Luke Zettlemoyer
Hannaneh Hajishirzi
LRM
628
572
0
29 Oct 2021
NormFormer: Improved Transformer Pretraining with Extra Normalization
Sam Shleifer
Jason Weston
Myle Ott
AI4CE
241
84
0
18 Oct 2021
Coherence boosting: When your pretrained language model is not paying enough attention
Nikolay Malkin
Zhen Wang
Nebojsa Jojic
RALM
178
42
0
15 Oct 2021
Jurassic is (almost) All You Need: Few-Shot Meaning-to-Text Generation for Open-Domain Dialogue
Lena Reed
Cecilia Li
Angela Ramirez
Liren Wu
M. Walker
185
8
0
15 Oct 2021
SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer
Tu Vu
Brian Lester
Noah Constant
Rami Al-Rfou
Daniel Cer
VLM
LRM
430
313
0
15 Oct 2021
Can Machines Learn Morality? The Delphi Experiment
Liwei Jiang
Jena D. Hwang
Chandra Bhagavatula
Ronan Le Bras
Jenny T Liang
...
Yulia Tsvetkov
Oren Etzioni
Maarten Sap
Regina A. Rini
Yejin Choi
FaML
309
151
0
14 Oct 2021
Does Vision-and-Language Pretraining Improve Lexical Grounding?
Tian Yun
Chen Sun
Ellie Pavlick
VLM
CoGe
219
36
0
21 Sep 2021
Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers
Jason Phang
Haokun Liu
Samuel R. Bowman
218
34
0
17 Sep 2021
Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Prasetya Ajie Utama
N. Moosavi
Victor Sanh
Iryna Gurevych
AAML
206
36
0
09 Sep 2021
CREAK: A Dataset for Commonsense Reasoning over Entity Knowledge
Yasumasa Onoe
Michael J.Q. Zhang
Eunsol Choi
Greg Durrett
HILM
230
94
0
03 Sep 2021
Finetuned Language Models Are Zero-Shot Learners
Jason W. Wei
Maarten Bosma
Vincent Zhao
Kelvin Guu
Adams Wei Yu
Brian Lester
Nan Du
Andrew M. Dai
Quoc V. Le
ALM
UQCV
1.3K
4,558
0
03 Sep 2021
An Empirical Exploration in Quality Filtering of Text Data
Leo Gao
126
12
0
02 Sep 2021
Rethinking Why Intermediate-Task Fine-Tuning Works
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Ting-Yun Chang
Chi-Jen Lu
LRM
196
32
0
26 Aug 2021
The Stability-Efficiency Dilemma: Investigating Sequence Length Warmup for Training GPT Models
Neural Information Processing Systems (NeurIPS), 2021
Conglong Li
Minjia Zhang
Yuxiong He
302
50
0
13 Aug 2021
Goal-Oriented Script Construction
International Conference on Natural Language Generation (INLG), 2021
Qing Lyu
Li Zhang
Chris Callison-Burch
193
35
0
28 Jul 2021
QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading Comprehension
ACM Computing Surveys (CSUR), 2021
Anna Rogers
Matt Gardner
Isabelle Augenstein
357
188
0
27 Jul 2021
HTLM: Hyper-Text Pre-Training and Prompting of Language Models
International Conference on Learning Representations (ICLR), 2021
Armen Aghajanyan
Dmytro Okhonko
M. Lewis
Mandar Joshi
Hu Xu
Gargi Ghosh
Luke Zettlemoyer
VLM
VPVLM
AI4TS
AI4CE
169
79
0
14 Jul 2021
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Elizabeth Clark
Tal August
Sofia Serrano
Nikita Haduong
Suchin Gururangan
Noah A. Smith
DeLMO
515
480
0
30 Jun 2021
Learning Stable Classifiers by Transferring Unstable Features
International Conference on Machine Learning (ICML), 2021
Yujia Bao
Shiyu Chang
Regina Barzilay
OOD
309
8
0
15 Jun 2021
Improving Paraphrase Detection with the Adversarial Paraphrasing Task
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Animesh Nighojkar
John Licato
149
40
0
14 Jun 2021
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation
Findings (Findings), 2021
Wanrong Zhu
Xinze Wang
An Yan
Miguel P. Eckstein
Wenjie Wang
147
7
0
10 Jun 2021
Bayesian Attention Belief Networks
International Conference on Machine Learning (ICML), 2021
Shujian Zhang
Xinjie Fan
Bo Chen
Mingyuan Zhou
BDL
210
34
0
09 Jun 2021
PROST: Physical Reasoning of Objects through Space and Time
Findings (Findings), 2021
Stéphane Aroca-Ouellette
Cory Paik
Alessandro Roncone
Katharina Kann
LRM
140
53
0
07 Jun 2021
MedNLI Is Not Immune: Natural Language Inference Artifacts in the Clinical Domain
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Christine Herlihy
Rachel Rudinger
120
28
0
02 Jun 2021
Previous
1
2
3
...
42
43
44
45
Next