Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2205.01068
Cited By
v1
v2
v3
v4 (latest)
OPT: Open Pre-trained Transformer Language Models
2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLM
OSLM
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"OPT: Open Pre-trained Transformer Language Models"
50 / 2,924 papers shown
Augmentation with Projection: Towards an Effective and Efficient Data Augmentation Paradigm for Distillation
International Conference on Learning Representations (ICLR), 2022
Ziqi Wang
Yuexin Wu
Frederick Liu
Daogao Liu
Le Hou
Hongkun Yu
Jing Li
Heng Ji
262
6
0
21 Oct 2022
SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Alireza Mohammadshahi
Vassilina Nikoulina
Alexandre Berard
Caroline Brun
James Henderson
Laurent Besacier
VLM
MoE
LRM
246
24
0
20 Oct 2022
lo-fi: distributed fine-tuning without communication
Mitchell Wortsman
Suchin Gururangan
Shen Li
Ali Farhadi
Ludwig Schmidt
Michael G. Rabbat
Ari S. Morcos
349
24
0
19 Oct 2022
Attribution and Obfuscation of Neural Text Authorship: A Data Mining Perspective
SIGKDD Explorations (SIGKDD Explor.), 2022
Adaku Uchendu
Thai Le
Dongwon Lee
DeLMO
316
51
0
19 Oct 2022
Prompting GPT-3 To Be Reliable
International Conference on Learning Representations (ICLR), 2022
Chenglei Si
Zhe Gan
Zhengyuan Yang
Shuohang Wang
Jianfeng Wang
Jordan L. Boyd-Graber
Lijuan Wang
KELM
LRM
414
343
0
17 Oct 2022
Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Sachin Kumar
Vidhisha Balachandran
Lucille Njoo
Antonios Anastasopoulos
Yulia Tsvetkov
ELM
452
106
0
14 Oct 2022
Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values
Yejin Bang
Tiezheng Yu
Andrea Madotto
Mohammad Kachuee
Mona T. Diab
Pascale Fung
178
14
0
14 Oct 2022
Machine Generated Text: A Comprehensive Survey of Threat Models and Detection Methods
IEEE Access (IEEE Access), 2022
Evan Crothers
Nathalie Japkowicz
H. Viktor
DeLMO
386
158
0
13 Oct 2022
Bootstrapping Multilingual Semantic Parsers using Large Language Models
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Abhijeet Awasthi
Nitish Gupta
Bidisha Samanta
Shachi Dave
Sunita Sarawagi
Partha P. Talukdar
274
7
0
13 Oct 2022
Visual Classification via Description from Large Language Models
International Conference on Learning Representations (ICLR), 2022
Sachit Menon
Carl Vondrick
VLM
385
374
0
13 Oct 2022
MAPL: Parameter-Efficient Adaptation of Unimodal Pre-Trained Models for Vision-Language Few-Shot Prompting
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Oscar Manas
Pau Rodríguez López
Saba Ahmadi
Aida Nematzadeh
Yash Goyal
Aishwarya Agrawal
VLM
VPVLM
264
58
0
13 Oct 2022
On Divergence Measures for Bayesian Pseudocoresets
Neural Information Processing Systems (NeurIPS), 2022
Balhae Kim
J. Choi
Seanie Lee
Yoonho Lee
Jung-Woo Ha
Juho Lee
DD
188
13
0
12 Oct 2022
Generating Executable Action Plans with Environmentally-Aware Language Models
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Maitrey Gramopadhye
D. Szafir
LM&Ro
LLMAG
325
38
0
10 Oct 2022
Controllable Dialogue Simulation with In-Context Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zekun Li
Wenhu Chen
Shiyang Li
Hong Wang
Jingu Qian
Xi Yan
475
57
0
09 Oct 2022
AlphaTuning: Quantization-Aware Parameter-Efficient Adaptation of Large-Scale Pre-Trained Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
S. Kwon
Jeonghoon Kim
Jeongin Bae
Kang Min Yoo
Jin-Hwa Kim
Baeseong Park
Byeongwook Kim
Jung-Woo Ha
Nako Sung
Dongsoo Lee
MQ
283
33
0
08 Oct 2022
Large Language Models can Implement Policy Iteration
Neural Information Processing Systems (NeurIPS), 2022
Ethan A. Brooks
Logan Walls
Richard L. Lewis
Satinder Singh
LM&Ro
OffRL
387
25
0
07 Oct 2022
LLMEffiChecker: Understanding and Testing Efficiency Degradation of Large Language Models
ACM Transactions on Software Engineering and Methodology (TOSEM), 2022
Simin Chen
Cong Liu
Mirazul Haque
Wei Yang
262
33
0
07 Oct 2022
Few-Shot Anaphora Resolution in Scientific Protocols via Mixtures of In-Context Experts
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Nghia T. Le
Fan Bai
Alan Ritter
380
13
0
07 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review
Nature Machine Intelligence (Nat. Mach. Intell.), 2022
Dieuwke Hupkes
Mario Giulianelli
Verna Dankers
Mikel Artetxe
Yanai Elazar
...
Leila Khalatbari
Maria Ryskina
Rita Frieske
Robert Bamler
Zhijing Jin
631
132
0
06 Oct 2022
Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
International Conference on Learning Representations (ICLR), 2022
Seonghyeon Ye
Doyoung Kim
Joel Jang
Joongbo Shin
Minjoon Seo
FedML
VLM
UQCV
LRM
451
25
0
06 Oct 2022
A Distributional Lens for Multi-Aspect Controllable Text Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yuxuan Gu
Xiaocheng Feng
Sicheng Ma
Lingyuan Zhang
Heng Gong
Bing Qin
344
46
0
06 Oct 2022
Large Language Models are Pretty Good Zero-Shot Video Game Bug Detectors
Mohammad Reza Taesiri
Finlay Macklon
Yihe Wang
Hengshuo Shen
Cor-Paul Bezemer
ELM
LLMAG
MLLM
178
21
0
05 Oct 2022
Ask Me Anything: A simple strategy for prompting language models
International Conference on Learning Representations (ICLR), 2022
Simran Arora
A. Narayan
Mayee F. Chen
Laurel J. Orr
Neel Guha
Kush S. Bhatia
Ines Chami
Frederic Sala
Christopher Ré
ReLM
LRM
650
256
0
05 Oct 2022
GLM-130B: An Open Bilingual Pre-trained Model
International Conference on Learning Representations (ICLR), 2022
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng Zhang
Yuxiao Dong
Jie Tang
BDL
LRM
805
1,221
0
05 Oct 2022
Explaining Patterns in Data with Language Models via Interpretable Autoprompting
Chandan Singh
John X. Morris
J. Aneja
Alexander M. Rush
Jianfeng Gao
LRM
189
0
0
04 Oct 2022
Text Characterization Toolkit
Daniel Simig
Tianlu Wang
Verna Dankers
Peter Henderson
Khuyagbaatar Batsuren
Dieuwke Hupkes
Mona T. Diab
171
0
0
04 Oct 2022
Knowledge Unlearning for Mitigating Privacy Risks in Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Joel Jang
Dongkeun Yoon
Sohee Yang
Sungmin Cha
Moontae Lee
Lajanugen Logeswaran
Minjoon Seo
KELM
PILM
MU
507
365
0
04 Oct 2022
Recitation-Augmented Language Models
International Conference on Learning Representations (ICLR), 2022
Zhiqing Sun
Xuezhi Wang
Yi Tay
Yiming Yang
Denny Zhou
RALM
868
76
0
04 Oct 2022
Robot Task Planning and Situation Handling in Open Worlds
Yan Ding
Xiaohan Zhang
S. Amiri
Nieqing Cao
Hao Yang
Chad Esselink
Shiqi Zhang
LM&Ro
166
23
0
04 Oct 2022
FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation
Transactions of the Association for Computational Linguistics (TACL), 2022
Parker Riley
Timothy Dozat
Jan A. Botha
Xavier Garcia
Dan Garrette
Jason Riesa
Orhan Firat
Noah Constant
325
23
0
01 Oct 2022
Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Zhenhailong Wang
Xiaoman Pan
Dian Yu
Dong Yu
Jianshu Chen
Heng Ji
VLM
271
11
0
01 Oct 2022
AudioGen: Textually Guided Audio Generation
International Conference on Learning Representations (ICLR), 2022
Felix Kreuk
Gabriel Synnaeve
Adam Polyak
Uriel Singer
Alexandre Défossez
Jade Copet
Devi Parikh
Yaniv Taigman
Yossi Adi
DiffM
433
394
0
30 Sep 2022
SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation
Computer Vision and Pattern Recognition (CVPR), 2022
R. Ramos
Bruno Martins
Desmond Elliott
Yova Kementchedjhieva
VLM
206
121
0
30 Sep 2022
Unpacking Large Language Models with Conceptual Consistency
Pritish Sahu
Michael Cogswell
Yunye Gong
Ajay Divakaran
LRM
217
20
0
29 Sep 2022
Bidirectional Language Models Are Also Few-shot Learners
International Conference on Learning Representations (ICLR), 2022
Ajay Patel
Bryan Li
Mohammad Sadegh Rasooli
Noah Constant
Colin Raffel
Chris Callison-Burch
LRM
229
70
0
29 Sep 2022
EditEval: An Instruction-Based Benchmark for Text Improvements
Conference on Computational Natural Language Learning (CoNLL), 2022
Jane Dwivedi-Yu
Timo Schick
Zhengbao Jiang
Maria Lomeli
Patrick Lewis
Gautier Izacard
Edouard Grave
Sebastian Riedel
Fabio Petroni
199
31
0
27 Sep 2022
Deep Generative Multimedia Children's Literature
Matthew Lyle Olson
96
0
0
27 Sep 2022
Can Large Language Models Truly Understand Prompts? A Case Study with Negated Prompts
Joel Jang
Seonghyeon Ye
Minjoon Seo
ELM
LRM
255
78
0
26 Sep 2022
Learning to Drop Out: An Adversarial Approach to Training Sequence VAEs
Neural Information Processing Systems (NeurIPS), 2022
Ðorðe Miladinovic
Kumar Shridhar
Kushal Kumar Jain
Max B. Paulus
J. M. Buhmann
Mrinmaya Sachan
Carl Allen
DRL
324
5
0
26 Sep 2022
Moral Mimicry: Large Language Models Produce Moral Rationalizations Tailored to Political Identity
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Gabriel Simmons
371
87
0
24 Sep 2022
Variational Open-Domain Question Answering
International Conference on Machine Learning (ICML), 2022
Valentin Liévin
Andreas Geert Motzfeldt
Ida Riis Jensen
Ole Winther
OOD
BDL
210
11
0
23 Sep 2022
A Case Report On The "A.I. Locked-In Problem": social concerns with modern NLP
Yoshija Walter
LLMAG
129
3
0
22 Sep 2022
WeLM: A Well-Read Pre-trained Language Model for Chinese
Hui Su
Xiao Zhou
Houjin Yu
Xiaoyu Shen
Yuwen Chen
Zilin Zhu
Yang Yu
Jie Zhou
269
23
0
21 Sep 2022
Generate rather than Retrieve: Large Language Models are Strong Context Generators
International Conference on Learning Representations (ICLR), 2022
Wenhao Yu
Dan Iter
Shuohang Wang
Yichong Xu
Mingxuan Ju
Soumya Sanyal
Chenguang Zhu
Michael Zeng
Meng Jiang
RALM
AIMat
1.2K
398
0
21 Sep 2022
Extremely Simple Activation Shaping for Out-of-Distribution Detection
International Conference on Learning Representations (ICLR), 2022
Andrija Djurisic
Nebojsa Bozanic
Arjun Ashok
Rosanne Liu
OODD
445
206
0
20 Sep 2022
Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask
Sheng-Chun Kao
Amir Yazdanbakhsh
Suvinay Subramanian
Shivani Agrawal
Utku Evci
T. Krishna
329
15
0
15 Sep 2022
FP8 Formats for Deep Learning
Paulius Micikevicius
Dusan Stosic
N. Burgess
Marius Cornea
Pradeep Dubey
...
Naveen Mellempudi
S. Oberman
Mohammad Shoeybi
Michael Siu
Hao Wu
BDL
VLM
MQ
802
202
0
12 Sep 2022
Open-Domain Dialog Evaluation using Follow-Ups Likelihood
International Conference on Computational Linguistics (COLING), 2022
Maxime De Bruyn
Ehsan Lotfi
Jeska Buhmann
Walter Daelemans
206
9
0
12 Sep 2022
Chain of Explanation: New Prompting Method to Generate Higher Quality Natural Language Explanation for Implicit Hate Speech
The Web Conference (WWW), 2022
Fan Huang
Haewoon Kwak
Jisun An
LRM
210
33
0
11 Sep 2022
Analyzing Transformers in Embedding Space
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Guy Dar
Mor Geva
Ankit Gupta
Jonathan Berant
332
124
0
06 Sep 2022
Previous
1
2
3
...
56
57
58
59
Next
Page 57 of 59
Page
of 59
Go