Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2205.01068
Cited By
v1
v2
v3
v4 (latest)
OPT: Open Pre-trained Transformer Language Models
2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLM
OSLM
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"OPT: Open Pre-trained Transformer Language Models"
50 / 2,924 papers shown
Retrieval-Augmented Multimodal Language Modeling
International Conference on Machine Learning (ICML), 2023
Michihiro Yasunaga
Armen Aghajanyan
Weijia Shi
Rich James
J. Leskovec
Abigail Z. Jacobs
M. Lewis
Luke Zettlemoyer
Anuj Kumar
RALM
270
133
0
22 Nov 2022
Multitask Vision-Language Prompt Tuning
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Sheng Shen
Shijia Yang
Tianjun Zhang
Bohan Zhai
Joseph E. Gonzalez
Kurt Keutzer
Trevor Darrell
VLM
VPVLM
291
78
0
21 Nov 2022
Deanthropomorphising NLP: Can a Language Model Be Conscious?
PLoS ONE (PLoS ONE), 2022
Matthew Shardlow
Piotr Przybyła
250
10
0
21 Nov 2022
Unsupervised Explanation Generation via Correct Instantiations
AAAI Conference on Artificial Intelligence (AAAI), 2022
Sijie Cheng
Zhiyong Wu
Jiangjie Chen
Zhixing Li
Yang Liu
Lingpeng Kong
ReLM
LRM
174
5
0
21 Nov 2022
The Stack: 3 TB of permissively licensed source code
Denis Kocetkov
Raymond Li
Loubna Ben Allal
Jia Li
Chenghao Mou
...
Sean M. Hughes
Thomas Wolf
Dzmitry Bahdanau
Leandro von Werra
H. D. Vries
245
410
0
20 Nov 2022
Artificial Interrogation for Attributing Language Models
Farhan Dhanani
Muhammad Rafi
64
1
0
20 Nov 2022
An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation
Yixuan Su
Jialu Xu
114
15
0
19 Nov 2022
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
International Conference on Machine Learning (ICML), 2022
Guangxuan Xiao
Ji Lin
Mickael Seznec
Hao Wu
Julien Demouth
Song Han
MQ
803
1,219
0
18 Nov 2022
Is the Elephant Flying? Resolving Ambiguities in Text-to-Image Generative Models
Ninareh Mehrabi
Palash Goyal
Apurv Verma
Jwala Dhamala
Varun Kumar
Qian Hu
Kai-Wei Chang
R. Zemel
Aram Galstyan
Rahul Gupta
204
8
0
17 Nov 2022
Ignore Previous Prompt: Attack Techniques For Language Models
Fábio Perez
Ian Ribeiro
SILM
436
634
0
17 Nov 2022
Galactica: A Large Language Model for Science
Ross Taylor
Marcin Kardas
Guillem Cucurull
Thomas Scialom
Anthony Hartshorn
Elvis Saravia
Andrew Poulton
Viktor Kerkez
Robert Stojnic
ELM
ReLM
396
937
0
16 Nov 2022
GAMMT: Generative Ambiguity Modeling Using Multiple Transformers
Xingcheng Xu
173
0
0
16 Nov 2022
On the Compositional Generalization Gap of In-Context Learning
BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2022
Arian Hosseini
Ankit Vani
Dzmitry Bahdanau
Alessandro Sordoni
Rameswar Panda
201
30
0
15 Nov 2022
Mind Your Bias: A Critical Review of Bias Detection Methods for Contextual Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Silke Husse
Andreas Spitz
233
8
0
15 Nov 2022
Evaluating the Factual Consistency of Large Language Models Through News Summarization
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Derek Tam
Anisha Mascarenhas
Shiyue Zhang
Sarah Kwan
Joey Tianyi Zhou
Colin Raffel
HILM
282
132
0
15 Nov 2022
FolkScope: Intention Knowledge Graph Construction for E-commerce Commonsense Discovery
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Changlong Yu
Weiqi Wang
Xin Liu
Jiaxin Bai
Yangqiu Song
Zheng Li
Yifan Gao
Tianyu Cao
Bing Yin
209
29
0
15 Nov 2022
Prompting Language Models for Linguistic Structure
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Terra Blevins
Hila Gonen
Luke Zettlemoyer
LRM
249
52
0
15 Nov 2022
Breadth-First Pipeline Parallelism
J. Lamy-Poirier
GNN
MoE
AI4CE
125
1
0
11 Nov 2022
Measuring Reliability of Large Language Models through Semantic Consistency
Harsh Raj
Domenic Rosati
Subhabrata Majumdar
HILM
273
39
0
10 Nov 2022
Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Xiang Fan
Yiwei Lyu
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
BDL
300
8
0
10 Nov 2022
Collateral facilitation in humans and language models
Conference on Computational Natural Language Learning (CoNLL), 2022
J. Michaelov
Benjamin Bergen
217
14
0
09 Nov 2022
Grammatical Error Correction: A Survey of the State of the Art
Computational Linguistics (CL), 2022
Christopher Bryant
Zheng Yuan
Muhammad Reza Qorib
Hannan Cao
Hwee Tou Ng
Ted Briscoe
3DV
252
115
0
09 Nov 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
...
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
841
2,772
0
09 Nov 2022
Creative Writing with an AI-Powered Writing Assistant: Perspectives from Professional Writers
Daphne Ippolito
Ann Yuan
Andy Coenen
Sehmon Burnam
231
124
0
09 Nov 2022
Active Example Selection for In-Context Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yiming Zhang
Shi Feng
Chenhao Tan
SILM
LRM
329
253
0
08 Nov 2022
Intriguing Properties of Compression on Multilingual Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Kelechi Ogueji
Orevaoghene Ahia
Gbemileke Onilude
Sebastian Gehrmann
Sara Hooker
Julia Kreutzer
302
15
0
04 Nov 2022
MolE: a molecular foundation model for drug discovery
Oscar Méndez-Lucio
C. Nicolaou
Berton Earnshaw
161
14
0
03 Nov 2022
LMentry: A Language Model Benchmark of Elementary Language Tasks
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Avia Efrat
Or Honovich
Omer Levy
239
31
0
03 Nov 2022
Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language Model
Journal of machine learning research (JMLR), 2022
A. Luccioni
S. Viguier
Anne-Laure Ligozat
607
427
0
03 Nov 2022
Large Language Models Are Human-Level Prompt Engineers
International Conference on Learning Representations (ICLR), 2022
Yongchao Zhou
Andrei Ioan Muresanu
Ziwen Han
Keiran Paster
Silviu Pitis
Harris Chan
Jimmy Ba
ALM
LLMAG
508
1,181
0
03 Nov 2022
Preventing Verbatim Memorization in Language Models Gives a False Sense of Privacy
International Conference on Natural Language Generation (INLG), 2022
Daphne Ippolito
Florian Tramèr
Milad Nasr
Chiyuan Zhang
Matthew Jagielski
Katherine Lee
Christopher A. Choquette-Choo
Nicholas Carlini
PILM
MU
385
98
0
31 Oct 2022
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Xiaochuang Han
Sachin Kumar
Yulia Tsvetkov
325
147
0
31 Oct 2022
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
Elias Frantar
Saleh Ashkboos
Torsten Hoefler
Dan Alistarh
MQ
535
1,573
0
31 Oct 2022
A Solvable Model of Neural Scaling Laws
A. Maloney
Daniel A. Roberts
J. Sully
262
78
0
30 Oct 2022
Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models
International Conference on Learning Representations (ICLR), 2022
Xiaoman Pan
Wenlin Yao
Hongming Zhang
Dian Yu
Dong Yu
Jianshu Chen
KELM
569
28
0
28 Oct 2022
Class Based Thresholding in Early Exit Semantic Segmentation Networks
IEEE Signal Processing Letters (SPL), 2022
Alperen Görmez
Erdem Koyuncu
152
6
0
27 Oct 2022
What Language Model to Train if You Have One Million GPU Hours?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Teven Le Scao
Thomas Wang
Daniel Hesslow
Lucile Saulnier
Stas Bekman
...
Lintang Sutawika
Jaesung Tae
Zheng-Xin Yong
Julien Launay
Iz Beltagy
MoE
AI4CE
576
120
0
27 Oct 2022
TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection
Piyush Behre
S.S. Tan
A. Shah
Harini Kesavamoorthy
Shuangyu Chang
Fei Zuo
C. Basoglu
Sayan D. Pathak
184
1
0
27 Oct 2022
Personalized Dialogue Generation with Persona-Adaptive Attention
AAAI Conference on Artificial Intelligence (AAAI), 2022
Qiushi Huang
Yu Zhang
Tom Ko
Xubo Liu
Boyong Wu
Wenwu Wang
Lilian H. Y. Tang
304
38
0
27 Oct 2022
Multi-lingual Evaluation of Code Generation Models
International Conference on Learning Representations (ICLR), 2022
Ben Athiwaratkun
Sanjay Krishna Gouda
Zijian Wang
Xiaopeng Li
Yuchen Tian
...
Baishakhi Ray
Parminder Bhatia
Sudipta Sengupta
Dan Roth
Bing Xiang
ELM
765
217
0
26 Oct 2022
Scaling Laws Beyond Backpropagation
Matthew J. Filipovich
Alessandro Cappelli
Daniel Hesslow
Julien Launay
209
4
0
26 Oct 2022
RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Victor Zhong
Weijia Shi
Anuj Kumar
Luke Zettlemoyer
229
29
0
25 Oct 2022
Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models
International Conference on Machine Learning (ICML), 2022
Hong Liu
Sang Michael Xie
Zhiyuan Li
Tengyu Ma
AI4CE
326
68
0
25 Oct 2022
Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding
Maximillian Chen
Alexandros Papangelis
Chenyang Tao
Andrew Rosenbaum
Seokhwan Kim
Yang Liu
Zhou Yu
Dilek Z. Hakkani-Tür
229
38
0
25 Oct 2022
Contrastive Search Is What You Need For Neural Text Generation
Yixuan Su
Nigel Collier
245
67
0
25 Oct 2022
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models
Hao Liu
Xinyang Geng
Lisa Lee
Igor Mordatch
Sergey Levine
Sharan Narang
Pieter Abbeel
KELM
CLL
255
3
0
24 Oct 2022
Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Maarten Sap
Ronan Le Bras
Daniel Fried
Yejin Choi
397
272
0
24 Oct 2022
The Curious Case of Absolute Position Embeddings
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Koustuv Sinha
Amirhossein Kazemnejad
Siva Reddy
J. Pineau
Dieuwke Hupkes
Adina Williams
226
19
0
23 Oct 2022
Exploring The Landscape of Distributional Robustness for Question Answering Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Anas Awadalla
Mitchell Wortsman
Gabriel Ilharco
Sewon Min
Ian H. Magnusson
Hannaneh Hajishirzi
Ludwig Schmidt
ELM
OOD
KELM
229
23
0
22 Oct 2022
Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yue Yang
Wenlin Yao
Hongming Zhang
Xiaoyang Wang
Dong Yu
Jianshu Chen
VLM
224
24
0
21 Oct 2022
Previous
1
2
3
...
55
56
57
58
59
Next
Page 56 of 59
Page
of 59
Go