Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2205.01068
Cited By
v1
v2
v3
v4 (latest)
OPT: Open Pre-trained Transformer Language Models
2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLM
OSLM
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"OPT: Open Pre-trained Transformer Language Models"
50 / 2,924 papers shown
ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions
Deyao Zhu
Jun Chen
Kilichbek Haydarov
Xiaoqian Shen
Wenxuan Zhang
Mohamed Elhoseiny
MLLM
241
126
0
12 Mar 2023
Task and Motion Planning with Large Language Models for Object Rearrangement
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Yan Ding
Xiaohan Zhang
Chris Paxton
Shiqi Zhang
LM&Ro
LRM
570
227
0
10 Mar 2023
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction
IEEE International Conference on Computer Vision (ICCV), 2023
Jiabang He
Lei Wang
Yingpeng Hu
Ning Liu
Hui-juan Liu
Xingdong Xu
Hengtao Shen
MLLM
281
56
0
09 Mar 2023
Stealing the Decoding Algorithms of Language Models
Conference on Computer and Communications Security (CCS), 2023
A. Naseh
Kalpesh Krishna
Mohit Iyyer
Amir Houmansadr
MLAU
313
29
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
277
727
0
07 Mar 2023
SemEval-2023 Task 10: Explainable Detection of Online Sexism
International Workshop on Semantic Evaluation (SemEval), 2023
Hannah Rose Kirk
Wenjie Yin
Bertie Vidgen
Paul Röttger
299
144
0
07 Mar 2023
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Neural Information Processing Systems (NeurIPS), 2023
Hugo Laurenccon
Lucile Saulnier
Thomas Wang
Christopher Akiki
Albert Villanova del Moral
...
Violette Lepercq
Suzana Ilić
Margaret Mitchell
Sasha Luccioni
Yacine Jernite
AI4CE
AILaw
214
199
0
07 Mar 2023
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Victor C. Dibia
VLM
330
132
0
06 Mar 2023
OpenICL: An Open-Source Framework for In-context Learning
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhenyu Wu
Yaoxiang Wang
Jiacheng Ye
Jiangtao Feng
Jingjing Xu
Yu Qiao
Zhiyong Wu
196
59
0
06 Mar 2023
Data Portraits: Recording Foundation Model Training Data
Neural Information Processing Systems (NeurIPS), 2023
Marc Marone
Benjamin Van Durme
523
37
0
06 Mar 2023
Prismer: A Vision-Language Model with Multi-Task Experts
Shikun Liu
Linxi Fan
Edward Johns
Zhiding Yu
Chaowei Xiao
Anima Anandkumar
VLM
MLLM
325
33
0
04 Mar 2023
Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM
European Association for Machine Translation Conferences/Workshops (EAMT), 2023
Rachel Bawden
François Yvon
VLM
LRM
296
89
0
03 Mar 2023
Competence-Based Analysis of Language Models
Adam Davies
Jize Jiang
Chengxiang Zhai
ELM
368
7
0
01 Mar 2023
How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding Tasks
Xuanting Chen
Junjie Ye
Can Zu
Nuo Xu
Rui Zheng
Minlong Peng
Jie Zhou
Tao Gui
Tao Gui
Xuanjing Huang
AI4MH
ELM
182
100
0
01 Mar 2023
EvoPrompting: Language Models for Code-Level Neural Architecture Search
Neural Information Processing Systems (NeurIPS), 2023
Angelica Chen
David Dohan
David R. So
VLM
LRM
473
126
0
28 Feb 2023
Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
AAAI Conference on Artificial Intelligence (AAAI), 2023
Seonghyeon Ye
Hyeonbin Hwang
Sohee Yang
Hyeongu Yun
Yireun Kim
Minjoon Seo
LRM
251
46
0
28 Feb 2023
HugNLP: A Unified and Comprehensive Library for Natural Language Processing
International Conference on Information and Knowledge Management (CIKM), 2023
Jiadong Wang
Polydoros Giannouris
Qiushi Sun
Wenkang Huang
Chengyu Wang
Ming Gao
204
6
0
28 Feb 2023
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
8.4K
18,046
0
27 Feb 2023
Finding Support Examples for In-Context Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xiaonan Li
Xipeng Qiu
336
121
0
27 Feb 2023
Fast Attention Requires Bounded Entries
Neural Information Processing Systems (NeurIPS), 2023
Josh Alman
Zhao Song
313
102
0
26 Feb 2023
Does a Neural Network Really Encode Symbolic Concepts?
International Conference on Machine Learning (ICML), 2023
Mingjie Li
Quanshi Zhang
307
35
0
25 Feb 2023
AugGPT: Leveraging ChatGPT for Text Data Augmentation
IEEE Transactions on Big Data (IEEE Trans. Big Data), 2023
Haixing Dai
Zheng Liu
Wenxiong Liao
Xiaoke Huang
Yihan Cao
...
Lichao Sun
Shijie Zhao
Hongtu Zhu
Tianming Liu
Xiang Li
311
243
0
25 Feb 2023
Semantic Mechanical Search with Large Vision and Language Models
Conference on Robot Learning (CoRL), 2023
Satvik Sharma
Huang Huang
K. Shivakumar
A. Imran
Ryan Hoque
Brian Ichter
Ken Goldberg
LM&Ro
VLM
289
10
0
24 Feb 2023
Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback
Baolin Peng
Michel Galley
Pengcheng He
Hao Cheng
Yujia Xie
...
Qiuyuan Huang
Lars Liden
Zhou Yu
Weizhu Chen
Jianfeng Gao
KELM
HILM
LRM
407
481
0
24 Feb 2023
In What Languages are Generative Language Models the Most Formal? Analyzing Formality Distribution across Languages
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Asim Ersoy
Gerson Vizcarra
T. Mayeesha
Benjamin Muller
231
4
0
23 Feb 2023
Active Prompting with Chain-of-Thought for Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Shizhe Diao
Pengcheng Wang
Yong Lin
Tong Zhang
ReLM
KELM
LLMAG
LRM
485
186
0
23 Feb 2023
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving
Zhuohan Li
Lianmin Zheng
Yinmin Zhong
Vincent Liu
Ying Sheng
...
Yanping Huang
Zhifeng Chen
Hao Zhang
Joseph E. Gonzalez
Ion Stoica
MoE
325
152
0
22 Feb 2023
On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective
IEEE Data Engineering Bulletin (IEEE Data Eng. Bull.), 2023
Yongfeng Zhang
Xixu Hu
Wenxin Hou
Hao Chen
Runkai Zheng
...
Weirong Ye
Xiubo Geng
Binxing Jiao
Yue Zhang
Xingxu Xie
AI4MH
520
290
0
22 Feb 2023
In-context Example Selection with Influences
Nguyen Tai
Eric Wong
360
69
0
21 Feb 2023
k
k
k
NN-Adapter: Efficient Domain Adaptation for Black-Box Language Models
Yangsibo Huang
Daogao Liu
Zexuan Zhong
Weijia Shi
Y. Lee
RALM
ALM
179
20
0
21 Feb 2023
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems
International Conference on Learning Representations (ICLR), 2023
Yihao Feng
Shentao Yang
Shujian Zhang
Jianguo Zhang
Caiming Xiong
Mi Zhou
Haiquan Wang
OffRL
217
26
0
20 Feb 2023
Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation
International Conference on Learning Representations (ICLR), 2023
Lorenz Kuhn
Y. Gal
Sebastian Farquhar
UQLM
666
484
0
19 Feb 2023
Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
AI4MH
301
293
0
19 Feb 2023
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
731
17
0
17 Feb 2023
Auditing large language models: a three-layered approach
AI and Ethics (AE), 2023
Jakob Mokander
Jonas Schuett
Hannah Rose Kirk
Luciano Floridi
AILaw
MLAU
492
277
0
16 Feb 2023
Counting Carbon: A Survey of Factors Influencing the Emissions of Machine Learning
A. Luccioni
Alex Hernandez-Garcia
238
71
0
16 Feb 2023
Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training
International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2023
Hongzheng Chen
Cody Hao Yu
Shuai Zheng
Zhen Zhang
Zhiru Zhang
Yida Wang
358
13
0
16 Feb 2023
Commonsense Reasoning for Conversational AI: A Survey of the State of the Art
Christopher Richardson
Larry Heck
LRM
207
10
0
15 Feb 2023
Speculative Decoding with Big Little Decoder
Neural Information Processing Systems (NeurIPS), 2023
Sehoon Kim
K. Mangalam
Suhong Moon
Jitendra Malik
Michael W. Mahoney
A. Gholami
Kurt Keutzer
MoE
455
163
0
15 Feb 2023
Dictionary-based Phrase-level Prompting of Large Language Models for Machine Translation
Marjan Ghazvininejad
Hila Gonen
Luke Zettlemoyer
227
94
0
15 Feb 2023
Measuring the Instability of Fine-Tuning
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yupei Du
D. Nguyen
256
7
0
15 Feb 2023
On the Planning Abilities of Large Language Models (A Critical Investigation with a Proposed Benchmark)
Kaya Stechly
S. Sreedharan
Matthew Marquez
Alberto Olmo Hernandez
Subbarao Kambhampati
LLMAG
LRM
148
102
0
13 Feb 2023
Do Vision and Language Models Share Concepts? A Vector Space Alignment Study
Transactions of the Association for Computational Linguistics (TACL), 2023
Jiaang Li
Yova Kementchedjhieva
Constanza Fierro
Anders Søgaard
VLM
259
19
0
13 Feb 2023
A Unified View of Long-Sequence Models towards Modeling Million-Scale Dependencies
Hongyu Hè
Marko Kabić
276
2
0
13 Feb 2023
Transformer models: an introduction and catalog
X. Amatriain
Ananth Sankar
Jie Bing
Praveen Kumar Bodigutla
Timothy J. Hazen
Michaeel Kazi
500
73
0
12 Feb 2023
A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng
Jianbo Yuan
Lei Yu
Lingpeng Kong
DiffM
289
119
0
11 Feb 2023
Distillation of encoder-decoder transformers for sequence labelling
Findings (Findings), 2023
M. Farina
D. Pappadopulo
Anant Gupta
Leslie Huang
Ozan Irsoy
Thamar Solorio
VLM
337
3
0
10 Feb 2023
Selective In-Context Data Augmentation for Intent Detection using Pointwise V-Information
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Yen-Ting Lin
Alexandros Papangelis
Seokhwan Kim
Sungjin Lee
Devamanyu Hazarika
Mahdi Namazifar
Di Jin
Yang Liu
Dilek Z. Hakkani-Tür
173
43
0
10 Feb 2023
In-Context Learning with Many Demonstration Examples
Mukai Li
Shansan Gong
Jiangtao Feng
Yiheng Xu
Jinchao Zhang
Zhiyong Wu
Lingpeng Kong
269
42
0
09 Feb 2023
Offsite-Tuning: Transfer Learning without Full Model
Guangxuan Xiao
Ji Lin
Song Han
205
99
0
09 Feb 2023
Previous
1
2
3
...
52
53
54
...
57
58
59
Next
Page 53 of 59
Page
of 59
Go