Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2205.01068
Cited By
v1
v2
v3
v4 (latest)
OPT: Open Pre-trained Transformer Language Models
2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLM
OSLM
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"OPT: Open Pre-trained Transformer Language Models"
50 / 2,924 papers shown
Recommendation as Instruction Following: A Large Language Model Empowered Recommendation Approach
Junjie Zhang
Ruobing Xie
Yupeng Hou
Wayne Xin Zhao
Leyu Lin
Ji-Rong Wen
325
308
0
11 May 2023
Self-Chained Image-Language Model for Video Localization and Question Answering
Neural Information Processing Systems (NeurIPS), 2023
Shoubin Yu
Jaemin Cho
Prateek Yadav
Joey Tianyi Zhou
401
201
0
11 May 2023
Evaluating Open-Domain Question Answering in the Era of Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Ehsan Kamalloo
Nouha Dziri
C. Clarke
Davood Rafiei
ELM
493
148
0
11 May 2023
Active Retrieval Augmented Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zhengbao Jiang
Frank F. Xu
Luyu Gao
Zhiqing Sun
Qian Liu
Jane Dwivedi-Yu
Yiming Yang
Jamie Callan
Graham Neubig
RALM
405
494
0
11 May 2023
INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
H. S. V. N. S. K. Renduchintala
Krishnateja Killamsetty
S. Bhatia
Milan Aggarwal
Ganesh Ramakrishnan
Rishabh K. Iyer
Balaji Krishnamurthy
AIFin
132
4
0
11 May 2023
Chain-of-Dictionary Prompting Elicits Translation in Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hongyuan Lu
Haoran Yang
Haoyang Huang
Dongdong Zhang
Wai Lam
Furu Wei
LRM
AI4CE
334
25
0
11 May 2023
Do LLMs Understand User Preferences? Evaluating LLMs On User Rating Prediction
Wang-Cheng Kang
Jianmo Ni
Nikhil Mehta
M. Sathiamoorthy
Lichan Hong
Ed H. Chi
D. Cheng
210
160
0
10 May 2023
VideoChat: Chat-Centric Video Understanding
Kunchang Li
Yinan He
Yi Wang
Yizhuo Li
Wen Wang
Ping Luo
Yali Wang
Limin Wang
Yu Qiao
MLLM
414
799
0
10 May 2023
Fast Distributed Inference Serving for Large Language Models
Bingyang Wu
Yinmin Zhong
Zili Zhang
Gang Huang
Xuanzhe Liu
Xin Jin
229
146
0
10 May 2023
InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Zhaoyang Liu
Yinan He
Wenhai Wang
Weiyun Wang
Yi Wang
...
Yali Wang
Limin Wang
Ping Luo
Jifeng Dai
Yu Qiao
LRM
MLLM
406
107
0
09 May 2023
Large Language Model Programs
Imanol Schlag
Sainbayar Sukhbaatar
Asli Celikyilmaz
Anuj Kumar
Jason Weston
Jürgen Schmidhuber
Xian Li
LRM
215
16
0
09 May 2023
StarCoder: may the source be with you!
Raymond Li
Loubna Ben Allal
Yangtian Zi
Niklas Muennighoff
Denis Kocetkov
...
Sean M. Hughes
Thomas Wolf
Arjun Guha
Leandro von Werra
H. D. Vries
515
1,058
0
09 May 2023
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models
ACM Multimedia (ACM MM), 2023
Shan Zhong
Zhongzhan Huang
Wushao Wen
Jinghui Qin
Guanbin Li
376
52
0
09 May 2023
MoT: Memory-of-Thought Enables ChatGPT to Self-Improve
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xiaonan Li
Xipeng Qiu
ReLM
KELM
LRM
AI4MH
336
51
0
09 May 2023
Read, Diagnose and Chat: Towards Explainable and Interactive LLMs-Augmented Depression Detection in Social Media
Wei Qin
Zetong Chen
Lei Wang
Yunshi Lan
Wei Ren
Richang Hong
AI4MH
233
25
0
09 May 2023
Explanation-based Finetuning Makes Models More Robust to Spurious Cues
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Josh Magnus Ludan
Yixuan Meng
Nguyen Tai
Saurabh Shah
Qing Lyu
Marianna Apidianaki
Chris Callison-Burch
AAML
LRM
348
24
0
08 May 2023
The Current State of Summarization
Fabian Retkowski
282
10
0
08 May 2023
How Do In-Context Examples Affect Compositional Generalization?
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Shengnan An
Zeqi Lin
Qiang Fu
B. Chen
Nanning Zheng
Jian-Guang Lou
Dongmei Zhang
408
70
0
08 May 2023
Augmented Large Language Models with Parametric Knowledge Guiding
Ziyang Luo
Can Xu
Lu Wang
Xiubo Geng
Chongyang Tao
Jing Ma
Qingwei Lin
Daxin Jiang
KELM
RALM
320
57
0
08 May 2023
Prompted LLMs as Chatbot Modules for Long Open-domain Conversation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Gibbeum Lee
Volker Hartmann
Jongho Park
Dimitris Papailiopoulos
Kangwook Lee
177
80
0
08 May 2023
Residual Prompt Tuning: Improving Prompt Tuning with Residual Reparameterization
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Anastasia Razdaibiedina
Yuning Mao
Rui Hou
Madian Khabsa
M. Lewis
Jimmy Ba
Amjad Almahairi
VLM
172
72
0
06 May 2023
Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yufen Huang
Jiji Tang
Zhuo Chen
Rongsheng Zhang
Xinfeng Zhang
...
Zeng Zhao
Zhou Zhao
Tangjie Lv
Zhipeng Hu
Wen Zhang
VLM
311
49
0
06 May 2023
LMEye: An Interactive Perception Network for Large Language Models
IEEE transactions on multimedia (IEEE TMM), 2023
Yunxin Li
Baotian Hu
Xinyu Chen
Lin Ma
Yong-mei Xu
Hao Fei
MLLM
VLM
290
41
0
05 May 2023
DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition
International Workshop on Semantic Evaluation (SemEval), 2023
Zeqi Tan
Shen Huang
Zixia Jia
Jiong Cai
Hai-Tao Zheng
...
Yueting Zhuang
Kewei Tu
Pengjun Xie
Fei Huang
Yong Jiang
178
13
0
05 May 2023
LMs stand their Ground: Investigating the Effect of Embodiment in Figurative Language Interpretation by Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Philipp Wicke
270
7
0
05 May 2023
VicunaNER: Zero/Few-shot Named Entity Recognition using Vicuna
Shezheng Song
136
15
0
05 May 2023
Otter: A Multi-Modal Model with In-Context Instruction Tuning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yue Liu
Yuanhan Zhang
Liangyu Chen
Jinghao Wang
Fanyi Pu
Joshua Adrian Cahyono
Jingkang Yang
Yu Qiao
MLLM
531
627
0
05 May 2023
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs
Neural Information Processing Systems (NeurIPS), 2023
Jinyang Li
Binyuan Hui
Ge Qu
Jiaxi Yang
Binhua Li
...
Guoliang Li
Kevin C. C. Chang
Fei Huang
Reynold Cheng
Yongbin Li
LMTD
407
717
0
04 May 2023
Sentence Embedding Leaks More Information than You Expect: Generative Embedding Inversion Attack to Recover the Whole Sentence
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Haoran Li
Mingshi Xu
Yangqiu Song
293
81
0
04 May 2023
Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Teng Wang
Jinrui Zhang
Junjie Fei
Hao Zheng
Yunlong Tang
Zhe Li
Mingqi Gao
Shanshan Zhao
MLLM
483
125
0
04 May 2023
Conformal Nucleus Sampling
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Shauli Ravfogel
Carlos Wert Carvajal
M.F. Eggl
UQLM
306
31
0
04 May 2023
"Oops, Did I Just Say That?" Testing and Repairing Unethical Suggestions of Large Language Models with Suggest-Critique-Reflect Process
Anna Glazkova
Zongjie Li
Michael Kadantsev
Maksim Glazkov
KELM
200
15
0
04 May 2023
Should ChatGPT and Bard Share Revenue with Their Data Providers? A New Business Model for the AI Era
Advances in Artificial Intelligence and Machine Learning (AAIML), 2023
Dong Zhang
143
5
0
04 May 2023
Cuttlefish: Low-Rank Model Training without All the Tuning
Conference on Machine Learning and Systems (MLSys), 2023
Hongyi Wang
Saurabh Agarwal
Pongsakorn U-chupala
Yoshiki Tanaka
Eric P. Xing
Dimitris Papailiopoulos
OffRL
301
26
0
04 May 2023
Personalized Abstractive Summarization by Tri-agent Generation Pipeline
Findings (Findings), 2023
Md Aminul Haque Palash
Sourav Saha
Faria Afrin
Pengcheng He
305
6
0
04 May 2023
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Lokesh Nagalapatti
Chun-Liang Li
Chih-Kuan Yeh
Hootan Nakhost
Yasuhisa Fujii
Alexander Ratner
Ranjay Krishna
Chen-Yu Lee
Tomas Pfister
ALM
762
755
0
03 May 2023
A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Nitay Calderon
Subhabrata Mukherjee
Roi Reichart
Amir Kantor
300
22
0
03 May 2023
Where We Have Arrived in Proving the Emergence of Sparse Symbolic Concepts in AI Models
Qihan Ren
Maximilian Brunner
Wen Shen
S. Mintchev
245
14
0
03 May 2023
Improving Contrastive Learning of Sentence Embeddings from AI Feedback
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
M. Abouheaf
W. Gueaieb
Md. Suruz Miah
D. Spinello
Xipeng Qiu
305
46
0
03 May 2023
How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?
Xin Xu
Yuqi Zhu
Xiaohan Wang
Ningyu Zhang
KELM
LRM
271
72
0
02 May 2023
Summarizing Multiple Documents with Conversational Structure for Meta-Review Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Miao Li
Eduard H. Hovy
Jey Han Lau
405
24
0
02 May 2023
VPGTrans: Transfer Visual Prompt Generator across LLMs
Neural Information Processing Systems (NeurIPS), 2023
Ao Zhang
Hao Fei
Yuan Yao
Wei Ji
Li Li
Zhiyuan Liu
Tat-Seng Chua
MLLM
VLM
211
101
0
02 May 2023
S2abEL: A Dataset for Entity Linking from Scientific Tables
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yuze Lou
Bailey Kuehl
Erin Bransom
Sergey Feldman
Aakanksha Naik
Doug Downey
252
5
0
30 Apr 2023
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Kent K. Chang
Mackenzie Cramer
Sandeep Soni
David Bamman
RALM
592
164
0
28 Apr 2023
Discourse over Discourse: The Need for an Expanded Pragmatic Focus in Conversational AI
S. M. Seals
V. Shalin
230
4
0
27 Apr 2023
ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System
Junke Wang
Dongdong Chen
Chong Luo
Xiyang Dai
Lu Yuan
Zuxuan Wu
Yu-Gang Jiang
384
69
0
27 Apr 2023
Controlled Text Generation with Natural Language Instructions
International Conference on Machine Learning (ICML), 2023
Wangchunshu Zhou
Yuchen Eleanor Jiang
Ethan Gotlieb Wilcox
Robert Bamler
Mrinmaya Sachan
451
115
0
27 Apr 2023
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Qinghao Ye
Haiyang Xu
Guohai Xu
Jiabo Ye
Ming Yan
...
Junfeng Tian
Qiang Qi
Ji Zhang
Feiyan Huang
Jingren Zhou
VLM
MLLM
1.1K
1,170
0
27 Apr 2023
Multi-Party Chat: Conversational Agents in Group Settings with Humans and Models
Jimmy Wei
Kurt Shuster
Arthur Szlam
Jason Weston
Jack Urbanek
M. Komeili
LLMAG
174
52
0
26 Apr 2023
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
ACM Transactions on Knowledge Discovery from Data (TKDD), 2023
Jingfeng Yang
Hongye Jin
Ruixiang Tang
Xiaotian Han
Qizhang Feng
Haoming Jiang
Bing Yin
Helen Zhou
LM&MA
433
940
0
26 Apr 2023
Previous
1
2
3
...
49
50
51
...
57
58
59
Next
Page 50 of 59
Page
of 59
Go