ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.01068
  4. Cited By
OPT: Open Pre-trained Transformer Language Models
v1v2v3v4 (latest)

OPT: Open Pre-trained Transformer Language Models

2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
    VLMOSLMAI4CE
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "OPT: Open Pre-trained Transformer Language Models"

50 / 2,924 papers shown
Recommendation as Instruction Following: A Large Language Model
  Empowered Recommendation Approach
Recommendation as Instruction Following: A Large Language Model Empowered Recommendation Approach
Junjie Zhang
Ruobing Xie
Yupeng Hou
Wayne Xin Zhao
Leyu Lin
Ji-Rong Wen
325
308
0
11 May 2023
Self-Chained Image-Language Model for Video Localization and Question
  Answering
Self-Chained Image-Language Model for Video Localization and Question AnsweringNeural Information Processing Systems (NeurIPS), 2023
Shoubin Yu
Jaemin Cho
Prateek Yadav
Joey Tianyi Zhou
401
201
0
11 May 2023
Evaluating Open-Domain Question Answering in the Era of Large Language
  Models
Evaluating Open-Domain Question Answering in the Era of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Ehsan Kamalloo
Nouha Dziri
C. Clarke
Davood Rafiei
ELM
493
148
0
11 May 2023
Active Retrieval Augmented Generation
Active Retrieval Augmented GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zhengbao Jiang
Frank F. Xu
Luyu Gao
Zhiqing Sun
Qian Liu
Jane Dwivedi-Yu
Yiming Yang
Jamie Callan
Graham Neubig
RALM
405
494
0
11 May 2023
INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of
  Language Models
INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
H. S. V. N. S. K. Renduchintala
Krishnateja Killamsetty
S. Bhatia
Milan Aggarwal
Ganesh Ramakrishnan
Rishabh K. Iyer
Balaji Krishnamurthy
AIFin
132
4
0
11 May 2023
Chain-of-Dictionary Prompting Elicits Translation in Large Language
  Models
Chain-of-Dictionary Prompting Elicits Translation in Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hongyuan Lu
Haoran Yang
Haoyang Huang
Dongdong Zhang
Wai Lam
Furu Wei
LRMAI4CE
334
25
0
11 May 2023
Do LLMs Understand User Preferences? Evaluating LLMs On User Rating
  Prediction
Do LLMs Understand User Preferences? Evaluating LLMs On User Rating Prediction
Wang-Cheng Kang
Jianmo Ni
Nikhil Mehta
M. Sathiamoorthy
Lichan Hong
Ed H. Chi
D. Cheng
210
160
0
10 May 2023
VideoChat: Chat-Centric Video Understanding
VideoChat: Chat-Centric Video Understanding
Kunchang Li
Yinan He
Yi Wang
Yizhuo Li
Wen Wang
Ping Luo
Yali Wang
Limin Wang
Yu Qiao
MLLM
414
799
0
10 May 2023
Fast Distributed Inference Serving for Large Language Models
Fast Distributed Inference Serving for Large Language Models
Bingyang Wu
Yinmin Zhong
Zili Zhang
Gang Huang
Xuanzhe Liu
Xin Jin
229
146
0
10 May 2023
InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT
  Beyond Language
InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Zhaoyang Liu
Yinan He
Wenhai Wang
Weiyun Wang
Yi Wang
...
Yali Wang
Limin Wang
Ping Luo
Jifeng Dai
Yu Qiao
LRMMLLM
406
107
0
09 May 2023
Large Language Model Programs
Large Language Model Programs
Imanol Schlag
Sainbayar Sukhbaatar
Asli Celikyilmaz
Anuj Kumar
Jason Weston
Jürgen Schmidhuber
Xian Li
LRM
215
16
0
09 May 2023
StarCoder: may the source be with you!
StarCoder: may the source be with you!
Raymond Li
Loubna Ben Allal
Yangtian Zi
Niklas Muennighoff
Denis Kocetkov
...
Sean M. Hughes
Thomas Wolf
Arjun Guha
Leandro von Werra
H. D. Vries
515
1,058
0
09 May 2023
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with
  Large Language Models
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language ModelsACM Multimedia (ACM MM), 2023
Shan Zhong
Zhongzhan Huang
Wushao Wen
Jinghui Qin
Guanbin Li
376
52
0
09 May 2023
MoT: Memory-of-Thought Enables ChatGPT to Self-Improve
MoT: Memory-of-Thought Enables ChatGPT to Self-ImproveConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xiaonan Li
Xipeng Qiu
ReLMKELMLRMAI4MH
336
51
0
09 May 2023
Read, Diagnose and Chat: Towards Explainable and Interactive
  LLMs-Augmented Depression Detection in Social Media
Read, Diagnose and Chat: Towards Explainable and Interactive LLMs-Augmented Depression Detection in Social Media
Wei Qin
Zetong Chen
Lei Wang
Yunshi Lan
Wei Ren
Richang Hong
AI4MH
233
25
0
09 May 2023
Explanation-based Finetuning Makes Models More Robust to Spurious Cues
Explanation-based Finetuning Makes Models More Robust to Spurious CuesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Josh Magnus Ludan
Yixuan Meng
Nguyen Tai
Saurabh Shah
Qing Lyu
Marianna Apidianaki
Chris Callison-Burch
AAMLLRM
348
24
0
08 May 2023
The Current State of Summarization
The Current State of Summarization
Fabian Retkowski
282
10
0
08 May 2023
How Do In-Context Examples Affect Compositional Generalization?
How Do In-Context Examples Affect Compositional Generalization?Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Shengnan An
Zeqi Lin
Qiang Fu
B. Chen
Nanning Zheng
Jian-Guang Lou
Dongmei Zhang
408
70
0
08 May 2023
Augmented Large Language Models with Parametric Knowledge Guiding
Augmented Large Language Models with Parametric Knowledge Guiding
Ziyang Luo
Can Xu
Lu Wang
Xiubo Geng
Chongyang Tao
Jing Ma
Qingwei Lin
Daxin Jiang
KELMRALM
320
57
0
08 May 2023
Prompted LLMs as Chatbot Modules for Long Open-domain Conversation
Prompted LLMs as Chatbot Modules for Long Open-domain ConversationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Gibbeum Lee
Volker Hartmann
Jongho Park
Dimitris Papailiopoulos
Kangwook Lee
177
80
0
08 May 2023
Residual Prompt Tuning: Improving Prompt Tuning with Residual
  Reparameterization
Residual Prompt Tuning: Improving Prompt Tuning with Residual ReparameterizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Anastasia Razdaibiedina
Yuning Mao
Rui Hou
Madian Khabsa
M. Lewis
Jimmy Ba
Amjad Almahairi
VLM
172
72
0
06 May 2023
Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal
  Structured Representations
Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured RepresentationsAAAI Conference on Artificial Intelligence (AAAI), 2023
Yufen Huang
Jiji Tang
Zhuo Chen
Rongsheng Zhang
Xinfeng Zhang
...
Zeng Zhao
Zhou Zhao
Tangjie Lv
Zhipeng Hu
Wen Zhang
VLM
311
49
0
06 May 2023
LMEye: An Interactive Perception Network for Large Language Models
LMEye: An Interactive Perception Network for Large Language ModelsIEEE transactions on multimedia (IEEE TMM), 2023
Yunxin Li
Baotian Hu
Xinyu Chen
Lin Ma
Yong-mei Xu
Hao Fei
MLLMVLM
290
41
0
05 May 2023
DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System
  for Multilingual Named Entity Recognition
DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity RecognitionInternational Workshop on Semantic Evaluation (SemEval), 2023
Zeqi Tan
Shen Huang
Zixia Jia
Jiong Cai
Hai-Tao Zheng
...
Yueting Zhuang
Kewei Tu
Pengjun Xie
Fei Huang
Yong Jiang
178
13
0
05 May 2023
LMs stand their Ground: Investigating the Effect of Embodiment in
  Figurative Language Interpretation by Language Models
LMs stand their Ground: Investigating the Effect of Embodiment in Figurative Language Interpretation by Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Philipp Wicke
270
7
0
05 May 2023
VicunaNER: Zero/Few-shot Named Entity Recognition using Vicuna
VicunaNER: Zero/Few-shot Named Entity Recognition using Vicuna
Shezheng Song
136
15
0
05 May 2023
Otter: A Multi-Modal Model with In-Context Instruction Tuning
Otter: A Multi-Modal Model with In-Context Instruction TuningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yue Liu
Yuanhan Zhang
Liangyu Chen
Jinghao Wang
Fanyi Pu
Joshua Adrian Cahyono
Jingkang Yang
Yu Qiao
MLLM
531
627
0
05 May 2023
Can LLM Already Serve as A Database Interface? A BIg Bench for
  Large-Scale Database Grounded Text-to-SQLs
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLsNeural Information Processing Systems (NeurIPS), 2023
Jinyang Li
Binyuan Hui
Ge Qu
Jiaxi Yang
Binhua Li
...
Guoliang Li
Kevin C. C. Chang
Fei Huang
Reynold Cheng
Yongbin Li
LMTD
407
717
0
04 May 2023
Sentence Embedding Leaks More Information than You Expect: Generative
  Embedding Inversion Attack to Recover the Whole Sentence
Sentence Embedding Leaks More Information than You Expect: Generative Embedding Inversion Attack to Recover the Whole SentenceAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Haoran Li
Mingshi Xu
Yangqiu Song
293
81
0
04 May 2023
Caption Anything: Interactive Image Description with Diverse Multimodal
  Controls
Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Teng Wang
Jinrui Zhang
Junjie Fei
Hao Zheng
Yunlong Tang
Zhe Li
Mingqi Gao
Shanshan Zhao
MLLM
483
125
0
04 May 2023
Conformal Nucleus Sampling
Conformal Nucleus SamplingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Shauli Ravfogel
Carlos Wert Carvajal
M.F. Eggl
UQLM
306
31
0
04 May 2023
"Oops, Did I Just Say That?" Testing and Repairing Unethical Suggestions
  of Large Language Models with Suggest-Critique-Reflect Process
"Oops, Did I Just Say That?" Testing and Repairing Unethical Suggestions of Large Language Models with Suggest-Critique-Reflect Process
Anna Glazkova
Zongjie Li
Michael Kadantsev
Maksim Glazkov
KELM
200
15
0
04 May 2023
Should ChatGPT and Bard Share Revenue with Their Data Providers? A New
  Business Model for the AI Era
Should ChatGPT and Bard Share Revenue with Their Data Providers? A New Business Model for the AI EraAdvances in Artificial Intelligence and Machine Learning (AAIML), 2023
Dong Zhang
143
5
0
04 May 2023
Cuttlefish: Low-Rank Model Training without All the Tuning
Cuttlefish: Low-Rank Model Training without All the TuningConference on Machine Learning and Systems (MLSys), 2023
Hongyi Wang
Saurabh Agarwal
Pongsakorn U-chupala
Yoshiki Tanaka
Eric P. Xing
Dimitris Papailiopoulos
OffRL
301
26
0
04 May 2023
Personalized Abstractive Summarization by Tri-agent Generation Pipeline
Personalized Abstractive Summarization by Tri-agent Generation PipelineFindings (Findings), 2023
Md Aminul Haque Palash
Sourav Saha
Faria Afrin
Pengcheng He
305
6
0
04 May 2023
Distilling Step-by-Step! Outperforming Larger Language Models with Less
  Training Data and Smaller Model Sizes
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model SizesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Lokesh Nagalapatti
Chun-Liang Li
Chih-Kuan Yeh
Hootan Nakhost
Yasuhisa Fujii
Alexander Ratner
Ranjay Krishna
Chen-Yu Lee
Tomas Pfister
ALM
762
755
0
03 May 2023
A Systematic Study of Knowledge Distillation for Natural Language
  Generation with Pseudo-Target Training
A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target TrainingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Nitay Calderon
Subhabrata Mukherjee
Roi Reichart
Amir Kantor
300
22
0
03 May 2023
Where We Have Arrived in Proving the Emergence of Sparse Symbolic
  Concepts in AI Models
Where We Have Arrived in Proving the Emergence of Sparse Symbolic Concepts in AI Models
Qihan Ren
Maximilian Brunner
Wen Shen
S. Mintchev
245
14
0
03 May 2023
Improving Contrastive Learning of Sentence Embeddings from AI Feedback
Improving Contrastive Learning of Sentence Embeddings from AI FeedbackAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
M. Abouheaf
W. Gueaieb
Md. Suruz Miah
D. Spinello
Xipeng Qiu
305
46
0
03 May 2023
How to Unleash the Power of Large Language Models for Few-shot Relation
  Extraction?
How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?
Xin Xu
Yuqi Zhu
Xiaohan Wang
Ningyu Zhang
KELMLRM
271
72
0
02 May 2023
Summarizing Multiple Documents with Conversational Structure for
  Meta-Review Generation
Summarizing Multiple Documents with Conversational Structure for Meta-Review GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Miao Li
Eduard H. Hovy
Jey Han Lau
405
24
0
02 May 2023
VPGTrans: Transfer Visual Prompt Generator across LLMs
VPGTrans: Transfer Visual Prompt Generator across LLMsNeural Information Processing Systems (NeurIPS), 2023
Ao Zhang
Hao Fei
Yuan Yao
Wei Ji
Li Li
Zhiyuan Liu
Tat-Seng Chua
MLLMVLM
211
101
0
02 May 2023
S2abEL: A Dataset for Entity Linking from Scientific Tables
S2abEL: A Dataset for Entity Linking from Scientific TablesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yuze Lou
Bailey Kuehl
Erin Bransom
Sergey Feldman
Aakanksha Naik
Doug Downey
252
5
0
30 Apr 2023
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Kent K. Chang
Mackenzie Cramer
Sandeep Soni
David Bamman
RALM
592
164
0
28 Apr 2023
Discourse over Discourse: The Need for an Expanded Pragmatic Focus in
  Conversational AI
Discourse over Discourse: The Need for an Expanded Pragmatic Focus in Conversational AI
S. M. Seals
V. Shalin
230
4
0
27 Apr 2023
ChatVideo: A Tracklet-centric Multimodal and Versatile Video
  Understanding System
ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System
Junke Wang
Dongdong Chen
Chong Luo
Xiyang Dai
Lu Yuan
Zuxuan Wu
Yu-Gang Jiang
384
69
0
27 Apr 2023
Controlled Text Generation with Natural Language Instructions
Controlled Text Generation with Natural Language InstructionsInternational Conference on Machine Learning (ICML), 2023
Wangchunshu Zhou
Yuchen Eleanor Jiang
Ethan Gotlieb Wilcox
Robert Bamler
Mrinmaya Sachan
451
115
0
27 Apr 2023
mPLUG-Owl: Modularization Empowers Large Language Models with
  Multimodality
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Qinghao Ye
Haiyang Xu
Guohai Xu
Jiabo Ye
Ming Yan
...
Junfeng Tian
Qiang Qi
Ji Zhang
Feiyan Huang
Jingren Zhou
VLMMLLM
1.1K
1,170
0
27 Apr 2023
Multi-Party Chat: Conversational Agents in Group Settings with Humans
  and Models
Multi-Party Chat: Conversational Agents in Group Settings with Humans and Models
Jimmy Wei
Kurt Shuster
Arthur Szlam
Jason Weston
Jack Urbanek
M. Komeili
LLMAG
174
52
0
26 Apr 2023
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and BeyondACM Transactions on Knowledge Discovery from Data (TKDD), 2023
Jingfeng Yang
Hongye Jin
Ruixiang Tang
Xiaotian Han
Qizhang Feng
Haoming Jiang
Bing Yin
Helen Zhou
LM&MA
433
940
0
26 Apr 2023
Previous
123...495051...575859
Next
Page 50 of 59
Pageof 59