v1v2v3v4 (latest)

OPT: Open Pre-trained Transformer Language Models

2 May 2022

Xian Li

Luke Zettlemoyer

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "OPT: Open Pre-trained Transformer Language Models"

50 / 2,924 papers shown

Recommendation as Instruction Following: A Large Language Model Empowered Recommendation Approach

325

308

11 May 2023

Self-Chained Image-Language Model for Video Localization and Question AnsweringNeural Information Processing Systems (NeurIPS), 2023

401

201

11 May 2023

Evaluating Open-Domain Question Answering in the Era of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

493

148

11 May 2023

Active Retrieval Augmented GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Graham Neubig

405

494

11 May 2023

INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

H. S. V. N. S. K. Renduchintala

Krishnateja Killamsetty

Ganesh Ramakrishnan

132

11 May 2023

Chain-of-Dictionary Prompting Elicits Translation in Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Haoran Yang

334

11 May 2023

Do LLMs Understand User Preferences? Evaluating LLMs On User Rating Prediction

210

160

10 May 2023

VideoChat: Chat-Centric Video Understanding

Yi Wang

Ping Luo

Yu Qiao

414

799

10 May 2023

Fast Distributed Inference Serving for Large Language Models

Xin Jin

229

146

10 May 2023

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language

Yi Wang

...

Ping Luo

Yu Qiao

406

107

09 May 2023

Large Language Model Programs

Jason Weston

Xian Li

215

09 May 2023

StarCoder: may the source be with you!

Niklas Muennighoff

...

515

1,058

09 May 2023

SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language ModelsACM Multimedia (ACM MM), 2023

376

09 May 2023

MoT: Memory-of-Thought Enables ChatGPT to Self-ImproveConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Xiaonan Li

Xipeng Qiu

ReLM KELM LRM AI4MH

336

09 May 2023

Read, Diagnose and Chat: Towards Explainable and Interactive LLMs-Augmented Depression Detection in Social Media

Lei Wang

Richang Hong

233

09 May 2023

Explanation-based Finetuning Makes Models More Robust to Spurious CuesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Marianna Apidianaki

348

08 May 2023

The Current State of Summarization

Fabian Retkowski

282

08 May 2023

How Do In-Context Examples Affect Compositional Generalization?Annual Meeting of the Association for Computational Linguistics (ACL), 2023

408

08 May 2023

Augmented Large Language Models with Parametric Knowledge Guiding

320

08 May 2023

Prompted LLMs as Chatbot Modules for Long Open-domain ConversationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Gibbeum Lee

Volker Hartmann

Jongho Park

Dimitris Papailiopoulos

Kangwook Lee

177

08 May 2023

Residual Prompt Tuning: Improving Prompt Tuning with Residual ReparameterizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Anastasia Razdaibiedina

Madian Khabsa

Jimmy Ba

172

06 May 2023

Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured RepresentationsAAAI Conference on Artificial Intelligence (AAAI), 2023

...

Zeng Zhao

311

06 May 2023

LMEye: An Interactive Perception Network for Large Language ModelsIEEE transactions on multimedia (IEEE TMM), 2023

Baotian Hu

Lin Ma

290

05 May 2023

DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity RecognitionInternational Workshop on Semantic Evaluation (SemEval), 2023

...

Fei Huang

178

05 May 2023

LMs stand their Ground: Investigating the Effect of Embodiment in Figurative Language Interpretation by Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Philipp Wicke

270

05 May 2023

VicunaNER: Zero/Few-shot Named Entity Recognition using Vicuna

Shezheng Song

136

05 May 2023

Otter: A Multi-Modal Model with In-Context Instruction TuningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

Joshua Adrian Cahyono

Jingkang Yang

Yu Qiao

MLLM

531

627

05 May 2023

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLsNeural Information Processing Systems (NeurIPS), 2023

...

Kevin C. C. Chang

Fei Huang

Reynold Cheng

Yongbin Li

LMTD

407

717

04 May 2023

Sentence Embedding Leaks More Information than You Expect: Generative Embedding Inversion Attack to Recover the Whole SentenceAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Haoran Li

Mingshi Xu

Yangqiu Song

293

04 May 2023

Caption Anything: Interactive Image Description with Diverse Multimodal Controls

483

125

04 May 2023

Conformal Nucleus SamplingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

306

04 May 2023

"Oops, Did I Just Say That?" Testing and Repairing Unethical Suggestions of Large Language Models with Suggest-Critique-Reflect Process

200

04 May 2023

Should ChatGPT and Bard Share Revenue with Their Data Providers? A New Business Model for the AI EraAdvances in Artificial Intelligence and Machine Learning (AAIML), 2023

Dong Zhang

143

04 May 2023

Cuttlefish: Low-Rank Model Training without All the TuningConference on Machine Learning and Systems (MLSys), 2023

Hongyi Wang

Saurabh Agarwal

Pongsakorn U-chupala

Yoshiki Tanaka

Eric P. Xing

Dimitris Papailiopoulos

OffRL

301

04 May 2023

Personalized Abstractive Summarization by Tri-agent Generation PipelineFindings (Findings), 2023

Md Aminul Haque Palash

Sourav Saha

Faria Afrin

Pengcheng He

305

04 May 2023

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model SizesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Lokesh Nagalapatti

Chun-Liang Li

Chih-Kuan Yeh

Hootan Nakhost

Yasuhisa Fujii

Alexander Ratner

Ranjay Krishna

Chen-Yu Lee

Tomas Pfister

ALM

762

755

03 May 2023

A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target TrainingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

300

03 May 2023

Where We Have Arrived in Proving the Emergence of Sparse Symbolic Concepts in AI Models

245

03 May 2023

Improving Contrastive Learning of Sentence Embeddings from AI FeedbackAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Xipeng Qiu

305

03 May 2023

How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?

Xin Xu

Yuqi Zhu

Xiaohan Wang

Ningyu Zhang

KELM LRM

271

02 May 2023

Summarizing Multiple Documents with Conversational Structure for Meta-Review GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Miao Li

Eduard H. Hovy

Jey Han Lau

405

02 May 2023

VPGTrans: Transfer Visual Prompt Generator across LLMsNeural Information Processing Systems (NeurIPS), 2023

Ao Zhang

Hao Fei

Yuan Yao

Wei Ji

Li Li

Zhiyuan Liu

Tat-Seng Chua

MLLM VLM

211

101

02 May 2023

S2abEL: A Dataset for Entity Linking from Scientific TablesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

252

30 Apr 2023

Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

592

164

28 Apr 2023

Discourse over Discourse: The Need for an Expanded Pragmatic Focus in Conversational AI

S. M. Seals

V. Shalin

230

27 Apr 2023

ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System

Lu Yuan

Zuxuan Wu

Yu-Gang Jiang

384

27 Apr 2023

Controlled Text Generation with Natural Language InstructionsInternational Conference on Machine Learning (ICML), 2023

451

115

27 Apr 2023

mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality

Jiabo Ye

...

Ji Zhang

Jingren Zhou

1.1K

1,170

27 Apr 2023

Multi-Party Chat: Conversational Agents in Group Settings with Humans and Models

Jason Weston

174

26 Apr 2023

Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and BeyondACM Transactions on Knowledge Discovery from Data (TKDD), 2023

433

940

26 Apr 2023