v1v2v3v4 (latest)

OPT: Open Pre-trained Transformer Language Models

2 May 2022

Xian Li

Luke Zettlemoyer

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "OPT: Open Pre-trained Transformer Language Models"

50 / 2,924 papers shown

ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions

241

126

12 Mar 2023

Task and Motion Planning with Large Language Models for Object RearrangementIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023

570

227

10 Mar 2023

ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information ExtractionIEEE International Conference on Computer Vision (ICCV), 2023

Lei Wang

281

09 Mar 2023

Stealing the Decoding Algorithms of Language ModelsConference on Computer and Communications Security (CCS), 2023

313

08 Mar 2023

A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT

Siyu Li

Philip S. Yu

Lichao Sun

277

727

07 Mar 2023

SemEval-2023 Task 10: Explainable Detection of Online SexismInternational Workshop on Semantic Evaluation (SemEval), 2023

Hannah Rose Kirk

Wenjie Yin

Bertie Vidgen

Paul Röttger

299

144

07 Mar 2023

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual DatasetNeural Information Processing Systems (NeurIPS), 2023

Albert Villanova del Moral

...

214

199

07 Mar 2023

LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Victor C. Dibia

VLM

330

132

06 Mar 2023

OpenICL: An Open-Source Framework for In-context LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Zhiyong Wu

196

06 Mar 2023

Data Portraits: Recording Foundation Model Training DataNeural Information Processing Systems (NeurIPS), 2023

Marc Marone

Benjamin Van Durme

523

06 Mar 2023

Prismer: A Vision-Language Model with Multi-Task Experts

Linxi Fan

325

04 Mar 2023

Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOMEuropean Association for Machine Translation Conferences/Workshops (EAMT), 2023

Rachel Bawden

François Yvon

VLM LRM

296

03 Mar 2023

Competence-Based Analysis of Language Models

368

01 Mar 2023

How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding Tasks

Jie Zhou

Xuanjing Huang

182

100

01 Mar 2023

EvoPrompting: Language Models for Code-Level Neural Architecture SearchNeural Information Processing Systems (NeurIPS), 2023

473

126

28 Feb 2023

Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction FollowingAAAI Conference on Artificial Intelligence (AAAI), 2023

251

28 Feb 2023

HugNLP: A Unified and Comprehensive Library for Natural Language ProcessingInternational Conference on Information and Knowledge Management (CIKM), 2023

Chengyu Wang

204

28 Feb 2023

LLaMA: Open and Efficient Foundation Language Models

...

8.4K

18,046

27 Feb 2023

Finding Support Examples for In-Context LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Xiaonan Li

Xipeng Qiu

336

121

27 Feb 2023

Fast Attention Requires Bounded EntriesNeural Information Processing Systems (NeurIPS), 2023

Josh Alman

Zhao Song

313

102

26 Feb 2023

Does a Neural Network Really Encode Symbolic Concepts?International Conference on Machine Learning (ICML), 2023

Mingjie Li

Quanshi Zhang

307

25 Feb 2023

AugGPT: Leveraging ChatGPT for Text Data AugmentationIEEE Transactions on Big Data (IEEE Trans. Big Data), 2023

...

Lichao Sun

Shijie Zhao

Hongtu Zhu

Tianming Liu

Xiang Li

311

243

25 Feb 2023

Semantic Mechanical Search with Large Vision and Language ModelsConference on Robot Learning (CoRL), 2023

289

24 Feb 2023

Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback

...

407

481

24 Feb 2023

In What Languages are Generative Language Models the Most Formal? Analyzing Formality Distribution across LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

231

23 Feb 2023

Active Prompting with Chain-of-Thought for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Shizhe Diao

Pengcheng Wang

Yong Lin

Tong Zhang

ReLM KELM LLMAG LRM

485

186

23 Feb 2023

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving

Zhuohan Li

Lianmin Zheng

Yinmin Zhong

Vincent Liu

Ying Sheng

...

Yanping Huang

Zhifeng Chen

Hao Zhang

Joseph E. Gonzalez

Ion Stoica

MoE

325

152

22 Feb 2023

On the Robustness of ChatGPT: An Adversarial and Out-of-distribution PerspectiveIEEE Data Engineering Bulletin (IEEE Data Eng. Bull.), 2023

Hao Chen

...

Yue Zhang

520

290

22 Feb 2023

In-context Example Selection with Influences

Nguyen Tai

Eric Wong

360

21 Feb 2023

k

NN-Adapter: Efficient Domain Adaptation for Black-Box Language Models

Weijia Shi

179

21 Feb 2023

Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue SystemsInternational Conference on Learning Representations (ICLR), 2023

217

20 Feb 2023

Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language GenerationInternational Conference on Learning Representations (ICLR), 2023

666

484

19 Feb 2023

Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT

Liang Ding

Bo Du

301

293

19 Feb 2023

Complex QA and language models hybrid architectures, Survey

731

17 Feb 2023

Auditing large language models: a three-layered approachAI and Ethics (AE), 2023

492

277

16 Feb 2023

Counting Carbon: A Survey of Factors Influencing the Emissions of Machine Learning

A. Luccioni

Alex Hernandez-Garcia

238

16 Feb 2023

Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model TrainingInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2023

358

16 Feb 2023

Commonsense Reasoning for Conversational AI: A Survey of the State of the Art

Christopher Richardson

Larry Heck

LRM

207

15 Feb 2023

Speculative Decoding with Big Little DecoderNeural Information Processing Systems (NeurIPS), 2023

Sehoon Kim

Suhong Moon

455

163

15 Feb 2023

Dictionary-based Phrase-level Prompting of Large Language Models for Machine Translation

Marjan Ghazvininejad

Hila Gonen

Luke Zettlemoyer

227

15 Feb 2023

Measuring the Instability of Fine-TuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Yupei Du

D. Nguyen

256

15 Feb 2023

On the Planning Abilities of Large Language Models (A Critical Investigation with a Proposed Benchmark)

Kaya Stechly

S. Sreedharan

Matthew Marquez

Alberto Olmo Hernandez

Subbarao Kambhampati

LLMAG LRM

148

102

13 Feb 2023

Do Vision and Language Models Share Concepts? A Vector Space Alignment StudyTransactions of the Association for Computational Linguistics (TACL), 2023

Jiaang Li

Yova Kementchedjhieva

Constanza Fierro

Anders Søgaard

VLM

259

13 Feb 2023

A Unified View of Long-Sequence Models towards Modeling Million-Scale Dependencies

Hongyu Hè

Marko Kabić

276

13 Feb 2023

Transformer models: an introduction and catalog

X. Amatriain

Ananth Sankar

Jie Bing

Praveen Kumar Bodigutla

Timothy J. Hazen

Michaeel Kazi

500

12 Feb 2023

A Reparameterized Discrete Diffusion Model for Text Generation

Lin Zheng

Jianbo Yuan

Lei Yu

Lingpeng Kong

DiffM

289

119

11 Feb 2023

Distillation of encoder-decoder transformers for sequence labellingFindings (Findings), 2023

337

10 Feb 2023

Selective In-Context Data Augmentation for Intent Detection using Pointwise V-InformationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

Yen-Ting Lin

Alexandros Papangelis

Yang Liu

173

10 Feb 2023

In-Context Learning with Many Demonstration Examples

Zhiyong Wu

Lingpeng Kong

269

09 Feb 2023

Offsite-Tuning: Transfer Learning without Full Model

Guangxuan Xiao

Ji Lin

Song Han

205

09 Feb 2023