ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.01068
  4. Cited By
OPT: Open Pre-trained Transformer Language Models
v1v2v3v4 (latest)

OPT: Open Pre-trained Transformer Language Models

2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
    VLMOSLMAI4CE
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "OPT: Open Pre-trained Transformer Language Models"

50 / 2,924 papers shown
ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched
  Visual Descriptions
ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions
Deyao Zhu
Jun Chen
Kilichbek Haydarov
Xiaoqian Shen
Wenxuan Zhang
Mohamed Elhoseiny
MLLM
241
126
0
12 Mar 2023
Task and Motion Planning with Large Language Models for Object
  Rearrangement
Task and Motion Planning with Large Language Models for Object RearrangementIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Yan Ding
Xiaohan Zhang
Chris Paxton
Shiqi Zhang
LM&RoLRM
570
227
0
10 Mar 2023
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for
  Document Information Extraction
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information ExtractionIEEE International Conference on Computer Vision (ICCV), 2023
Jiabang He
Lei Wang
Yingpeng Hu
Ning Liu
Hui-juan Liu
Xingdong Xu
Hengtao Shen
MLLM
281
56
0
09 Mar 2023
Stealing the Decoding Algorithms of Language Models
Stealing the Decoding Algorithms of Language ModelsConference on Computer and Communications Security (CCS), 2023
A. Naseh
Kalpesh Krishna
Mohit Iyyer
Amir Houmansadr
MLAU
313
29
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
277
727
0
07 Mar 2023
SemEval-2023 Task 10: Explainable Detection of Online Sexism
SemEval-2023 Task 10: Explainable Detection of Online SexismInternational Workshop on Semantic Evaluation (SemEval), 2023
Hannah Rose Kirk
Wenjie Yin
Bertie Vidgen
Paul Röttger
299
144
0
07 Mar 2023
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual DatasetNeural Information Processing Systems (NeurIPS), 2023
Hugo Laurenccon
Lucile Saulnier
Thomas Wang
Christopher Akiki
Albert Villanova del Moral
...
Violette Lepercq
Suzana Ilić
Margaret Mitchell
Sasha Luccioni
Yacine Jernite
AI4CEAILaw
214
199
0
07 Mar 2023
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations
  and Infographics using Large Language Models
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Victor C. Dibia
VLM
330
132
0
06 Mar 2023
OpenICL: An Open-Source Framework for In-context Learning
OpenICL: An Open-Source Framework for In-context LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhenyu Wu
Yaoxiang Wang
Jiacheng Ye
Jiangtao Feng
Jingjing Xu
Yu Qiao
Zhiyong Wu
196
59
0
06 Mar 2023
Data Portraits: Recording Foundation Model Training Data
Data Portraits: Recording Foundation Model Training DataNeural Information Processing Systems (NeurIPS), 2023
Marc Marone
Benjamin Van Durme
523
37
0
06 Mar 2023
Prismer: A Vision-Language Model with Multi-Task Experts
Prismer: A Vision-Language Model with Multi-Task Experts
Shikun Liu
Linxi Fan
Edward Johns
Zhiding Yu
Chaowei Xiao
Anima Anandkumar
VLMMLLM
325
33
0
04 Mar 2023
Investigating the Translation Performance of a Large Multilingual
  Language Model: the Case of BLOOM
Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOMEuropean Association for Machine Translation Conferences/Workshops (EAMT), 2023
Rachel Bawden
François Yvon
VLMLRM
296
89
0
03 Mar 2023
Competence-Based Analysis of Language Models
Competence-Based Analysis of Language Models
Adam Davies
Jize Jiang
Chengxiang Zhai
ELM
368
7
0
01 Mar 2023
How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language
  Understanding Tasks
How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding Tasks
Xuanting Chen
Junjie Ye
Can Zu
Nuo Xu
Rui Zheng
Minlong Peng
Jie Zhou
Tao Gui
Tao Gui
Xuanjing Huang
AI4MHELM
182
100
0
01 Mar 2023
EvoPrompting: Language Models for Code-Level Neural Architecture Search
EvoPrompting: Language Models for Code-Level Neural Architecture SearchNeural Information Processing Systems (NeurIPS), 2023
Angelica Chen
David Dohan
David R. So
VLMLRM
473
126
0
28 Feb 2023
Investigating the Effectiveness of Task-Agnostic Prefix Prompt for
  Instruction Following
Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction FollowingAAAI Conference on Artificial Intelligence (AAAI), 2023
Seonghyeon Ye
Hyeonbin Hwang
Sohee Yang
Hyeongu Yun
Yireun Kim
Minjoon Seo
LRM
251
46
0
28 Feb 2023
HugNLP: A Unified and Comprehensive Library for Natural Language
  Processing
HugNLP: A Unified and Comprehensive Library for Natural Language ProcessingInternational Conference on Information and Knowledge Management (CIKM), 2023
Jiadong Wang
Polydoros Giannouris
Qiushi Sun
Wenkang Huang
Chengyu Wang
Ming Gao
204
6
0
28 Feb 2023
LLaMA: Open and Efficient Foundation Language Models
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALMPILM
8.4K
18,046
0
27 Feb 2023
Finding Support Examples for In-Context Learning
Finding Support Examples for In-Context LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xiaonan Li
Xipeng Qiu
336
121
0
27 Feb 2023
Fast Attention Requires Bounded Entries
Fast Attention Requires Bounded EntriesNeural Information Processing Systems (NeurIPS), 2023
Josh Alman
Zhao Song
313
102
0
26 Feb 2023
Does a Neural Network Really Encode Symbolic Concepts?
Does a Neural Network Really Encode Symbolic Concepts?International Conference on Machine Learning (ICML), 2023
Mingjie Li
Quanshi Zhang
307
35
0
25 Feb 2023
AugGPT: Leveraging ChatGPT for Text Data Augmentation
AugGPT: Leveraging ChatGPT for Text Data AugmentationIEEE Transactions on Big Data (IEEE Trans. Big Data), 2023
Haixing Dai
Zheng Liu
Wenxiong Liao
Xiaoke Huang
Yihan Cao
...
Lichao Sun
Shijie Zhao
Hongtu Zhu
Tianming Liu
Xiang Li
311
243
0
25 Feb 2023
Semantic Mechanical Search with Large Vision and Language Models
Semantic Mechanical Search with Large Vision and Language ModelsConference on Robot Learning (CoRL), 2023
Satvik Sharma
Huang Huang
K. Shivakumar
A. Imran
Ryan Hoque
Brian Ichter
Ken Goldberg
LM&RoVLM
289
10
0
24 Feb 2023
Check Your Facts and Try Again: Improving Large Language Models with
  External Knowledge and Automated Feedback
Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback
Baolin Peng
Michel Galley
Pengcheng He
Hao Cheng
Yujia Xie
...
Qiuyuan Huang
Lars Liden
Zhou Yu
Weizhu Chen
Jianfeng Gao
KELMHILMLRM
407
481
0
24 Feb 2023
In What Languages are Generative Language Models the Most Formal?
  Analyzing Formality Distribution across Languages
In What Languages are Generative Language Models the Most Formal? Analyzing Formality Distribution across LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Asim Ersoy
Gerson Vizcarra
T. Mayeesha
Benjamin Muller
231
4
0
23 Feb 2023
Active Prompting with Chain-of-Thought for Large Language Models
Active Prompting with Chain-of-Thought for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Shizhe Diao
Pengcheng Wang
Yong Lin
Tong Zhang
ReLMKELMLLMAGLRM
485
186
0
23 Feb 2023
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep
  Learning Serving
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving
Zhuohan Li
Lianmin Zheng
Yinmin Zhong
Vincent Liu
Ying Sheng
...
Yanping Huang
Zhifeng Chen
Hao Zhang
Joseph E. Gonzalez
Ion Stoica
MoE
325
152
0
22 Feb 2023
On the Robustness of ChatGPT: An Adversarial and Out-of-distribution
  Perspective
On the Robustness of ChatGPT: An Adversarial and Out-of-distribution PerspectiveIEEE Data Engineering Bulletin (IEEE Data Eng. Bull.), 2023
Yongfeng Zhang
Xixu Hu
Wenxin Hou
Hao Chen
Runkai Zheng
...
Weirong Ye
Xiubo Geng
Binxing Jiao
Yue Zhang
Xingxu Xie
AI4MH
520
290
0
22 Feb 2023
In-context Example Selection with Influences
In-context Example Selection with Influences
Nguyen Tai
Eric Wong
360
69
0
21 Feb 2023
$k$NN-Adapter: Efficient Domain Adaptation for Black-Box Language Models
kkkNN-Adapter: Efficient Domain Adaptation for Black-Box Language Models
Yangsibo Huang
Daogao Liu
Zexuan Zhong
Weijia Shi
Y. Lee
RALMALM
179
20
0
21 Feb 2023
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning
  for Task-oriented Dialogue Systems
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue SystemsInternational Conference on Learning Representations (ICLR), 2023
Yihao Feng
Shentao Yang
Shujian Zhang
Jianguo Zhang
Caiming Xiong
Mi Zhou
Haiquan Wang
OffRL
217
26
0
20 Feb 2023
Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation
  in Natural Language Generation
Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language GenerationInternational Conference on Learning Representations (ICLR), 2023
Lorenz Kuhn
Y. Gal
Sebastian Farquhar
UQLM
666
484
0
19 Feb 2023
Can ChatGPT Understand Too? A Comparative Study on ChatGPT and
  Fine-tuned BERT
Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
AI4MH
301
293
0
19 Feb 2023
Complex QA and language models hybrid architectures, Survey
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
731
17
0
17 Feb 2023
Auditing large language models: a three-layered approach
Auditing large language models: a three-layered approachAI and Ethics (AE), 2023
Jakob Mokander
Jonas Schuett
Hannah Rose Kirk
Luciano Floridi
AILawMLAU
492
277
0
16 Feb 2023
Counting Carbon: A Survey of Factors Influencing the Emissions of
  Machine Learning
Counting Carbon: A Survey of Factors Influencing the Emissions of Machine Learning
A. Luccioni
Alex Hernandez-Garcia
238
71
0
16 Feb 2023
Slapo: A Schedule Language for Progressive Optimization of Large Deep
  Learning Model Training
Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model TrainingInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2023
Hongzheng Chen
Cody Hao Yu
Shuai Zheng
Zhen Zhang
Zhiru Zhang
Yida Wang
358
13
0
16 Feb 2023
Commonsense Reasoning for Conversational AI: A Survey of the State of
  the Art
Commonsense Reasoning for Conversational AI: A Survey of the State of the Art
Christopher Richardson
Larry Heck
LRM
207
10
0
15 Feb 2023
Speculative Decoding with Big Little Decoder
Speculative Decoding with Big Little DecoderNeural Information Processing Systems (NeurIPS), 2023
Sehoon Kim
K. Mangalam
Suhong Moon
Jitendra Malik
Michael W. Mahoney
A. Gholami
Kurt Keutzer
MoE
455
163
0
15 Feb 2023
Dictionary-based Phrase-level Prompting of Large Language Models for
  Machine Translation
Dictionary-based Phrase-level Prompting of Large Language Models for Machine Translation
Marjan Ghazvininejad
Hila Gonen
Luke Zettlemoyer
227
94
0
15 Feb 2023
Measuring the Instability of Fine-Tuning
Measuring the Instability of Fine-TuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yupei Du
D. Nguyen
256
7
0
15 Feb 2023
On the Planning Abilities of Large Language Models (A Critical
  Investigation with a Proposed Benchmark)
On the Planning Abilities of Large Language Models (A Critical Investigation with a Proposed Benchmark)
Kaya Stechly
S. Sreedharan
Matthew Marquez
Alberto Olmo Hernandez
Subbarao Kambhampati
LLMAGLRM
148
102
0
13 Feb 2023
Do Vision and Language Models Share Concepts? A Vector Space Alignment
  Study
Do Vision and Language Models Share Concepts? A Vector Space Alignment StudyTransactions of the Association for Computational Linguistics (TACL), 2023
Jiaang Li
Yova Kementchedjhieva
Constanza Fierro
Anders Søgaard
VLM
259
19
0
13 Feb 2023
A Unified View of Long-Sequence Models towards Modeling Million-Scale
  Dependencies
A Unified View of Long-Sequence Models towards Modeling Million-Scale Dependencies
Hongyu Hè
Marko Kabić
276
2
0
13 Feb 2023
Transformer models: an introduction and catalog
Transformer models: an introduction and catalog
X. Amatriain
Ananth Sankar
Jie Bing
Praveen Kumar Bodigutla
Timothy J. Hazen
Michaeel Kazi
500
73
0
12 Feb 2023
A Reparameterized Discrete Diffusion Model for Text Generation
A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng
Jianbo Yuan
Lei Yu
Lingpeng Kong
DiffM
289
119
0
11 Feb 2023
Distillation of encoder-decoder transformers for sequence labelling
Distillation of encoder-decoder transformers for sequence labellingFindings (Findings), 2023
M. Farina
D. Pappadopulo
Anant Gupta
Leslie Huang
Ozan Irsoy
Thamar Solorio
VLM
337
3
0
10 Feb 2023
Selective In-Context Data Augmentation for Intent Detection using
  Pointwise V-Information
Selective In-Context Data Augmentation for Intent Detection using Pointwise V-InformationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Yen-Ting Lin
Alexandros Papangelis
Seokhwan Kim
Sungjin Lee
Devamanyu Hazarika
Mahdi Namazifar
Di Jin
Yang Liu
Dilek Z. Hakkani-Tür
173
43
0
10 Feb 2023
In-Context Learning with Many Demonstration Examples
In-Context Learning with Many Demonstration Examples
Mukai Li
Shansan Gong
Jiangtao Feng
Yiheng Xu
Jinchao Zhang
Zhiyong Wu
Lingpeng Kong
269
42
0
09 Feb 2023
Offsite-Tuning: Transfer Learning without Full Model
Offsite-Tuning: Transfer Learning without Full Model
Guangxuan Xiao
Ji Lin
Song Han
205
99
0
09 Feb 2023
Previous
123...525354...575859
Next
Page 53 of 59
Pageof 59