ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.01068
  4. Cited By
OPT: Open Pre-trained Transformer Language Models
v1v2v3v4 (latest)

OPT: Open Pre-trained Transformer Language Models

2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
    VLMOSLMAI4CE
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "OPT: Open Pre-trained Transformer Language Models"

50 / 2,924 papers shown
Retrieval-Augmented Multimodal Language Modeling
Retrieval-Augmented Multimodal Language ModelingInternational Conference on Machine Learning (ICML), 2023
Michihiro Yasunaga
Armen Aghajanyan
Weijia Shi
Rich James
J. Leskovec
Abigail Z. Jacobs
M. Lewis
Luke Zettlemoyer
Anuj Kumar
RALM
270
133
0
22 Nov 2022
Multitask Vision-Language Prompt Tuning
Multitask Vision-Language Prompt TuningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Sheng Shen
Shijia Yang
Tianjun Zhang
Bohan Zhai
Joseph E. Gonzalez
Kurt Keutzer
Trevor Darrell
VLMVPVLM
291
78
0
21 Nov 2022
Deanthropomorphising NLP: Can a Language Model Be Conscious?
Deanthropomorphising NLP: Can a Language Model Be Conscious?PLoS ONE (PLoS ONE), 2022
Matthew Shardlow
Piotr Przybyła
250
10
0
21 Nov 2022
Unsupervised Explanation Generation via Correct Instantiations
Unsupervised Explanation Generation via Correct InstantiationsAAAI Conference on Artificial Intelligence (AAAI), 2022
Sijie Cheng
Zhiyong Wu
Jiangjie Chen
Zhixing Li
Yang Liu
Lingpeng Kong
ReLMLRM
174
5
0
21 Nov 2022
The Stack: 3 TB of permissively licensed source code
The Stack: 3 TB of permissively licensed source code
Denis Kocetkov
Raymond Li
Loubna Ben Allal
Jia Li
Chenghao Mou
...
Sean M. Hughes
Thomas Wolf
Dzmitry Bahdanau
Leandro von Werra
H. D. Vries
245
410
0
20 Nov 2022
Artificial Interrogation for Attributing Language Models
Artificial Interrogation for Attributing Language Models
Farhan Dhanani
Muhammad Rafi
64
1
0
20 Nov 2022
An Empirical Study On Contrastive Search And Contrastive Decoding For
  Open-ended Text Generation
An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation
Yixuan Su
Jialu Xu
114
15
0
19 Nov 2022
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large
  Language Models
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language ModelsInternational Conference on Machine Learning (ICML), 2022
Guangxuan Xiao
Ji Lin
Mickael Seznec
Hao Wu
Julien Demouth
Song Han
MQ
803
1,219
0
18 Nov 2022
Is the Elephant Flying? Resolving Ambiguities in Text-to-Image
  Generative Models
Is the Elephant Flying? Resolving Ambiguities in Text-to-Image Generative Models
Ninareh Mehrabi
Palash Goyal
Apurv Verma
Jwala Dhamala
Varun Kumar
Qian Hu
Kai-Wei Chang
R. Zemel
Aram Galstyan
Rahul Gupta
204
8
0
17 Nov 2022
Ignore Previous Prompt: Attack Techniques For Language Models
Ignore Previous Prompt: Attack Techniques For Language Models
Fábio Perez
Ian Ribeiro
SILM
436
634
0
17 Nov 2022
Galactica: A Large Language Model for Science
Galactica: A Large Language Model for Science
Ross Taylor
Marcin Kardas
Guillem Cucurull
Thomas Scialom
Anthony Hartshorn
Elvis Saravia
Andrew Poulton
Viktor Kerkez
Robert Stojnic
ELMReLM
396
937
0
16 Nov 2022
GAMMT: Generative Ambiguity Modeling Using Multiple Transformers
GAMMT: Generative Ambiguity Modeling Using Multiple Transformers
Xingcheng Xu
173
0
0
16 Nov 2022
On the Compositional Generalization Gap of In-Context Learning
On the Compositional Generalization Gap of In-Context LearningBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2022
Arian Hosseini
Ankit Vani
Dzmitry Bahdanau
Alessandro Sordoni
Rameswar Panda
201
30
0
15 Nov 2022
Mind Your Bias: A Critical Review of Bias Detection Methods for
  Contextual Language Models
Mind Your Bias: A Critical Review of Bias Detection Methods for Contextual Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Silke Husse
Andreas Spitz
233
8
0
15 Nov 2022
Evaluating the Factual Consistency of Large Language Models Through News
  Summarization
Evaluating the Factual Consistency of Large Language Models Through News SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Derek Tam
Anisha Mascarenhas
Shiyue Zhang
Sarah Kwan
Joey Tianyi Zhou
Colin Raffel
HILM
282
132
0
15 Nov 2022
FolkScope: Intention Knowledge Graph Construction for E-commerce
  Commonsense Discovery
FolkScope: Intention Knowledge Graph Construction for E-commerce Commonsense DiscoveryAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Changlong Yu
Weiqi Wang
Xin Liu
Jiaxin Bai
Yangqiu Song
Zheng Li
Yifan Gao
Tianyu Cao
Bing Yin
209
29
0
15 Nov 2022
Prompting Language Models for Linguistic Structure
Prompting Language Models for Linguistic StructureAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Terra Blevins
Hila Gonen
Luke Zettlemoyer
LRM
249
52
0
15 Nov 2022
Breadth-First Pipeline Parallelism
Breadth-First Pipeline Parallelism
J. Lamy-Poirier
GNNMoEAI4CE
125
1
0
11 Nov 2022
Measuring Reliability of Large Language Models through Semantic
  Consistency
Measuring Reliability of Large Language Models through Semantic Consistency
Harsh Raj
Domenic Rosati
Subhabrata Majumdar
HILM
273
39
0
10 Nov 2022
Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language
  Model Control
Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model ControlAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Xiang Fan
Yiwei Lyu
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
BDL
300
8
0
10 Nov 2022
Collateral facilitation in humans and language models
Collateral facilitation in humans and language modelsConference on Computational Natural Language Learning (CoNLL), 2022
J. Michaelov
Benjamin Bergen
217
14
0
09 Nov 2022
Grammatical Error Correction: A Survey of the State of the Art
Grammatical Error Correction: A Survey of the State of the ArtComputational Linguistics (CL), 2022
Christopher Bryant
Zheng Yuan
Muhammad Reza Qorib
Hannan Cao
Hwee Tou Ng
Ted Briscoe
3DV
252
115
0
09 Nov 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
...
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
841
2,772
0
09 Nov 2022
Creative Writing with an AI-Powered Writing Assistant: Perspectives from
  Professional Writers
Creative Writing with an AI-Powered Writing Assistant: Perspectives from Professional Writers
Daphne Ippolito
Ann Yuan
Andy Coenen
Sehmon Burnam
231
124
0
09 Nov 2022
Active Example Selection for In-Context Learning
Active Example Selection for In-Context LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yiming Zhang
Shi Feng
Chenhao Tan
SILMLRM
329
253
0
08 Nov 2022
Intriguing Properties of Compression on Multilingual Models
Intriguing Properties of Compression on Multilingual ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Kelechi Ogueji
Orevaoghene Ahia
Gbemileke Onilude
Sebastian Gehrmann
Sara Hooker
Julia Kreutzer
302
15
0
04 Nov 2022
MolE: a molecular foundation model for drug discovery
MolE: a molecular foundation model for drug discovery
Oscar Méndez-Lucio
C. Nicolaou
Berton Earnshaw
161
14
0
03 Nov 2022
LMentry: A Language Model Benchmark of Elementary Language Tasks
LMentry: A Language Model Benchmark of Elementary Language TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Avia Efrat
Or Honovich
Omer Levy
239
31
0
03 Nov 2022
Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language
  Model
Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language ModelJournal of machine learning research (JMLR), 2022
A. Luccioni
S. Viguier
Anne-Laure Ligozat
607
427
0
03 Nov 2022
Large Language Models Are Human-Level Prompt Engineers
Large Language Models Are Human-Level Prompt EngineersInternational Conference on Learning Representations (ICLR), 2022
Yongchao Zhou
Andrei Ioan Muresanu
Ziwen Han
Keiran Paster
Silviu Pitis
Harris Chan
Jimmy Ba
ALMLLMAG
508
1,181
0
03 Nov 2022
Preventing Verbatim Memorization in Language Models Gives a False Sense
  of Privacy
Preventing Verbatim Memorization in Language Models Gives a False Sense of PrivacyInternational Conference on Natural Language Generation (INLG), 2022
Daphne Ippolito
Florian Tramèr
Milad Nasr
Chiyuan Zhang
Matthew Jagielski
Katherine Lee
Christopher A. Choquette-Choo
Nicholas Carlini
PILMMU
385
98
0
31 Oct 2022
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for
  Text Generation and Modular Control
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular ControlAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Xiaochuang Han
Sachin Kumar
Yulia Tsvetkov
325
147
0
31 Oct 2022
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained
  Transformers
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
Elias Frantar
Saleh Ashkboos
Torsten Hoefler
Dan Alistarh
MQ
535
1,573
0
31 Oct 2022
A Solvable Model of Neural Scaling Laws
A Solvable Model of Neural Scaling Laws
A. Maloney
Daniel A. Roberts
J. Sully
262
78
0
30 Oct 2022
Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language
  Models
Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Xiaoman Pan
Wenlin Yao
Hongming Zhang
Dian Yu
Dong Yu
Jianshu Chen
KELM
569
28
0
28 Oct 2022
Class Based Thresholding in Early Exit Semantic Segmentation Networks
Class Based Thresholding in Early Exit Semantic Segmentation NetworksIEEE Signal Processing Letters (SPL), 2022
Alperen Görmez
Erdem Koyuncu
152
6
0
27 Oct 2022
What Language Model to Train if You Have One Million GPU Hours?
What Language Model to Train if You Have One Million GPU Hours?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Teven Le Scao
Thomas Wang
Daniel Hesslow
Lucile Saulnier
Stas Bekman
...
Lintang Sutawika
Jaesung Tae
Zheng-Xin Yong
Julien Launay
Iz Beltagy
MoEAI4CE
576
120
0
27 Oct 2022
TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and
  Punctuation model evaluation and selection
TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection
Piyush Behre
S.S. Tan
A. Shah
Harini Kesavamoorthy
Shuangyu Chang
Fei Zuo
C. Basoglu
Sayan D. Pathak
184
1
0
27 Oct 2022
Personalized Dialogue Generation with Persona-Adaptive Attention
Personalized Dialogue Generation with Persona-Adaptive AttentionAAAI Conference on Artificial Intelligence (AAAI), 2022
Qiushi Huang
Yu Zhang
Tom Ko
Xubo Liu
Boyong Wu
Wenwu Wang
Lilian H. Y. Tang
304
38
0
27 Oct 2022
Multi-lingual Evaluation of Code Generation Models
Multi-lingual Evaluation of Code Generation ModelsInternational Conference on Learning Representations (ICLR), 2022
Ben Athiwaratkun
Sanjay Krishna Gouda
Zijian Wang
Xiaopeng Li
Yuchen Tian
...
Baishakhi Ray
Parminder Bhatia
Sudipta Sengupta
Dan Roth
Bing Xiang
ELM
765
217
0
26 Oct 2022
Scaling Laws Beyond Backpropagation
Scaling Laws Beyond Backpropagation
Matthew J. Filipovich
Alessandro Cappelli
Daniel Hesslow
Julien Launay
209
4
0
26 Oct 2022
RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question
  Answering
RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Victor Zhong
Weijia Shi
Anuj Kumar
Luke Zettlemoyer
229
29
0
25 Oct 2022
Same Pre-training Loss, Better Downstream: Implicit Bias Matters for
  Language Models
Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language ModelsInternational Conference on Machine Learning (ICML), 2022
Hong Liu
Sang Michael Xie
Zhiyuan Li
Tengyu Ma
AI4CE
326
68
0
25 Oct 2022
Weakly Supervised Data Augmentation Through Prompting for Dialogue
  Understanding
Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding
Maximillian Chen
Alexandros Papangelis
Chenyang Tao
Andrew Rosenbaum
Seokhwan Kim
Yang Liu
Zhou Yu
Dilek Z. Hakkani-Tür
229
38
0
25 Oct 2022
Contrastive Search Is What You Need For Neural Text Generation
Contrastive Search Is What You Need For Neural Text Generation
Yixuan Su
Nigel Collier
245
67
0
25 Oct 2022
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal
  Language Models
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models
Hao Liu
Xinyang Geng
Lisa Lee
Igor Mordatch
Sergey Levine
Sharan Narang
Pieter Abbeel
KELMCLL
255
3
0
24 Oct 2022
Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs
Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Maarten Sap
Ronan Le Bras
Daniel Fried
Yejin Choi
397
272
0
24 Oct 2022
The Curious Case of Absolute Position Embeddings
The Curious Case of Absolute Position EmbeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Koustuv Sinha
Amirhossein Kazemnejad
Siva Reddy
J. Pineau
Dieuwke Hupkes
Adina Williams
226
19
0
23 Oct 2022
Exploring The Landscape of Distributional Robustness for Question
  Answering Models
Exploring The Landscape of Distributional Robustness for Question Answering ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Anas Awadalla
Mitchell Wortsman
Gabriel Ilharco
Sewon Min
Ian H. Magnusson
Hannaneh Hajishirzi
Ludwig Schmidt
ELMOODKELM
229
23
0
22 Oct 2022
Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination
Z-LaVI: Zero-Shot Language Solver Fueled by Visual ImaginationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yue Yang
Wenlin Yao
Hongming Zhang
Xiaoyang Wang
Dong Yu
Jianshu Chen
VLM
224
24
0
21 Oct 2022
Previous
123...5556575859
Next
Page 56 of 59
Pageof 59