Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2205.01068
Cited By
v1
v2
v3
v4 (latest)
OPT: Open Pre-trained Transformer Language Models
2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLM
OSLM
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"OPT: Open Pre-trained Transformer Language Models"
50 / 2,924 papers shown
Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: An Empirical Study
International Joint Conference on Natural Language Processing (IJCNLP), 2023
Yi Chen
Rui Wang
Haiyun Jiang
Shuming Shi
Ruifeng Xu
LM&MA
423
116
0
03 Apr 2023
LLMMaps -- A Visual Metaphor for Stratified Evaluation of Large Language Models
Patrik Puchert
Poonam Poonam
Christian van Onzenoodt
Timo Ropinski
153
11
0
02 Apr 2023
Evaluating Large Language Models on a Highly-specialized Topic, Radiation Oncology Physics
Frontiers in Oncology (Front Oncol), 2023
J. Holmes
Zheng Liu
Hua Zhou
Yuzhen Ding
Terence T. Sio
...
Jonathan B. Ashman
Xiang Li
Tianming Liu
Jiajian Shen
Wen Liu
LM&MA
AI4CE
ELM
234
144
0
01 Apr 2023
Evaluating GPT-4 and ChatGPT on Japanese Medical Licensing Examinations
Jungo Kasai
Y. Kasai
Keisuke Sakaguchi
Yutaro Yamada
Dragomir R. Radev
LM&MA
ELM
178
123
0
31 Mar 2023
CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society
Neural Information Processing Systems (NeurIPS), 2023
Ge Li
Hasan Hammoud
Hani Itani
Dmitrii Khizbullin
Guohao Li
SyDa
ALM
580
977
0
31 Mar 2023
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face
Neural Information Processing Systems (NeurIPS), 2023
Yongliang Shen
Kaitao Song
Xu Tan
Dongsheng Li
Weiming Lu
Yueting Zhuang
MLLM
1.1K
1,240
0
30 Mar 2023
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X
Knowledge Discovery and Data Mining (KDD), 2023
Qinkai Zheng
Xiao Xia
Xu Zou
Yuxiao Dong
Shanshan Wang
...
Andi Wang
Yang Li
Teng Su
Zhilin Yang
Jie Tang
ELM
ALM
SyDa
400
474
0
30 Mar 2023
BloombergGPT: A Large Language Model for Finance
Shijie Wu
Ozan Irsoy
Steven Lu
Vadim Dabravolski
Mark Dredze
Sebastian Gehrmann
P. Kambadur
David S. Rosenberg
Gideon Mann
AIFin
686
1,157
0
30 Mar 2023
Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations
Computer Vision and Pattern Recognition (CVPR), 2023
VS Vibashan
Ning Yu
Chen Xing
Can Qin
M. Gao
Juan Carlos Niebles
Vishal M. Patel
Ran Xu
VLM
ISeg
252
19
0
29 Mar 2023
An Over-parameterized Exponential Regression
Yeqi Gao
Sridhar Mahadevan
Zhao Song
270
43
0
29 Mar 2023
Larger Probes Tell a Different Story: Extending Psycholinguistic Datasets Via In-Context Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Namrata Shivagunde
Vladislav Lialin
Anna Rumshisky
364
4
0
29 Mar 2023
InceptionNeXt: When Inception Meets ConvNeXt
Computer Vision and Pattern Recognition (CVPR), 2023
Weihao Yu
Pan Zhou
Shuicheng Yan
Xinchao Wang
544
268
0
29 Mar 2023
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
Renrui Zhang
Jiaming Han
Chris Liu
Shiyang Feng
Aojun Zhou
Xiangfei Hu
Shilin Yan
Pan Lu
Jiaming Song
Yu Qiao
MLLM
590
943
0
28 Mar 2023
Training Language Models with Language Feedback at Scale
Jérémy Scheurer
Jon Ander Campos
Tomasz Korbak
Jun Shern Chan
Angelica Chen
Dong Wang
Ethan Perez
ALM
359
123
0
28 Mar 2023
Hallucinations in Large Multilingual Translation Models
Transactions of the Association for Computational Linguistics (TACL), 2023
Nuno M. Guerreiro
Duarte M. Alves
Jonas Waldendorf
Barry Haddow
Alexandra Birch
Pierre Colombo
André F.T. Martins
VLM
HILM
LRM
416
203
0
28 Mar 2023
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
IEEE International Conference on Computer Vision (ICCV), 2023
Kunchang Li
Yali Wang
Yizhuo Li
Yi Wang
Yinan He
Limin Wang
Yu Qiao
VGen
536
238
0
28 Mar 2023
Solving Regularized Exp, Cosh and Sinh Regression Problems
Zhihang Li
Zhao Song
Wanrong Zhu
211
41
0
28 Mar 2023
Foundation Models and Fair Use
Journal of machine learning research (JMLR), 2023
Peter Henderson
Xuechen Li
Dan Jurafsky
Tatsunori Hashimoto
Christopher De Sa
Abigail Z. Jacobs
187
161
0
28 Mar 2023
Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning
Vladislav Lialin
Vijeta Deshpande
Anna Rumshisky
325
238
0
28 Mar 2023
Unlocking the Potential of ChatGPT: A Comprehensive Exploration of its Applications, Advantages, Limitations, and Future Directions in Natural Language Processing
Walid Hariri
AI4MH
LM&MA
909
120
0
27 Mar 2023
Unified Text Structuralization with Instruction-tuned Language Models
Xuanfan Ni
Piji Li
Huayang Li
240
15
0
27 Mar 2023
Koala: An Index for Quantifying Overlaps with Pre-training Corpora
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Thuy-Trang Vu
Xuanli He
Gholamreza Haffari
Ehsan Shareghi
CLL
177
23
0
26 Mar 2023
Exploring the Impact of Instruction Data Scaling on Large Language Models: An Empirical Study on Real-World Use Cases
Yunjie Ji
Yong Deng
Yan Gong
Yiping Peng
Qiang Niu
Guang Dai
Baochang Ma
Xiangang Li
ALM
179
116
0
26 Mar 2023
No more Reviewer #2: Subverting Automatic Paper-Reviewer Assignment using Adversarial Learning
USENIX Security Symposium (USENIX Security), 2023
Thorsten Eisenhofer
Erwin Quiring
Jonas Moller
Doreen Riepel
Thorsten Holz
Konrad Rieck
AAML
244
9
0
25 Mar 2023
Scaling Expert Language Models with Unsupervised Domain Discovery
Suchin Gururangan
Margaret Li
M. Lewis
Weijia Shi
Tim Althoff
Noah A. Smith
Luke Zettlemoyer
MoE
274
58
0
24 Mar 2023
k
k
k
NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference
International Conference on Learning Representations (ICLR), 2023
Benfeng Xu
Quan Wang
Zhendong Mao
Yajuan Lyu
Qiaoqiao She
Yongdong Zhang
308
66
0
24 Mar 2023
Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense
Neural Information Processing Systems (NeurIPS), 2023
Kalpesh Krishna
Yixiao Song
Marzena Karpinska
John Wieting
Mohit Iyyer
DeLMO
357
439
0
23 Mar 2023
Fundamentals of Generative Large Language Models and Perspectives in Cyber-Defense
Andrei Kucharavy
Z. Schillaci
Loic Maréchal
Maxime Wursch
Ljiljana Dolamic
Remi Sabonnadiere
Dimitri Percia David
Alain Mermoud
Vincent Lenders
ELM
AI4CE
171
37
0
21 Mar 2023
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion
Geonmo Gu
Sanghyuk Chun
Wonjae Kim
HeeJae Jun
Yoohoon Kang
Sangdoo Yun
DiffM
555
84
0
21 Mar 2023
Multi-modal Prompting for Low-Shot Temporal Action Localization
Chen Ju
Zeqian Li
Peisen Zhao
Ya Zhang
Xiaopeng Zhang
Qi Tian
Yanfeng Wang
Weidi Xie
201
25
0
21 Mar 2023
Large AI Models in Health Informatics: Applications, Challenges, and the Future
IEEE journal of biomedical and health informatics (IEEE JBHI), 2023
Jianing Qiu
Lin Li
Jiankai Sun
Jiachuan Peng
Peilun Shi
...
Bo Xiao
Wu Yuan
Ningli Wang
Dong Xu
Benny Lo
AI4MH
LM&MA
285
185
0
21 Mar 2023
Language Model Behavior: A Comprehensive Survey
International Conference on Computational Logic (ICCL), 2023
Tyler A. Chang
Benjamin Bergen
VLM
LRM
LM&MA
381
141
0
20 Mar 2023
eP-ALM: Efficient Perceptual Augmentation of Language Models
IEEE International Conference on Computer Vision (ICCV), 2023
Mustafa Shukor
Corentin Dancette
Matthieu Cord
MLLM
VLM
424
34
0
20 Mar 2023
CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition
Deepti Hegde
Jeya Maria Jose Valanarasu
Vishal M. Patel
CLIP
427
98
0
20 Mar 2023
PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Xiaozhe Ren
Pingyi Zhou
Xinfan Meng
Xinjing Huang
Yadao Wang
...
Jiansheng Wei
Xin Jiang
Teng Su
Qun Liu
Jun Yao
ALM
MoE
242
81
0
20 Mar 2023
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4
Zheng-Long Liu
Yue Huang
Xiao-Xing Yu
Lu Zhang
Zihao Wu
...
Hongtu Zhu
Shijie Zhao
Tianming Liu
D. Zhu
Xiang Li
MedIm
LM&MA
297
211
0
20 Mar 2023
Large Language Model Instruction Following: A Survey of Progresses and Challenges
Computational Linguistics (CL), 2023
Renze Lou
Kai Zhang
Wenpeng Yin
ALM
LRM
856
38
0
18 Mar 2023
A Comprehensive Capability Analysis of GPT-3 and GPT-3.5 Series Models
Junjie Ye
Xuanting Chen
Nuo Xu
Can Zu
Zekai Shao
...
Jie Zhou
Siming Chen
Tao Gui
Tao Gui
Xuanjing Huang
ELM
313
444
0
18 Mar 2023
Instance-Conditioned GAN Data Augmentation for Representation Learning
Pietro Astolfi
Arantxa Casanova
Jakob Verbeek
Pascal Vincent
Adriana Romero Soriano
M. Drozdzal
220
10
0
16 Mar 2023
SemDeDup: Data-efficient learning at web-scale through semantic deduplication
Amro Abbas
Kushal Tirumala
Daniel Simig
Surya Ganguli
Ari S. Morcos
305
243
0
16 Mar 2023
DeltaScore: Fine-Grained Story Evaluation with Perturbations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zhuohan Xie
Miao Li
Trevor Cohn
Jey Han Lau
391
11
0
15 Mar 2023
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Potsawee Manakul
Adian Liusie
Mark Gales
HILM
LRM
424
692
0
15 Mar 2023
UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Daixuan Cheng
Shaohan Huang
Junyu Bi
Yu-Wei Zhan
Jianfeng Liu
Yujing Wang
Hao Sun
Furu Wei
Denvy Deng
Tao Gui
RALM
LRM
238
95
0
15 Mar 2023
MCR-DL: Mix-and-Match Communication Runtime for Deep Learning
IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2023
Quentin G. Anthony
A. A. Awan
Jeff Rasley
Yuxiong He
Hari Subramoni
Mustafa Abduljabbar
Hari Subramoni
D. Panda
MoE
130
8
0
15 Mar 2023
ZeroQuant-V2: Exploring Post-training Quantization in LLMs from Comprehensive Study to Low Rank Compensation
Z. Yao
Xiaoxia Wu
Cheng-rong Li
Stephen Youn
Yuxiong He
MQ
388
71
0
15 Mar 2023
The Life Cycle of Knowledge in Big Language Models: A Survey
Machine Intelligence Research (MIR), 2023
Boxi Cao
Hongyu Lin
Xianpei Han
Le Sun
KELM
266
30
0
14 Mar 2023
Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on Consistency with Human Preferences
Yunjie Ji
Yan Gong
Yiping Peng
Chao Ni
Peiyan Sun
Dongyu Pan
Baochang Ma
Xiangang Li
ELM
ALM
AI4MH
126
40
0
14 Mar 2023
Eliciting Latent Predictions from Transformers with the Tuned Lens
Nora Belrose
Zach Furman
Logan Smith
Danny Halawi
Igor V. Ostrovsky
Lev McKinney
Stella Biderman
Jacob Steinhardt
669
320
0
14 Mar 2023
Transformer Models for Acute Brain Dysfunction Prediction
B. Silva
Miguel Contreras
T. Ozrazgat-Baslanti
Yuanfang Ren
Ziyuan Guan
Kia Khezeli
A. Bihorac
Parisa Rashidi
132
0
0
13 Mar 2023
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
International Conference on Machine Learning (ICML), 2023
Ying Sheng
Lianmin Zheng
Binhang Yuan
Zhuohan Li
Max Ryabinin
...
Joseph E. Gonzalez
Abigail Z. Jacobs
Christopher Ré
Ion Stoica
Ce Zhang
454
585
0
13 Mar 2023
Previous
1
2
3
...
51
52
53
...
57
58
59
Next
Page 52 of 59
Page
of 59
Go