Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2205.01068
Cited By
v1
v2
v3
v4 (latest)
OPT: Open Pre-trained Transformer Language Models
2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLM
OSLM
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"OPT: Open Pre-trained Transformer Language Models"
50 / 2,924 papers shown
Towards Sampling Data Structures for Tensor Products in Turnstile Streams
Zhao Song
Shenghao Xie
Samson Zhou
147
0
0
04 Oct 2025
On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection
Weiqing He
Xiang Li
Tianqi Shang
Li Shen
Weijie J. Su
Q. Long
WaLM
260
0
0
04 Oct 2025
Brain-Language Model Alignment: Insights into the Platonic Hypothesis and Intermediate-Layer Advantage
Angela Lopez-Cardona
Sebastian Idesis
Mireia Masias Bruns
Sergi Abadal
Ioannis Arapakis
120
0
0
03 Oct 2025
AgenticRAG: Tool-Augmented Foundation Models for Zero-Shot Explainable Recommender Systems
Bo Ma
Hang Li
ZeHua Hu
XiaoFan Gui
LuYao Liu
Simon Liu
LRM
129
0
0
03 Oct 2025
Don't Just Chase "Highlighted Tokens" in MLLMs: Revisiting Visual Holistic Context Retention
Xin Zou
Di Lu
Yizhou Wang
Yibo Yan
Yuanhuiyi Lyu
Xu Zheng
Linfeng Zhang
Xuming Hu
VLM
292
7
0
03 Oct 2025
Neural Correlates of Language Models Are Specific to Human Language
Iñigo Parra
144
0
0
03 Oct 2025
Detecting Post-generation Edits to Watermarked LLM Outputs via Combinatorial Watermarking
Liyan Xie
Muhammad Siddeek
Mohamed Seif
Andrea J. Goldsmith
Mengdi Wang
135
1
0
02 Oct 2025
The Unseen Frontier: Pushing the Limits of LLM Sparsity with Surrogate-Free ADMM
Kwanhee Lee
Hyeondo Jang
Dongyeop Lee
Dan Alistarh
Namhoon Lee
101
1
0
02 Oct 2025
Bridging Collaborative Filtering and Large Language Models with Dynamic Alignment, Multimodal Fusion and Evidence-grounded Explanations
Bo Ma
LuYao Liu
Simon Lau
Chandler Yuan
and XueY Cui
Rosie Zhang
62
0
0
02 Oct 2025
Learning a Zeroth-Order Optimizer for Fine-Tuning LLMs
Kairun Zhang
Xue Yang
Yanjun Zhao
Yifan Sun
Huan Zhang
182
0
0
01 Oct 2025
CAST: Continuous and Differentiable Semi-Structured Sparsity-Aware Training for Large Language Models
Weiyu Huang
Yuezhou Hu
Jun Zhu
Jianfei Chen
CLL
112
0
0
30 Sep 2025
Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation
Miao Rang
Zhenni Bi
Hang Zhou
Hanting Chen
An Xiao
Tianyu Guo
Kai Han
Xinghao Chen
Yunhe Wang
162
1
0
30 Sep 2025
Understanding the Mixture-of-Experts with Nadaraya-Watson Kernel
Chuanyang Zheng
Jiankai Sun
Yihang Gao
Enze Xie
Yuehao Wang
...
Kashif Rasul
Mac Schwager
Anderson Schneider
Zinan Lin
Yuriy Nevmyvaka
MoE
231
2
0
30 Sep 2025
Scaling Spoken Language Models with Syllabic Speech Tokenization
Nicholas Lee
Cheol Jun Cho
Alan W. Black
Gopala K. Anumanchipalli
131
0
0
30 Sep 2025
UniPruning: Unifying Local Metric and Global Feedback for Scalable Sparse LLMs
Yizhuo Ding
Wanying Qu
Jiawei Geng
Wenqi Shao
Yanwei Fu
181
0
0
29 Sep 2025
Negative Pre-activations Differentiate Syntax
Linghao Kong
Angelina Ning
Micah Adler
Nir Shavit
124
0
0
29 Sep 2025
OIG-Bench: A Multi-Agent Annotated Benchmark for Multimodal One-Image Guides Understanding
Jiancong Xie
Wenjin Wang
Zhuomeng Zhang
Zihan Liu
Qi Liu
Ke Feng
Zixun Sun
Yuedong Yang
VLM
90
0
0
29 Sep 2025
Tequila: Trapping-free Ternary Quantization for Large Language Models
Hong Huang
Decheng Wu
Rui Cen
Guanghua Yu
Z. Li
Kai Liu
Jianchen Zhu
Peng Chen
Xue Liu
Dapeng Wu
MQ
271
3
0
28 Sep 2025
Knowledge distillation through geometry-aware representational alignment
Prajjwal Bhattarai
Mohammad Amjad
Dmytro Zhylko
Tuka Alhanai
177
0
0
27 Sep 2025
PT
2
^2
2
-LLM: Post-Training Ternarization for Large Language Models
Xianglong Yan
Chengzhu Bao
Zhiteng Li
Tianao Zhang
Kaicheng Yang
Haotong Qin
Ruobing Xie
Xingwu Sun
Yulun Zhang
MQ
227
0
0
27 Sep 2025
GeoBS: Information-Theoretic Quantification of Geographic Bias in AI Models
Zhangyu Wang
Nemin Wu
Zhongliang Zhou
Jiangnan Xia
Zeping Liu
...
A. Nambi
T. Ganu
Ni Lao
Ninghao Liu
Gengchen Mai
133
1
0
27 Sep 2025
SDQ-LLM: Sigma-Delta Quantization for 1-bit LLMs of any size
Junhao Xia
Ming Zhao
Limin Xiao
Xiujun Zhang
MQ
107
0
0
27 Sep 2025
LLM Watermark Evasion via Bias Inversion
Jeongyeon Hwang
Sangdon Park
Jungseul Ok
WaLM
345
0
0
27 Sep 2025
Black-Box Hallucination Detection via Consistency Under the Uncertain Expression
Seongho Joo
Kyungmin Min
Jahyun Koo
Kyomin Jung
HILM
118
2
0
26 Sep 2025
SuperOffload: Unleashing the Power of Large-Scale LLM Training on Superchips
Xinyu Lian
Masahiro Tanaka
Olatunji Ruwase
Minjia Zhang
121
2
0
25 Sep 2025
SCRA-VQA: Summarized Caption-Rerank for Augmented Large Language Models in Visual Question Answering
Yan Zhang
Jiaqing Lin
Miao Zhang
Kui Xiao
Xiaoju Hou
Yue Zhao
Ruoyao Xiao
111
0
0
25 Sep 2025
PMark: Towards Robust and Distortion-free Semantic-level Watermarking with Channel Constraints
Jiahao Huo
Shuliang Liu
Bin Wang
Junyan Zhang
Yibo Yan
Aiwei Liu
Xuming Hu
Mingxun Zhou
189
4
0
25 Sep 2025
GEP: A GCG-Based method for extracting personally identifiable information from chatbots built on small language models
Jieli Zhu
Vi Ngoc-Nha Tran
230
0
0
25 Sep 2025
Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing
Yisong Xiao
Aishan Liu
Siyuan Liang
Zonghao Ying
Xianglong Liu
Dacheng Tao
KELM
154
2
0
24 Sep 2025
Confidence-Aware Routing for Large Language Model Reliability Enhancement: A Multi-Signal Approach to Pre-Generation Hallucination Mitigation
Nandakishor M
HILM
129
0
0
23 Sep 2025
When Long Helps Short: How Context Length in Supervised Fine-tuning Affects Behavior of Large Language Models
Yingming Zheng
Hanqi Li
Kai Yu
Lu Chen
252
0
0
23 Sep 2025
Are We Scaling the Right Thing? A System Perspective on Test-Time Scaling
Youpeng Zhao
Jinpeng LV
Di Wu
Jun Wang
Christopher Gooley
LRM
108
0
0
23 Sep 2025
On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Quantized LLMs
Rongguang Ye
Ming Tang
Edith C. H. Ngai
MQ
100
0
0
22 Sep 2025
LIMI: Less is More for Agency
Yang Xiao
Mohan Jiang
Jie Sun
Keyu Li
Jifan Lin
...
Y. Cheng
Wenjie Li
Xiang Wang
Dequan Wang
Pengfei Liu
VLM
216
5
0
22 Sep 2025
BEFT: Bias-Efficient Fine-Tuning of Language Models
Baichuan Huang
Ananth Balashankar
Amir Aminifar
134
0
0
19 Sep 2025
Fair-GPTQ: Bias-Aware Quantization for Large Language Models
Irina Proskurina
Guillaume Metzler
Julien Velcin
MQ
138
0
0
18 Sep 2025
Do LLMs Align Human Values Regarding Social Biases? Judging and Explaining Social Biases with LLMs
Yang Liu
Chenhui Chu
167
0
0
17 Sep 2025
Prompt Stability in Code LLMs: Measuring Sensitivity across Emotion- and Personality-Driven Variations
Wei Ma
Y. Yang
Jingquan Ge
Xiaofei Xie
Lingxiao Jiang
152
0
0
17 Sep 2025
A Framework for Generating Artificial Datasets to Validate Absolute and Relative Position Concepts
George Correa de Araujo
H. Maia
Hélio Pedrini
144
0
0
17 Sep 2025
EvoEmpirBench: Dynamic Spatial Reasoning with Agent-ExpVer
Pukun Zhao
Longxiang Wang
Miaowei Wang
Chen Chen
Fanqing Zhou
Haojian Huang
209
0
0
16 Sep 2025
Character-Level Perturbations Disrupt LLM Watermarks
Zhaoxi Zhang
Xiaomei Zhang
Y. Zhang
He Zhang
Shirui Pan
B. Liu
Asif Q. Gill
Leo Yu Zhang
AAML
WaLM
410
1
0
11 Sep 2025
Collaborate, Deliberate, Evaluate: How LLM Alignment Affects Coordinated Multi-Agent Outcomes
Abhijnan Nath
Carine Graff
Nikhil Krishnaswamy
LLMAG
182
3
0
07 Sep 2025
SMooGPT: Stylized Motion Generation using Large Language Models
Lei Zhong
Yi Yang
Changjian Li
115
1
0
04 Sep 2025
RecBase: Generative Foundation Model Pretraining for Zero-Shot Recommendation
Sashuai Zhou
Weinan Gan
Qijiong Liu
Ke Lei
Jieming Zhu
Hai Huang
Yan Xia
Ruiming Tang
Zhenhua Dong
Zhou Zhao
126
4
0
03 Sep 2025
Behavioral Fingerprinting of Large Language Models
Zehua Pei
Hui-Ling Zhen
Ying Zhang
Zhiyuan Yang
Xing Li
Xianzhi Yu
Mingxuan Yuan
Bei Yu
90
2
0
02 Sep 2025
Evaluating Recabilities of Foundation Models: A Multi-Domain, Multi-Dataset Benchmark
Qijiong Liu
Jieming Zhu
Yingxin Lai
Xiaoyu Dong
Lu Fan
Zhipeng Bian
Zhenhua Dong
Xiao-Ming Wu
113
2
0
29 Aug 2025
MM-SeR: Multimodal Self-Refinement for Lightweight Image Captioning
Junha Song
Yongsik Jo
So Yeon Min
Quanting Xie
Taehwan Kim
Yonatan Bisk
Jaegul Choo
VLM
226
0
0
29 Aug 2025
VeriLoRA: Fine-Tuning Large Language Models with Verifiable Security via Zero-Knowledge Proofs
Guofu Liao
Taotao Wang
Shengli Zhang
Jiqun Zhang
Shi Long
Dacheng Tao
ALM
236
0
0
29 Aug 2025
PDTrim: Targeted Pruning for Prefill-Decode Disaggregation in Inference
Hao Zhang
Mengsi Lyu
Zhuo Chen
Xingrun Xing
Yulong Ao
Yonghua Lin
484
1
0
29 Aug 2025
GUARD: Glocal Uncertainty-Aware Robust Decoding for Effective and Efficient Open-Ended Text Generation
Yuanhao Ding
Esteban Garces Arias
Meimingwei Li
Julian Rodemann
Matthias Aßenmacher
Danlu Chen
Gaojuan Fan
C. Heumann
Chongsheng Zhang
165
3
0
28 Aug 2025
Previous
1
2
3
4
5
6
...
57
58
59
Next
Page 3 of 59
Page
of 59
Go