Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.16990
Cited By
v1
v2 (latest)
Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding
22 May 2025
Runpeng Yu
Xinyin Ma
Xinchao Wang
MLLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (21 upvotes)
Papers citing
"Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding"
50 / 54 papers shown
dVLM-AD: Enhance Diffusion Vision-Language-Model for Driving via Controllable Reasoning
Yingzi Ma
Yulong Cao
Wenhao Ding
Shuibai Zhang
Yan Wang
Boris Ivanovic
Ming Jiang
Marco Pavone
Chaowei Xiao
VLM
LRM
207
2
0
04 Dec 2025
Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective
Jingyang Ou
Jiaqi Han
Minkai Xu
Shaoxuan Xu
Jianwen Xie
Stefano Ermon
Yi Wu
Chongxuan Li
DiffM
135
1
0
03 Dec 2025
Beyond Confidence: Adaptive and Coherent Decoding for Diffusion Language Models
Kecheng Chen
Ziru Liu
Xijia Tao
Hui Liu
Xinyu Fu
Suiyun Zhang
Dandan Tu
Lingpeng Kong
Rui Liu
Haoliang Li
DiffM
339
1
0
26 Nov 2025
From Bits to Rounds: Parallel Decoding with Exploration for Diffusion Language Models
Hengyu Fu
Baihe Huang
Virginia Adams
Charles Wang
Venkat Srinivasan
Jiantao Jiao
237
3
0
26 Nov 2025
Masked Diffusion Models are Secretly Learned-Order Autoregressive Models
Prateek Garg
Bhavya Kohli
Sunita Sarawagi
DiffM
222
1
0
24 Nov 2025
Bringing Stability to Diffusion: Decomposing and Reducing Variance of Training Masked Diffusion Models
Mengni Jia
Mengyu Zhou
Yihao Liu
Xiaoxi Jiang
Guanjun Jiang
106
0
0
22 Nov 2025
A Comprehensive Study on Visual Token Redundancy for Discrete Diffusion-based Multimodal Large Language Models
Duo Li
Zuhao Yang
Xiaoqin Zhang
Ling Shao
Shijian Lu
VLM
157
1
0
19 Nov 2025
Reasoning in Diffusion Large Language Models is Concentrated in Dynamic Confusion Zones
Ranfei Chen
Ming Chen
Kaifei Wang
DiffM
AI4CE
LRM
207
0
0
19 Nov 2025
D
3
^{3}
3
ToM: Decider-Guided Dynamic Token Merging for Accelerating Diffusion MLLMs
Shuochen Chang
Xiaofeng Zhang
Qingyang Liu
Li Niu
86
0
0
15 Nov 2025
KLASS: KL-Guided Fast Inference in Masked Diffusion Models
S. Kim
S. Hong
Hojung Jung
Youngrok Park
Se-Young Yun
DiffM
VLM
139
0
0
07 Nov 2025
From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model
Yatai Ji
Teng Wang
Yuying Ge
Zhiheng Liu
Sidi Yang
Y. Shan
Ping Luo
DiffM
VLM
175
1
0
22 Oct 2025
Saber: An Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model
Yihong Dong
Zhaoyu Ma
Xue Jiang
Zhiyuan Fan
Jiaru Qian
...
Rongyu Cao
B. Li
Fei Huang
Yongbin Li
Ge Li
142
4
0
20 Oct 2025
Attention Is All You Need for KV Cache in Diffusion LLMs
Quan Nguyen-Tri
Mukul Ranjan
Zhiqiang Shen
161
5
0
16 Oct 2025
Latent Refinement Decoding: Enhancing Diffusion-Based Language Models by Refining Belief States
Qinglin Zhu
Yizhen Yao
Runcong Zhao
Yanzheng Xiang
Amrutha Saseendran
Chen Jin
Philip Teare
Bin Liang
Yulan He
Lin Gui
DiffM
181
1
0
13 Oct 2025
Unlocking the Potential of Diffusion Language Models through Template Infilling
Junhoo Lee
Seungyeon Kim
Nojun Kwak
DiffM
AI4CE
71
0
0
13 Oct 2025
The False Promise of Zero-Shot Super-Resolution in Machine-Learned Operators
Mansi Sakarvadia
Kareem Hegazy
A. Totounferoush
Kyle Chard
Yaoqing Yang
Ian Foster
Michael W. Mahoney
SupR
294
26
0
08 Oct 2025
CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credits
Kangyu Wang
Zhiyun Jiang
Haibo Feng
Weijia Zhao
Lin Liu
Jianguo Li
Zhenzhong Lan
Weiyao Lin
124
5
0
07 Oct 2025
LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning
Haoqiang Kang
Y. Zhang
Nikki Lijing Kuang
Nicklas Majamaki
Navdeep Jaitly
Yi-An Ma
Lianhui Qin
LRM
619
3
0
06 Oct 2025
Finish First, Perfect Later: Test-Time Token-Level Cross-Validation for Diffusion Large Language Models
Runchu Tian
Junxia Cui
Xueqiang Xu
Feng Yao
Jingbo Shang
164
1
0
06 Oct 2025
Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models
Shutong Wu
Jiawei Zhang
DiffM
322
2
0
30 Sep 2025
Fast-dLLM v2: Efficient Block-Diffusion LLM
Chengyue Wu
Hao Zhang
Shuchen Xue
Shizhe Diao
Y. Fu
Zhijian Liu
Pavlo Molchanov
Ping Luo
Song Han
Enze Xie
AI4CE
217
27
0
30 Sep 2025
AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size
Guanxi Lu
Hao Mark Chen
Yuto Karashima
Zhican Wang
Daichi Fujiki
Hongxiang Fan
AI4CE
152
4
0
30 Sep 2025
dParallel: Learnable Parallel Decoding for dLLMs
Zigeng Chen
Gongfan Fang
Xinyin Ma
Ruonan Yu
Xinchao Wang
126
12
0
30 Sep 2025
LLaDA-MoE: A Sparse MoE Diffusion Language Model
Fengqi Zhu
Zebin You
Yipeng Xing
Zenan Huang
Lin Liu
...
Junbo Zhao
Da Zheng
Chongxuan Li
Jianguo Li
J. Wen
MoE
267
15
0
29 Sep 2025
RFG: Test-Time Scaling for Diffusion Large Language Model Reasoning with Reward-Free Guidance
Tianlang Chen
Minkai Xu
Jure Leskovec
Stefano Ermon
LRM
AI4CE
151
2
0
29 Sep 2025
RIV: Recursive Introspection Mask Diffusion Vision Language Model
YuQian Li
Limeng Qiao
Lin Ma
VLM
86
1
0
28 Sep 2025
A2D: Any-Order, Any-Step Safety Alignment for Diffusion Language Models
Wonje Jeung
Sangyeon Yoon
Yoonjun Cho
Dongjae Jeon
Sangwoo Shin
Hyesoo Hong
Albert No
DiffM
168
0
0
27 Sep 2025
Soft-Di[M]O: Improving One-Step Discrete Image Generation with Soft Embeddings
Yuanzhi Zhu
Xi Wang
Stéphane Lathuilière
Vicky Kalogeiton
145
2
0
26 Sep 2025
From Text to Talk: Audio-Language Model Needs Non-Autoregressive Joint Training
Tianqiao Liu
Xueyi Li
Hao Wang
Haoxuan Li
Zhichao Chen
Weiqi Luo
Zitao Liu
AuLLM
148
0
0
24 Sep 2025
Set Block Decoding is a Language Model Inference Accelerator
Itai Gat
Heli Ben-Hamu
Marton Havasi
Daniel Haziza
Jeremy Reizenstein
Gabriel Synnaeve
David Lopez-Paz
Brian Karrer
Y. Lipman
162
7
0
04 Sep 2025
A Survey on Diffusion Language Models
Tianyi Li
Mingda Chen
Bowei Guo
Zhiqiang Shen
326
37
0
14 Aug 2025
Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models
Jinsong Li
Xiaoyi Dong
Yuhang Zang
Yuhang Cao
Jiaqi Wang
Dahua Lin
DiffM
181
13
0
01 Aug 2025
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs
Xiaoran Liu
Zhigeng Liu
Zengfeng Huang
Zengfeng Huang
Qipeng Guo
Ziwei He
Xipeng Qiu
442
21
0
17 Jun 2025
Discrete Diffusion in Large Language and Multimodal Models: A Survey
Runpeng Yu
Qi Li
Xinchao Wang
DiffM
AI4CE
540
26
0
16 Jun 2025
Joint Vision-Language Social Bias Removal for CLIP
Computer Vision and Pattern Recognition (CVPR), 2024
Haoyu Zhang
Yangyang Guo
Mohan S. Kankanhalli
VLM
429
9
0
19 Nov 2024
Scaling Diffusion Language Models via Adaptation from Autoregressive Models
International Conference on Learning Representations (ICLR), 2024
Shansan Gong
Shivam Agarwal
Yizhe Zhang
Jiacheng Ye
Lin Zheng
...
Peilin Zhao
W. Bi
Jiawei Han
Yuan Yao
Dianbo Sui
AI4CE
423
145
0
23 Oct 2024
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
International Conference on Learning Representations (ICLR), 2024
Jiacheng Ye
Lei Li
Shansan Gong
Lin Zheng
Xin Jiang
Zhiyu Li
Dianbo Sui
DiffM
LRM
647
74
0
18 Oct 2024
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Zhe Chen
Weiyun Wang
Hao Tian
Shenglong Ye
Zhangwei Gao
...
Tong Lu
Dahua Lin
Yu Qiao
Jifeng Dai
Wenhai Wang
MLLM
VLM
534
1,004
0
25 Apr 2024
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Computer Vision and Pattern Recognition (CVPR), 2023
Xiang Yue
Yuansheng Ni
Kai Zhang
Tianyu Zheng
Ruoqi Liu
...
Yibo Liu
Wenhao Huang
Huan Sun
Yu-Chuan Su
Wenhu Chen
OSLM
ELM
VLM
869
1,649
0
27 Nov 2023
Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution
International Conference on Machine Learning (ICML), 2023
Aaron Lou
Chenlin Meng
Stefano Ermon
DiffM
445
335
0
25 Oct 2023
Improved Baselines with Visual Instruction Tuning
Computer Vision and Pattern Recognition (CVPR), 2023
Haotian Liu
Chunyuan Li
Yuheng Li
Yong Jae Lee
VLM
MLLM
616
4,263
0
05 Oct 2023
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
International Conference on Learning Representations (ICLR), 2023
Pan Lu
Hritik Bansal
Tony Xia
Hamish Ivison
Chun-yue Li
Hannaneh Hajishirzi
Hao Cheng
Kai-Wei Chang
Michel Galley
Jianfeng Gao
LRM
MLLM
577
1,198
0
03 Oct 2023
Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond
Jinze Bai
Shuai Bai
Shusheng Yang
Shijie Wang
Sinan Tan
Peng Wang
Junyang Lin
Chang Zhou
Jingren Zhou
MLLM
VLM
ObjD
547
1,632
0
24 Aug 2023
MMBench: Is Your Multi-modal Model an All-around Player?
European Conference on Computer Vision (ECCV), 2023
Yuanzhan Liu
Haodong Duan
Yuanhan Zhang
Yue Liu
Songyang Zhang
...
Yuan Liu
Conghui He
Ziwei Liu
Kai-xiang Chen
Dahua Lin
762
1,685
0
12 Jul 2023
MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
Chaoyou Fu
Peixian Chen
Chunjiang Ge
Yulei Qin
Mengdan Zhang
...
Xing Sun
Zhenyu Qiu
Rongrong Ji
Caifeng Shan
Ran He
ELM
MLLM
815
1,252
0
23 Jun 2023
Evaluating Object Hallucination in Large Vision-Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yifan Li
Yifan Du
Kun Zhou
Jinpeng Wang
Wayne Xin Zhao
Ji-Rong Wen
MLLM
LRM
691
1,287
0
17 May 2023
Visual Instruction Tuning
Neural Information Processing Systems (NeurIPS), 2023
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
1.2K
7,615
0
17 Apr 2023
A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng
Jianbo Yuan
Lei Yu
Lingpeng Kong
DiffM
289
119
0
11 Feb 2023
SeqDiffuSeq: Text Diffusion with Encoder-Decoder Transformers
Hongyi Yuan
Zheng Yuan
Chuanqi Tan
Fei Huang
Songfang Huang
DiffM
252
83
0
20 Dec 2022
CLIP-Diffusion-LM: Apply Diffusion Model on Image Captioning
Shi-You Xu
VLM
DiffM
204
18
0
10 Oct 2022
1
2
Next
Page 1 of 2