Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2504.14666
Cited By
Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Computer Vision and Pattern Recognition (CVPR), 2025
20 April 2025
Kaihang Pan
Wang Lin
Zhongqi Yue
Tenglong Ao
Liyu Jia
Wei Zhao
Juncheng Billy Li
Siliang Tang
Hanwang Zhang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens"
17 / 17 papers shown
Title
REAR: Rethinking Visual Autoregressive Models via Generator-Tokenizer Consistency Regularization
Qiyuan He
Y. Li
Haotian Ye
Jinghao Wang
Xinyao Liao
Pheng-Ann Heng
Stefano Ermon
James Zou
Angela Yao
DiffM
VGen
204
1
0
06 Oct 2025
Reconstruction Alignment Improves Unified Multimodal Models
Ji Xie
Trevor Darrell
Luke Zettlemoyer
Xudong Wang
174
13
0
08 Sep 2025
Towards Meta-Cognitive Knowledge Editing for Multimodal LLMs
Zhaoyu Fan
Kaihang Pan
Mingze Zhou
Bosheng Qin
Juncheng Billy Li
Shengyu Zhang
Wenqiao Zhang
Siliang Tang
Fei Wu
Yueting Zhuang
KELM
120
0
0
06 Sep 2025
TAP: Parameter-efficient Task-Aware Prompting for Adverse Weather Removal
Hanting Wang
Shengpeng Ji
Shulei Wang
Hai Huang
Xiao Jin
Qifei Zhang
Tao Jin
68
0
0
11 Aug 2025
Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey
Jindong Li
Yali Fu
Jiahong Liu
Linxiao Cao
Wei Ji
Menglin Yang
Irwin King
Ming-Hsuan Yang
OffRL
134
2
0
21 Jul 2025
What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities
Wendong Bu
Yang Wu
Qifan Yu
Minghe Gao
Bingchen Miao
...
Mengze Li
Wei Ji
Juncheng Billy Li
Siliang Tang
Yueting Zhuang
ELM
145
2
0
10 Jun 2025
FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL
Kaihang Pan
Wendong Bu
Y. Wu
Yang Wu
Kai Shen
Yunfei Li
Hang Zhao
Juncheng Billy Li
Siliang Tang
Yueting Zhuang
198
9
0
05 Jun 2025
FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities
Jin Wang
Yao Lai
Aoxue Li
Shifeng Zhang
Jiacheng Sun
Ning Kang
Chengyue Wu
Zhenguo Li
Ping Luo
342
17
0
26 May 2025
KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models
Yongliang Wu
Zonghui Li
Xinting Hu
Xinyu Ye
Xianfang Zeng
Gang Yu
Wenbo Zhu
Bernt Schiele
Ming-Hsuan Yang
Xu Yang
VLM
252
21
0
22 May 2025
Reasoning Physical Video Generation with Diffusion Timestep Tokens via Reinforcement Learning
Wang Lin
Liyu Jia
Wentao Hu
Kaihang Pan
Zhongqi Yue
Wei Zhao
Jingyuan Chen
Fei Wu
Hanwang Zhang
VGen
238
8
0
22 Apr 2025
Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark
Bingchen Miao
Y. Wu
Minghe Gao
Qifan Yu
Wendong Bu
Wenqiao Zhang
Yunfei Li
Siliang Tang
Tat-Seng Chua
Juncheng Billy Li
LLMAG
LRM
360
3
0
24 Mar 2025
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling
Xiaokang Chen
Zhiyu Wu
Xingchao Liu
Zizheng Pan
Wen Liu
Zhenda Xie
X. Yu
Chong Ruan
AI4TS
510
434
0
29 Jan 2025
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model
Xianwei Zhuang
Yuxin Xie
Yufan Deng
Liming Liang
Jinghan Ru
Yuguo Yin
Yuexian Zou
MLLM
VLM
LRM
285
27
0
21 Jan 2025
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Computer Vision and Pattern Recognition (CVPR), 2024
Qifan Yu
Wei Chow
Zhongqi Yue
Kaihang Pan
Yang Wu
Xiaoyang Wan
Juncheng Billy Li
Siliang Tang
Hao Zhang
Yueting Zhuang
DiffM
437
107
0
24 Nov 2024
GraphCLIP: Enhancing Transferability in Graph Foundation Models for Text-Attributed Graphs
The Web Conference (WWW), 2024
Yun Zhu
Haizhou Shi
Xiaotang Wang
Yongchao Liu
Yaoke Wang
Boci Peng
Chuntao Hong
Siliang Tang
VLM
519
33
0
14 Oct 2024
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Dongyang Liu
Shitian Zhao
Le Zhuo
Weifeng Lin
Ping Luo
Xinyue Li
Qi Qin
Yu Qiao
Hongsheng Li
Peng Gao
MLLM
361
106
0
05 Aug 2024
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Chameleon Team
MLLM
488
601
0
16 May 2024
1