ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.16318
  4. Cited By
One Diffusion to Generate Them All
v1v2 (latest)

One Diffusion to Generate Them All

Computer Vision and Pattern Recognition (CVPR), 2024
25 November 2024
Duong H. Le
Tuan Pham
Sangho Lee
Christopher Clark
Aniruddha Kembhavi
Stephan Mandt
Ranjay Krishna
Jiasen Lu
    VLM
ArXiv (abs)PDFHTMLHuggingFace (31 upvotes)Github (624★)

Papers citing "One Diffusion to Generate Them All"

23 / 23 papers shown
Title
Visual Bridge: Universal Visual Perception Representations Generating
Visual Bridge: Universal Visual Perception Representations Generating
Yilin Gao
Shuguang Dou
Junzhou Li
Zhiheng Yu
Yin Li
Dongsheng Jiang
Shugong Xu
DiffMVOS
238
0
0
11 Nov 2025
The False Promise of Zero-Shot Super-Resolution in Machine-Learned Operators
The False Promise of Zero-Shot Super-Resolution in Machine-Learned Operators
Mansi Sakarvadia
Kareem Hegazy
A. Totounferoush
Kyle Chard
Yaoqing Yang
Ian Foster
Michael W. Mahoney
SupR
208
12
0
08 Oct 2025
Zoom-In to Sort AI-Generated Images Out
Zoom-In to Sort AI-Generated Images Out
Yikun Ji
Y. Hong
Bowen Deng
Jun Lan
Huijia Zhu
Weiqiang Wang
Liqing Zhang
Jianfu Zhang
104
0
0
05 Oct 2025
Universal Multi-Domain Translation via Diffusion Routers
Universal Multi-Domain Translation via Diffusion Routers
Duc Kieu
Kien Do
Tuan Hoang
T. Le
Tung Kieu
D. Nguyen
T. Nguyen
84
0
0
26 Sep 2025
MultiCrafter: High-Fidelity Multi-Subject Generation via Disentangled Attention and Identity-Aware Preference Alignment
MultiCrafter: High-Fidelity Multi-Subject Generation via Disentangled Attention and Identity-Aware Preference Alignment
Tao Wu
Yibo Jiang
Yehao Lu
Zhizhong Wang
Longxiang Zhang
Zequn Qin
Xi Li
96
1
0
26 Sep 2025
Video models are zero-shot learners and reasoners
Video models are zero-shot learners and reasoners
Thaddäus Wiedemer
Yuxuan Li
Paul Vicol
Shixiang Shane Gu
Nick Matarese
Kevin Swersky
Been Kim
P. Jaini
Robert Geirhos
VLMLRM
176
37
0
24 Sep 2025
ShaLa: Multimodal Shared Latent Space Modelling
ShaLa: Multimodal Shared Latent Space Modelling
Jiali Cui
Yan-Ying Chen
Yanxia Zhang
M. Klenk
92
0
0
24 Aug 2025
Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering
Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering
Shanlin Sun
Yifan Wang
Hanwen Zhang
Yifeng Xiong
Qin Ren
Ruogu Fang
Xiaohui Xie
Chenyu You
126
2
0
20 Aug 2025
OmniTry: Virtual Try-On Anything without Masks
OmniTry: Virtual Try-On Anything without Masks
Yutong Feng
Linlin Zhang
H. Cao
Yiming Chen
Xiaoduan Feng
Jian Cao
Yuxiong Wu
Bin Wang
72
1
0
19 Aug 2025
LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation
LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation
Wenhui Song
Hanhui Li
Jiehui Huang
Panwen Hu
Yuhao Cheng
Long Chen
Yiqiang Yan
Xiaodan Liang
DiffMVGen
101
2
0
11 Aug 2025
Trade-offs in Image Generation: How Do Different Dimensions Interact?
Trade-offs in Image Generation: How Do Different Dimensions Interact?
Sicheng Zhang
Binzhu Xie
Zhonghao Yan
Yuli Zhang
Donghao Zhou
Xiaofei Chen
Shi Qiu
Jiaqi Liu
Guoyang Xie
Zhichao Lu
115
2
0
29 Jul 2025
Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
Yi Xin
Juncheng Yan
Qi Qin
Ge Wang
Dongyang Liu
...
Jiaming Song
Guangtao Zhai
Xiaohong Liu
Botian Shi
Peng Gao
150
16
0
23 Jul 2025
RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation
RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation
Liheng Zhang
Lexi Pang
Hang Ye
Xiaoxuan Ma
Yizhou Wang
DiffM
147
0
0
03 Jul 2025
MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans
MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans
Shubhankar Borse
Seokeon Choi
S. Park
J. Kim
Shreya Kadambi
Risheek Garrepalli
Sungrack Yun
Munawar Hayat
Fatih Porikli
EGVMVLM
207
2
0
25 Jun 2025
StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets
StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets
Anh-Quan Cao
Ivan Lopes
Raoul de Charette
171
1
0
09 Jun 2025
Jodi: Unification of Visual Generation and Understanding via Joint Modeling
Jodi: Unification of Visual Generation and Understanding via Joint Modeling
Yifeng Xu
Zhenliang He
Meina Kan
Shiguang Shan
Xilin Chen
VLM
274
1
0
25 May 2025
CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
H. Zhang
Dexiang Hong
Maoke Yang
Yutao Chen
Zhao Zhang
Jie Shao
Xinglong Wu
Zuxuan Wu
Yu Jiang
DiffMAI4CE
423
11
0
25 May 2025
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing
Zinan Guo
Pengze Zhang
Yanze Wu
Chong Mou
Mingcong Liu
Qian He
183
4
0
05 May 2025
OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding
OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding
Dianbing Xi
Jiadong Wang
Yuanzhi Liang
Xi Qiu
Yuchi Huo
Ruiqi Wang
Fangqiu Yi
Xuzhao Li
DiffMVGen
464
10
0
15 Apr 2025
Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision
Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision
Yuandong Pu
Le Zhuo
Kaiwen Zhu
Liangbin Xie
Wenlong Zhang
Xiangyu Chen
Peng Gao
Botian Shi
Chao Dong
Yihao Liu
MLLM
252
9
0
07 Apr 2025
From Fragment to One Piece: A Survey on AI-Driven Graphic Design
From Fragment to One Piece: A Survey on AI-Driven Graphic Design
Xingxing Zou
Wen Zhang
Nanxuan Zhao
286
2
0
24 Mar 2025
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
Tsu-Jui Fu
Yusu Qian
Chen Chen
Wenze Hu
Zhe Gan
Yue Yang
497
8
0
16 Mar 2025
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Chameleon Team
MLLM
440
581
0
16 May 2024
1