ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.08332
  4. Cited By
Versatile Diffusion: Text, Images and Variations All in One Diffusion
  Model

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model

15 November 2022
Xingqian Xu
Zhangyang Wang
Eric Zhang
Kai Wang
Humphrey Shi
    DiffM
ArXivPDFHTML

Papers citing "Versatile Diffusion: Text, Images and Variations All in One Diffusion Model"

50 / 139 papers shown
Title
Adaptive Diffusion Policy Optimization for Robotic Manipulation
Adaptive Diffusion Policy Optimization for Robotic Manipulation
Huiyun Jiang
Zhuang Yang
9
0
0
13 May 2025
Any-to-Any Vision-Language Model for Multimodal X-ray Imaging and Radiological Report Generation
Any-to-Any Vision-Language Model for Multimodal X-ray Imaging and Radiological Report Generation
Daniele Molino
Francesco Di Feola
Linlin Shen
Paolo Soda
V. Guarrasi
MedIm
LM&MA
57
0
0
02 May 2025
The Dual Power of Interpretable Token Embeddings: Jailbreaking Attacks and Defenses for Diffusion Model Unlearning
The Dual Power of Interpretable Token Embeddings: Jailbreaking Attacks and Defenses for Diffusion Model Unlearning
Siyi Chen
Yimeng Zhang
Sijia Liu
Q. Qu
AAML
61
0
0
30 Apr 2025
Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking
Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking
Junxi Chen
Junhao Dong
Xiaohua Xie
33
0
0
08 Apr 2025
QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning
QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning
Quanxing Xu
Ling Zhou
X. Zhong
Feifei Zhang
Rubing Huang
Chia-Wen Lin
34
0
0
04 Apr 2025
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
Prin Phunyaphibarn
Phillip Y. Lee
Jaihoon Kim
Minhyuk Sung
DiffM
81
0
0
26 Mar 2025
LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images
LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images
Leyang Wang
Joice Lin
DiffM
63
0
0
20 Mar 2025
A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli
A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli
Pengyu Liu
Guohua Dong
D. Guo
Kun Li
Fengling Li
Xun Yang
Meng Wang
Xiaomin Ying
AI4CE
41
0
0
20 Mar 2025
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
Tsu-jui Fu
Yusu Qian
Chen Chen
Wenze Hu
Zhe Gan
Y. Yang
85
1
0
16 Mar 2025
Make Optimization Once and for All with Fine-grained Guidance
Mingjia Shi
Ruihan Lin
Xuxi Chen
Yuhao Zhou
Zezhen Ding
...
Tong Wang
Kai Wang
Zhangyang Wang
J. Zhang
Tianlong Chen
48
1
0
14 Mar 2025
Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting
Cagri Gungor
Derek Eppinger
Adriana Kovashka
56
0
0
10 Mar 2025
SEED: Towards More Accurate Semantic Evaluation for Visual Brain Decoding
Juhyeon Park
P. Y. Kim
Jiook Cha
Shinjae Yoo
Taesup Moon
48
0
0
09 Mar 2025
Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach
Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach
Soumyadeep Ro
Sanapala Satwika
Pamarthi Yasoda Gayathri
Mohmmad Ghaith Balsha
Aysegul Ucar
VLM
ObjD
54
0
0
06 Mar 2025
Language-Guided Visual Perception Disentanglement for Image Quality Assessment and Conditional Image Generation
Zhichao Yang
Leida Li
Pengfei Chen
Jinjian Wu
Giuseppe Valenzise
64
0
0
04 Mar 2025
MindSimulator: Exploring Brain Concept Localization via Synthetic FMRI
Guangyin Bao
Qi Zhang
Z. Gong
Zhuojia Wu
Duoqian Miao
34
0
0
04 Mar 2025
Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment
Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment
Minh-Quan Le
Gaurav Mittal
Tianjian Meng
A S M Iftekhar
Vishwas Suryanarayanan
Barun Patra
Dimitris Samaras
Mei Chen
DiffM
53
0
0
07 Feb 2025
BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities
BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities
Zhibo Tian
Ruijie Quan
Fan Ma
Kun Zhan
Yi Yang
29
1
0
24 Jan 2025
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery
Shristi Das Biswas
Matthew Shreve
Xuelu Li
Prateek Singhal
Kaushik Roy
DiffM
36
1
0
20 Jan 2025
MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation
MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation
Daniele Molino
Francesco Di Feola
E. Faiella
Deborah Fazzini
D. Santucci
Linlin Shen
V. Guarrasi
Paolo Soda
SyDa
MedIm
39
0
0
10 Jan 2025
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
Hui Zhang
Zuxuan Wu
Zhen Xing
Jie Shao
Yu-Gang Jiang
32
9
0
31 Dec 2024
D-Judge: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance
D-Judge: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance
Renyang Liu
Ziyu Lyu
Wei Zhou
See-Kiong Ng
EGVM
28
0
0
23 Dec 2024
Optimized two-stage AI-based Neural Decoding for Enhanced Visual
  Stimulus Reconstruction from fMRI Data
Optimized two-stage AI-based Neural Decoding for Enhanced Visual Stimulus Reconstruction from fMRI Data
Lorenzo Veronese
Andrea Moglia
Luca Mainardi
Pietro Cerveri
DiffM
59
0
0
17 Dec 2024
UIBDiffusion: Universal Imperceptible Backdoor Attack for Diffusion Models
UIBDiffusion: Universal Imperceptible Backdoor Attack for Diffusion Models
Yuning Han
Bingyin Zhao
Rui Chu
Feng Luo
Biplab Sikdar
Yingjie Lao
DiffM
AAML
67
1
0
16 Dec 2024
COBRA: A Continual Learning Approach to Vision-Brain Understanding
COBRA: A Continual Learning Approach to Vision-Brain Understanding
Xuan-Bac Nguyen
Arabinda Kumar Choudhary
Pawan Sinha
Xin Li
Khoa Luu
CLL
66
0
0
25 Nov 2024
One Diffusion to Generate Them All
One Diffusion to Generate Them All
Duong H. Le
Tuan Pham
Sangho Lee
Christopher Clark
Aniruddha Kembhavi
Stephan Mandt
Ranjay Krishna
Jiasen Lu
VLM
59
5
0
25 Nov 2024
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Qifan Yu
Wei Chow
Zhongqi Yue
Kaihang Pan
Yang Wu
Xiaoyang Wan
Juncheng Billy Li
Siliang Tang
H. Zhang
Yueting Zhuang
DiffM
95
15
0
24 Nov 2024
Quantum-Brain: Quantum-Inspired Neural Network Approach to Vision-Brain
  Understanding
Quantum-Brain: Quantum-Inspired Neural Network Approach to Vision-Brain Understanding
Hoang-Quan Nguyen
Xuan-Bac Nguyen
Hugh Churchill
Arabinda Kumar Choudhary
Pawan Sinha
S. Khan
Khoa Luu
56
1
0
20 Nov 2024
Decoding Visual Experience and Mapping Semantics through Whole-Brain
  Analysis Using fMRI Foundation Models
Decoding Visual Experience and Mapping Semantics through Whole-Brain Analysis Using fMRI Foundation Models
Yanchen Wang
Adam Turnbull
Tiange Xiang
Yunlong Xu
Sa Zhou
Adnan Masoud
Shekoofeh Azizi
F. Lin
Ehsan Adeli
24
0
0
11 Nov 2024
BrainBits: How Much of the Brain are Generative Reconstruction Methods
  Using?
BrainBits: How Much of the Brain are Generative Reconstruction Methods Using?
David Mayo
Christopher Wang
Asa Harbin
Abdulrahman Alabdulkareem
Albert Eaton Shaw
Boris Katz
Andrei Barbu
DiffM
36
0
0
05 Nov 2024
MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of
  Low-rank Experts
MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts
Jie Zhu
Y. Chen
Mingyu Ding
Ping Luo
Leye Wang
Jingdong Wang
DiffM
24
2
0
30 Oct 2024
Diff-Instruct*: Towards Human-Preferred One-step Text-to-image
  Generative Models
Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models
Weijian Luo
C. Zhang
Debing Zhang
Zhengyang Geng
21
3
0
28 Oct 2024
Diff-Instruct++: Training One-step Text-to-image Generator Model to
  Align with Human Preferences
Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences
Weijian Luo
EGVM
36
6
0
24 Oct 2024
Group Diffusion Transformers are Unsupervised Multitask Learners
Group Diffusion Transformers are Unsupervised Multitask Learners
Lianghua Huang
Wei Wang
Zhi-Fan Wu
Huanzhang Dou
Yupeng Shi
Yutong Feng
C. Liang
Yu Liu
Jingren Zhou
VLM
31
11
0
19 Oct 2024
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Jinbin Bai
Tian-Chun Ye
Wei Chow
Enxin Song
Qing-Guo Chen
Xiangtai Li
Zhen Dong
Lei Zhu
50
13
0
10 Oct 2024
Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance
Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance
Jaehoon Joo
Taejin Jeong
Seongjae Hwang
DiffM
16
0
0
18 Sep 2024
Latent Diffusion Models for Controllable RNA Sequence Generation
Latent Diffusion Models for Controllable RNA Sequence Generation
Kaixuan Huang
Yukang Yang
Kaidi Fu
Yanyi Chu
Le Cong
Mengdi Wang
39
1
0
15 Sep 2024
Spiking Diffusion Models
Spiking Diffusion Models
Jiahang Cao
Hanzhong Guo
Ziqing Wang
Deming Zhou
Hao Cheng
Qiang Zhang
Renjing Xu
DiffM
27
1
0
29 Aug 2024
Connecting Dreams with Visual Brainstorming Instruction
Connecting Dreams with Visual Brainstorming Instruction
Yasheng Sun
Bohan Li
Mingchen Zhuge
Deng-Ping Fan
Salman Khan
F. Khan
Hideki Koike
DiffM
27
0
0
14 Aug 2024
ViMo: Generating Motions from Casual Videos
ViMo: Generating Motions from Casual Videos
Liangdong Qiu
Chengxing Yu
Yanran Li
Zhao Wang
Haibin Huang
Chongyang Ma
Di Zhang
Pengfei Wan
Xiaoguang Han
VGen
24
2
0
13 Aug 2024
Fine-gained Zero-shot Video Sampling
Fine-gained Zero-shot Video Sampling
Dengsheng Chen
Jie Hu
Javier Segovia-Aguas
Enhua Wu
VGen
DiffM
16
0
0
31 Jul 2024
Diffusion Models for Multi-Task Generative Modeling
Diffusion Models for Multi-Task Generative Modeling
Changyou Chen
Han Ding
Bunyamin Sisman
Yi Tian Xu
Ouye Xie
Benjamin Z. Yao
Son Dinh Tran
Belinda Zeng
DiffM
32
4
0
24 Jul 2024
IMAGDressing-v1: Customizable Virtual Dressing
IMAGDressing-v1: Customizable Virtual Dressing
Fei Shen
Xin Jiang
Xin He
Hu Ye
Cong Wang
Xiaoyu Du
Zechao Li
Jinghui Tang
DiffM
45
28
0
17 Jul 2024
E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion
  Priors
E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors
Jinxiu Liang
Bohan Yu
Yixin Yang
Yiming Han
Boxin Shi
VGen
DiffM
MDE
14
0
0
11 Jul 2024
Mixing Natural and Synthetic Images for Robust Self-Supervised
  Representations
Mixing Natural and Synthetic Images for Robust Self-Supervised Representations
Reza Akbarian Bafghi
Nidhin Harilal
C. Monteleoni
M. Raissi
DiffM
23
0
0
18 Jun 2024
ControlVAR: Exploring Controllable Visual Autoregressive Modeling
ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Xiang Li
Kai Qiu
Hao Chen
Jason Kuen
Zhe-nan Lin
Rita Singh
Bhiksha Raj
DiffM
35
21
0
14 Jun 2024
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via
  Synthetic-Domain Alignment
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
Jiayi Guo
Junhao Zhao
Chunjiang Ge
Chaoqun Du
Zanlin Ni
Shiji Song
Humphrey Shi
Gao Huang
TTA
DiffM
27
5
0
06 Jun 2024
Inspired by AI? A Novel Generative AI System To Assist Conceptual
  Automotive Design
Inspired by AI? A Novel Generative AI System To Assist Conceptual Automotive Design
Ye Wang
Nicole B. Damen
Thomas Gale
Voho Seo
Hooman Shayani
AI4CE
18
2
0
06 Jun 2024
Uni-ISP: Unifying the Learning of ISPs from Multiple Cameras
Uni-ISP: Unifying the Learning of ISPs from Multiple Cameras
Lingen Li
Mingde Yao
Xingyu Meng
Muquan Yu
Tianfan Xue
Jinwei Gu
34
0
0
03 Jun 2024
MindFormer: A Transformer Architecture for Multi-Subject Brain Decoding
  via fMRI
MindFormer: A Transformer Architecture for Multi-Subject Brain Decoding via fMRI
Inhwa Han
Jaayeon Lee
Jong Chul Ye
MedIm
AI4CE
18
0
0
28 May 2024
User-Friendly Customized Generation with Multi-Modal Prompts
User-Friendly Customized Generation with Multi-Modal Prompts
Linhao Zhong
Yan Hong
Wentao Chen
Binglin Zhou
Yiyi Zhang
Jianfu Zhang
Liqing Zhang
DiffM
35
0
0
26 May 2024
123
Next