Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.13753
Cited By
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
29 August 2022
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis"
50 / 66 papers shown
Title
Open-set Anomaly Segmentation in Complex Scenarios
Song Xia
Yi Yu
Henghui Ding
Wenhan Yang
S. Liu
Alex C. Kot
Xudong Jiang
DiffM
50
0
0
28 Apr 2025
DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks
Yinqi Li
Hong Chang
Ruibing Hou
Shiguang Shan
Xilin Chen
DiffM
50
0
0
24 Apr 2025
DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model
Ming Yuan
Sichao Wang
Chuang Zhang
Lei He
Qing Xu
Jianqiang Wang
DiffM
MDE
47
0
0
31 Mar 2025
VVRec: Reconstruction Attacks on DL-based Volumetric Video Upstreaming via Latent Diffusion Model with Gamma Distribution
Rui Lu
B. Zhang
Dan Wang
VGen
150
0
0
25 Feb 2025
Grounding Text-to-Image Diffusion Models for Controlled High-Quality Image Generation
Ahmad Süleyman
Göksel Biricik
41
2
0
15 Jan 2025
Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation
Lakshmikar R. Polamreddy
Kalyan Roy
Sheng-Han Yueh
Deepshikha Mahato
Shilpa Kuppili
Jialu Li
Youshan Zhang
MedIm
75
1
0
22 Nov 2024
ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
Zanlin Ni
Yulin Wang
Renping Zhou
Yizeng Han
Jiayi Guo
Zhiyuan Liu
Yuan Yao
Gao Huang
48
4
0
11 Nov 2024
Scalable, Tokenization-Free Diffusion Model Architectures with Efficient Initial Convolution and Fixed-Size Reusable Structures for On-Device Image Generation
Sanchar Palit
Sathya Veera Reddy Dendi
Mallikarjuna Talluri
Raj Narayana Gadde
26
0
0
09 Nov 2024
Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image
Yu Zhao
Hao Fei
Xiangtai Li
L. Qin
Jiayi Ji
Hongyuan Zhu
Meishan Zhang
M. Zhang
Jianguo Wei
DiffM
26
1
0
20 Oct 2024
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Sihyun Yu
Sangkyung Kwak
Huiwon Jang
Jongheon Jeong
Jonathan Huang
Jinwoo Shin
Saining Xie
OCL
65
59
0
09 Oct 2024
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
Zanlin Ni
Yulin Wang
Renping Zhou
Rui Lu
Jiayi Guo
Jinyi Hu
Zhiyuan Liu
Yuan Yao
Gao Huang
25
7
0
31 Aug 2024
Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets
Qin Lei
Jiang Zhong
Qizhu Dai
DiffM
24
2
0
11 Jul 2024
Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Zanlin Ni
Yulin Wang
Renping Zhou
Jiayi Guo
Jinyi Hu
Zhiyuan Liu
Shiji Song
Yuan Yao
Gao Huang
25
14
0
08 Jun 2024
CHAMP: Conformalized 3D Human Multi-Hypothesis Pose Estimators
Harry Zhang
Luca Carlone
3DH
64
1
0
27 May 2024
DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Yibo Wang
Ruiyuan Gao
Kai Chen
Kaiqiang Zhou
Yingjie Cai
...
Zhenguo Li
Lihui Jiang
Dit-Yan Yeung
Qiang Xu
Kai Zhang
DiffM
113
21
0
20 Mar 2024
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Zixin Zhu
Xuelu Feng
Dongdong Chen
Junsong Yuan
Chunming Qiao
Gang Hua
DiffM
23
7
0
18 Mar 2024
BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution
Feng Li
Yixuan Wu
Zichao Liang
Runmin Cong
H. Bai
Yao-Min Zhao
Meng Wang
DiffM
35
1
0
15 Mar 2024
ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
28
1
0
15 Mar 2024
Desigen: A Pipeline for Controllable Design Template Generation
Haohan Weng
Danqing Huang
Yu Qiao
Zheng Hu
Chin-Yew Lin
Tong Zhang
C. L. P. Chen
DiffM
19
14
0
14 Mar 2024
Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser
Qingyuan Cai
Xuecai Hu
Saihui Hou
Li Yao
Yongzhen Huang
DiffM
21
0
0
07 Mar 2024
Discriminative Probing and Tuning for Text-to-Image Generation
Leigang Qu
Wenjie Wang
Yongqi Li
Hanwang Zhang
Liqiang Nie
Tat-Seng Chua
31
7
0
07 Mar 2024
Contextualized Diffusion Models for Text-Guided Image and Video Generation
Ling Yang
Zhilong Zhang
Zhaochen Yu
Jingwei Liu
Minkai Xu
Stefano Ermon
Bin Cui
23
4
0
26 Feb 2024
RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
Xinchen Zhang
Ling Yang
Yaqi Cai
Zhaochen Yu
Kai-Ni Wang
...
Ye Tian
Minkai Xu
Yong Tang
Yujiu Yang
Bin Cui
DiffM
27
5
0
20 Feb 2024
Bring Metric Functions into Diffusion Models
Jie An
Zhengyuan Yang
Jianfeng Wang
Linjie Li
Zicheng Liu
Lijuan Wang
Jiebo Luo
DiffM
19
4
0
04 Jan 2024
Improving Diffusion-Based Image Synthesis with Context Prediction
Ling Yang
Jingwei Liu
Shenda Hong
Zhilong Zhang
Zhilin Huang
Zheming Cai
Wentao Zhang
Bin Cui
DiffM
38
33
0
04 Jan 2024
iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views
Chin-Hsuan Wu
Yen-Chun Chen
Bolivar Solarte
Lu Yuan
Min Sun
16
9
0
28 Dec 2023
LLMGA: Multimodal Large Language Model based Generation Assistant
Bin Xia
Shiyin Wang
Yingfan Tao
Yitong Wang
Jiaya Jia
MLLM
19
12
0
27 Nov 2023
Exploring Iterative Refinement with Diffusion Models for Video Grounding
Xiao Liang
Tao Shi
Yaoyuan Liang
Te Tao
Shao-Lun Huang
DiffM
22
1
0
26 Oct 2023
ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection
Zhongzhan Huang
Pan Zhou
Shuicheng Yan
Liang Lin
8
26
0
20 Oct 2023
LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts
Hanan Gani
Shariq Farooq Bhat
Muzammal Naseer
Salman Khan
Peter Wonka
DiffM
34
37
0
16 Oct 2023
NExT-GPT: Any-to-Any Multimodal LLM
Shengqiong Wu
Hao Fei
Leigang Qu
Wei Ji
Tat-Seng Chua
MLLM
40
448
0
11 Sep 2023
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Jiaxi Gu
Shicong Wang
Haoyu Zhao
Tianyi Lu
Xing Zhang
Zuxuan Wu
Songcen Xu
Wei Zhang
Yu-Gang Jiang
Hang Xu
DiffM
VGen
28
43
0
07 Sep 2023
StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models
Zhizhong Wang
Lei Zhao
Wei Xing
DiffM
19
118
0
15 Aug 2023
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Binbin Yang
Yinzheng Luo
Ziliang Chen
Guangrun Wang
Xiaodan Liang
Liang Lin
DiffM
11
12
0
13 Aug 2023
DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation
Runyang Feng
Yixing Gao
Tze Ho Elden Tse
Xu Ma
H. Chang
DiffM
26
23
0
31 Jul 2023
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Shihao Zhao
Dongdong Chen
Yen-Chun Chen
Jianmin Bao
Shaozhe Hao
Lu Yuan
Kwan-Yee Kenneth Wong
14
234
0
25 May 2023
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
18
5
0
24 May 2023
Pyramid Diffusion Models For Low-light Image Enhancement
Dewei Zhou
Zongxin Yang
Yi Yang
DiffM
32
77
0
17 May 2023
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models
Shan Zhong
Zhongzhan Huang
Wushao Wen
Jinghui Qin
Liang Lin
17
40
0
09 May 2023
Multimodal-driven Talking Face Generation via a Unified Diffusion-based Generator
Chao Xu
Shaoting Zhu
Junwei Zhu
Alexander I. Rudnicky
Jiangning Zhang
Ying Tai
Yong Liu
DiffM
45
14
0
04 May 2023
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
Jaemin Cho
Linjie Li
Zhengyuan Yang
Zhe Gan
Lijuan Wang
Mohit Bansal
EGVM
6
5
0
13 Apr 2023
Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Qiucheng Wu
Yujian Liu
Handong Zhao
T. Bui
Zhe-nan Lin
Yang Zhang
Shiyu Chang
DiffM
29
43
0
07 Apr 2023
Training-Free Layout Control with Cross-Attention Guidance
Minghao Chen
Iro Laina
Andrea Vedaldi
DiffM
124
217
0
06 Apr 2023
Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models
Jaewoong Lee
Sang-Sub Jang
Jaehyeong Jo
Jaehong Yoon
Yunji Kim
Jin-Hwa Kim
Jung-Woo Ha
Sung Ju Hwang
DiffM
11
4
0
04 Apr 2023
One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models
Yasser Benigmim
Subhankar Roy
S. Essid
Vicky Kalogeiton
Stéphane Lathuilière
DiffM
42
26
0
31 Mar 2023
DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion
Sauradip Nag
Xiatian Zhu
Jiankang Deng
Yi-Zhe Song
Tao Xiang
DiffM
VGen
25
21
0
27 Mar 2023
Freestyle Layout-to-Image Synthesis
Han Xue
Z. Huang
Qianru Sun
Li-Na Song
Wenjun Zhang
DiffM
13
62
0
25 Mar 2023
Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation
Wenkang Shan
Zhenhua Liu
Xinfeng Zhang
Zhao Wang
Kai Han
Shanshe Wang
Siwei Ma
Wen Gao
DiffM
47
81
0
21 Mar 2023
Transformer-based Image Generation from Scene Graphs
Renato Sortino
S. Palazzo
C. Spampinato
ViT
33
15
0
08 Mar 2023
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Chao Xue
W. Liu
Shunxing Xie
Zhenfang Wang
Jiaxing Li
...
Shi-Yong Chen
Yibing Zhan
Jing Zhang
Chaoyue Wang
Dacheng Tao
24
1
0
01 Mar 2023
1
2
Next