Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.06125
Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents
13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hierarchical Text-Conditional Image Generation with CLIP Latents"
50 / 4,735 papers shown
Title
Accelerating db-A* for Kinodynamic Motion Planning Using Diffusion
Julius Franke
A. Moldagalieva
Pia Hanfeld
Wolfgang Hönig
DiffM
72
0
0
07 Mar 2025
Frequency Autoregressive Image Generation with Continuous Tokens
Hu Yu
Hao Luo
Hangjie Yuan
Yu Rong
Feng Zhao
VGen
37
2
0
07 Mar 2025
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data
Zengqun Zhao
Ziquan Liu
Yu Cao
Shaogang Gong
Ioannis Patras
38
0
0
07 Mar 2025
scDD: Latent Codes Based scRNA-seq Dataset Distillation with Foundation Model Knowledge
Zhen Yu
Jianan Han
Yang Liu
Qingchao Chen
62
0
0
06 Mar 2025
ControlFill: Spatially Adjustable Image Inpainting from Prompt Learning
Boseong Jeon
52
0
0
06 Mar 2025
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion
Ziyi Yang
Fanqi Wan
Longguang Zhong
Canbin Huang
Guosheng Liang
Xiaojun Quan
MoMe
90
0
0
06 Mar 2025
Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models
Rui Jiang
Xinghe Fu
Guangcong Zheng
Teng Li
Taiping Yao
Xi Li
DiffM
60
0
0
06 Mar 2025
WarmFed: Federated Learning with Warm-Start for Globalization and Personalization Via Personalized Diffusion Models
Tao Feng
Jie Zhang
Xiangjian Li
Rong Huang
Huashan Liu
Zhijie Wang
FedML
52
0
0
05 Mar 2025
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles
Rui Zhao
Weijia Mao
Mike Zheng Shou
64
0
0
05 Mar 2025
LangGas: Introducing Language in Selective Zero-Shot Background Subtraction for Semi-Transparent Gas Leak Detection with a New Dataset
Wenqi Guo
Yiyang Du
Shan Du
67
1
0
04 Mar 2025
MindSimulator: Exploring Brain Concept Localization via Synthetic FMRI
Guangyin Bao
Qi Zhang
Z. Gong
Zhuojia Wu
Duoqian Miao
34
0
0
04 Mar 2025
Fine-Grained Controllable Apparel Showcase Image Generation via Garment-Centric Outpainting
Rong Zhang
J. Wang
Zhiwen Zuo
Jianfeng Dong
W. Li
Chi-Yin Wang
W. Xu
Xun Wang
DiffM
69
0
0
03 Mar 2025
MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation
Yi Wang
Mushui Liu
Wanggui He
Longxiang Zhang
Z. Huang
...
H. Li
Weilong Dai
Mingli Song
Jie Song
Hao Jiang
MLLM
MoE
LRM
75
1
0
03 Mar 2025
CacheQuant: Comprehensively Accelerated Diffusion Models
Xuewen Liu
Zhikai Li
Qingyi Gu
DiffM
30
0
0
03 Mar 2025
One-shot In-context Part Segmentation
Zhenqi Dai
Ting Liu
X. Zhang
Y. X. Wei
Yanning Zhang
VLM
71
1
0
03 Mar 2025
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation
Jiantao Lin
Xin Yang
Meixi Chen
Yingjie Xu
D. Yan
Leyi Wu
Xinli Xu
Lie Xu
Shunsi Zhang
Ying-Cong Chen
55
1
0
03 Mar 2025
Interactive Gadolinium-Free MRI Synthesis: A Transformer with Localization Prompt Learning
Linhao Li
Changhui Su
Yu Guo
Huimao Zhang
Dong Liang
K. Shang
MedIm
48
0
0
03 Mar 2025
WeGen: A Unified Model for Interactive Multimodal Generation as We Chat
Zhipeng Huang
Shaobin Zhuang
Canmiao Fu
Binxin Yang
Ying Zhang
Chong Sun
Zhizheng Zhang
Yali Wang
Chen Li
Zheng-Jun Zha
DiffM
69
1
0
03 Mar 2025
FaceShot: Bring Any Character into Life
Junyao Gao
Yanan Sun
Fei Shen
Xin Jiang
Zhening Xing
Kai-xiang Chen
Cairong Zhao
CVBM
3DH
40
1
0
02 Mar 2025
A Simple and Effective Reinforcement Learning Method for Text-to-Image Diffusion Fine-tuning
Shashank Gupta
Chaitanya Ahuja
Tsung-Yu Lin
Sreya Dutta Roy
Harrie Oosterhuis
Maarten de Rijke
Satya Narayan Shukla
46
1
0
02 Mar 2025
Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think
Jie Tian
Xiaoye Qu
Zhenyi Lu
Wei Wei
Sichen Liu
Yu-Xi Cheng
DiffM
VGen
44
0
0
02 Mar 2025
Zero-Shot Head Swapping in Real-World Scenarios
S. Jeong
Taewoong Kang
Hyojin Jang
Jaegul Choo
34
0
0
02 Mar 2025
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Ziyang Zhang
Yang Yu
Yucheng Chen
Xulei Yang
S. Yeo
MedIm
45
1
0
02 Mar 2025
Periodic Materials Generation using Text-Guided Joint Diffusion Model
Kishalay Das
Subhojyoti Khastagir
Pawan Goyal
Seung-Cheol Lee
S. Bhattacharjee
Niloy Ganguly
DiffM
29
0
0
01 Mar 2025
Advancing AI-Powered Medical Image Synthesis: Insights from MedVQA-GI Challenge Using CLIP, Fine-Tuned Stable Diffusion, and Dream-Booth + LoRA
Ojonugwa Oluwafemi Ejiga Peter
Md Mahmudur Rahman
Fahmi Khalifa
DiffM
MedIm
31
1
0
28 Feb 2025
Tight Inversion: Image-Conditioned Inversion for Real Image Editing
Edo Kadosh
Nir Goren
Or Patashnik
Daniel Garibi
Daniel Cohen-Or
DiffM
59
0
0
27 Feb 2025
QPM: Discrete Optimization for Globally Interpretable Image Classification
Thomas Norrenbrock
T. Kaiser
Sovan Biswas
R. Manuvinakurike
Bodo Rosenhahn
45
0
0
27 Feb 2025
MFSR: Multi-fractal Feature for Super-resolution Reconstruction with Fine Details Recovery
Lianping Yang
Peng Jiao
Jinshan Pan
Hegui Zhu
Su Guo
31
0
0
27 Feb 2025
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think
L. Chen
S. Bai
Wenhao Chai
Weichu Xie
Haozhe Zhao
Leon Vinci
Junyang Lin
Baobao Chang
DiffM
82
4
0
27 Feb 2025
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance
Xin Ye
Burhaneddin Yaman
Sheng Cheng
Feng Tao
Abhirup Mallik
Liu Ren
DiffM
63
1
0
27 Feb 2025
Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation
Zhi Cen
Huaijin Pi
Sida Peng
Qing Shuai
Yujun Shen
Hujun Bao
Xiaowei Zhou
Ruizhen Hu
VGen
OffRL
59
1
0
27 Feb 2025
Optimal Stochastic Trace Estimation in Generative Modeling
Xinyang Liu
Hengrong Du
Wei Deng
Ruqi Zhang
AI4TS
39
0
0
26 Feb 2025
Intent Tagging: Exploring Micro-Prompting Interactions for Supporting Granular Human-GenAI Co-Creation Workflows
Frederic Gmeiner
Nicolai Marquardt
Michael Bentley
Hugo Romat
M. Pahud
...
Asta Roseway
Nikolas Martelaro
Kenneth Holstein
K. Hinckley
N. Riche
45
0
0
26 Feb 2025
Diffusion-based Planning with Learned Viability Filters
Nicholas Ioannidis
Daniele Reda
S. Cohan
M. van de Panne
64
0
0
26 Feb 2025
On the Interpolation Effect of Score Smoothing
Zhengdao Chen
DiffM
71
0
0
26 Feb 2025
Improved YOLOv12 with LLM-Generated Synthetic Data for Enhanced Apple Detection and Benchmarking Against YOLOv11 and YOLOv10
Ranjan Sapkota
Manoj Karkee
38
4
0
26 Feb 2025
CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification
Mingkun Zhang
Keping Bi
Wei Chen
J. Guo
Xueqi Cheng
BDL
VLM
50
1
0
25 Feb 2025
GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music
Xinran Liu
Xu Dong
Diptesh Kanojia
Wenwu Wang
Zhenhua Feng
DiffM
60
0
0
25 Feb 2025
HRR: Hierarchical Retrospection Refinement for Generated Image Detection
Peipei Yuan
Zijing Xie
Shuo Ye
Hong Chen
Yulong Wang
DiffM
137
0
0
25 Feb 2025
FairGen: Controlling Sensitive Attributes for Fair Generations in Diffusion Models via Adaptive Latent Guidance
Mintong Kang
Vinayshekhar Bannihatti Kumar
Shamik Roy
Abhishek Kumar
Sopan Khosla
Balakrishnan Narayanaswamy
Rashmi Gangadharaiah
37
0
0
25 Feb 2025
Synthesizing Consistent Novel Views via 3D Epipolar Attention without Re-Training
Botao Ye
Sifei Liu
Xueting Li
Marc Pollefeys
Ming Yang
64
0
0
25 Feb 2025
Multi-Perspective Data Augmentation for Few-shot Object Detection
Anh-Khoa Nguyen Vu
Quoc-Truong Truong
Vinh-Tiep Nguyen
T. Ngo
Thanh-Toan Do
Tam V. Nguyen
69
1
0
25 Feb 2025
Bayesian Optimization for Controlled Image Editing via LLMs
Chengkun Cai
Haoliang Liu
Xu Zhao
Zhongyu Jiang
Tianfang Zhang
Zongkai Wu
Jenq-Neng Hwang
Serge Belongie
Lei Li
BDL
OffRL
85
2
0
25 Feb 2025
Improved Diffusion-based Generative Model with Better Adversarial Robustness
Zekun Wang
Mingyang Yi
Shuchen Xue
Z. Li
Ming Liu
Bing Qin
Zhi-Ming Ma
DiffM
37
0
0
24 Feb 2025
Posterior Inference with Diffusion Models for High-dimensional Black-box Optimization
Taeyoung Yun
Kiyoung Om
Jaewoo Lee
Sujin Yun
Jinkyoo Park
48
1
0
24 Feb 2025
Mitigating Hallucinations in Diffusion Models through Adaptive Attention Modulation
Trevine Oorloff
Yaser Yacoob
Abhinav Shrivastava
46
0
0
24 Feb 2025
Distributional Vision-Language Alignment by Cauchy-Schwarz Divergence
Wenzhe Yin
Zehao Xiao
Pan Zhou
Shujian Yu
Jiayi Shen
J. Sonke
E. Gavves
34
0
0
24 Feb 2025
Culture-TRIP: Culturally-Aware Text-to-Image Generation with Iterative Prompt Refinment
Suchae Jeong
Inseong Choi
Youngsik Yun
Jihie Kim
DiffM
33
2
0
24 Feb 2025
Methods and Trends in Detecting Generated Images: A Comprehensive Review
Arpan Mahara
N. Rishe
AAML
75
0
0
24 Feb 2025
PuzzleFusion++: Auto-agglomerative 3D Fracture Assembly by Denoise and Verify
Zhengqing Wang
Jiacheng Chen
Yasutaka Furukawa
62
5
0
24 Feb 2025
Previous
1
2
3
...
6
7
8
...
93
94
95
Next