Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image
Generation

Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation

1 June 2023

Papers citing "Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation"

13 / 13 papers shown

Title
Group Diffusion Transformers are Unsupervised Multitask Learners Lianghua Huang Wei Wang Zhi-Fan Wu Huanzhang Dou Yupeng Shi Yutong Feng C. Liang Yu Liu Jingren Zhou VLM 36 11 0 19 Oct 2024
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback Ming Li Taojiannan Yang Huafeng Kuang Jie Wu Zhaoning Wang Xuefeng Xiao C. L. P. Chen 35 62 0 11 Apr 2024
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions Xiaoyu Liu Yuxiang Wei Ming-Yu Liu Xianhui Lin Peiran Ren Xuansong Xie Wangmeng Zuo DiffM 22 5 0 09 Apr 2024
Controllable Generation with Text-to-Image Diffusion Models: A Survey Pu Cao Feng Zhou Qing-Huang Song Lu Yang 67 35 0 07 Mar 2024
When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability Wenjie Xuan Yufei Xu Shanshan Zhao Chaoyue Wang Juhua Liu Bo Du Dacheng Tao 26 2 0 01 Mar 2024
BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models Senthil Purushwalkam Akash Gokul Shafiq R. Joty Nikhil Naik DiffM 29 16 0 25 Jan 2024
ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models Denis Zavadski Johann-Friedrich Feiden Carsten Rother DiffM 44 5 0 11 Dec 2023
One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls Minghui Hu Jianbin Zheng Chuanxia Zheng Chaoyue Wang Dacheng Tao Tat-Jen Cham DiffM 13 3 0 27 Nov 2023
Training-Free Layout Control with Cross-Attention Guidance Minghao Chen Iro Laina Andrea Vedaldi DiffM 124 221 0 06 Apr 2023
MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation Chuanxia Zheng L. Vuong Jianfei Cai Dinh Q. Phung MQ 58 72 0 19 Sep 2022
Pretraining is All You Need for Image-to-Image Translation Tengfei Wang Ting Zhang Bo Zhang Hao Ouyang Dong Chen Qifeng Chen Fang Wen DiffM 184 177 0 25 May 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation Junnan Li Dongxu Li Caiming Xiong S. Hoi MLLM BDL VLM CLIP 388 4,110 0 28 Jan 2022
A Learned Representation For Artistic Style Vincent Dumoulin Jonathon Shlens M. Kudlur GAN 210 1,153 0 24 Oct 2016