ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.00964
  4. Cited By
Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image
  Generation

Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation

1 June 2023
Minghui Hu
Jianbin Zheng
Daqing Liu
Chuanxia Zheng
Chaoyue Wang
Dacheng Tao
Tat-Jen Cham
    DiffM
ArXivPDFHTML

Papers citing "Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation"

13 / 13 papers shown
Title
Group Diffusion Transformers are Unsupervised Multitask Learners
Group Diffusion Transformers are Unsupervised Multitask Learners
Lianghua Huang
Wei Wang
Zhi-Fan Wu
Huanzhang Dou
Yupeng Shi
Yutong Feng
C. Liang
Yu Liu
Jingren Zhou
VLM
36
11
0
19 Oct 2024
ControlNet++: Improving Conditional Controls with Efficient Consistency
  Feedback
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Ming Li
Taojiannan Yang
Huafeng Kuang
Jie Wu
Zhaoning Wang
Xuefeng Xiao
C. L. P. Chen
35
62
0
11 Apr 2024
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions
Xiaoyu Liu
Yuxiang Wei
Ming-Yu Liu
Xianhui Lin
Peiran Ren
Xuansong Xie
Wangmeng Zuo
DiffM
22
5
0
09 Apr 2024
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Pu Cao
Feng Zhou
Qing-Huang Song
Lu Yang
67
35
0
07 Mar 2024
When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on
  its Contour-following Ability
When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability
Wenjie Xuan
Yufei Xu
Shanshan Zhao
Chaoyue Wang
Juhua Liu
Bo Du
Dacheng Tao
26
2
0
01 Mar 2024
BootPIG: Bootstrapping Zero-shot Personalized Image Generation
  Capabilities in Pretrained Diffusion Models
BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models
Senthil Purushwalkam
Akash Gokul
Shafiq R. Joty
Nikhil Naik
DiffM
29
16
0
25 Jan 2024
ControlNet-XS: Designing an Efficient and Effective Architecture for
  Controlling Text-to-Image Diffusion Models
ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models
Denis Zavadski
Johann-Friedrich Feiden
Carsten Rother
DiffM
44
5
0
11 Dec 2023
One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion
  Schedule Flaws and Enhancing Low-Frequency Controls
One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls
Minghui Hu
Jianbin Zheng
Chuanxia Zheng
Chaoyue Wang
Dacheng Tao
Tat-Jen Cham
DiffM
13
3
0
27 Nov 2023
Training-Free Layout Control with Cross-Attention Guidance
Training-Free Layout Control with Cross-Attention Guidance
Minghao Chen
Iro Laina
Andrea Vedaldi
DiffM
124
221
0
06 Apr 2023
MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation
MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation
Chuanxia Zheng
L. Vuong
Jianfei Cai
Dinh Q. Phung
MQ
58
72
0
19 Sep 2022
Pretraining is All You Need for Image-to-Image Translation
Pretraining is All You Need for Image-to-Image Translation
Tengfei Wang
Ting Zhang
Bo Zhang
Hao Ouyang
Dong Chen
Qifeng Chen
Fang Wen
DiffM
184
177
0
25 May 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
388
4,110
0
28 Jan 2022
A Learned Representation For Artistic Style
A Learned Representation For Artistic Style
Vincent Dumoulin
Jonathon Shlens
M. Kudlur
GAN
210
1,153
0
24 Oct 2016
1