Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Home
Papers
1904.01310
Cited By
DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis
2 April 2019
Minfeng Zhu
Pingbo Pan
Wei Chen
Yi Yang
GAN
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis"
50 / 323 papers shown
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models
IEEE International Conference on Computer Vision (ICCV), 2023
Eslam Mohamed Bakr
Pengzhan Sun
Xiaoqian Shen
Faizan Farooq Khan
Li Erran Li
Mohamed Elhoseiny
VLM
304
104
0
11 Apr 2023
3D GANs and Latent Space: A comprehensive survey
S. Tata
Subhankar Mishra
235
3
0
08 Apr 2023
Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing
IEEE International Conference on Computer Vision (ICCV), 2023
Alberto Baldrati
Davide Morelli
Giuseppe Cartella
Marcella Cornia
Marco Bertini
Rita Cucchiara
DiffM
196
89
0
04 Apr 2023
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation
Computer Vision and Pattern Recognition (CVPR), 2023
Mayu Otani
Riku Togashi
Yu Sawai
Ryosuke Ishigami
Yuta Nakashima
Esa Rahtu
J. Heikkilä
Shiníchi Satoh
217
77
0
04 Apr 2023
Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models
IEEE International Conference on Computer Vision (ICCV), 2023
Jaewoong Lee
Sang-Sub Jang
Jaehyeong Jo
Jaehong Yoon
Yunji Kim
Jin-Hwa Kim
Jung-Woo Ha
Sung Ju Hwang
DiffM
239
7
0
04 Apr 2023
Variational Distribution Learning for Unsupervised Text-to-Image Generation
Computer Vision and Pattern Recognition (CVPR), 2023
Minsoo Kang
Doyup Lee
Jiseob Kim
Saehoon Kim
Bohyung Han
DRL
OOD
178
4
0
28 Mar 2023
Factor Decomposed Generative Adversarial Networks for Text-to-Image Synthesis
Jiguo Li
Xiaobin Liu
Lirong Zheng
DRL
115
1
0
24 Mar 2023
Ablating Concepts in Text-to-Image Diffusion Models
IEEE International Conference on Computer Vision (ICCV), 2023
Nupur Kumari
Bin Zhang
Sheng-Yu Wang
Eli Shechtman
Richard Y. Zhang
Jun-Yan Zhu
VLM
478
277
0
23 Mar 2023
MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion Models
IEEE International Conference on Computer Vision (ICCV), 2023
Jing Zhao
Heliang Zheng
Chaoyue Wang
L. Lan
Wenjing Yang
VLM
327
24
0
23 Mar 2023
LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
IEEE International Conference on Computer Vision (ICCV), 2023
K. Pnvr
Bharat Singh
P. Ghosh
Behjat Siddiquie
David Jacobs
DiffM
301
34
0
22 Mar 2023
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
IEEE International Conference on Computer Vision (ICCV), 2023
Can Qin
Ning Yu
Chen Xing
Shu Zhen Zhang
Zeyuan Chen
Stefano Ermon
Yun Fu
Caiming Xiong
Ran Xu
DiffM
376
27
0
17 Mar 2023
Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation
Yi Ma
Huan Yang
Wenjing Wang
Jianlong Fu
Jiaying Liu
133
71
0
16 Mar 2023
Scaling up GANs for Text-to-Image Synthesis
Computer Vision and Pattern Recognition (CVPR), 2023
Minguk Kang
Jun-Yan Zhu
Richard Y. Zhang
Jaesik Park
Eli Shechtman
Sylvain Paris
Taesung Park
319
596
0
09 Mar 2023
Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding
Jiacheng Li
Longhui Wei
Zongyuan Zhan
Xinfu He
Siliang Tang
Qi Tian
Yueting Zhuang
142
5
0
07 Mar 2023
Counterfactual Edits for Generative Evaluation
Maria Lymperaiou
Giorgos Filandrianos
Konstantinos Thomas
Giorgos Stamou
EGVM
233
0
0
02 Mar 2023
Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition
Yifan Jiang
Han Chen
Hanseok Ko
DiffM
361
5
0
26 Feb 2023
Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models
ACM Transactions on Graphics (TOG), 2023
Rinon Gal
Moab Arar
Yuval Atzmon
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
DiffM
448
236
0
23 Feb 2023
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Artificial Intelligence Review (AIR), 2023
Kunlin Wang
Zi Wang
Zhang Li
Ang Su
Xichao Teng
Minhao Liu
Qifeng Yu
Qifeng Yu
ObjD
669
27
0
21 Feb 2023
Composer: Creative and Controllable Image Synthesis with Composable Conditions
International Conference on Machine Learning (ICML), 2023
Lianghua Huang
Di Chen
Yu Liu
Yujun Shen
Deli Zhao
Jingren Zhou
DiffM
423
354
0
20 Feb 2023
Fine-grained Cross-modal Fusion based Refinement for Text-to-Image Synthesis
Chinese journal of electronics (CJE), 2023
Haoran Sun
Yang Wang
Haipeng Liu
Biao Qian
259
13
0
17 Feb 2023
Multi-modal Machine Learning in Engineering Design: A Review and Future Directions
Journal of Computing and Information Science in Engineering (JCISE), 2023
Binyang Song
Ruilin Zhou
Faez Ahmed
AI4CE
356
63
0
14 Feb 2023
Glaze: Protecting Artists from Style Mimicry by Text-to-Image Models
USENIX Security Symposium (USENIX Security), 2023
Shawn Shan
Jenna Cryan
Emily Wenger
Haitao Zheng
Rana Hanocka
Ben Y. Zhao
WIGM
505
244
0
08 Feb 2023
Design Booster: A Text-Guided Diffusion Model for Image Translation with Spatial Layout Preservation
Shiqi Sun
Shancheng Fang
Qian He
Wei Liu
DiffM
137
3
0
05 Feb 2023
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
ACM Transactions on Graphics (TOG), 2023
Hila Chefer
Yuval Alaluf
Yael Vinker
Lior Wolf
Daniel Cohen-Or
DiffM
559
665
0
31 Jan 2023
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
Computer Vision and Pattern Recognition (CVPR), 2023
Ming Tao
Bingkun Bao
Hao Tang
Changsheng Xu
DiffM
VLM
231
137
0
30 Jan 2023
An Impartial Transformer for Story Visualization
N. Tsakas
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
ViT
234
3
0
09 Jan 2023
ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions
Aashish Anantha Ramakrishnan
Sharon X. Huang
Dongwon Lee
262
6
0
05 Jan 2023
Attribute-Centric Compositional Text-to-Image Generation
International Journal of Computer Vision (IJCV), 2023
Yuren Cong
Martin Renqiang Min
Erran L. Li
Bodo Rosenhahn
M. Yang
236
18
0
04 Jan 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
International Conference on Machine Learning (ICML), 2023
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
479
695
0
02 Jan 2023
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
International Conference on Learning Representations (ICLR), 2022
Weixi Feng
Xuehai He
Tsu-Jui Fu
Varun Jampani
Arjun Reddy Akula
P. Narayana
Sugato Basu
Xinze Wang
William Yang Wang
CoGe
581
382
0
09 Dec 2022
Multi-Concept Customization of Text-to-Image Diffusion
Computer Vision and Pattern Recognition (CVPR), 2022
Nupur Kumari
Bin Zhang
Richard Y. Zhang
Eli Shechtman
Jun-Yan Zhu
693
1,162
0
08 Dec 2022
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2022
Zhixing Zhang
Ligong Han
Arna Ghosh
Dimitris N. Metaxas
Jian Ren
DiffM
441
180
0
08 Dec 2022
3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation
AAAI Conference on Artificial Intelligence (AAAI), 2022
Zutao Jiang
Guangsong Lu
Xiaodan Liang
Jihua Zhu
Wei Zhang
Xiaojun Chang
Hang Xu
DiffM
175
13
0
02 Dec 2022
CLIP2GAN: Towards Bridging Text with the Latent Space of GANs
Yixuan Wang
Wen-gang Zhou
Jianmin Bao
Weilun Wang
Li Li
Houqiang Li
GAN
CLIP
140
9
0
28 Nov 2022
Unified Discrete Diffusion for Simultaneous Vision-Language Generation
International Conference on Learning Representations (ICLR), 2022
Minghui Hu
Chuanxia Zheng
Heliang Zheng
Tat-Jen Cham
Chaoyue Wang
Zuopeng Yang
Dacheng Tao
Ponnuthurai Nagaratnam Suganthan
DiffM
262
36
0
27 Nov 2022
Interactive Image Manipulation with Complex Text Instructions
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Ryugo Morita
Zhiqiang Zhang
Man M. Ho
Jinjia Zhou
DiffM
175
4
0
25 Nov 2022
Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Computer Vision and Pattern Recognition (CVPR), 2022
Tanzila Rahman
Hsin-Ying Lee
Jian Ren
Sergey Tulyakov
Shweta Mahajan
Leonid Sigal
DiffM
365
91
0
23 Nov 2022
Learning to Model Multimodal Semantic Alignment for Story Visualization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Bowen Li
Thomas Lukasiewicz
DiffM
246
3
0
14 Nov 2022
Large-Scale Bidirectional Training for Zero-Shot Image Captioning
Taehoon Kim
Mark A Marsden
Pyunghwan Ahn
Sangyun Kim
Sihaeng Lee
Alessandra Sala
S. Kim
VLM
210
5
0
13 Nov 2022
HumanDiffusion: a Coarse-to-Fine Alignment Diffusion Framework for Controllable Text-Driven Person Image Generation
Kai Zhang
Muyi Sun
Jianxin Sun
Binghao Zhao
Kunbo Zhang
Zhenan Sun
Tieniu Tan
DiffM
146
14
0
11 Nov 2022
Disentangling Content and Motion for Text-Based Neural Video Manipulation
British Machine Vision Conference (BMVC), 2022
Levent Karacan
Tolga Kerimouglu
.Ismail .Inan
Tolga Birdal
Erkut Erdem
Aykut Erdem
270
1
0
05 Nov 2022
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory
AAAI Conference on Artificial Intelligence (AAAI), 2022
Se Jin Park
Minsu Kim
Joanna Hong
J. Choi
Y. Ro
CVBM
259
103
0
02 Nov 2022
UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance
Wei Li
Xue Xu
Xinyan Xiao
Jiacheng Liu
Hu Yang
...
Zhanpeng Wang
Zhifan Feng
Qiaoqiao She
Yajuan Lyu
Hua Wu
493
31
0
28 Oct 2022
SSD: Towards Better Text-Image Consistency Metric in Text-to-Image Generation
Social Science Research Network (SSRN), 2022
Zhaorui Tan
Xi Yang
Zihan Ye
Qiufeng Wang
Yuyao Yan
Anh Nguyen
Kaizhu Huang
EGVM
191
3
0
27 Oct 2022
Lafite2: Few-shot Text-to-Image Generation
Jiuxiang Gu
Chunyuan Li
Changyou Chen
Jianfeng Gao
Jinhui Xu
DiffM
176
14
0
25 Oct 2022
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Rui Li
Weihua Li
Yi Yang
Hanyu Wei
Jianhua Jiang
Quan-wei Bai
DiffM
368
17
0
18 Oct 2022
Markup-to-Image Diffusion Models with Scheduled Sampling
International Conference on Learning Representations (ICLR), 2022
Yuntian Deng
Noriyuki Kojima
Alexander M. Rush
DiffM
185
6
0
11 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Ye Zhu
Yuehua Wu
Andrii Zadaianchuk
Yan Yan
354
38
0
05 Oct 2022
ManiCLIP: Multi-Attribute Face Manipulation from Text
International Journal of Computer Vision (IJCV), 2022
Hao Wang
Guosheng Lin
A. Molino
Anran Wang
Jiashi Feng
Zehuan Yuan
CVBM
205
13
0
02 Oct 2022
Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image Generation
ACM Multimedia (ACM MM), 2022
Xintian Wu
Hanbin Zhao
Liangli Zheng
Shouhong Ding
Xi Li
172
16
0
28 Sep 2022
Previous
1
2
3
4
5
6
7
Next