Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1802.09178
Cited By
v1
v2 (latest)
Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network
26 February 2018
Zizhao Zhang
Yuanpu Xie
Ling Yang
EGVM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network"
50 / 129 papers shown
Z-SASLM: Zero-Shot Style-Aligned SLI Blending Latent Manipulation
Alessio Borgi
Luca Maiano
Irene Amerini
256
0
0
29 Mar 2025
End-to-end Training for Text-to-Image Synthesis using Dual-Text Embeddings
Yeruru Asrar Ahmed
Anurag Mittal
DiffM
346
0
0
03 Feb 2025
DiT4Edit: Diffusion Transformer for Image Editing
AAAI Conference on Artificial Intelligence (AAAI), 2024
Kunyu Feng
Yi Ma
Bingyuan Wang
Zhiheng Liu
Haozhe Chen
Qifeng Chen
Zeyu Wang
347
88
0
05 Nov 2024
TAGE: Trustworthy Attribute Group Editing for Stable Few-shot Image Generation
International Conference on Signal Processing Systems (ICSPS), 2024
Ruicheng Zhang
Guoheng Huang
Yejing Huo
Xiaochen Yuan
Zhizhen Zhou
Xuhang Chen
Guo Zhong
377
1
0
23 Oct 2024
ArtiFade: Learning to Generate High-quality Subject from Blemished Images
Computer Vision and Pattern Recognition (CVPR), 2024
Shuya Yang
Shaozhe Hao
Yukang Cao
Kwan-Yee K. Wong
DiffM
132
1
0
05 Sep 2024
Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models
Oz Zafar
Yuval Cohen
Lior Wolf
Idan Schwartz
VLM
355
4
0
21 Aug 2024
Deep Multi-Task Learning for Malware Image Classification
Journal of Information Security and Applications (JISA), 2022
A. Bensaoud
Jugal Kalita
283
42
0
09 May 2024
A Survey on Deep Learning and State-of-the-art Applications
Mohd Halim Mohd Noor
A. O. Ige
AILaw
MLAU
271
0
0
26 Mar 2024
The Right Losses for the Right Gains: Improving the Semantic Consistency of Deep Text-to-Image Generation with Distribution-Sensitive Losses
Mahmoud Ahmed
Omer Moussa
Ismail Shaheen
Mohamed S. Abdelfattah
Amr Abdalla
Marwan Eid
Hesham M. Eraqi
Mohamed Moustafa
331
1
0
18 Dec 2023
CogCartoon: Towards Practical Story Visualization
Zhongyang Zhu
Jie Tang
DiffM
310
7
0
17 Dec 2023
Object-aware Inversion and Reassembly for Image Editing
International Conference on Learning Representations (ICLR), 2023
Zhen Yang
Dinggang Gui
Wen Wang
Hao Chen
Bohan Zhuang
Chunhua Shen
DiffM
364
31
0
18 Oct 2023
Breaking Barriers to Creative Expression: Co-Designing and Implementing an Accessible Text-to-Image Interface
Atieh Taheri
Mohammad Izadi
Gururaj Shriram
Negar Rostamzadeh
Shaun Kane
DiffM
230
4
0
05 Sep 2023
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Fengxiang Bie
Jianlong Wu
Zhongzhu Zhou
Adam Ghanem
Minjia Zhang
...
Pareesa Ameneh Golnari
David A. Clifton
Yuxiong He
Dacheng Tao
Shuaiwen Leon Song
EGVM
294
67
0
02 Sep 2023
Iterative Multi-granular Image Editing using Diffusion Models
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
K. J. Joseph
Prateksha Udhayanan
Tripti Shukla
Aishwarya Agarwal
Srikrishna Karanam
Koustava Goswami
Balaji Vasan Srinivasan
DiffM
351
27
0
01 Sep 2023
DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation
Zhuowei Chen
Shancheng Fang
Wei Liu
Qian He
Mengqi Huang
Yongdong Zhang
Zhendong Mao
DiffM
307
31
0
01 Jul 2023
DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image Generation using Limited Data
International Journal of Computer Vision (IJCV), 2023
Jin Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
DiffM
584
20
0
25 Jun 2023
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
ACM Multimedia (ACM MM), 2023
Huiguo He
Tianfu Wang
Huan Yang
Jianlong Fu
N. Yuan
Jian Yin
Hongyang Chao
Tao Gui
EGVM
378
14
0
20 Jun 2023
The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models
Roy Voetman
Maya Aghaei
K. Dijkstra
DiffM
299
16
0
16 Jun 2023
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
341
16
0
24 May 2023
SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis
Azade Farshad
Yousef Yeganeh
Yucong Chi
Cheng-nan Shen
Bjorn Ommer
Nassir Navab
DiffM
280
48
0
28 Apr 2023
IconShop: Text-Guided Vector Icon Synthesis with Autoregressive Transformers
ACM Transactions on Graphics (TOG), 2023
Rong Wu
Wanchao Su
Kede Ma
Jing Liao
581
80
0
27 Apr 2023
ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis
IEEE transactions on multimedia (IEEE TMM), 2023
Hongchen Tan
Baocai Yin
Kun Wei
Xiuping Liu
Xin Li
199
28
0
13 Apr 2023
Gradient-Free Textual Inversion
ACM Multimedia (ACM MM), 2023
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
300
40
0
12 Apr 2023
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation
Computer Vision and Pattern Recognition (CVPR), 2023
Mayu Otani
Riku Togashi
Yu Sawai
Ryosuke Ishigami
Yuta Nakashima
Esa Rahtu
J. Heikkilä
Shiníchi Satoh
255
86
0
04 Apr 2023
Discriminative Class Tokens for Text-to-Image Diffusion Models
IEEE International Conference on Computer Vision (ICCV), 2023
Idan Schwartz
Vésteinn Snaebjarnarson
Hila Chefer
Robert Bamler
Serge Belongie
Lior Wolf
Sagie Benaim
498
13
0
30 Mar 2023
Factor Decomposed Generative Adversarial Networks for Text-to-Image Synthesis
Jiguo Li
Xiaobin Liu
Lirong Zheng
DRL
171
1
0
24 Mar 2023
MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion Models
IEEE International Conference on Computer Vision (ICCV), 2023
Jing Zhao
Heliang Zheng
Chaoyue Wang
L. Lan
Wenjing Yang
VLM
386
25
0
23 Mar 2023
Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation
Yi Ma
Huan Yang
Wenjing Wang
Jianlong Fu
Jiaying Liu
171
74
0
16 Mar 2023
Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion
Inhwa Han
Serin Yang
Taesung Kwon
Jong Chul Ye
DiffM
357
43
0
15 Mar 2023
Fine-grained Cross-modal Fusion based Refinement for Text-to-Image Synthesis
Chinese journal of electronics (CJE), 2023
Haoran Sun
Yang Wang
Haipeng Liu
Biao Qian
338
13
0
17 Feb 2023
Multi-modal Machine Learning in Engineering Design: A Review and Future Directions
Journal of Computing and Information Science in Engineering (JCISE), 2023
Binyang Song
Ruilin Zhou
Faez Ahmed
AI4CE
429
71
0
14 Feb 2023
Attribute-Centric Compositional Text-to-Image Generation
International Journal of Computer Vision (IJCV), 2023
Yuren Cong
Martin Renqiang Min
Erran L. Li
Bodo Rosenhahn
M. Yang
361
19
0
04 Jan 2023
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Neural Information Processing Systems (NeurIPS), 2022
Yi-Chun Zhu
Hongyu Liu
Yibing Song
Ziyang Yuan
Xintong Han
Chun Yuan
Qifeng Chen
Jue Wang
VLM
DiffM
393
41
0
14 Oct 2022
Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image Generation
ACM Multimedia (ACM MM), 2022
Xintian Wu
Hanbin Zhao
Liangli Zheng
Shouhong Ding
Xi Li
245
16
0
28 Sep 2022
AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation
ACM Multimedia (ACM MM), 2022
Yi Ma
Huan Yang
Bei Liu
Jianlong Fu
Jiaying Liu
DiffM
MLLM
296
12
0
07 Sep 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Computer Vision and Pattern Recognition (CVPR), 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
1.5K
4,101
0
25 Aug 2022
Vision-Language Matching for Text-to-Image Synthesis via Generative Adversarial Networks
IEEE transactions on multimedia (IEEE TMM), 2022
Qingrong Cheng
Keyu Wen
X. Gu
VLM
EGVM
188
20
0
20 Aug 2022
T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-Up
Expert systems with applications (ESWA), 2022
Deyin Liu
Yang Wang
Q. Tian
Zongyuan Ge
DiffM
337
9
0
18 Aug 2022
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design
ACM Multimedia (ACM MM), 2022
Xujie Zhang
Yuyang Sha
Michael C. Kampffmeyer
Zhenyu Xie
Zequn Jie
Chengwen Huang
Jianqing Peng
Xiaodan Liang
227
31
0
11 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
International Conference on Learning Representations (ICLR), 2022
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
976
2,554
0
02 Aug 2022
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Dongchao Yang
Jianwei Yu
Helin Wang
Wen Wang
Chao Weng
Yuexian Zou
Dong Yu
DiffM
360
403
0
20 Jul 2022
Transforming Image Generation from Scene Graphs
International Conference on Pattern Recognition (ICPR), 2022
Renato Sortino
S. Palazzo
C. Spampinato
ViT
246
2
0
01 Jul 2022
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
AAAI Conference on Artificial Intelligence (AAAI), 2022
Taejun Bak
Junmo Lee
Hanbin Bae
Jinhyeok Yang
Jaesung Bae
Young-Sun Joo
319
44
0
27 Jun 2022
Improved Vector Quantized Diffusion Models
Zhicong Tang
Shuyang Gu
Jianmin Bao
Dong Chen
Fang Wen
DiffM
576
74
0
31 May 2022
Text-to-Face Generation with StyleGAN2
D. M. A. Ayanthi
Sarasi Munasinghe
CVBM
165
10
0
25 May 2022
Synthetic Data -- what, why and how?
James Jordon
Lukasz Szpruch
F. Houssiau
M. Bottarelli
Giovanni Cherubin
Carsten Maple
Samuel N. Cohen
Adrian Weller
376
180
0
06 May 2022
DR-GAN: Distribution Regularization for Text-to-Image Generation
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Hongchen Tan
Xiuping Liu
Baocai Yin
Xin Li
GAN
213
53
0
17 Apr 2022
PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation
European Conference on Computer Vision (ECCV), 2022
Jing He
Weihao Ye
Tao Gui
Jun Peng
Chunjiang Ge
Xiaoshuai Sun
Chao Chen
Rongrong Ji
368
6
0
02 Apr 2022
One-shot Ultra-high-Resolution Generative Adversarial Network That Synthesizes 16K Images On A Single GPU
Image and Vision Computing (IVC), 2022
Junseok Oh
Donghwee Yoon
Injung Kim
410
2
0
28 Feb 2022
Multimodal Image Synthesis and Editing: The Generative AI Era
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
702
96
0
27 Dec 2021
1
2
3
Next
Page 1 of 3