ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09178
  4. Cited By
Photographic Text-to-Image Synthesis with a Hierarchically-nested
  Adversarial Network
v1v2 (latest)

Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network

26 February 2018
Zizhao Zhang
Yuanpu Xie
Ling Yang
    EGVM
ArXiv (abs)PDFHTML

Papers citing "Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network"

50 / 129 papers shown
Z-SASLM: Zero-Shot Style-Aligned SLI Blending Latent Manipulation
Z-SASLM: Zero-Shot Style-Aligned SLI Blending Latent Manipulation
Alessio Borgi
Luca Maiano
Irene Amerini
256
0
0
29 Mar 2025
End-to-end Training for Text-to-Image Synthesis using Dual-Text Embeddings
End-to-end Training for Text-to-Image Synthesis using Dual-Text Embeddings
Yeruru Asrar Ahmed
Anurag Mittal
DiffM
346
0
0
03 Feb 2025
DiT4Edit: Diffusion Transformer for Image Editing
DiT4Edit: Diffusion Transformer for Image EditingAAAI Conference on Artificial Intelligence (AAAI), 2024
Kunyu Feng
Yi Ma
Bingyuan Wang
Zhiheng Liu
Haozhe Chen
Qifeng Chen
Zeyu Wang
347
88
0
05 Nov 2024
TAGE: Trustworthy Attribute Group Editing for Stable Few-shot Image
  Generation
TAGE: Trustworthy Attribute Group Editing for Stable Few-shot Image GenerationInternational Conference on Signal Processing Systems (ICSPS), 2024
Ruicheng Zhang
Guoheng Huang
Yejing Huo
Xiaochen Yuan
Zhizhen Zhou
Xuhang Chen
Guo Zhong
377
1
0
23 Oct 2024
ArtiFade: Learning to Generate High-quality Subject from Blemished
  Images
ArtiFade: Learning to Generate High-quality Subject from Blemished ImagesComputer Vision and Pattern Recognition (CVPR), 2024
Shuya Yang
Shaozhe Hao
Yukang Cao
Kwan-Yee K. Wong
DiffM
132
1
0
05 Sep 2024
Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models
Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models
Oz Zafar
Yuval Cohen
Lior Wolf
Idan Schwartz
VLM
355
4
0
21 Aug 2024
Deep Multi-Task Learning for Malware Image Classification
Deep Multi-Task Learning for Malware Image ClassificationJournal of Information Security and Applications (JISA), 2022
A. Bensaoud
Jugal Kalita
283
42
0
09 May 2024
A Survey on Deep Learning and State-of-the-art Applications
A Survey on Deep Learning and State-of-the-art Applications
Mohd Halim Mohd Noor
A. O. Ige
AILawMLAU
271
0
0
26 Mar 2024
The Right Losses for the Right Gains: Improving the Semantic Consistency
  of Deep Text-to-Image Generation with Distribution-Sensitive Losses
The Right Losses for the Right Gains: Improving the Semantic Consistency of Deep Text-to-Image Generation with Distribution-Sensitive Losses
Mahmoud Ahmed
Omer Moussa
Ismail Shaheen
Mohamed S. Abdelfattah
Amr Abdalla
Marwan Eid
Hesham M. Eraqi
Mohamed Moustafa
331
1
0
18 Dec 2023
CogCartoon: Towards Practical Story Visualization
CogCartoon: Towards Practical Story Visualization
Zhongyang Zhu
Jie Tang
DiffM
310
7
0
17 Dec 2023
Object-aware Inversion and Reassembly for Image Editing
Object-aware Inversion and Reassembly for Image EditingInternational Conference on Learning Representations (ICLR), 2023
Zhen Yang
Dinggang Gui
Wen Wang
Hao Chen
Bohan Zhuang
Chunhua Shen
DiffM
364
31
0
18 Oct 2023
Breaking Barriers to Creative Expression: Co-Designing and Implementing
  an Accessible Text-to-Image Interface
Breaking Barriers to Creative Expression: Co-Designing and Implementing an Accessible Text-to-Image Interface
Atieh Taheri
Mohammad Izadi
Gururaj Shriram
Negar Rostamzadeh
Shaun Kane
DiffM
230
4
0
05 Sep 2023
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of
  Large Model
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large ModelIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Fengxiang Bie
Jianlong Wu
Zhongzhu Zhou
Adam Ghanem
Minjia Zhang
...
Pareesa Ameneh Golnari
David A. Clifton
Yuxiong He
Dacheng Tao
Shuaiwen Leon Song
EGVM
294
67
0
02 Sep 2023
Iterative Multi-granular Image Editing using Diffusion Models
Iterative Multi-granular Image Editing using Diffusion ModelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
K. J. Joseph
Prateksha Udhayanan
Tripti Shukla
Aishwarya Agarwal
Srikrishna Karanam
Koustava Goswami
Balaji Vasan Srinivasan
DiffM
351
27
0
01 Sep 2023
DreamIdentity: Improved Editability for Efficient Face-identity
  Preserved Image Generation
DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation
Zhuowei Chen
Shancheng Fang
Wei Liu
Qian He
Mengqi Huang
Yongdong Zhang
Zhendong Mao
DiffM
307
31
0
01 Jul 2023
DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image
  Generation using Limited Data
DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image Generation using Limited DataInternational Journal of Computer Vision (IJCV), 2023
Jin Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
DiffM
584
20
0
25 Jun 2023
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy
  Guided Reinforcement Learning
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement LearningACM Multimedia (ACM MM), 2023
Huiguo He
Tianfu Wang
Huan Yang
Jianlong Fu
N. Yuan
Jian Yin
Hongyang Chao
Tao Gui
EGVM
378
14
0
20 Jun 2023
The Big Data Myth: Using Diffusion Models for Dataset Generation to
  Train Deep Detection Models
The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models
Roy Voetman
Maya Aghaei
K. Dijkstra
DiffM
299
16
0
16 Jun 2023
Vision + Language Applications: A Survey
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
341
16
0
24 May 2023
SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis
SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis
Azade Farshad
Yousef Yeganeh
Yucong Chi
Cheng-nan Shen
Bjorn Ommer
Nassir Navab
DiffM
280
48
0
28 Apr 2023
IconShop: Text-Guided Vector Icon Synthesis with Autoregressive
  Transformers
IconShop: Text-Guided Vector Icon Synthesis with Autoregressive TransformersACM Transactions on Graphics (TOG), 2023
Rong Wu
Wanchao Su
Kede Ma
Jing Liao
581
80
0
27 Apr 2023
ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis
ALR-GAN: Adaptive Layout Refinement for Text-to-Image SynthesisIEEE transactions on multimedia (IEEE TMM), 2023
Hongchen Tan
Baocai Yin
Kun Wei
Xiuping Liu
Xin Li
199
28
0
13 Apr 2023
Gradient-Free Textual Inversion
Gradient-Free Textual InversionACM Multimedia (ACM MM), 2023
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
300
40
0
12 Apr 2023
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image
  Generation
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image GenerationComputer Vision and Pattern Recognition (CVPR), 2023
Mayu Otani
Riku Togashi
Yu Sawai
Ryosuke Ishigami
Yuta Nakashima
Esa Rahtu
J. Heikkilä
Shiníchi Satoh
255
86
0
04 Apr 2023
Discriminative Class Tokens for Text-to-Image Diffusion Models
Discriminative Class Tokens for Text-to-Image Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Idan Schwartz
Vésteinn Snaebjarnarson
Hila Chefer
Robert Bamler
Serge Belongie
Lior Wolf
Sagie Benaim
498
13
0
30 Mar 2023
Factor Decomposed Generative Adversarial Networks for Text-to-Image
  Synthesis
Factor Decomposed Generative Adversarial Networks for Text-to-Image Synthesis
Jiguo Li
Xiaobin Liu
Lirong Zheng
DRL
171
1
0
24 Mar 2023
MagicFusion: Boosting Text-to-Image Generation Performance by Fusing
  Diffusion Models
MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Jing Zhao
Heliang Zheng
Chaoyue Wang
L. Lan
Wenjing Yang
VLM
386
25
0
23 Mar 2023
Unified Multi-Modal Latent Diffusion for Joint Subject and Text
  Conditional Image Generation
Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation
Yi Ma
Huan Yang
Wenjing Wang
Jianlong Fu
Jiaying Liu
171
74
0
16 Mar 2023
Highly Personalized Text Embedding for Image Manipulation by Stable
  Diffusion
Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion
Inhwa Han
Serin Yang
Taesung Kwon
Jong Chul Ye
DiffM
357
43
0
15 Mar 2023
Fine-grained Cross-modal Fusion based Refinement for Text-to-Image
  Synthesis
Fine-grained Cross-modal Fusion based Refinement for Text-to-Image SynthesisChinese journal of electronics (CJE), 2023
Haoran Sun
Yang Wang
Haipeng Liu
Biao Qian
338
13
0
17 Feb 2023
Multi-modal Machine Learning in Engineering Design: A Review and Future
  Directions
Multi-modal Machine Learning in Engineering Design: A Review and Future DirectionsJournal of Computing and Information Science in Engineering (JCISE), 2023
Binyang Song
Ruilin Zhou
Faez Ahmed
AI4CE
429
71
0
14 Feb 2023
Attribute-Centric Compositional Text-to-Image Generation
Attribute-Centric Compositional Text-to-Image GenerationInternational Journal of Computer Vision (IJCV), 2023
Yuren Cong
Martin Renqiang Min
Erran L. Li
Bodo Rosenhahn
M. Yang
361
19
0
04 Jan 2023
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation
  with Semantic Modulations
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic ModulationsNeural Information Processing Systems (NeurIPS), 2022
Yi-Chun Zhu
Hongyu Liu
Yibing Song
Ziyang Yuan
Xintong Han
Chun Yuan
Qifeng Chen
Jue Wang
VLMDiffM
393
41
0
14 Oct 2022
Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image
  Generation
Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image GenerationACM Multimedia (ACM MM), 2022
Xintian Wu
Hanbin Zhao
Liangli Zheng
Shouhong Ding
Xi Li
245
16
0
28 Sep 2022
AI Illustrator: Translating Raw Descriptions into Images by Prompt-based
  Cross-Modal Generation
AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal GenerationACM Multimedia (ACM MM), 2022
Yi Ma
Huan Yang
Bei Liu
Jianlong Fu
Jiaying Liu
DiffMMLLM
296
12
0
07 Sep 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for
  Subject-Driven Generation
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven GenerationComputer Vision and Pattern Recognition (CVPR), 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
1.5K
4,101
0
25 Aug 2022
Vision-Language Matching for Text-to-Image Synthesis via Generative
  Adversarial Networks
Vision-Language Matching for Text-to-Image Synthesis via Generative Adversarial NetworksIEEE transactions on multimedia (IEEE TMM), 2022
Qingrong Cheng
Keyu Wen
X. Gu
VLMEGVM
188
20
0
20 Aug 2022
T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency
  and Manifold Mix-Up
T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-UpExpert systems with applications (ESWA), 2022
Deyin Liu
Yang Wang
Q. Tian
Zongyuan Ge
DiffM
337
9
0
18 Aug 2022
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal
  Fashion Design
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion DesignACM Multimedia (ACM MM), 2022
Xujie Zhang
Yuyang Sha
Michael C. Kampffmeyer
Zhenyu Xie
Zequn Jie
Chengwen Huang
Jianqing Peng
Xiaodan Liang
227
31
0
11 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Prompt-to-Prompt Image Editing with Cross Attention ControlInternational Conference on Learning Representations (ICLR), 2022
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
976
2,554
0
02 Aug 2022
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Diffsound: Discrete Diffusion Model for Text-to-sound GenerationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Dongchao Yang
Jianwei Yu
Helin Wang
Wen Wang
Chao Weng
Yuexian Zou
Dong Yu
DiffM
360
403
0
20 Jul 2022
Transforming Image Generation from Scene Graphs
Transforming Image Generation from Scene GraphsInternational Conference on Pattern Recognition (ICPR), 2022
Renato Sortino
S. Palazzo
C. Spampinato
ViT
246
2
0
01 Jul 2022
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Avocodo: Generative Adversarial Network for Artifact-free VocoderAAAI Conference on Artificial Intelligence (AAAI), 2022
Taejun Bak
Junmo Lee
Hanbin Bae
Jinhyeok Yang
Jaesung Bae
Young-Sun Joo
319
44
0
27 Jun 2022
Improved Vector Quantized Diffusion Models
Improved Vector Quantized Diffusion Models
Zhicong Tang
Shuyang Gu
Jianmin Bao
Dong Chen
Fang Wen
DiffM
576
74
0
31 May 2022
Text-to-Face Generation with StyleGAN2
Text-to-Face Generation with StyleGAN2
D. M. A. Ayanthi
Sarasi Munasinghe
CVBM
165
10
0
25 May 2022
Synthetic Data -- what, why and how?
Synthetic Data -- what, why and how?
James Jordon
Lukasz Szpruch
F. Houssiau
M. Bottarelli
Giovanni Cherubin
Carsten Maple
Samuel N. Cohen
Adrian Weller
376
180
0
06 May 2022
DR-GAN: Distribution Regularization for Text-to-Image Generation
DR-GAN: Distribution Regularization for Text-to-Image GenerationIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Hongchen Tan
Xiuping Liu
Baocai Yin
Xin Li
GAN
213
53
0
17 Apr 2022
PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image
  Generation
PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image GenerationEuropean Conference on Computer Vision (ECCV), 2022
Jing He
Weihao Ye
Tao Gui
Jun Peng
Chunjiang Ge
Xiaoshuai Sun
Chao Chen
Rongrong Ji
368
6
0
02 Apr 2022
One-shot Ultra-high-Resolution Generative Adversarial Network That
  Synthesizes 16K Images On A Single GPU
One-shot Ultra-high-Resolution Generative Adversarial Network That Synthesizes 16K Images On A Single GPUImage and Vision Computing (IVC), 2022
Junseok Oh
Donghwee Yoon
Injung Kim
410
2
0
28 Feb 2022
Multimodal Image Synthesis and Editing: The Generative AI Era
Multimodal Image Synthesis and Editing: The Generative AI EraIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
702
96
0
27 Dec 2021
123
Next
Page 1 of 3