ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.03242
  4. Cited By
StackGAN: Text to Photo-realistic Image Synthesis with Stacked
  Generative Adversarial Networks

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

10 December 2016
Han Zhang
Tao Xu
Hongsheng Li
Shaoting Zhang
Xiaogang Wang
Xiaolei Huang
Dimitris N. Metaxas
    GAN
ArXivPDFHTML

Papers citing "StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks"

50 / 290 papers shown
Title
PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling
PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling
Alara Dirik
Tuanfeng Y. Wang
Duygu Ceylan
Stefanos Zafeiriou
Anna Frühstück
DiffM
40
0
0
19 Apr 2025
A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images
A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images
Zineb Sordo
Eric Chagnon
Daniela Ushizima
EGVM
MedIm
61
1
0
28 Feb 2025
Texture Image Synthesis Using Spatial GAN Based on Vision Transformers
Texture Image Synthesis Using Spatial GAN Based on Vision Transformers
Elahe Salari
Zohreh Azimifar
ViT
50
0
0
03 Feb 2025
INFELM: In-depth Fairness Evaluation of Large Text-To-Image Models
INFELM: In-depth Fairness Evaluation of Large Text-To-Image Models
Di Jin
Xing Liu
Yu Liu
Jia Qing Yap
Andrea Wong
Adriana Crespo
Qi Lin
Zhiyuan Yin
Qiang Yan
Ryan Ye
EGVM
VLM
116
0
0
10 Jan 2025
CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution
CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution
Bingwen Hu
Heng Liu
Zhedong Zheng
Ping Liu
SupR
81
0
0
16 Dec 2024
TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation
TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation
Gihyun Kwon
Jong Chul Ye
DiffM
61
3
0
08 Oct 2024
Denoising with a Joint-Embedding Predictive Architecture
Denoising with a Joint-Embedding Predictive Architecture
Dengsheng Chen
Jie Hu
Xiaoming Wei
Enhua Wu
DiffM
50
2
0
02 Oct 2024
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He
Haodong Li
Wei Yin
Yixun Liang
Leheng Li
Kaiqiang Zhou
Hongbo Zhang
Bingbing Liu
Ying-Cong Chen
DiffM
VLM
44
40
0
26 Sep 2024
Jailbreaking Text-to-Image Models with LLM-Based Agents
Jailbreaking Text-to-Image Models with LLM-Based Agents
Yingkai Dong
Zheng Li
Xiangtao Meng
Ning Yu
Shanqing Guo
LLMAG
36
13
0
01 Aug 2024
Theoretical Insights into CycleGAN: Analyzing Approximation and Estimation Errors in Unpaired Data Generation
Theoretical Insights into CycleGAN: Analyzing Approximation and Estimation Errors in Unpaired Data Generation
Luwei Sun
Dongrui Shen
Han Feng
29
2
0
16 Jul 2024
Surgical Text-to-Image Generation
Surgical Text-to-Image Generation
C. Nwoye
Rupak Bose
K. Elgohary
Lorenzo Arboit
Giorgio Carlino
Joël L. Lavanchy
Pietro Mascagni
N. Padoy
MedIm
55
3
0
12 Jul 2024
User-Friendly Customized Generation with Multi-Modal Prompts
User-Friendly Customized Generation with Multi-Modal Prompts
Linhao Zhong
Yan Hong
Wentao Chen
Binglin Zhou
Yiyi Zhang
Jianfu Zhang
Liqing Zhang
DiffM
37
0
0
26 May 2024
KiNETGAN: Enabling Distributed Network Intrusion Detection through
  Knowledge-Infused Synthetic Data Generation
KiNETGAN: Enabling Distributed Network Intrusion Detection through Knowledge-Infused Synthetic Data Generation
Anantaa Kotal
Brandon Luton
Anupam Joshi
18
1
0
26 May 2024
Evolving Storytelling: Benchmarks and Methods for New Character
  Customization with Diffusion Models
Evolving Storytelling: Benchmarks and Methods for New Character Customization with Diffusion Models
Xiyu Wang
Yufei Wang
Satoshi Tsutsui
Weisi Lin
Bihan Wen
Alex C. Kot
35
4
0
20 May 2024
SignAvatar: Sign Language 3D Motion Reconstruction and Generation
SignAvatar: Sign Language 3D Motion Reconstruction and Generation
Lu Dong
Lipisha Chaudhary
Fei Xu
Xiao Wang
Mason Lary
Ifeoma Nwogu
SLR
32
3
0
13 May 2024
TextGaze: Gaze-Controllable Face Generation with Natural Language
TextGaze: Gaze-Controllable Face Generation with Natural Language
Hengfei Wang
Zhongqun Zhang
Yihua Cheng
Hyung Jin Chang
DiffM
33
2
0
26 Apr 2024
Iteratively Prompting Multimodal LLMs to Reproduce Natural and
  AI-Generated Images
Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images
Ali Naseh
Katherine Thai
Mohit Iyyer
Amir Houmansadr
33
5
0
21 Apr 2024
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
Haoyu Zheng
Wenqiao Zhang
Yaoke Wang
Hao Zhou
Jiang Liu
Juncheng Li
Zheqi Lv
Siliang Tang
Yueting Zhuang
Yueting Zhuang
32
1
0
21 Apr 2024
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation
Minbin Huang
Yanxin Long
Xinchi Deng
Ruihang Chu
Jiangfeng Xiong
Xiaodan Liang
Hong Cheng
Qinglin Lu
Wei Liu
MLLM
EGVM
59
8
0
13 Mar 2024
Improving deep learning with prior knowledge and cognitive models: A
  survey on enhancing explainability, adversarial robustness and zero-shot
  learning
Improving deep learning with prior knowledge and cognitive models: A survey on enhancing explainability, adversarial robustness and zero-shot learning
F. Mumuni
A. Mumuni
AAML
31
5
0
11 Mar 2024
Wavelet-Guided Acceleration of Text Inversion in Diffusion-Based Image
  Editing
Wavelet-Guided Acceleration of Text Inversion in Diffusion-Based Image Editing
Gwanhyeong Koo
Sunjae Yoon
Changdong Yoo
DiffM
19
7
0
18 Jan 2024
IMPRESS: Evaluating the Resilience of Imperceptible Perturbations
  Against Unauthorized Data Usage in Diffusion-Based Generative AI
IMPRESS: Evaluating the Resilience of Imperceptible Perturbations Against Unauthorized Data Usage in Diffusion-Based Generative AI
Bochuan Cao
Changjiang Li
Ting Wang
Jinyuan Jia
Bo Li
Jinghui Chen
DiffM
17
21
0
30 Oct 2023
A Distributed Approach to Meteorological Predictions: Addressing Data
  Imbalance in Precipitation Prediction Models through Federated Learning and
  GANs
A Distributed Approach to Meteorological Predictions: Addressing Data Imbalance in Precipitation Prediction Models through Federated Learning and GANs
Elaheh Jafarigol
Theodore Trafalis
13
7
0
19 Oct 2023
Improving Compositional Text-to-image Generation with Large
  Vision-Language Models
Improving Compositional Text-to-image Generation with Large Vision-Language Models
Song Wen
Guian Fang
Renrui Zhang
Peng Gao
Hao Dong
Dimitris N. Metaxas
21
17
0
10 Oct 2023
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation
  Using only Images
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images
Cuican Yu
Guansong Lu
Yihan Zeng
Jian-jun Sun
Xiaodan Liang
Huibin Li
Zongben Xu
Songcen Xu
Wei Zhang
Hang Xu
33
14
0
31 Aug 2023
CoDeF: Content Deformation Fields for Temporally Consistent Video
  Processing
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Ouyang Hao
Qiuyu Wang
Yuxi Xiao
Qingyan Bai
Juntao Zhang
Kecheng Zheng
Xiaowei Zhou
Qifeng Chen
Yujun Shen
DiffM
VGen
41
81
0
15 Aug 2023
Interleaving GANs with knowledge graphs to support design creativity for
  book covers
Interleaving GANs with knowledge graphs to support design creativity for book covers
Alexandru Motogna
Adrian Groza
GAN
6
0
0
03 Aug 2023
Stylized Projected GAN: A Novel Architecture for Fast and Realistic
  Image Generation
Stylized Projected GAN: A Novel Architecture for Fast and Realistic Image Generation
Md Nurul Muttakin
Malik Shahid Sultan
R. Hoehndorf
H. Ombao
GAN
24
0
0
30 Jul 2023
Synaptic Plasticity Models and Bio-Inspired Unsupervised Deep Learning:
  A Survey
Synaptic Plasticity Models and Bio-Inspired Unsupervised Deep Learning: A Survey
Gabriele Lagani
Fabrizio Falchi
Claudio Gennaro
Giuseppe Amato
AAML
33
6
0
30 Jul 2023
Semantic Image Completion and Enhancement using GANs
Semantic Image Completion and Enhancement using GANs
Priyansh Saxena
Raahat Gupta
Akshat Maheshwari
Saumil Maheshwari
VLM
21
1
0
27 Jul 2023
Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models
Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models
Xin Yuan
Linjie Li
Jianfeng Wang
Zhengyuan Yang
Kevin Qinghong Lin
Zicheng Liu
Lijuan Wang
DiffM
51
6
0
27 Jul 2023
Image Captions are Natural Prompts for Text-to-Image Models
Image Captions are Natural Prompts for Text-to-Image Models
Shiye Lei
Hao Chen
Senyang Zhang
Bo-Lu Zhao
Dacheng Tao
VLM
24
19
0
17 Jul 2023
Counting Guidance for High Fidelity Text-to-Image Synthesis
Counting Guidance for High Fidelity Text-to-Image Synthesis
Wonjune Kang
Kevin Galim
H. Koo
Nam Ik Cho
DiffM
32
8
0
30 Jun 2023
Differential Diffusion: Giving Each Pixel Its Strength
Differential Diffusion: Giving Each Pixel Its Strength
E. Levin
Ohad Fried
DiffM
37
20
0
01 Jun 2023
PanoGen: Text-Conditioned Panoramic Environment Generation for
  Vision-and-Language Navigation
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
Jialu Li
Mohit Bansal
DiffM
27
49
0
30 May 2023
Text-to-image Editing by Image Information Removal
Text-to-image Editing by Image Information Removal
Zhongping Zhang
Jian Zheng
Jacob Zhiyuan Fang
Bryan A. Plummer
DiffM
16
12
0
27 May 2023
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Shihao Zhao
Dongdong Chen
Yen-Chun Chen
Jianmin Bao
Shaozhe Hao
Lu Yuan
Kwan-Yee Kenneth Wong
25
234
0
25 May 2023
Latent-Shift: Latent Diffusion with Temporal Shift for Efficient
  Text-to-Video Generation
Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation
Jie An
Songyang Zhang
Harry Yang
Sonal Gupta
Jia-Bin Huang
Jiebo Luo
Xiaoyue Yin
DiffM
VGen
27
106
0
17 Apr 2023
ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis
ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis
Hongchen Tan
Baocai Yin
Kun Wei
Xiuping Liu
Xin Li
13
16
0
13 Apr 2023
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image
  Models
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models
Eslam Mohamed Bakr
Pengzhan Sun
Xiaoqian Shen
Faizan Farooq Khan
Li Erran Li
Mohamed Elhoseiny
VLM
13
76
0
11 Apr 2023
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image
  Generation
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation
Mayu Otani
Riku Togashi
Yu Sawai
Ryosuke Ishigami
Yuta Nakashima
Esa Rahtu
J. Heikkilä
Shiníchi Satoh
31
62
0
04 Apr 2023
Spatial Latent Representations in Generative Adversarial Networks for
  Image Generation
Spatial Latent Representations in Generative Adversarial Networks for Image Generation
Maciej Sypetkowski
GAN
26
1
0
25 Mar 2023
Freestyle Layout-to-Image Synthesis
Freestyle Layout-to-Image Synthesis
Han Xue
Z. Huang
Qianru Sun
Li-Na Song
Wenjun Zhang
DiffM
15
62
0
25 Mar 2023
Factor Decomposed Generative Adversarial Networks for Text-to-Image
  Synthesis
Factor Decomposed Generative Adversarial Networks for Text-to-Image Synthesis
Jiguo Li
Xiaobin Liu
Lirong Zheng
DRL
19
1
0
24 Mar 2023
TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision
TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision
Jiacheng Wei
Hao Wang
Jiashi Feng
Guosheng Lin
Kim-Hui Yap
22
30
0
23 Mar 2023
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
Can Qin
Ning Yu
Chen Xing
Shu Zhen Zhang
Zeyuan Chen
Stefano Ermon
Yun Fu
Caiming Xiong
Ran Xu
DiffM
30
19
0
17 Mar 2023
Unsupervised Traffic Scene Generation with Synthetic 3D Scene Graphs
Unsupervised Traffic Scene Generation with Synthetic 3D Scene Graphs
Artem Savkin
Rachid Ellouze
Nassir Navab
F. Tombari
11
10
0
15 Mar 2023
Graph Transformer GANs for Graph-Constrained House Generation
Graph Transformer GANs for Graph-Constrained House Generation
H. Tang
Zhenyu Zhang
Humphrey Shi
Bo-wen Li
Lin Shao
N. Sebe
Radu Timofte
Luc Van Gool
34
19
0
14 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
24
501
0
07 Mar 2023
Testing the Channels of Convolutional Neural Networks
Testing the Channels of Convolutional Neural Networks
Kang Choi
Donghyun Son
Younghoon Kim
Jiwon Seo
15
1
0
06 Mar 2023
123456
Next