ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.10485
  4. Cited By
AttnGAN: Fine-Grained Text to Image Generation with Attentional
  Generative Adversarial Networks

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

28 November 2017
Tao Xu
Pengchuan Zhang
Qiuyuan Huang
Han Zhang
Zhe Gan
Xiaolei Huang
Xiaodong He
    GAN
    ViT
ArXivPDFHTML

Papers citing "AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks"

50 / 210 papers shown
Title
PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling
PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling
Alara Dirik
Tuanfeng Y. Wang
Duygu Ceylan
Stefanos Zafeiriou
Anna Frühstück
DiffM
40
0
0
19 Apr 2025
Generative Data Imputation for Sparse Learner Performance Data Using Generative Adversarial Imputation Networks
Generative Data Imputation for Sparse Learner Performance Data Using Generative Adversarial Imputation Networks
Liang Zhang
Jionghao Lin
John Sabatini
Diego Zapata-Rivera
Carol Forsyth
Yang Jiang
John Hollander
Xiangen Hu
Arthur C. Graesser
44
0
0
23 Mar 2025
PanoGen++: Domain-Adapted Text-Guided Panoramic Environment Generation for Vision-and-Language Navigation
Sen Wang
Dongliang Zhou
Liang Xie
Chao Xu
Ye Yan
Erwei Yin
DiffM
70
2
0
13 Mar 2025
Fine-Grained Alignment and Noise Refinement for Compositional Text-to-Image Generation
Amir Mohammad Izadi
Seyed Mohsen Hosseini
Soroush Vafaie Tabar
Ali Abdollahi
Armin Saghafian
M. Baghshah
EGVM
40
0
0
09 Mar 2025
A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images
A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images
Zineb Sordo
Eric Chagnon
Daniela Ushizima
EGVM
MedIm
61
1
0
28 Feb 2025
SOEDiff: Efficient Distillation for Small Object Editing
SOEDiff: Efficient Distillation for Small Object Editing
Yiming Wu
Qihe Pan
Zhen Zhao
Zicheng Wang
Sifan Long
Ronghua Liang
DiffM
60
0
0
03 Jan 2025
TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions
Vriksha Srihari
R. Bhavya
Shruti Jayaraman
V. Mary Anita Rajam
DiffM
VGen
28
0
0
02 Jan 2025
CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution
CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution
Bingwen Hu
Heng Liu
Zhedong Zheng
Ping Liu
SupR
81
0
0
16 Dec 2024
SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers
SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers
Zehao Chen
Rong Pan
87
1
0
13 Dec 2024
Any-Resolution AI-Generated Image Detection by Spectral Learning
Any-Resolution AI-Generated Image Detection by Spectral Learning
Dimitrios Karageorgiou
Symeon Papadopoulos
I. Kompatsiaris
Efstratios Gavves
101
0
0
28 Nov 2024
Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects
Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects
Weimin Qiu
Jieke Wang
Meng Tang
DiffM
79
0
0
28 Nov 2024
FactorizePhys: Matrix Factorization for Multidimensional Attention in
  Remote Physiological Sensing
FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological Sensing
Jitesh Joshi
Sos S. Agaian
Youngjun Cho
AI4TS
39
1
0
03 Nov 2024
An Online Learning Approach to Prompt-based Selection of Generative Models
An Online Learning Approach to Prompt-based Selection of Generative Models
Xiaoyan Hu
Ho-fung Leung
Farzan Farnia
33
2
0
17 Oct 2024
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Sihyun Yu
Sangkyung Kwak
Huiwon Jang
Jongheon Jeong
Jonathan Huang
Jinwoo Shin
Saining Xie
OCL
68
62
0
09 Oct 2024
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He
Haodong Li
Wei Yin
Yixun Liang
Leheng Li
Kaiqiang Zhou
Hongbo Zhang
Bingbing Liu
Ying-Cong Chen
DiffM
VLM
44
40
0
26 Sep 2024
Face Mask Removal with Region-attentive Face Inpainting
Face Mask Removal with Region-attentive Face Inpainting
Minmin Yang
CVBM
44
0
0
10 Sep 2024
Perception-guided Jailbreak against Text-to-Image Models
Perception-guided Jailbreak against Text-to-Image Models
Yihao Huang
Le Liang
Tianlin Li
Xiaojun Jia
Run Wang
Weikai Miao
G. Pu
Yang Liu
39
7
0
20 Aug 2024
Surgical Text-to-Image Generation
Surgical Text-to-Image Generation
C. Nwoye
Rupak Bose
K. Elgohary
Lorenzo Arboit
Giorgio Carlino
Joël L. Lavanchy
Pietro Mascagni
N. Padoy
MedIm
55
3
0
12 Jul 2024
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Dewei Zhou
Y. Li
Fan Ma
Zongxin Yang
Y. Yang
91
11
0
02 Jul 2024
Adaptive Image Quality Assessment via Teaching Large Multimodal Model to
  Compare
Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare
Hanwei Zhu
Haoning Wu
Yixuan Li
Zicheng Zhang
Baoliang Chen
Lingyu Zhu
Yuming Fang
Guangtao Zhai
Weisi Lin
Shiqi Wang
38
18
0
29 May 2024
Ensembling Diffusion Models via Adaptive Feature Aggregation
Ensembling Diffusion Models via Adaptive Feature Aggregation
Cong Wang
Kuan Tian
Yonghang Guan
Jun Zhang
Zhiwei Jiang
Fei Shen
Xiao Han
34
5
0
27 May 2024
Training-free Subject-Enhanced Attention Guidance for Compositional
  Text-to-image Generation
Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation
Shengyuan Liu
Bo Wang
Ye Ma
Te Yang
Xipeng Cao
Quan Chen
Han Li
Di Dong
Peng Jiang
EGVM
36
2
0
11 May 2024
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Navve Wasserman
Noam Rotstein
Roy Ganz
Ron Kimmel
DiffM
34
15
0
28 Apr 2024
MultiBooth: Towards Generating All Your Concepts in an Image from Text
MultiBooth: Towards Generating All Your Concepts in an Image from Text
Chenyang Zhu
Kai Li
Yue Ma
Chunming He
Li Xiu
DiffM
104
22
0
22 Apr 2024
Iteratively Prompting Multimodal LLMs to Reproduce Natural and
  AI-Generated Images
Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images
Ali Naseh
Katherine Thai
Mohit Iyyer
Amir Houmansadr
33
5
0
21 Apr 2024
AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal
  Conditioning
AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning
W. Para
Abdelrahman Eldesokey
Zhenyu Li
Pradyumna Reddy
Jiankang Deng
Peter Wonka
DiffM
30
0
0
08 Feb 2024
Spatial-Aware Latent Initialization for Controllable Image Generation
Spatial-Aware Latent Initialization for Controllable Image Generation
Wenqiang Sun
Tengtao Li
Zehong Lin
Jun Zhang
31
10
0
29 Jan 2024
Inflation with Diffusion: Efficient Temporal Adaptation for
  Text-to-Video Super-Resolution
Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution
Xin Yuan
Jinoo Baek
Keyang Xu
Omer Tov
Hongliang Fei
VGen
32
3
0
18 Jan 2024
Tuning-Free Inversion-Enhanced Control for Consistent Image Editing
Tuning-Free Inversion-Enhanced Control for Consistent Image Editing
Xiaoyue Duan
Shuhao Cui
Guoliang Kang
Baochang Zhang
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
31
8
0
22 Dec 2023
A Survey of Generative AI for Intelligent Transportation Systems
A Survey of Generative AI for Intelligent Transportation Systems
Huan Yan
Yong Li
21
8
0
13 Dec 2023
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
Sherwin Bahmani
Ivan Skorokhodov
Victor Rong
Gordon Wetzstein
Leonidas J. Guibas
Peter Wonka
Sergey Tulyakov
Jeong Joon Park
Andrea Tagliasacchi
David B. Lindell
DiffM
41
103
0
29 Nov 2023
Material Palette: Extraction of Materials from a Single Image
Material Palette: Extraction of Materials from a Single Image
Ivan Lopes
Fabio Pizzati
Raoul de Charette
DiffM
21
12
0
28 Nov 2023
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Yutong Feng
Biao Gong
Di Chen
Yujun Shen
Yu Liu
Jingren Zhou
DiffM
26
43
0
28 Nov 2023
Text-Driven Image Editing via Learnable Regions
Text-Driven Image Editing via Learnable Regions
Yuanze Lin
Yi-Wen Chen
Yi-Hsuan Tsai
Lu Jiang
Ming-Hsuan Yang
DiffM
21
16
0
28 Nov 2023
Tell2Design: A Dataset for Language-Guided Floor Plan Generation
Tell2Design: A Dataset for Language-Guided Floor Plan Generation
Sicong Leng
Yangqiaoyu Zhou
Mohammed Haroon Dupty
W. Lee
Sam Joyce
Wei Lu
3DV
27
10
0
27 Nov 2023
Soulstyler: Using Large Language Model to Guide Image Style Transfer for
  Target Object
Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object
Junhao Chen
Peng Rong
Jingbo Sun
Chao Li
Xiang Li
Hongwu Lv
VLM
21
2
0
22 Nov 2023
The Challenges of Image Generation Models in Generating Multi-Component
  Images
The Challenges of Image Generation Models in Generating Multi-Component Images
Tham Yik Foong
Shashank Kotyan
Poyuan Mao
Danilo Vasconcellos Vargas
EGVM
39
1
0
22 Nov 2023
Steal My Artworks for Fine-tuning? A Watermarking Framework for
  Detecting Art Theft Mimicry in Text-to-Image Models
Steal My Artworks for Fine-tuning? A Watermarking Framework for Detecting Art Theft Mimicry in Text-to-Image Models
Ge Luo
Junqiang Huang
Manman Zhang
Zhenxing Qian
Sheng Li
Xinpeng Zhang
WIGM
17
9
0
22 Nov 2023
A Chronological Survey of Theoretical Advancements in Generative
  Adversarial Networks for Computer Vision
A Chronological Survey of Theoretical Advancements in Generative Adversarial Networks for Computer Vision
Hrishikesh Sharma
AI4CE
EGVM
13
1
0
02 Nov 2023
Improving Compositional Text-to-image Generation with Large
  Vision-Language Models
Improving Compositional Text-to-image Generation with Large Vision-Language Models
Song Wen
Guian Fang
Renrui Zhang
Peng Gao
Hao Dong
Dimitris N. Metaxas
21
17
0
10 Oct 2023
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image
  Action Editing
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing
Jiarui Yao
Yifan Liu
Simon S. Du
Shifeng Chen
DiffM
16
24
0
28 Sep 2023
Text-to-Image Generation for Abstract Concepts
Text-to-Image Generation for Abstract Concepts
Jiayi Liao
Xu Chen
Qiang Fu
Lun Du
Xiangnan He
Xiang Wang
Shi Han
Dongmei Zhang
32
14
0
26 Sep 2023
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Jiaxi Gu
Shicong Wang
Haoyu Zhao
Tianyi Lu
Xing Zhang
Zuxuan Wu
Songcen Xu
Wei Zhang
Yu-Gang Jiang
Hang Xu
DiffM
VGen
34
43
0
07 Sep 2023
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation
  Using only Images
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images
Cuican Yu
Guansong Lu
Yihan Zeng
Jian-jun Sun
Xiaodan Liang
Huibin Li
Zongben Xu
Songcen Xu
Wei Zhang
Hang Xu
33
14
0
31 Aug 2023
CoDeF: Content Deformation Fields for Temporally Consistent Video
  Processing
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Ouyang Hao
Qiuyu Wang
Yuxi Xiao
Qingyan Bai
Juntao Zhang
Kecheng Zheng
Xiaowei Zhou
Qifeng Chen
Yujun Shen
DiffM
VGen
41
81
0
15 Aug 2023
Interleaving GANs with knowledge graphs to support design creativity for
  book covers
Interleaving GANs with knowledge graphs to support design creativity for book covers
Alexandru Motogna
Adrian Groza
GAN
6
0
0
03 Aug 2023
Towards General Visual-Linguistic Face Forgery Detection
Towards General Visual-Linguistic Face Forgery Detection
Ke Sun
Shen Chen
Taiping Yao
Haozhe Yang
Xiaoshuai Sun
Shouhong Ding
R. Ji
24
12
0
31 Jul 2023
UniBriVL: Robust Universal Representation and Generation of Audio Driven
  Diffusion Models
UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models
Sen Fang
Bowen Gao
Yangjian Wu
T. Teoh
DiffM
18
1
0
29 Jul 2023
Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models
Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models
Xin Yuan
Linjie Li
Jianfeng Wang
Zhengyuan Yang
Kevin Qinghong Lin
Zicheng Liu
Lijuan Wang
DiffM
51
6
0
27 Jul 2023
FaceCLIPNeRF: Text-driven 3D Face Manipulation using Deformable Neural
  Radiance Fields
FaceCLIPNeRF: Text-driven 3D Face Manipulation using Deformable Neural Radiance Fields
S. Hwang
J. Hyung
Daejin Kim
Minjeong Kim
Jaegul Choo
3DH
CLIP
CVBM
46
11
0
21 Jul 2023
12345
Next