Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2104.00567
Cited By
v1
v2
v3
v4
v5
v6 (latest)
Text to Image Generation with Semantic-Spatial Aware GAN
Computer Vision and Pattern Recognition (CVPR), 2021
1 April 2021
Kaiqin Hu
Wentong Liao
M. Yang
Bodo Rosenhahn
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Text to Image Generation with Semantic-Spatial Aware GAN"
45 / 45 papers shown
Coffee: Controllable Diffusion Fine-tuning
Ziyao Zeng
Jingcheng Ni
Ruyi Liu
Alex Wong
DiffM
231
1
0
18 Nov 2025
Reliable Cross-modal Alignment via Prototype Iterative Construction
Xiang Ma
Litian Xu
Lexin Fang
Caiming Zhang
Lizhen Cui
129
2
0
13 Oct 2025
An LLM-LVLM Driven Agent for Iterative and Fine-Grained Image Editing
Zihan Liang
Jiahao Sun
Haoran Ma
DiffM
156
1
0
24 Aug 2025
T2UE: Generating Unlearnable Examples from Text Descriptions
Xingjun Ma
Hanxun Huang
Tianwei Song
Ye Sun
Yifeng Gao
Yu-Gang Jiang
191
1
0
05 Aug 2025
UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries
Yijie Zhu
Lingsen Zhang
Zitong Yu
Rui Shao
Tao Tan
Liqiang Nie
255
5
0
31 Jul 2025
α
α
α
-GAN by Rényi Cross Entropy
Ni Ding
Miao Qiao
Jiaxing Xu
Yiping Ke
Xiaoyu Zhang
GAN
288
0
0
20 May 2025
Hadamard product in deep learning: Introduction, Advances and Challenges
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
Volkan Cevher
AAML
393
21
0
17 Apr 2025
PartStickers: Generating Parts of Objects for Rapid Prototyping
Mo Zhou
Josh Myers-Dean
Danna Gurari
289
0
0
07 Apr 2025
End-to-end Training for Text-to-Image Synthesis using Dual-Text Embeddings
Yeruru Asrar Ahmed
Anurag Mittal
DiffM
339
0
0
03 Feb 2025
A Machine Learning Framework for Handling Unreliable Absence Label and Class Imbalance for Marine Stinger Beaching Prediction
Amuche Ibenegbu
Amandine Schaeffer
Pierre Lafaye de Micheaux
Rohitash Chandra
207
1
0
20 Jan 2025
Facial Expression Analysis and Its Potentials in IoT Systems: A Contemporary Survey
ACM Computing Surveys (ACM CSUR), 2024
Zixuan Shanggua
Yanjie Dong
Song Guo
Victor C. M. Leung
M. Jamal Deen
Yan Wang
538
15
0
23 Dec 2024
LaMI-GO: Latent Mixture Integration for Goal-Oriented Communications Achieving High Spectrum Efficiency
Achintha Wijesinghe
Suchinthaka Wanninayaka
Weiwei Wang
Yu-Chieh Chao
Songyang Zhang
Zhi Ding
346
1
0
18 Dec 2024
Sketch-Guided Stylized Landscape Cinemagraph Synthesis
H. Jin
Hengyuan Chang
Xiaoxuan Xie
Zhengyang Wang
Xusheng Du
Shaojun Hu
H. Xie
DiffM
VGen
328
1
0
01 Dec 2024
Offline Evaluation of Set-Based Text-to-Image Generation
Negar Arabzadeh
Fernando Diaz
Junfeng He
EGVM
280
1
0
22 Oct 2024
Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities
European Conference on Computer Vision (ECCV), 2024
Lorenzo Baraldi
Federico Cocchi
Marcella Cornia
Lorenzo Baraldi
Alessandro Nicolosi
Rita Cucchiara
289
33
0
29 Jul 2024
Guardians of the Quantum GAN
Archisman Ghosh
Debarshi Kundu
Avimita Chatterjee
Swaroop Ghosh
471
4
0
24 Apr 2024
Text-IF: Leveraging Semantic Text Guidance for Degradation-Aware and Interactive Image Fusion
Xunpeng Yi
Han Xu
Hao Zhang
Linfeng Tang
Jiayi Ma
320
142
0
25 Mar 2024
ARtVista: Gateway To Empower Anyone Into Artist
Trong-Vu Hoang
Quang-Binh Nguyen
Duy-Nam Ly
Khanh-Duy Le
Tam V. Nguyen
Minh-Triet Tran
Trung-Truc Huynh-Le
231
5
0
13 Mar 2024
Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community
Arman Isajanyan
Artur Shatveryan
David Kocharyan
Zinan Lin
Humphrey Shi
EGVM
260
9
0
15 Feb 2024
EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2024
Jingyuan Yang
Jiawei Feng
Hui Huang
VLM
310
36
0
09 Jan 2024
The Right Losses for the Right Gains: Improving the Semantic Consistency of Deep Text-to-Image Generation with Distribution-Sensitive Losses
Mahmoud Ahmed
Omer Moussa
Ismail Shaheen
Mohamed S. Abdelfattah
Amr Abdalla
Marwan Eid
Hesham M. Eraqi
Mohamed Moustafa
323
1
0
18 Dec 2023
DiffusionAtlas: High-Fidelity Consistent Diffusion Video Editing
Shao-Yu Chang
Hwann-Tzong Chen
Tyng-Luh Liu
DiffM
VGen
322
4
0
05 Dec 2023
Perceptual Image Compression with Cooperative Cross-Modal Side Information
Shiyu Qin
Bin Chen
Yujun Huang
Baoyi An
Tao Dai
Shu-Tao Xia
258
4
0
23 Nov 2023
DIFFNAT: Improving Diffusion Image Quality Using Natural Image Statistics
Aniket Roy
Maiterya Suin
Anshul B. Shah
Ketul Shah
Jiang-Long Liu
Rama Chellappa
DiffM
228
6
0
16 Nov 2023
Multimodal Machine Learning in Image-Based and Clinical Biomedicine: Survey and Prospects
International Journal of Computer Vision (IJCV), 2023
Elisa Warner
Joonsan Lee
William Hsu
Tanveer Syeda-Mahmood
Charles Kahn
Olivier Gevaert
Arvind Rao
LM&MA
725
56
0
04 Nov 2023
Understanding Generative AI in Art: An Interview Study with Artists on G-AI from an HCI Perspective
Jingyu Shi
Rahul Jain
Runlin Duan
Karthik Ramani
287
15
0
19 Oct 2023
STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized Alignment
International Conference on Machine Learning (ICML), 2023
Jaewoo Lee
Jaehong Yoon
Wonjae Kim
Yunji Kim
Sung Ju Hwang
CLL
375
2
0
12 Oct 2023
TP2O: Creative Text Pair-to-Object Generation using Balance Swap-Sampling
European Conference on Computer Vision (ECCV), 2023
Jun Li
Zedong Zhang
Zhiqiang Wang
DiffM
292
17
0
03 Oct 2023
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Fengxiang Bie
Jianlong Wu
Zhongzhu Zhou
Adam Ghanem
Minjia Zhang
...
Pareesa Ameneh Golnari
David A. Clifton
Yuxiong He
Dacheng Tao
Shuaiwen Leon Song
EGVM
293
67
0
02 Sep 2023
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
339
16
0
24 May 2023
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation
Computer Vision and Pattern Recognition (CVPR), 2023
Mengqi Huang
Zhendong Mao
Quang Wang
Yongdong Zhang
VGen
DiffM
282
32
0
23 May 2023
TextDiffuser: Diffusion Models as Text Painters
Neural Information Processing Systems (NeurIPS), 2023
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
693
214
0
18 May 2023
Not Only Generative Art: Stable Diffusion for Content-Style Disentanglement in Art Analysis
International Conference on Multimedia Retrieval (ICMR), 2023
Yankun Wu
Yuta Nakashima
Noa Garcia
CoGe
DiffM
321
34
0
20 Apr 2023
A review of ensemble learning and data augmentation models for class imbalanced problems: combination, implementation and evaluation
Expert systems with applications (ESWA), 2023
A. Khan
Omkar Chaudhari
Rohitash Chandra
681
430
0
06 Apr 2023
Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models
Xuhui Jia
Yang Zhao
Kelvin C. K. Chan
Yandong Li
Han-Ying Zhang
Boqing Gong
Tingbo Hou
Jian Shu
Yu-Chuan Su
DiffM
329
130
0
05 Apr 2023
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation
Computer Vision and Pattern Recognition (CVPR), 2023
Mayu Otani
Riku Togashi
Yu Sawai
Ryosuke Ishigami
Yuta Nakashima
Esa Rahtu
J. Heikkilä
Shiníchi Satoh
249
85
0
04 Apr 2023
Indonesian Text-to-Image Synthesis with Sentence-BERT and FastGAN
Made Raharja Surya Mahadi
N. P. Utama
313
4
0
25 Mar 2023
Paint it Black: Generating paintings from text descriptions
Mahnoor Shahid
Mark Koch
Niklas Schneider
301
3
0
17 Feb 2023
Shape-aware Text-driven Layered Video Editing
Computer Vision and Pattern Recognition (CVPR), 2023
Yao-Chih Lee
Ji-Ze Jang
Yi-Ting Chen
Elizabeth Qiu
Jia-Bin Huang
VGen
DiffM
389
61
0
30 Jan 2023
Attribute-Centric Compositional Text-to-Image Generation
International Journal of Computer Vision (IJCV), 2023
Yuren Cong
Martin Renqiang Min
Erran L. Li
Bodo Rosenhahn
M. Yang
348
19
0
04 Jan 2023
SceneComposer: Any-Level Semantic Image Synthesis
Computer Vision and Pattern Recognition (CVPR), 2022
Yu Zeng
Zhe Lin
Jianming Zhang
Qing Liu
John Collomosse
Jason Kuen
Vishal M. Patel
DiffM
196
55
0
21 Nov 2022
HumanDiffusion: a Coarse-to-Fine Alignment Diffusion Framework for Controllable Text-Driven Person Image Generation
Kai Zhang
Muyi Sun
Jianxin Sun
Binghao Zhao
Kunbo Zhang
Zhenan Sun
Tieniu Tan
DiffM
232
15
0
11 Nov 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
AAAI Conference on Artificial Intelligence (AAAI), 2022
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
359
118
0
29 Aug 2022
T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-Up
Expert systems with applications (ESWA), 2022
Deyin Liu
Yang Wang
Q. Tian
Zongyuan Ge
DiffM
331
9
0
18 Aug 2022
Recurrent Affine Transformation for Text-to-image Synthesis
IEEE transactions on multimedia (IEEE TMM), 2022
Senmao Ye
Fei Liu
Mingkui Tan
236
33
0
22 Apr 2022
1
Page 1 of 1