v1v2v3v4v5v6 (latest)

Text to Image Generation with Semantic-Spatial Aware GAN

Computer Vision and Pattern Recognition (CVPR), 2021

1 April 2021

Papers citing "Text to Image Generation with Semantic-Spatial Aware GAN"

45 / 45 papers shown

Coffee: Controllable Diffusion Fine-tuning

231

18 Nov 2025

Reliable Cross-modal Alignment via Prototype Iterative Construction

129

13 Oct 2025

An LLM-LVLM Driven Agent for Iterative and Fine-Grained Image Editing

156

24 Aug 2025

T2UE: Generating Unlearnable Examples from Text Descriptions

191

05 Aug 2025

UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries

255

31 Jul 2025

α

-GAN by Rényi Cross Entropy

288

20 May 2025

Hadamard product in deep learning: Introduction, Advances and ChallengesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025

393

17 Apr 2025

PartStickers: Generating Parts of Objects for Rapid Prototyping

Mo Zhou

Josh Myers-Dean

Danna Gurari

289

07 Apr 2025

End-to-end Training for Text-to-Image Synthesis using Dual-Text Embeddings

Yeruru Asrar Ahmed

Anurag Mittal

DiffM

339

03 Feb 2025

A Machine Learning Framework for Handling Unreliable Absence Label and Class Imbalance for Marine Stinger Beaching Prediction

Amuche Ibenegbu

Amandine Schaeffer

Pierre Lafaye de Micheaux

Rohitash Chandra

207

20 Jan 2025

Facial Expression Analysis and Its Potentials in IoT Systems: A Contemporary SurveyACM Computing Surveys (ACM CSUR), 2024

538

23 Dec 2024

LaMI-GO: Latent Mixture Integration for Goal-Oriented Communications Achieving High Spectrum Efficiency

Achintha Wijesinghe

Suchinthaka Wanninayaka

346

18 Dec 2024

Sketch-Guided Stylized Landscape Cinemagraph Synthesis

328

01 Dec 2024

Offline Evaluation of Set-Based Text-to-Image Generation

280

22 Oct 2024

Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local SimilaritiesEuropean Conference on Computer Vision (ECCV), 2024

Lorenzo Baraldi

Lorenzo Baraldi

289

29 Jul 2024

Guardians of the Quantum GAN

Archisman Ghosh

Debarshi Kundu

Avimita Chatterjee

Swaroop Ghosh

471

24 Apr 2024

Text-IF: Leveraging Semantic Text Guidance for Degradation-Aware and Interactive Image Fusion

320

142

25 Mar 2024

ARtVista: Gateway To Empower Anyone Into Artist

231

13 Mar 2024

Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community

260

15 Feb 2024

EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2024

310

09 Jan 2024

The Right Losses for the Right Gains: Improving the Semantic Consistency of Deep Text-to-Image Generation with Distribution-Sensitive Losses

Mahmoud Ahmed

Omer Moussa

Ismail Shaheen

Mohamed S. Abdelfattah

323

18 Dec 2023

DiffusionAtlas: High-Fidelity Consistent Diffusion Video Editing

322

05 Dec 2023

Perceptual Image Compression with Cooperative Cross-Modal Side Information

258

23 Nov 2023

DIFFNAT: Improving Diffusion Image Quality Using Natural Image Statistics

228

16 Nov 2023

Multimodal Machine Learning in Image-Based and Clinical Biomedicine: Survey and ProspectsInternational Journal of Computer Vision (IJCV), 2023

Elisa Warner

Joonsan Lee

William Hsu

Tanveer Syeda-Mahmood

725

04 Nov 2023

Understanding Generative AI in Art: An Interview Study with Artists on G-AI from an HCI Perspective

287

19 Oct 2023

STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized AlignmentInternational Conference on Machine Learning (ICML), 2023

Yunji Kim

375

12 Oct 2023

TP2O: Creative Text Pair-to-Object Generation using Balance Swap-SamplingEuropean Conference on Computer Vision (ECCV), 2023

Jun Li

Zedong Zhang

Zhiqiang Wang

DiffM

292

03 Oct 2023

RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large ModelIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

...

Pareesa Ameneh Golnari

Yuxiong He

293

02 Sep 2023

Vision + Language Applications: A Survey

Yutong Zhou

N. Shimada

VLM

339

24 May 2023

Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image GenerationComputer Vision and Pattern Recognition (CVPR), 2023

Mengqi Huang

282

23 May 2023

TextDiffuser: Diffusion Models as Text PaintersNeural Information Processing Systems (NeurIPS), 2023

693

214

18 May 2023

Not Only Generative Art: Stable Diffusion for Content-Style Disentanglement in Art AnalysisInternational Conference on Multimedia Retrieval (ICMR), 2023

321

20 Apr 2023

A review of ensemble learning and data augmentation models for class imbalanced problems: combination, implementation and evaluationExpert systems with applications (ESWA), 2023

A. Khan

Omkar Chaudhari

Rohitash Chandra

681

430

06 Apr 2023

Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models

329

130

05 Apr 2023

Toward Verifiable and Reproducible Human Evaluation for Text-to-Image GenerationComputer Vision and Pattern Recognition (CVPR), 2023

Esa Rahtu

Shiníchi Satoh

249

04 Apr 2023

Indonesian Text-to-Image Synthesis with Sentence-BERT and FastGAN

Made Raharja Surya Mahadi

N. P. Utama

313

25 Mar 2023

Paint it Black: Generating paintings from text descriptions

Mahnoor Shahid

Mark Koch

Niklas Schneider

301

17 Feb 2023

Shape-aware Text-driven Layered Video EditingComputer Vision and Pattern Recognition (CVPR), 2023

389

30 Jan 2023

Attribute-Centric Compositional Text-to-Image GenerationInternational Journal of Computer Vision (IJCV), 2023

348

04 Jan 2023

SceneComposer: Any-Level Semantic Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2022

196

21 Nov 2022

HumanDiffusion: a Coarse-to-Fine Alignment Diffusion Framework for Controllable Text-Driven Person Image Generation

232

11 Nov 2022

Frido: Feature Pyramid Diffusion for Complex Scene Image SynthesisAAAI Conference on Artificial Intelligence (AAAI), 2022

Lu Yuan

359

118

29 Aug 2022

T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-UpExpert systems with applications (ESWA), 2022

Deyin Liu

Yang Wang

Q. Tian

Zongyuan Ge

DiffM

331

18 Aug 2022

Recurrent Affine Transformation for Text-to-image SynthesisIEEE transactions on multimedia (IEEE TMM), 2022

Senmao Ye

Fei Liu

Mingkui Tan

236

22 Apr 2022