ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.13321
  4. Cited By
Semantic Object Accuracy for Generative Text-to-Image Synthesis

Semantic Object Accuracy for Generative Text-to-Image Synthesis

29 October 2019
Tobias Hinz
Stefan Heinrich
S. Wermter
    EGVM
ArXivPDFHTML

Papers citing "Semantic Object Accuracy for Generative Text-to-Image Synthesis"

50 / 89 papers shown
Title
Training-free Dense-Aligned Diffusion Guidance for Modular Conditional Image Synthesis
Training-free Dense-Aligned Diffusion Guidance for Modular Conditional Image Synthesis
Zixuan Wang
Duo Peng
Feng Chen
Y. Yang
Yinjie Lei
DiffM
74
0
0
02 Apr 2025
RealCustom++: Representing Images as Real-Word for Real-Time Customization
RealCustom++: Representing Images as Real-Word for Real-Time Customization
Zhendong Mao
Mengqi Huang
Fei Ding
Mingcong Liu
Qian He
Xiaojun Chang
DiffM
70
6
0
03 Jan 2025
Visual Verity in AI-Generated Imagery: Computational Metrics and
  Human-Centric Analysis
Visual Verity in AI-Generated Imagery: Computational Metrics and Human-Centric Analysis
Memoona Aziz
Umair Rehman
Syed Ali Safi
Amir Zaib Abbasi
EGVM
19
2
0
22 Aug 2024
Analyzing Quality, Bias, and Performance in Text-to-Image Generative
  Models
Analyzing Quality, Bias, and Performance in Text-to-Image Generative Models
Nila Masrourisaadat
Nazanin Sedaghatkish
Fatemeh Sarshartehrani
Edward A. Fox
37
6
0
28 Jun 2024
Consistency-diversity-realism Pareto fronts of conditional image
  generative models
Consistency-diversity-realism Pareto fronts of conditional image generative models
Pietro Astolfi
Marlene Careil
Melissa Hall
Oscar Manas
Matthew Muckley
Jakob Verbeek
Adriana Romero Soriano
M. Drozdzal
45
10
0
14 Jun 2024
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and
  Margin Loss
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss
Xuhua Ren
Hengcan Shi
Jin Li
VLM
33
0
0
12 Mar 2024
When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on
  its Contour-following Ability
When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability
Wenjie Xuan
Yufei Xu
Shanshan Zhao
Chaoyue Wang
Juhua Liu
Bo Du
Dacheng Tao
26
2
0
01 Mar 2024
TokenCompose: Text-to-Image Diffusion with Token-level Supervision
TokenCompose: Text-to-Image Diffusion with Token-level Supervision
Zirui Wang
Zhizhou Sha
Zheng Ding
Yilin Wang
Zhuowen Tu
DiffM
27
21
0
06 Dec 2023
Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment
Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment
Brian Gordon
Yonatan Bitton
Yonatan Shafir
Roopal Garg
Xi Chen
Dani Lischinski
Daniel Cohen-Or
Idan Szpektor
42
11
0
05 Dec 2023
Davidsonian Scene Graph: Improving Reliability in Fine-grained
  Evaluation for Text-to-Image Generation
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation
Jaemin Cho
Yushi Hu
Roopal Garg
Peter Anderson
Ranjay Krishna
Jason Baldridge
Mohit Bansal
Jordi Pont-Tuset
Su Wang
EGVM
27
66
0
27 Oct 2023
A Picture is Worth a Thousand Words: Principled Recaptioning Improves
  Image Generation
A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation
Eyal Segalis
Dani Valevski
Danny Lumen
Yossi Matias
Yaniv Leviathan
DiffM
42
22
0
25 Oct 2023
DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM
  Planning
DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning
Abhaysinh Zala
Han Lin
Jaemin Cho
Mohit Bansal
35
12
0
18 Oct 2023
GenEval: An Object-Focused Framework for Evaluating Text-to-Image
  Alignment
GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment
Dhruba Ghosh
Hanna Hajishirzi
Ludwig Schmidt
9
134
0
17 Oct 2023
Hypernymy Understanding Evaluation of Text-to-Image Models via WordNet
  Hierarchy
Hypernymy Understanding Evaluation of Text-to-Image Models via WordNet Hierarchy
Anton Baryshnikov
Max Ryabinin
VLM
16
2
0
13 Oct 2023
AI-Generated Images as Data Source: The Dawn of Synthetic Era
AI-Generated Images as Data Source: The Dawn of Synthetic Era
Zuhao Yang
Fangneng Zhan
Kunhao Liu
Muyu Xu
Shijian Lu
EGVM
25
18
0
03 Oct 2023
Structural Adversarial Objectives for Self-Supervised Representation
  Learning
Structural Adversarial Objectives for Self-Supervised Representation Learning
Xiao Zhang
Michael Maire
11
0
0
30 Sep 2023
T2IW: Joint Text to Image & Watermark Generation
T2IW: Joint Text to Image & Watermark Generation
Anan Liu
Guokai Zhang
Yuting Su
Ning Xu
Yongdong Zhang
Lanjun Wang
34
4
0
07 Sep 2023
Dense Text-to-Image Generation with Attention Modulation
Dense Text-to-Image Generation with Attention Modulation
Yunji Kim
Jiyoung Lee
Jin-Hwa Kim
Jung-Woo Ha
Jun-Yan Zhu
DiffM
36
134
0
24 Aug 2023
Likelihood-Based Text-to-Image Evaluation with Patch-Level Perceptual
  and Semantic Credit Assignment
Likelihood-Based Text-to-Image Evaluation with Patch-Level Perceptual and Semantic Credit Assignment
Qi Chen
Chaorui Deng
Zixiong Huang
Bowen Zhang
Mingkui Tan
Qi Wu
EGVM
19
0
0
16 Aug 2023
Learning to Generate Semantic Layouts for Higher Text-Image
  Correspondence in Text-to-Image Synthesis
Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis
Minho Park
Jooyeol Yun
Seunghwan Choi
Jaegul Choo
DiffM
17
11
0
16 Aug 2023
Food-500 Cap: A Fine-Grained Food Caption Benchmark for Evaluating
  Vision-Language Models
Food-500 Cap: A Fine-Grained Food Caption Benchmark for Evaluating Vision-Language Models
Zheng Ma
Mianzhi Pan
Wenhan Wu
Ka Leong Cheng
Jianbing Zhang
Shujian Huang
Jiajun Chen
VLM
CoGe
18
3
0
06 Aug 2023
TIAM -- A Metric for Evaluating Alignment in Text-to-Image Generation
TIAM -- A Metric for Evaluating Alignment in Text-to-Image Generation
P. Grimal
Hervé Le Borgne
Olivier Ferret
Julien Tourille
EGVM
40
10
0
11 Jul 2023
DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image
  Generation using Limited Data
DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image Generation using Limited Data
Jin Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
DiffM
24
10
0
25 Jun 2023
The Big Data Myth: Using Diffusion Models for Dataset Generation to
  Train Deep Detection Models
The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models
Roy Voetman
Maya Aghaei
K. Dijkstra
DiffM
19
11
0
16 Jun 2023
Grounded Text-to-Image Synthesis with Attention Refocusing
Grounded Text-to-Image Synthesis with Attention Refocusing
Quynh Phung
Songwei Ge
Jia-Bin Huang
DiffM
23
104
0
08 Jun 2023
Are Diffusion Models Vision-And-Language Reasoners?
Are Diffusion Models Vision-And-Language Reasoners?
Benno Krojer
Elinor Poole-Dayan
Vikram S. Voleti
Christopher Pal
Siva Reddy
34
12
0
25 May 2023
Visual Programming for Text-to-Image Generation and Evaluation
Visual Programming for Text-to-Image Generation and Evaluation
Jaemin Cho
Abhaysinh Zala
Mohit Bansal
MLLM
21
50
0
24 May 2023
Vision + Language Applications: A Survey
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
26
5
0
24 May 2023
A Parameter-free Adaptive Resonance Theory-based Topological Clustering
  Algorithm Capable of Continual Learning
A Parameter-free Adaptive Resonance Theory-based Topological Clustering Algorithm Capable of Continual Learning
Naoki Masuyama
Takanori Takebayashi
Yusuke Nojima
C. K. Loo
H. Ishibuchi
S. Wermter
16
5
0
01 May 2023
SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis
SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis
Azade Farshad
Yousef Yeganeh
Yucong Chi
Cheng-nan Shen
Bjorn Ommer
Nassir Navab
DiffM
41
28
0
28 Apr 2023
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image
  Generation
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
Jaemin Cho
Linjie Li
Zhengyuan Yang
Zhe Gan
Lijuan Wang
Mohit Bansal
EGVM
11
5
0
13 Apr 2023
ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis
ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis
Hongchen Tan
Baocai Yin
Kun Wei
Xiuping Liu
Xin Li
13
16
0
13 Apr 2023
Gradient-Free Textual Inversion
Gradient-Free Textual Inversion
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
28
31
0
12 Apr 2023
Text-Conditioned Sampling Framework for Text-to-Image Generation with
  Masked Generative Models
Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models
Jaewoong Lee
Sang-Sub Jang
Jaehyeong Jo
Jaehong Yoon
Yunji Kim
Jin-Hwa Kim
Jung-Woo Ha
Sung Ju Hwang
DiffM
24
4
0
04 Apr 2023
Discriminative Class Tokens for Text-to-Image Diffusion Models
Discriminative Class Tokens for Text-to-Image Diffusion Models
Idan Schwartz
Vésteinn Snaebjarnarson
Hila Chefer
Ryan Cotterell
Serge J. Belongie
Lior Wolf
Sagie Benaim
19
9
0
30 Mar 2023
MagicFusion: Boosting Text-to-Image Generation Performance by Fusing
  Diffusion Models
MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion Models
Jing Zhao
Heliang Zheng
Chaoyue Wang
L. Lan
Wenjing Yang
VLM
38
17
0
23 Mar 2023
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation
  with Question Answering
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
Yushi Hu
Benlin Liu
Jungo Kasai
Yizhong Wang
Mari Ostendorf
Ranjay Krishna
Noah A. Smith
EGVM
27
206
0
21 Mar 2023
Highly Personalized Text Embedding for Image Manipulation by Stable
  Diffusion
Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion
Inhwa Han
Serin Yang
Taesung Kwon
Jong Chul Ye
DiffM
21
35
0
15 Mar 2023
Spatial-temporal Transformer-guided Diffusion based Data Augmentation
  for Efficient Skeleton-based Action Recognition
Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition
Yifan Jiang
Han Chen
Hanseok Ko
DiffM
32
3
0
26 Feb 2023
Paint it Black: Generating paintings from text descriptions
Paint it Black: Generating paintings from text descriptions
Mahnoor Shahid
Mark Koch
Niklas Schneider
12
1
0
17 Feb 2023
Benchmarking Spatial Relationships in Text-to-Image Generation
Benchmarking Spatial Relationships in Text-to-Image Generation
Tejas Gokhale
Hamid Palangi
Besmira Nushi
Vibhav Vineet
Eric Horvitz
Ece Kamar
Chitta Baral
Yezhou Yang
EGVM
34
66
0
20 Dec 2022
SpaText: Spatio-Textual Representation for Controllable Image Generation
SpaText: Spatio-Textual Representation for Controllable Image Generation
Omri Avrahami
Thomas Hayes
Oran Gafni
Sonal Gupta
Yaniv Taigman
Devi Parikh
Dani Lischinski
Ohad Fried
Xiaoyue Yin
DiffM
32
203
0
25 Nov 2022
Learning to Model Multimodal Semantic Alignment for Story Visualization
Learning to Model Multimodal Semantic Alignment for Story Visualization
Bowen Li
Thomas Lukasiewicz
DiffM
23
2
0
14 Nov 2022
SSD: Towards Better Text-Image Consistency Metric in Text-to-Image
  Generation
SSD: Towards Better Text-Image Consistency Metric in Text-to-Image Generation
Zhaorui Tan
Xi Yang
Zihan Ye
Qiufeng Wang
Yuyao Yan
Anh Nguyen
Kaizhu Huang
EGVM
11
3
0
27 Oct 2022
Lafite2: Few-shot Text-to-Image Generation
Lafite2: Few-shot Text-to-Image Generation
Yufan Zhou
Chunyuan Li
Changyou Chen
Jianfeng Gao
Jinhui Xu
DiffM
19
11
0
25 Oct 2022
Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image
  Generation
Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image Generation
Xintian Wu
Hanbin Zhao
Liangli Zheng
Shouhong Ding
Xi Li
29
13
0
28 Sep 2022
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story
  Continuation
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation
A. Maharana
Darryl Hannan
Mohit Bansal
DiffM
11
77
0
13 Sep 2022
DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for
  Text-to-Image Generation
DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation
Mengqi Huang
Zhendong Mao
Penghui Wang
Quang Wang
Yongdong Zhang
23
20
0
03 Sep 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
18
90
0
29 Aug 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for
  Subject-Driven Generation
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
24
2,701
0
25 Aug 2022
12
Next