ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.01197
  4. Cited By
Getting it Right: Improving Spatial Consistency in Text-to-Image Models

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

1 April 2024
Agneet Chatterjee
Gabriela Ben-Melech Stan
Estelle Aflalo
Sayak Paul
Dhruba Ghosh
Tejas Gokhale
Ludwig Schmidt
Hanna Hajishirzi
Vasudev Lal
Chitta Baral
Yezhou Yang
    EGVMVLM
ArXiv (abs)PDFHTMLHuggingFace (32 upvotes)

Papers citing "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"

9 / 9 papers shown
Title
CARINOX: Inference-time Scaling with Category-Aware Reward-based Initial Noise Optimization and Exploration
CARINOX: Inference-time Scaling with Category-Aware Reward-based Initial Noise Optimization and Exploration
S. Kasaei
Ali Aghayari
Arash Marioriyad
Niki Sepasian
Shayan Baghayi Nejad
MohammadAmin Fazli
M. Baghshah
M. Rohban
DiffMEGVM
208
0
0
22 Sep 2025
GenTune: Toward Traceable Prompts to Improve Controllability of Image Refinement in Environment Design
GenTune: Toward Traceable Prompts to Improve Controllability of Image Refinement in Environment DesignACM Symposium on User Interface Software and Technology (UIST), 2025
Wen-Fan Wang
Ting-Ying Lee
Chien-Ting Lu
Che-Wei Hsu
Nil Ponsa Campany
Yu-Mei Chen
Mike Y. Chen
Bing-Yu Chen
DiffM
123
2
0
21 Aug 2025
Towards Self-Improvement of Diffusion Models via Group Preference Optimization
Towards Self-Improvement of Diffusion Models via Group Preference Optimization
Renjie Chen
Wenfeng Lin
Yichen Zhang
Jiangchuan Wei
Boyuan Liu
Chao Feng
Jiao Ran
Mingyu Guo
278
3
0
16 May 2025
Dual Caption Preference Optimization for Diffusion Models
Dual Caption Preference Optimization for Diffusion Models
Amir Saeidi
Yiran Luo
Agneet Chatterjee
Shamanthak Hegde
Bimsara Pathiraja
Yezhou Yang
Chitta Baral
DiffM
297
1
0
09 Feb 2025
Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models
Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models
Arash Marioriyad
Parham Rezaei
M. Baghshah
M. Rohban
CoGe
944
2
0
30 Oct 2024
Attention Overlap Is Responsible for The Entity Missing Problem in Text-to-image Diffusion Models!
Attention Overlap Is Responsible for The Entity Missing Problem in Text-to-image Diffusion Models!
Arash Marioriyad
Mohammadali Banayeeanzade
Reza Abbasi
M. Rohban
M. Baghshah
DiffM
304
6
0
28 Oct 2024
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language
  Models
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language ModelsEuropean Conference on Computer Vision (ECCV), 2024
Agneet Chatterjee
Yiran Luo
Tejas Gokhale
Yezhou Yang
Chitta Baral
LRM
267
9
0
05 Aug 2024
Information Theoretic Text-to-Image Alignment
Information Theoretic Text-to-Image Alignment
Chao Wang
Giulio Franzese
A. Finamore
Massimo Gallo
Pietro Michiardi
554
3
0
31 May 2024
Paragraph-to-Image Generation with Information-Enriched Diffusion Model
Paragraph-to-Image Generation with Information-Enriched Diffusion ModelInternational Journal of Computer Vision (IJCV), 2023
Weijia Wu
Zhuang Li
Yefei He
Mike Zheng Shou
Chunhua Shen
Lele Cheng
Yan Li
Yan Li
Chen Zhang
VLM
461
39
0
24 Nov 2023
1