Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.02231
Cited By
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models
5 August 2024
Agneet Chatterjee
Yiran Luo
Tejas Gokhale
Yezhou Yang
Chitta Baral
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"
4 / 4 papers shown
Title
Dual Caption Preference Optimization for Diffusion Models
Amir Saeidi
Yiran Luo
Agneet Chatterjee
Shamanthak Hegde
Bimsara Pathiraja
Yezhou Yang
Chitta Baral
DiffM
51
0
0
09 Feb 2025
STUPD: A Synthetic Dataset for Spatial and Temporal Relation Reasoning
Palaash Agrawal
Haidi Azaman
Cheston Tan
36
3
0
13 Sep 2023
Training-Free Layout Control with Cross-Attention Guidance
Minghao Chen
Iro Laina
Andrea Vedaldi
DiffM
124
221
0
06 Apr 2023
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,764
0
24 Feb 2021
1