Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2310.01596
Cited By
v1
v2
v3
v4 (latest)
ImagenHub: Standardizing the evaluation of conditional image generation models
International Conference on Learning Representations (ICLR), 2023
2 October 2023
Max Ku
Tianle Li
Kai Zhang
Yujie Lu
Xingyu Fu
Wenwen Zhuang
Wenhu Chen
EGVM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (19 upvotes)
Github (33279★)
Papers citing
"ImagenHub: Standardizing the evaluation of conditional image generation models"
49 / 49 papers shown
BeyondFacial: Identity-Preserving Personalized Generation Beyond Facial Close-ups
Songsong Zhang
Chuanqi Tang
Hongguang Zhang
Guijian Tang
Minglong Li
Xueqiong Li
Shaowu Yang
Yuanxi Peng
Wenjing Yang
Jing Zhao
331
0
0
15 Nov 2025
The Intricate Dance of Prompt Complexity, Quality, Diversity, and Consistency in T2I Models
Xiaofeng Zhang
Aaron Courville
M. Drozdzal
Adriana Romero Soriano
DiffM
208
3
0
22 Oct 2025
Small is Sufficient: Reducing the World AI Energy Consumption Through Model Selection
Tiago da Silva Barros
Frédéric Giroire
Ramon Aparicio-Pardo
Joanna Moulierac
214
2
0
02 Oct 2025
EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing
Keming Wu
Sicong Jiang
Max Ku
Ping Nie
Minghao Liu
Wenhu Chen
178
20
0
30 Sep 2025
EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling
Xin Luo
Jiahao Wang
Chenyuan Wu
Shitao Xiao
Xiyan Jiang
Defu Lian
Jiajun Zhang
Dong Liu
Zheng Liu
OffRL
280
28
0
28 Sep 2025
Human Preference-Aligned Concept Customization Benchmark via Decomposed Evaluation
Reina Ishikawa
Ryo Fujii
Hideo Saito
Ryo Hachiuma
230
1
0
03 Sep 2025
The Promise of RL for Autoregressive Image Editing
Saba Ahmadi
Rabiul Awal
Ankur Sikarwar
Amirhossein Kazemnejad
Ge Ya Luo
...
Sai Rajeswar
Siva Reddy
C. Pal
Benno Krojer
Aishwarya Agrawal
OffRL
KELM
365
3
0
01 Aug 2025
ADIEE: Automatic Dataset Creation and Scorer for Instruction-Guided Image Editing Evaluation
Sherry X. Chen
Yi Wei
Luowei Zhou
Suren Kumar
328
5
0
09 Jul 2025
Multi-Modal Language Models as Text-to-Image Model Evaluators
Jiahui Chen
Candace Ross
Reyhane Askari Hemmat
Koustuv Sinha
Melissa Hall
M. Drozdzal
Adriana Romero-Soriano
EGVM
474
1
0
01 May 2025
REED-VAE: RE-Encode Decode Training for Iterative Image Editing with Diffusion Models
Gal Almog
Ariel Shamir
Ohad Fried
DiffM
296
2
0
26 Apr 2025
RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation
Aviv Slobodkin
Hagai Taitelbaum
Yonatan Bitton
Brian Gordon
Michal Sokolik
Nitzan Bitton-Guetta
Almog Gueta
Royi Rassin
Itay Laish
Dani Lischinski
EGVM
VGen
510
2
0
24 Apr 2025
Early Timestep Zero-Shot Candidate Selection for Instruction-Guided Image Editing
Joowon Kim
Ziseok Lee
Donghyeon Cho
Sanghyun Jo
Y. Jung
Kyungsu Kim
Eunho Yang
DiffM
373
2
0
18 Apr 2025
Complex-Edit
\texttt{Complex-Edit}
Complex-Edit
: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark
S. Yang
Mude Hui
Bingchen Zhao
Yuyin Zhou
Nataniel Ruiz
Cihang Xie
CoGe
459
20
0
17 Apr 2025
A Unified Agentic Framework for Evaluating Conditional Image Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jifang Wang
Xue Yang
Longyue Wang
Zhenran Xu
Longji Xu
Yaowei Wang
Weihua Luo
Kaifu Zhang
Baotian Hu
Min Zhang
EGVM
DiffM
408
7
0
09 Apr 2025
MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models
Wulin Xie
Yujiao Shi
Chaoyou Fu
Yang Shi
Bingyan Nie
Hongkai Chen
Zheng Zhang
Liang Wang
Tieniu Tan
506
13
0
04 Apr 2025
3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models
Yujiao Shi
Mengchen Zhang
Tong Wu
Tengfei Wang
Gordon Wetzstein
Dahua Lin
Yu Qiao
ELM
739
7
0
27 Mar 2025
Single Image Iterative Subject-driven Generation and Editing
Yair Shpitzer
Gal Chechik
Idan Schwartz
346
1
0
20 Mar 2025
Visual Persona: Foundation Model for Full-Body Human Customization
Computer Vision and Pattern Recognition (CVPR), 2025
Jisu Nam
Soowon Son
Zhan Xu
Jing Shi
Difan Liu
Feng Liu
Aashish Misraa
Seungryong Kim
Yang Zhou
DiffM
384
8
0
19 Mar 2025
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing
Rongyao Fang
Chengqi Duan
Kun Wang
Linjiang Huang
Hao Li
...
Xingyu Zeng
R. Zhao
Jifeng Dai
Xihui Liu
Hongsheng Li
MLLM
ReLM
LRM
445
85
0
13 Mar 2025
VLForgery Face Triad: Detection, Localization and Attribution via Multimodal Large Language Models
Xinan He
Yue Zhou
Bing Fan
Bin Li
Guopu Zhu
Feng Ding
360
15
0
08 Mar 2025
IDEA-Bench: How Far are Generative Models from Professional Designing?
Computer Vision and Pattern Recognition (CVPR), 2024
C. Liang
Lianghua Huang
Jingwu Fang
Huanzhang Dou
Wei Wang
Zhi-Fan Wu
Yupeng Shi
Junge Zhang
Xin Zhao
Yu Liu
3DV
405
6
0
16 Dec 2024
Towards Unified Benchmark and Models for Multi-Modal Perceptual Metrics
Sara Ghazanfari
Siddharth Garg
Nicolas Flammarion
Prashanth Krishnamurthy
Farshad Khorrami
Francesco Croce
VLM
421
1
0
13 Dec 2024
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Ziwei Huang
Wanggui He
Quanyu Long
Yandi Wang
Haoyuan Li
...
Fangxun Shu
Long Chen
Hao Jiang
Yaoyao Yu
Leilei Gan
EGVM
1.2K
9
0
05 Dec 2024
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
International Conference on Learning Representations (ICLR), 2024
Cong Wei
Zheyang Xiong
Weiming Ren
Xinrun Du
Ge Zhang
Lei Ma
602
107
0
11 Nov 2024
KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities
Hsin-Ping Huang
Xinyu Wang
Yonatan Bitton
Hagai Taitelbaum
Gaurav Singh Tomar
...
Xuhui Jia
Kelvin Chan
Hexiang Hu
Yu-Chuan Su
Ming-Hsuan Yang
EGVM
425
4
0
15 Oct 2024
PixLens: A Novel Framework for Disentangled Evaluation in Diffusion-Based Image Editing with Object Detection + SAM
Stefan Stefanache
Lluís Pastor Pérez
Julen Costa Watanabe
Ernesto Sanchez Tejedor
Thomas Hofmann
Enis Simsar
EGVM
139
0
0
08 Oct 2024
Finding the Subjective Truth: Collecting 2 Million Votes for Comprehensive Gen-AI Model Evaluation
Dimitrios Christodoulou
Mads Kuhlmann-Jørgensen
EGVM
234
9
0
18 Sep 2024
ABHINAW: A method for Automatic Evaluation of Typography within AI-Generated Images
Abhinaw Jagtap
Nachiket Tapas
R. G. Brajesh
EGVM
308
0
0
18 Sep 2024
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Yuang Peng
Yuxin Cui
Haomiao Tang
Zekun Qi
Runpei Dong
Jing Bai
Chunrui Han
Zheng Ge
Xiangyu Zhang
Shu-Tao Xia
EGVM
523
110
0
24 Jun 2024
Holistic Evaluation for Interleaved Text-and-Image Generation
Minqian Liu
Zhiyang Xu
Zihao Lin
Trevor Ashby
Joy Rimchala
Jiaxin Zhang
Lifu Huang
EGVM
351
26
0
20 Jun 2024
ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning
Zhongjie Duan
Wenmeng Zhou
Cen Chen
Yaliang Li
Weining Qian
VGen
DiffM
242
6
0
20 Jun 2024
Consistency-diversity-realism Pareto fronts of conditional image generative models
Pietro Astolfi
Marlene Careil
Melissa Hall
Oscar Manas
Matthew Muckley
Jakob Verbeek
Adriana Romero Soriano
M. Drozdzal
357
38
0
14 Jun 2024
Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Xingyu Fu
Muyu He
Yujie Lu
William Yang Wang
Dan Roth
EGVM
LRM
244
42
0
11 Jun 2024
GenAI Arena: An Open Evaluation Platform for Generative Models
Neural Information Processing Systems (NeurIPS), 2024
Dongfu Jiang
Max Ku
Tianle Li
Yuansheng Ni
Shizhuo Sun
Rongqi Fan
Wenhu Chen
EGVM
592
61
0
06 Jun 2024
Conditional Idempotent Generative Networks
Niccolò Ronchetti
202
0
0
05 Jun 2024
Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)
Michael Stephen Saxon
Fatima Jahara
Mahsa Khoshnoodi
Yujie Lu
Aditya Sharma
William Y. Wang
EGVM
396
13
0
05 Apr 2024
Evaluating Text-to-Visual Generation with Image-to-Text Generation
Zhiqiu Lin
Deepak Pathak
Baiqi Li
Jiayao Li
Xide Xia
Graham Neubig
Pengchuan Zhang
Deva Ramanan
EGVM
564
406
0
01 Apr 2024
AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks
Max Ku
Cong Wei
Weiming Ren
Huan Yang
Wenhu Chen
VGen
DiffM
647
98
0
21 Mar 2024
A Survey on Quality Metrics for Text-to-Image Generation
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2024
Sebastian Hartwig
Dominik Engel
Leon Sick
H. Kniesel
Tristan Payer
Poonam Poonam
Michael Glockler
Alex Bauerle
Timo Ropinski
EGVM
387
0
0
18 Mar 2024
LightIt: Illumination Modeling and Control for Diffusion Models
Peter Kocsis
Julien Philip
Kalyan Sunkavalli
Matthias Nießner
Yannick Hold-Geoffroy
358
53
0
15 Mar 2024
Multi-LoRA Composition for Image Generation
Ming Zhong
Haoran Pan
Shuohang Wang
Yadong Lu
Yizhu Jiao
Siru Ouyang
Donghan Yu
Jiawei Han
Weizhu Chen
MoMe
293
87
0
26 Feb 2024
LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding
Yuxuan Wang
Yueqian Wang
Pengfei Wu
Jianxin Liang
Dongyan Zhao
Zilong Zheng
VLM
301
3
0
25 Feb 2024
Instruct-Imagen: Image Generation with Multi-modal Instruction
Computer Vision and Pattern Recognition (CVPR), 2024
Hexiang Hu
Kelvin C. K. Chan
Yu-Chuan Su
Wenhu Chen
Yandong Li
...
Xue Ben
Boqing Gong
William W. Cohen
Ming-Wei Chang
Xuhui Jia
MLLM
316
85
0
03 Jan 2024
Semantic Guidance Tuning for Text-To-Image Diffusion Models
Hyun Kang
Dohae Lee
Myungjin Shin
In-Kwon Lee
304
1
0
26 Dec 2023
VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation
Max Ku
Dongfu Jiang
Cong Wei
Xiang Yue
Wenhu Chen
423
141
0
22 Dec 2023
VBench: Comprehensive Benchmark Suite for Video Generative Models
Computer Vision and Pattern Recognition (CVPR), 2023
Ziqi Huang
Yinan He
Jiashuo Yu
Fan Zhang
Chenyang Si
...
Xinyuan Chen
Limin Wang
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
619
1,240
0
29 Nov 2023
Shadows Don't Lie and Lines Can't Bend! Generative Models don't know Projective Geometry...for now
Computer Vision and Pattern Recognition (CVPR), 2023
Ayush Sarkar
Hanlin Mai
Amitabh Mahapatra
Svetlana Lazebnik
D. A. Forsyth
Anand Bhattad
GAN
311
59
0
28 Nov 2023
GPT-4V(ision) as a Generalist Evaluator for Vision-Language Tasks
Xinlu Zhang
Yujie Lu
Weizhi Wang
An Yan
Jun Yan
Lianke Qin
Heng Wang
Xifeng Yan
William Y. Wang
Linda R. Petzold
LM&MA
MLLM
ELM
274
131
0
02 Nov 2023
Photoswap: Personalized Subject Swapping in Images
Neural Information Processing Systems (NeurIPS), 2023
Jing Gu
Yilin Wang
Nanxuan Zhao
Tsu-Jui Fu
Wei Xiong
...
Zhifei Zhang
Chentao Song
Jianming Zhang
Hyun-Sun Jung
Xin Eric Wang
DiffM
324
55
0
29 May 2023
1
Page 1 of 1