Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.04908
Cited By
No Token Left Behind: Explainability-Aided Image Classification and Generation
11 April 2022
Roni Paiss
Hila Chefer
Lior Wolf
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"No Token Left Behind: Explainability-Aided Image Classification and Generation"
23 / 23 papers shown
Title
Explainable Search and Discovery of Visual Cultural Heritage Collections with Multimodal Large Language Models
T. Arnold
L. Tilton
30
0
0
07 Nov 2024
FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting
Liyao Jiang
Negar Hassanpour
Mohammad Salameh
Mohan Sai Singamsetti
Fengyu Sun
Wei Lu
Di Niu
DiffM
58
2
0
21 Aug 2024
MePT: Multi-Representation Guided Prompt Tuning for Vision-Language Model
Xinyang Wang
Yi Yang
Minfeng Zhu
Kecheng Zheng
Shi Liu
Wei Chen
VPVLM
MLLM
VLM
31
1
0
19 Aug 2024
Teach CLIP to Develop a Number Sense for Ordinal Regression
Yao Du
Qiang Zhai
Weihang Dai
X. Li
33
0
0
07 Aug 2024
A2SF: Accumulative Attention Scoring with Forgetting Factor for Token Pruning in Transformer Decoder
Hyun Rae Jo
Dong Kun Shin
19
4
0
30 Jul 2024
MaskInversion: Localized Embeddings via Optimization of Explainability Maps
Walid Bousselham
Sofian Chaybouti
Christian Rupprecht
Vittorio Ferrari
Hilde Kuehne
51
0
0
29 Jul 2024
Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis
Marianna Ohanyan
Hayk Manukyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
35
1
0
06 Jun 2024
Text-guided Explorable Image Super-resolution
Kanchana Vaishnavi Gandikota
Paramanand Chandramouli
32
7
0
02 Mar 2024
LoMOE: Localized Multi-Object Editing via Multi-Diffusion
Goirik Chakrabarty
Aditya Chandrasekar
Ramya Hebbalaguppe
AP Prathosh
DiffM
43
6
0
01 Mar 2024
SPIRE: Semantic Prompt-Driven Image Restoration
Chenyang Qi
Zhengzhong Tu
Keren Ye
M. Delbracio
P. Milanfar
Qifeng Chen
Hossein Talebi
DiffM
13
11
0
18 Dec 2023
Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation
Qin Guo
Tianwei Lin
DiffM
13
28
0
15 Dec 2023
Visual Attention Prompted Prediction and Learning
Yifei Zhang
Siyi Gu
Bo Pan
Guangji Bai
Meikang Qiu
Xiaofeng Yang
Liang Zhao
LRM
VLM
14
2
0
12 Oct 2023
Composite Diffusion | whole >= Σparts
Vikram Jamwal
S. Ramaneswaran
DiffM
13
0
0
25 Jul 2023
Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
Jaskirat Singh
Liang Zheng
13
18
0
10 Jul 2023
Word-Level Explanations for Analyzing Bias in Text-to-Image Models
Alexander Lin
Lucas Monteiro Paes
Sree Harsha Tanneru
Suraj Srinivas
Himabindu Lakkaraju
9
10
0
03 Jun 2023
Text-guided Eyeglasses Manipulation with Spatial Constraints
Jiacheng Wang
Ping Liu
Jingen Liu
Wei-ping Xu
DiffM
16
6
0
25 Apr 2023
Teaching CLIP to Count to Ten
Roni Paiss
Ariel Ephrat
Omer Tov
Shiran Zada
Inbar Mosseri
Michal Irani
Tali Dekel
VLM
CLIP
16
88
0
23 Feb 2023
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
Hila Chefer
Yuval Alaluf
Yael Vinker
Lior Wolf
Daniel Cohen-Or
DiffM
12
498
0
31 Jan 2023
SpaText: Spatio-Textual Representation for Controllable Image Generation
Omri Avrahami
Thomas Hayes
Oran Gafni
Sonal Gupta
Yaniv Taigman
Devi Parikh
Dani Lischinski
Ohad Fried
Xiaoyue Yin
DiffM
19
149
0
25 Nov 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Y. Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
29
1,744
0
02 Aug 2022
Text2Mesh: Text-Driven Neural Stylization for Meshes
O. Michel
Roi Bar-On
Richard Liu
Sagie Benaim
Rana Hanocka
CLIP
AI4CE
175
350
0
06 Dec 2021
Mind the Gap: Domain Gap Control for Single Shot Domain Adaptation for Generative Adversarial Networks
Peihao Zhu
Rameen Abdal
John C. Femiani
Peter Wonka
GAN
132
80
0
15 Oct 2021
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
319
2,108
0
02 Sep 2021
1