Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2403.09193
Cited By
v1
v2 (latest)
Can We Talk Models Into Seeing the World Differently?
International Conference on Learning Representations (ICLR), 2024
14 March 2024
Paul Gavrikov
Jovita Lukasik
Steffen Jung
Robert Geirhos
Bianca Lamm
Muhammad Jehanzeb Mirza
Margret Keuper
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (9 upvotes)
Papers citing
"Can We Talk Models Into Seeing the World Differently?"
5 / 5 papers shown
When Harmful Content Gets Camouflaged: Unveiling Perception Failure of LVLMs with CamHarmTI
Yanhui Li
Qi Zhou
Zhihong Xu
Huizhong Guo
Wenhai Wang
Dongxia Wang
VLM
83
0
0
29 Nov 2025
TTRV: Test-Time Reinforcement Learning for Vision Language Models
Akshit Singh
Shyam Marjit
Wei Lin
Paul Gavrikov
Serena Yeung-Levy
Hilde Kuehne
Rogerio Feris
Sivan Doveh
James R. Glass
Muhammad Jehanzeb Mirza
VLM
246
0
0
08 Oct 2025
Taxonomic Reasoning for Rare Arthropods: Combining Dense Image Captioning and RAG for Interpretable Classification
Nathaniel Lesperance
S. Ratnasingham
Graham W. Taylor
VLM
319
0
0
13 Mar 2025
The in-context inductive biases of vision-language models differ across modalities
Kelsey Allen
Ishita Dasgupta
Eliza Kosoy
Andrew Kyle Lampinen
398
2
0
03 Feb 2025
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models
Muhammad Jehanzeb Mirza
Mengjie Zhao
Zhuoyuan Mao
Sivan Doveh
Wei Lin
...
Yuki Mitsufuji
Horst Possegger
Rogerio Feris
Leonid Karlinsky
James Glass
VLM
668
3
0
08 Oct 2024
1