Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.19423
Cited By
Evaluating Vision-Language Models on Bistable Images
29 May 2024
Artemis Panagopoulou
Coby Melkin
Chris Callison-Burch
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Evaluating Vision-Language Models on Bistable Images"
3 / 3 papers shown
Title
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Qinghao Ye
Haiyang Xu
Guohai Xu
Jiabo Ye
Ming Yan
...
Junfeng Tian
Qiang Qi
Ji Zhang
Feiyan Huang
Jingren Zhou
VLM
MLLM
203
883
0
27 Apr 2023
WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning
Krishna Srinivasan
K. Raman
Jiecao Chen
Michael Bendersky
Marc Najork
VLM
181
307
0
02 Mar 2021
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
273
845
0
17 Feb 2021
1