Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.15445
Cited By
IRFL: Image Recognition of Figurative Language
27 March 2023
Ron Yosef
Yonatan Bitton
Dafna Shahaf
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IRFL: Image Recognition of Figurative Language"
14 / 14 papers shown
Title
SemEval-2025 Task 1: AdMIRe -- Advancing Multimodal Idiomaticity Representation
Thomas Pickard
Aline Villavicencio
Maggie Mi
Wei He
Dylan Phelps
Carolina Scarton
75
1
0
19 Mar 2025
Can We Predict Performance of Large Models across Vision-Language Tasks?
Qinyu Zhao
Ming Xu
Kartik Gupta
Akshay Asthana
Liang Zheng
Stephen Gould
37
0
0
14 Oct 2024
HEMM: Holistic Evaluation of Multimodal Foundation Models
Paul Pu Liang
Akshay Goindani
Talha Chafekar
Leena Mathur
Haofei Yu
Ruslan Salakhutdinov
Louis-Philippe Morency
36
10
0
03 Jul 2024
Seeing the Unseen: Visual Metaphor Captioning for Videos
Abisek Rajakumar Kalarani
Pushpak Bhattacharyya
Sumit Shekhar
VLM
29
1
0
07 Jun 2024
ViPE: Visualise Pretty-much Everything
Hassan Shahmohammadi
Adhiraj Ghosh
Hendrik P. A. Lensch
DiffM
20
1
0
16 Oct 2023
VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use
Yonatan Bitton
Hritik Bansal
Jack Hessel
Rulin Shao
Wanrong Zhu
Anas Awadalla
Josh Gardner
Rohan Taori
L. Schimdt
VLM
29
76
0
12 Aug 2023
Factorized Contrastive Learning: Going Beyond Multi-view Redundancy
Paul Pu Liang
Zihao Deng
Martin Q. Ma
James Y. Zou
Louis-Philippe Morency
Ruslan Salakhutdinov
SSL
16
49
0
08 Jun 2023
Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications
Paul Pu Liang
Chun Kai Ling
Yun Cheng
A. Obolenskiy
Yudong Liu
Rohan Pandey
Alex Wilf
Louis-Philippe Morency
Ruslan Salakhutdinov
OffRL
26
11
0
07 Jun 2023
I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors
Tuhin Chakrabarty
Arkadiy Saakyan
Olivia Winn
Artemis Panagopoulou
Yue Yang
Marianna Apidianaki
Smaranda Muresan
DiffM
19
27
0
24 May 2023
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images
Nitzan Bitton-Guetta
Yonatan Bitton
Jack Hessel
Ludwig Schmidt
Yuval Elovici
Gabriel Stanovsky
Roy Schwartz
VLM
121
65
0
13 Mar 2023
Metaphors in Pre-Trained Language Models: Probing and Generalization Across Datasets and Languages
Ehsan Aghazadeh
Mohsen Fayyaz
Yadollah Yaghoobzadeh
31
51
0
26 Mar 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
382
4,010
0
28 Jan 2022
How Much Can CLIP Benefit Vision-and-Language Tasks?
Sheng Shen
Liunian Harold Li
Hao Tan
Mohit Bansal
Anna Rohrbach
Kai-Wei Chang
Z. Yao
Kurt Keutzer
CLIP
VLM
MLLM
182
342
0
13 Jul 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,735
0
24 Feb 2021
1