IRFL: Image Recognition of Figurative Language

27 March 2023

Papers citing "IRFL: Image Recognition of Figurative Language"

14 / 14 papers shown

Title
SemEval-2025 Task 1: AdMIRe -- Advancing Multimodal Idiomaticity Representation Thomas Pickard Aline Villavicencio Maggie Mi Wei He Dylan Phelps Carolina Scarton 75 1 0 19 Mar 2025
Can We Predict Performance of Large Models across Vision-Language Tasks? Qinyu Zhao Ming Xu Kartik Gupta Akshay Asthana Liang Zheng Stephen Gould 37 0 0 14 Oct 2024
HEMM: Holistic Evaluation of Multimodal Foundation Models Paul Pu Liang Akshay Goindani Talha Chafekar Leena Mathur Haofei Yu Ruslan Salakhutdinov Louis-Philippe Morency 36 10 0 03 Jul 2024
Seeing the Unseen: Visual Metaphor Captioning for Videos Abisek Rajakumar Kalarani Pushpak Bhattacharyya Sumit Shekhar VLM 29 1 0 07 Jun 2024
ViPE: Visualise Pretty-much Everything Hassan Shahmohammadi Adhiraj Ghosh Hendrik P. A. Lensch DiffM 20 1 0 16 Oct 2023
VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use Yonatan Bitton Hritik Bansal Jack Hessel Rulin Shao Wanrong Zhu Anas Awadalla Josh Gardner Rohan Taori L. Schimdt VLM 29 76 0 12 Aug 2023
Factorized Contrastive Learning: Going Beyond Multi-view Redundancy Paul Pu Liang Zihao Deng Martin Q. Ma James Y. Zou Louis-Philippe Morency Ruslan Salakhutdinov SSL 16 49 0 08 Jun 2023
Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications Paul Pu Liang Chun Kai Ling Yun Cheng A. Obolenskiy Yudong Liu Rohan Pandey Alex Wilf Louis-Philippe Morency Ruslan Salakhutdinov OffRL 26 11 0 07 Jun 2023
I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors Tuhin Chakrabarty Arkadiy Saakyan Olivia Winn Artemis Panagopoulou Yue Yang Marianna Apidianaki Smaranda Muresan DiffM 19 27 0 24 May 2023
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images Nitzan Bitton-Guetta Yonatan Bitton Jack Hessel Ludwig Schmidt Yuval Elovici Gabriel Stanovsky Roy Schwartz VLM 121 65 0 13 Mar 2023
Metaphors in Pre-Trained Language Models: Probing and Generalization Across Datasets and Languages Ehsan Aghazadeh Mohsen Fayyaz Yadollah Yaghoobzadeh 31 51 0 26 Mar 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation Junnan Li Dongxu Li Caiming Xiong S. Hoi MLLM BDL VLM CLIP 382 4,010 0 28 Jan 2022
How Much Can CLIP Benefit Vision-and-Language Tasks? Sheng Shen Liunian Harold Li Hao Tan Mohit Bansal Anna Rohrbach Kai-Wei Chang Z. Yao Kurt Keutzer CLIP VLM MLLM 182 342 0 13 Jul 2021
Zero-Shot Text-to-Image Generation Aditya A. Ramesh Mikhail Pavlov Gabriel Goh Scott Gray Chelsea Voss Alec Radford Mark Chen Ilya Sutskever VLM 253 4,735 0 24 Feb 2021