Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2108.04024
Cited By
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
IEEE International Conference on Computer Vision (ICCV), 2021
9 August 2021
Zheyuan Liu
Cristian Rodriguez-Opazo
Damien Teney
Stephen Gould
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (305★)
Papers citing
"Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models"
50 / 154 papers shown
Generative Editing in the Joint Vision-Language Space for Zero-Shot Composed Image Retrieval
Xin Wang
H. Zhang
Mang Li
Zhaohui Xia
Y. Chen
Yu Zhang
Chunyu Wei
DiffM
239
0
0
01 Dec 2025
UNION: A Lightweight Target Representation for Efficient Zero-Shot Image-Guided Retrieval with Optional Textual Queries
Hoang-Bao Le
Allie Tran
Binh T. Nguyen
Liting Zhou
C. Gurrin
VLM
104
0
0
27 Nov 2025
FIGROTD: A Friendly-to-Handle Dataset for Image Guided Retrieval with Optional Text
Hoang-Bao Le
Allie Tran
Binh T. Nguyen
Liting Zhou
C. Gurrin
120
0
0
27 Nov 2025
Reasoning Guided Embeddings: Leveraging MLLM Reasoning for Improved Multimodal Retrieval
Chunxu Liu
Jiyuan Yang
Ruopeng Gao
Yuhan Zhu
Feng Zhu
Rui Zhao
L. Wang
219
4
0
20 Nov 2025
MoRA: Missing Modality Low-Rank Adaptation for Visual Recognition
Shu Zhao
Nilesh A. Ahuja
Tan Yu
Tianyi Shen
V. Narayanan
VLM
208
1
0
09 Nov 2025
UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings
Zhibin Lan
Liqiang Niu
Fandong Meng
Jie Zhou
Jinsong Su
MLLM
LRM
283
15
0
01 Nov 2025
Instance-Level Composed Image Retrieval
Bill Psomas
George Retsinas
Nikos Efthymiadis
P. Filntisis
Yannis Avrithis
Petros Maragos
Ondřej Chum
Giorgos Tolias
216
7
0
29 Oct 2025
Enhanced MLLM Black-Box Jailbreaking Attacks and Defenses
Xingwei Zhong
K. Fok
V. Thing
AAML
197
0
0
24 Oct 2025
MCA: Modality Composition Awareness for Robust Composed Multimodal Retrieval
Qiyu Wu
Shuyang Cui
Satoshi Hayakawa
Wei-Yao Wang
Hiromi Wakaki
Yuki Mitsufuji
133
0
0
17 Oct 2025
NExT-OMNI: Towards Any-to-Any Omnimodal Foundation Models with Discrete Flow Matching
Run Luo
Xiaobo Xia
Lu Wang
Longze Chen
Renke Shan
Jing Luo
Min Yang
Tat-Seng Chua
VGen
314
13
0
15 Oct 2025
MRMR: A Realistic and Expert-Level Multidisciplinary Benchmark for Reasoning-Intensive Multimodal Retrieval
Siyue Zhang
Yuan Gao
Xiao Zhou
Yilun Zhao
Tingyu Song
Arman Cohan
Anh Tuan Luu
Chen Zhao
VLM
LRM
193
1
0
10 Oct 2025
CIR-CoT: Towards Interpretable Composed Image Retrieval via End-to-End Chain-of-Thought Reasoning
Weihuang Lin
Yiwei Ma
Jinfa Huang
Xiaoshuai Sun
Rongrong Ji
LRM
206
0
0
09 Oct 2025
(Token-Level) InfoRMIA: Stronger Membership Inference and Memorization Assessment for LLMs
Jiashu Tao
Reza Shokri
165
2
0
07 Oct 2025
SQUARE: Semantic Query-Augmented Fusion and Efficient Batch Reranking for Training-free Zero-Shot Composed Image Retrieval
Ren-Di Wu
Yu-Yen Lin
Huei-Fang Yang
VLM
235
1
0
30 Sep 2025
MR
2
^2
2
-Bench: Going Beyond Matching to Reasoning in Multimodal Retrieval
Junjie Zhou
Ze Liu
Lei Xiong
Jin-Ge Yao
Yueze Wang
...
Zhicheng Dou
Siqi Bao
Defu Lian
Yongping Xiong
Zheng Liu
VLM
LRM
174
2
0
30 Sep 2025
SETR: A Two-Stage Semantic-Enhanced Framework for Zero-Shot Composed Image Retrieval
Yuqi Xiao
Yingying Zhu
131
1
0
30 Sep 2025
GRAPE: Let GPRO Supervise Query Rewriting by Ranking for Retrieval
Zhaohua Zhang
Jianhuan Zhuo
Muxi Chen
Chenchen Zhao
Wenyu Jiang
...
Mingyang Chen
Yu Tang
Qiuyong Xiao
Jihong Zhang
Zhixun Su
VLM
175
0
0
27 Sep 2025
OmniBridge: Unified Multimodal Understanding, Generation, and Retrieval via Latent Space Alignment
Teng Xiao
Zuchao Li
Lefei Zhang
317
2
0
23 Sep 2025
Chain-of-Thought Re-ranking for Image Retrieval Tasks
Shangrong Wu
Yanghong Zhou
Yang Chen
Feng Zhang
P. Y. Mok
LRM
177
1
0
18 Sep 2025
Recurrence Meets Transformers for Universal Multimodal Retrieval
Davide Caffagni
Sara Sarto
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
293
3
0
10 Sep 2025
EVENT-Retriever: Event-Aware Multimodal Image Retrieval for Realistic Captions
Dinh-Khoi Vo
Van-Loc Nguyen
M. Tran
T. Le
3DV
VGen
80
0
0
31 Aug 2025
Disentangling Latent Embeddings with Sparse Linear Concept Subspaces (SLiCS)
Zhi Li
Hau Phan
Matthew Emigh
Austin J. Brockmeier
CoGe
191
0
0
27 Aug 2025
Beyond Simple Edits: Composed Video Retrieval with Dense Modifications
Omkar Thawakar
Dmitry Demidov
Ritesh Thawkar
Rao Muhammad Anwer
M. Shah
Fahad Shahbaz Khan
Salman Khan
VGen
153
4
0
19 Aug 2025
Enhancing Supervised Composed Image Retrieval via Reasoning-Augmented Representation Engineering
Jun Li
Kai Li
Shaoguo Liu
Tingting Gao
Shaoguo Liu
Tingting Gao
LRM
239
0
0
15 Aug 2025
Composed Object Retrieval: Object-level Retrieval via Composed Expressions
Tong Wang
Guanyu Yang
Nian Liu
Zongyan Han
Jinxing Zhou
Salman Khan
Fahad Shahbaz Khan
270
0
0
06 Aug 2025
Agentic Personalized Fashion Recommendation in the Age of Generative AI: Challenges, Opportunities, and Evaluation
Yashar Deldjoo
Nima Rafiee
Mahdyar Ravanbakhsh
160
0
0
04 Aug 2025
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey
Meishan Zhang
Xin Zhang
X. Zhao
Shouzheng Huang
Baotian Hu
Min Zhang
371
4
0
28 Jul 2025
U-MARVEL: Unveiling Key Factors for Universal Multimodal Retrieval via Embedding Learning with MLLMs
Xiaojie Li
Chu Li
Shi-Zhe Chen
Xi Chen
OffRL
340
5
0
20 Jul 2025
SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation
Shiqi Huang
Shuting He
Huaiyuan Qin
Bihan Wen
396
8
0
17 Jul 2025
Visual Re-Ranking with Non-Visual Side Information
Scandinavian Conference on Image Analysis (SCIA), 2025
Gustav Hanning
Gabrielle Flood
Viktor Larsson
221
0
0
01 Jul 2025
Zero Shot Composed Image Retrieval
Santhosh Kakarla
Gautama Shastry Bulusu Venkata
232
1
0
07 Jun 2025
From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos
Animesh Gupta
Jay Parmar
Ishan R. Dave
M. Shah
409
1
0
05 Jun 2025
SORCE: Small Object Retrieval in Complex Environments
Chunxu Liu
Chi Xie
X. Chen
Wei Li
Feng Zhu
Rui Zhao
Limin Wang
242
1
0
30 May 2025
ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval
Computer Vision and Pattern Recognition (CVPR), 2025
Eric Xing
Pranavi Kolouju
Robert Pless
Abby Stylianou
Nathan Jacobs
361
4
0
27 May 2025
MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval
Rong-Cheng Tu
Zhao Jin
Jingyi Liao
Xiao Luo
Yingjie Wang
Li Shen
Dacheng Tao
431
7
0
26 May 2025
DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval
Yuxin Yang
Yinan Zhou
Yuxin Chen
Ziqi Zhang
Zongyang Ma
...
Bing Li
Lin Song
Jun Gao
Peng Li
Weiming Hu
523
4
0
23 May 2025
InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Zifu Wan
Yaqi Xie
Ce Zhang
Zhiqiu Lin
Zihan Wang
Simon Stepputtis
Deva Ramanan
Katia Sycara
259
6
0
23 May 2025
From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval
Yabing Wang
Zhuotao Tian
Qingpei Guo
Zheng Qin
Sanping Zhou
Ming-Hsuan Yang
Le Wang
871
4
0
25 Apr 2025
TMCIR: Token Merge Benefits Composed Image Retrieval
Chaoyang Wang
Zeyu Zhang
Long Teng
Zijun Li
Shichao Kan
405
4
0
15 Apr 2025
MIEB: Massive Image Embedding Benchmark
Chenghao Xiao
Isaac Chung
Imene Kerboua
Jamie Stirling
Xin Zhang
Márton Kardos
Roman Solomatin
Noura Al Moubayed
Kenneth Enevoldsen
Niklas Muennighoff
VLM
593
7
0
14 Apr 2025
NCL-CIR: Noise-aware Contrastive Learning for Composed Image Retrieval
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Peng Gao
Yujian Lee
Zailong Chen
Hui Zhang
Xubo Liu
Yiyang Hu
Guquang Jing
358
3
0
06 Apr 2025
Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data
Yiqun Duan
Sameera Ramasinghe
Stephen Gould
Ajanthan Thalaiyasingam
473
3
0
01 Apr 2025
IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval
Bangwei Liu
Yicheng Bao
Shaohui Lin
Xuhong Wang
Xin Tan
Longji Xu
Yuan Xie
Chaochao Lu
466
6
0
01 Apr 2025
AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs
Yi-Ting Shen
Sungmin Eum
Doheon Lee
Rohit Shete
Chiao-Yi Wang
H. Kwon
Shuvra S. Bhattacharyya
414
0
0
28 Mar 2025
FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval
Zixu Li
Zhiheng Fu
Yupeng Hu
Zhiwei Chen
Haokun Wen
Liqiang Nie
456
39
0
27 Mar 2025
Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2024
Haoqiang Lin
Haokun Wen
Xuemeng Song
Meng Liu
Yupeng Hu
Liqiang Nie
483
41
0
25 Mar 2025
good4cir: Generating Detailed Synthetic Captions for Composed Image Retrieval
Pranavi Kolouju
Eric Xing
Robert Pless
Nathan Jacobs
Abby Stylianou
3DV
240
5
0
22 Mar 2025
Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval
Computer Vision and Pattern Recognition (CVPR), 2025
Yuanmin Tang
Jing Yu
Keke Gai
Jiamin Zhuang
Gang Xiong
Gaopeng Gou
Qi Wu
VGen
708
18
0
21 Mar 2025
Scale Efficient Training for Large Datasets
Computer Vision and Pattern Recognition (CVPR), 2025
Qing Zhou
Junyu Gao
Qi Wang
DD
377
7
0
17 Mar 2025
ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning
The Web Conference (WWW), 2025
Pengfei Luo
Jingbo Zhou
Tong Xu
Yuan Xia
Linli Xu
Tong Xu
LRM
421
14
0
13 Mar 2025
1
2
3
4
Next
Page 1 of 4