ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.04024
  4. Cited By
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language
  Models

Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models

9 August 2021
Zheyuan Liu
Cristian Rodriguez-Opazo
Damien Teney
Stephen Gould
    VLM
ArXiv (abs)PDFHTML

Papers citing "Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models"

50 / 134 papers shown
Title
Recurrence Meets Transformers for Universal Multimodal Retrieval
Recurrence Meets Transformers for Universal Multimodal Retrieval
Davide Caffagni
Sara Sarto
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
0
0
0
10 Sep 2025
EVENT-Retriever: Event-Aware Multimodal Image Retrieval for Realistic Captions
EVENT-Retriever: Event-Aware Multimodal Image Retrieval for Realistic Captions
Dinh-Khoi Vo
Van-Loc Nguyen
M. Tran
T. Le
3DVVGen
4
0
0
31 Aug 2025
Disentangling Latent Embeddings with Sparse Linear Concept Subspaces (SLiCS)
Disentangling Latent Embeddings with Sparse Linear Concept Subspaces (SLiCS)
Zhi Li
Hau Phan
Matthew Emigh
Austin J. Brockmeier
CoGe
52
0
0
27 Aug 2025
Beyond Simple Edits: Composed Video Retrieval with Dense Modifications
Beyond Simple Edits: Composed Video Retrieval with Dense Modifications
Omkar Thawakar
Dmitry Demidov
Ritesh Thawkar
Rao Muhammad Anwer
M. Shah
Fahad Shahbaz Khan
Salman Khan
VGen
12
0
0
19 Aug 2025
Enhancing Supervised Composed Image Retrieval via Reasoning-Augmented Representation Engineering
Enhancing Supervised Composed Image Retrieval via Reasoning-Augmented Representation Engineering
Jun Li
Kai Li
Shaoguo Liu
Tingting Gao
LRM
12
0
0
15 Aug 2025
Composed Object Retrieval: Object-level Retrieval via Composed Expressions
Composed Object Retrieval: Object-level Retrieval via Composed Expressions
Tong Wang
Guanyu Yang
Nian Liu
Zongyan Han
Jinxing Zhou
Salman Khan
Fahad Shahbaz Khan
32
0
0
06 Aug 2025
Agentic Personalized Fashion Recommendation in the Age of Generative AI: Challenges, Opportunities, and Evaluation
Agentic Personalized Fashion Recommendation in the Age of Generative AI: Challenges, Opportunities, and Evaluation
Yashar Deldjoo
Nima Rafiee
Mahdyar Ravanbakhsh
27
0
0
04 Aug 2025
U-MARVEL: Unveiling Key Factors for Universal Multimodal Retrieval via Embedding Learning with MLLMs
U-MARVEL: Unveiling Key Factors for Universal Multimodal Retrieval via Embedding Learning with MLLMs
Xiaojie Li
Chu Li
Shi-Zhe Chen
Xi Chen
OffRL
57
0
0
20 Jul 2025
SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation
SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation
Shiqi Huang
Shuting He
Huaiyuan Qin
Bihan Wen
68
0
0
17 Jul 2025
Visual Re-Ranking with Non-Visual Side Information
Visual Re-Ranking with Non-Visual Side Information
Gustav Hanning
Gabrielle Flood
Viktor Larsson
113
0
0
01 Jul 2025
Zero Shot Composed Image Retrieval
Zero Shot Composed Image Retrieval
Santhosh Kakarla
Gautama Shastry Bulusu Venkata
76
0
0
07 Jun 2025
From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos
Animesh Gupta
Jay Parmar
Ishan R. Dave
M. Shah
176
1
0
05 Jun 2025
SORCE: Small Object Retrieval in Complex Environments
SORCE: Small Object Retrieval in Complex Environments
Chunxu Liu
Chi Xie
X. Chen
Wei Li
Feng Zhu
Rui Zhao
Limin Wang
68
0
0
30 May 2025
ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval
ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval
Eric Xing
Pranavi Kolouju
Robert Pless
Abby Stylianou
Nathan Jacobs
100
1
0
27 May 2025
MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval
MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval
Rong-Cheng Tu
Zhao Jin
Jingyi Liao
Xiao Luo
Yingjie Wang
Li Shen
Dacheng Tao
168
1
0
26 May 2025
DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval
DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval
Yuxin Yang
Yinan Zhou
Yuxin Chen
Ziqi Zhang
Zongyang Ma
...
Bing Li
Lin Song
Jun Gao
Peng Li
Weiming Hu
238
0
0
23 May 2025
InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning
InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning
Zifu Wan
Yaqi Xie
Ce Zhang
Zhiqiu Lin
Zihan Wang
Simon Stepputtis
Deva Ramanan
Katia Sycara
100
2
0
23 May 2025
From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval
From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval
Yabing Wang
Zhuotao Tian
Qingpei Guo
Zheng Qin
Sanping Zhou
Ming-Hsuan Yang
Le Wang
517
2
0
25 Apr 2025
TMCIR: Token Merge Benefits Composed Image Retrieval
TMCIR: Token Merge Benefits Composed Image Retrieval
Chaoyang Wang
Zeyu Zhang
Long Teng
Zijun Li
Shichao Kan
158
0
0
15 Apr 2025
MIEB: Massive Image Embedding Benchmark
MIEB: Massive Image Embedding Benchmark
Chenghao Xiao
Isaac Chung
Imene Kerboua
Jamie Stirling
Xin Zhang
Márton Kardos
Roman Solomatin
Noura Al Moubayed
Kenneth Enevoldsen
Niklas Muennighoff
VLM
205
3
0
14 Apr 2025
NCL-CIR: Noise-aware Contrastive Learning for Composed Image Retrieval
NCL-CIR: Noise-aware Contrastive Learning for Composed Image Retrieval
Peng Gao
Yujian Lee
Zailong Chen
Hui Zhang
Xubo Liu
Yiyang Hu
Guquang Jing
140
0
0
06 Apr 2025
IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval
IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval
Bangwei Liu
Yicheng Bao
Shaohui Lin
Xuhong Wang
Xin Tan
Longji Xu
Yuan Xie
Chaochao Lu
241
1
0
01 Apr 2025
Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data
Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data
Yiqun Duan
Sameera Ramasinghe
Stephen Gould
Ajanthan Thalaiyasingam
163
2
0
01 Apr 2025
AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs
AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs
Yi-Ting Shen
Sungmin Eum
Doheon Lee
Rohit Shete
Chiao-Yi Wang
H. Kwon
Shuvra S. Bhattacharyya
130
0
0
28 Mar 2025
FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval
FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval
Zixu Li
Zhiheng Fu
Yupeng Hu
Zhiwei Chen
Haokun Wen
Liqiang Nie
165
12
0
27 Mar 2025
Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval
Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval
Haoqiang Lin
Haokun Wen
Xuemeng Song
Meng Liu
Yupeng Hu
Liqiang Nie
238
20
0
25 Mar 2025
good4cir: Generating Detailed Synthetic Captions for Composed Image Retrieval
good4cir: Generating Detailed Synthetic Captions for Composed Image Retrieval
Pranavi Kolouju
Eric Xing
Robert Pless
Nathan Jacobs
Abby Stylianou
3DV
104
2
0
22 Mar 2025
Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval
Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval
Yuanmin Tang
Jing Yu
Keke Gai
Jiamin Zhuang
Gang Xiong
Gaopeng Gou
Qi Wu
VGen
309
6
0
21 Mar 2025
Scale Efficient Training for Large Datasets
Scale Efficient Training for Large Datasets
Qing Zhou
Junyu Gao
Qi Wang
DD
172
1
0
17 Mar 2025
ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning
Pengfei Luo
Jingbo Zhou
Tong Xu
Yuan Xia
Linli Xu
Enhong Chen
LRM
192
2
0
13 Mar 2025
Data-Efficient Generalization for Zero-shot Composed Image Retrieval
Zining Chen
Zhicheng Zhao
Fei Su
Xiaoqin Zhang
Shijian Lu
VLM
175
1
0
07 Mar 2025
Composed Multi-modal Retrieval: A Survey of Approaches and Applications
Composed Multi-modal Retrieval: A Survey of Approaches and Applications
Kun Zhang
Jingyu Li
Zhiyu Li
Jingjing Zhang
F. Li
...
Nan Chen
Lei Zhang
Yongdong Zhang
Zhendong Mao
S.Kevin Zhou
169
0
0
03 Mar 2025
A Comprehensive Survey on Composed Image Retrieval
A Comprehensive Survey on Composed Image Retrieval
Xuemeng Song
Haoqiang Lin
Haokun Wen
Bohan Hou
Mingzhu Xu
Liqiang Nie
181
5
0
19 Feb 2025
Towards Text-Image Interleaved Retrieval
Towards Text-Image Interleaved Retrieval
Xin Zhang
Ziqi Dai
Yuchen Li
Yanzhao Zhang
Dingkun Long
Pengjun Xie
Hao Fei
Jun Yu
Wenjie Li
Min Zhang
89
0
0
18 Feb 2025
Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation
Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation
Mohammad Mahdi Abootorabi
Amirhosein Zobeiri
Mahdi Dehghani
Mohammadali Mohammadkhani
Bardia Mohammadi
Omid Ghahroodi
M. Baghshah
Ehsaneddin Asgari
RALM
456
14
0
12 Feb 2025
Triplet Synthesis For Enhancing Composed Image Retrieval via Counterfactual Image Generation
Kenta Uesugi
Naoki Saito
Keisuke Maeda
Takahiro Ogawa
Miki Haseyama
119
0
0
22 Jan 2025
SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval
Bhavin Jawade
JOÃO-BRUNO Soares
K. Thadani
D. Mohan
Amir Erfan Eshratifar
Benjamin Culpepper
Paloma de Juan
S. Setlur
V. Govindaraju
137
1
0
12 Jan 2025
VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks
VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks
Ziyan Jiang
Rui Meng
Xinyi Yang
Semih Yavuz
Yingbo Zhou
Lei Ma
MLLMVLM
280
49
0
03 Jan 2025
GME: Improving Universal Multimodal Retrieval by Multimodal LLMs
GME: Improving Universal Multimodal Retrieval by Multimodal LLMs
Xin Zhang
Yanzhao Zhang
Wen Xie
Mingxin Li
Ziqi Dai
Dingkun Long
Pengjun Xie
Meishan Zhang
Wenjie Li
Hao Fei
299
35
0
22 Dec 2024
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for
  Training-Free Zero-Shot Composed Image Retrieval
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval
Yuanmin Tang
Xiaoting Qin
Jing Zhang
Jing Yu
Gaopeng Gou
Gang Xiong
Qingwei Ling
Saravan Rajmohan
Dongmei Zhang
Qi Wu
LRM
195
4
0
15 Dec 2024
Composed Image Retrieval for Training-Free Domain Conversion
Composed Image Retrieval for Training-Free Domain Conversion
Nikos Efthymiadis
Bill Psomas
Zakaria Laskar
Konstantinos Karantzalos
Yannis Avrithis
Ondřej Chum
Giorgos Tolias
176
0
0
04 Dec 2024
LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Yikun Liu
Pingan Chen
Jiayin Cai
Xiaolong Jiang
Feng-Long Xie
Jiangchao Yao
Yanfeng Wang
Weidi Xie
RALM
137
0
0
02 Dec 2024
Imagine and Seek: Improving Composed Image Retrieval with an Imagined
  Proxy
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
Yuchen Li
Fan Ma
Yi Yang
280
5
0
24 Nov 2024
AnySynth: Harnessing the Power of Image Synthetic Data Generation for
  Generalized Vision-Language Tasks
AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks
Yuchen Li
Fan Ma
Yi Yang
DiffM
279
3
0
24 Nov 2024
MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs
MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs
Sheng-Chieh Lin
Chankyu Lee
Mohammad Shoeybi
Jimmy J. Lin
Bryan Catanzaro
Ming-Yu Liu
433
34
0
04 Nov 2024
Modality and Task Adaptation for Enhanced Zero-shot Composed Image Retrieval
Modality and Task Adaptation for Enhanced Zero-shot Composed Image Retrieval
Haiwen Li
Fei Su
Zhicheng Zhao
132
1
0
31 Oct 2024
Multi-path Exploration and Feedback Adjustment for Text-to-Image Person
  Retrieval
Multi-path Exploration and Feedback Adjustment for Text-to-Image Person Retrieval
Bin Kang
Bin Chen
Jinqiao Wang
Yong Xu
97
0
0
26 Oct 2024
ChatSearch: a Dataset and a Generative Retrieval Model for General
  Conversational Image Retrieval
ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval
Zijia Zhao
Longteng Guo
Tongtian Yue
Erdong Hu
Shuai Shao
Zehuan Yuan
Hua Huang
Qingbin Liu
81
3
0
24 Oct 2024
Unified Multi-Modal Interleaved Document Representation for Information
  Retrieval
Unified Multi-Modal Interleaved Document Representation for Information Retrieval
Jaewoo Lee
Joonho Ko
Jinheon Baek
Soyeong Jeong
Sung Ju Hwang
152
2
0
03 Oct 2024
EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections
EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections
Francesc Net
Lluís Gómez
102
0
0
02 Oct 2024
123
Next