ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.00145
  4. Cited By
Modality-Agnostic Attention Fusion for visual search with text feedback

Modality-Agnostic Attention Fusion for visual search with text feedback

30 June 2020
Eric Dodds
Jack Culpepper
Simão Herdade
Yang Zhang
K. Boakye
    EgoV
ArXivPDFHTML

Papers citing "Modality-Agnostic Attention Fusion for visual search with text feedback"

10 / 10 papers shown
Title
NCL-CIR: Noise-aware Contrastive Learning for Composed Image Retrieval
NCL-CIR: Noise-aware Contrastive Learning for Composed Image Retrieval
Peng Gao
Yujian Lee
Zailong Chen
Hui Zhang
Xubo Liu
Yiyang Hu
Guquang Jing
35
0
0
06 Apr 2025
Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions
Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions
Prajwal Gatti
Kshitij Parikh
Dhriti Prasanna Paul
Manish Gupta
Anand Mishra
107
2
0
12 Feb 2025
Sentence-level Prompts Benefit Composed Image Retrieval
Sentence-level Prompts Benefit Composed Image Retrieval
Yang Bai
Xinxing Xu
Yong-Jin Liu
Salman Khan
Fahad Khan
Wangmeng Zuo
Rick Siow Mong Goh
Chun-Mei Feng
28
26
0
09 Oct 2023
Training and challenging models for text-guided fashion image retrieval
Training and challenging models for text-guided fashion image retrieval
Eric Dodds
Jack Culpepper
Gaurav Srivastava
14
8
0
23 Apr 2022
UIGR: Unified Interactive Garment Retrieval
UIGR: Unified Interactive Garment Retrieval
Xiaoping Han
Sen He
Li Zhang
Yi-Zhe Song
Tao Xiang
16
7
0
06 Apr 2022
M5Product: Self-harmonized Contrastive Learning for E-commercial
  Multi-modal Pretraining
M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining
Xiao Dong
Xunlin Zhan
Yangxin Wu
Yunchao Wei
Michael C. Kampffmeyer
Xiaoyong Wei
Minlong Lu
Yaowei Wang
Xiaodan Liang
25
36
0
09 Sep 2021
Unified Vision-Language Pre-Training for Image Captioning and VQA
Unified Vision-Language Pre-Training for Image Captioning and VQA
Luowei Zhou
Hamid Palangi
Lei Zhang
Houdong Hu
Jason J. Corso
Jianfeng Gao
MLLM
VLM
250
926
0
24 Sep 2019
Neural Naturalist: Generating Fine-Grained Image Comparisons
Neural Naturalist: Generating Fine-Grained Image Comparisons
Maxwell Forbes
Christine Kaeser-Chen
Piyush Sharma
Serge J. Belongie
VLM
62
55
0
09 Sep 2019
Dialog-based Interactive Image Retrieval
Dialog-based Interactive Image Retrieval
Xiaoxiao Guo
Hui Wu
Yu Cheng
Steven J. Rennie
Gerald Tesauro
Rogerio Feris
48
204
0
01 May 2018
Multimodal Compact Bilinear Pooling for Visual Question Answering and
  Visual Grounding
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
144
1,464
0
06 Jun 2016
1