Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2004.01804
Cited By
v1
v2 (latest)
Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval
Computer Vision and Pattern Recognition (CVPR), 2020
3 April 2020
Tobias Weyand
A. Araújo
Bingyi Cao
Jack Sim
Re-assign community
ArXiv (abs)
PDF
HTML
Github (794★)
Papers citing
"Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval"
50 / 246 papers shown
Adaptive Blind All-in-One Image Restoration
David Serrano-Lozano
Luis Herranz
Shaolin Su
Javier Vázquez-Corral
VLM
552
4
0
27 Nov 2024
Systematic Reward Gap Optimization for Mitigating VLM Hallucinations
Lehan He
Zeren Chen
Zhelun Shi
Tianyu Yu
Jing Shao
Lu Sheng
MLLM
606
2
0
26 Nov 2024
Beyond Sight: Towards Cognitive Alignment in LVLM via Enriched Visual Knowledge
Computer Vision and Pattern Recognition (CVPR), 2024
Yaqi Zhao
Yuanyang Yin
Lin Li
Mingan Lin
Victor Shea-Jay Huang
Siwei Chen
Xin Wu
Baoqun Yin
Guosheng Dong
Wentao Zhang
280
3
0
25 Nov 2024
INQUIRE: A Natural World Text-to-Image Retrieval Benchmark
Neural Information Processing Systems (NeurIPS), 2024
Edward Vendrow
Omiros Pantazis
Alexander Shepard
Gabriel J. Brostow
Kate E. Jones
Oisin Mac Aodha
Sara Beery
Grant Van Horn
VLM
364
21
0
04 Nov 2024
ManiBox: Enhancing Embodied Spatial Generalization via Scalable Simulation Data Generations
Hengkai Tan
Xuezhou Xu
Chengyang Ying
Xinyi Mao
Songming Liu
Xingxing Zhang
Hang Su
Jun Zhu
Hang Su
Jun Zhu
267
9
0
04 Nov 2024
TIPS: Text-Image Pretraining with Spatial awareness
International Conference on Learning Representations (ICLR), 2024
Kevis-Kokitsi Maninis
Kaifeng Chen
Soham Ghosh
Arjun Karpur
Koert Chen
...
Jan Dlabal
Dan Gnanapragasam
Mojtaba Seyedhosseini
Howard Zhou
Andre Araujo
VLM
436
17
0
21 Oct 2024
Taking off the Rose-Tinted Glasses: A Critical Look at Adversarial ML Through the Lens of Evasion Attacks
Kevin Eykholt
Farhan Ahmed
Pratik Vaishnavi
Amir Rahmati
AAML
279
1
0
15 Oct 2024
Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers
IEEE Robotics and Automation Letters (RA-L), 2024
Stephen Hausler
Peyman Moghadam
SSL
ViT
403
11
0
09 Oct 2024
MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain
European Conference on Computer Vision (ECCV), 2024
Timothy Chase Jr
Karthik Dantu
275
2
0
07 Oct 2024
MM-R
3
^3
3
: On (In-)Consistency of Vision-Language Models (VLMs)
Shih-Han Chou
Shivam Chandhok
James J. Little
Leonid Sigal
288
0
0
07 Oct 2024
Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
International Conference on Learning Representations (ICLR), 2024
Saurav Jha
Shiqi Yang
Masato Ishii
Mengjie Zhao
Christian Simon
Muhammad Jehanzeb Mirza
Dong Gong
Lina Yao
Shusuke Takahashi
Yuki Mitsufuji
DiffM
497
2
0
01 Oct 2024
Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval
German Conference on Pattern Recognition (DAGM), 2024
Morris Florek
David Tschirschwitz
Björn Barz
Volker Rodehorst
VLM
216
0
0
20 Sep 2024
Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment
Similarity Search and Applications (SISAP), 2024
Konstantin Schall
Kai Uwe Barthel
Nico Hezel
Klaus Jung
VLM
287
7
0
03 Sep 2024
AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval
European Conference on Computer Vision (ECCV), 2024
Pavel Suma
Giorgos Kordopatis-Zilos
Ahmet Iscen
Giorgos Tolias
VLM
385
8
0
06 Aug 2024
Autonomous Improvement of Instruction Following Skills via Foundation Models
Zhiyuan Zhou
P. Atreya
Abraham Lee
Homer Walke
Oier Mees
Sergey Levine
252
27
0
30 Jul 2024
LookupForensics: A Large-Scale Multi-Task Dataset for Multi-Phase Image-Based Fact Verification
Shuhan Cui
H. Nguyen
Trung-Nghia Le
Chun-Shien Lu
Isao Echizen
202
0
0
26 Jul 2024
EchoSight: Advancing Visual-Language Models with Wiki Knowledge
Yibin Yan
Weidi Xie
RALM
233
31
0
17 Jul 2024
Benchmarking Vision Language Models for Cultural Understanding
Shravan Nayak
Kanishk Jain
Rabiul Awal
Siva Reddy
Sjoerd van Steenkiste
Lisa Anne Hendricks
Karolina Stañczak
Aishwarya Agrawal
VLM
CoGe
294
74
0
15 Jul 2024
SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs
Xin Su
Man Luo
Kris W Pan
Tien Pei Chou
Vasudev Lal
Phillip Howard
353
5
0
28 Jun 2024
Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation
Neural Information Processing Systems (NeurIPS), 2024
Ning-Hsu Wang
Yu-Lun Liu
MDE
252
27
0
18 Jun 2024
Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion
G. An
Yuchi Huo
Sung-eui Yoon
260
0
0
17 Jun 2024
Depth Anything V2
Lihe Yang
Bingyi Kang
Zilong Huang
Zhen Zhao
Xiaohan Li
Jiashi Feng
Hengshuang Zhao
DiffM
VLM
MDE
359
1,064
0
13 Jun 2024
DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification
Zhengrui Xu
Guan’an Wang
Xiaowen Huang
Jitao Sang
270
0
0
13 Jun 2024
UDON: Universal Dynamic Online distillatioN for generic image representations
Nikolaos-Antonios Ypsilantis
Kaifeng Chen
André Araujo
Ondřej Chum
172
7
0
12 Jun 2024
Accelerating Heterogeneous Federated Learning with Closed-form Classifiers
Eros Fani
Raffaello Camoriano
Barbara Caputo
Marco Ciccone
266
8
0
03 Jun 2024
Quantum Visual Feature Encoding Revisited
Xuan-Bac Nguyen
Hoang-Quan Nguyen
Hugh Churchill
Samee U. Khan
Khoa Luu
227
14
0
30 May 2024
Federated Learning under Partially Class-Disjoint Data via Manifold Reshaping
Ziqing Fan
Jiangchao Yao
Ruipeng Zhang
Lingjuan Lyu
Ya Zhang
Yanfeng Wang
FedML
252
5
0
29 May 2024
Federated Learning with Bilateral Curation for Partially Class-Disjoint Data
Ziqing Fan
Ruipeng Zhang
Jiangchao Yao
Bo Han
Ya Zhang
Yanfeng Wang
FedML
275
17
0
29 May 2024
Recent Trends in Personalized Dialogue Generation: A Review of Datasets, Methodologies, and Evaluations
Yi-Pei Chen
Noriki Nishida
Hideki Nakayama
Yuji Matsumoto
LLMAG
300
27
0
28 May 2024
AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval
Sihe Zhang
Qingdong He
Jinlong Peng
Yuxi Li
Zhengkai Jiang
Jiafu Wu
Mingmin Chi
Yabiao Wang
Chengjie Wang
223
0
0
28 May 2024
Prompt-Aware Adapter: Towards Learning Adaptive Visual Tokens for Multimodal Large Language Models
Yue Zhang
Hehe Fan
Yi Yang
287
6
0
24 May 2024
No Filter: Cultural and Socioeconomic Diversity in Contrastive Vision-Language Models
Angeline Pouget
Lucas Beyer
Emanuele Bugliarello
Xiao Wang
Andreas Steiner
Xiao-Qi Zhai
Ibrahim Alabdulmohsin
VLM
265
13
0
22 May 2024
General Place Recognition Survey: Towards Real-World Autonomy
Peng Yin
Jianhao Jiao
Shiqi Zhao
Lingyun Xu
Guoquan Huang
Howie Choset
Sebastian A. Scherer
Jianda Han
464
20
0
08 May 2024
Retrieval Robust to Object Motion Blur
Rong Zou
Marc Pollefeys
D. Rozumnyi
192
0
0
27 Apr 2024
Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval
Ryoya Nara
Yu-Chieh Lin
Yuji Nozawa
Youyang Ng
Goh Itoh
Osamu Torii
Yusuke Matsui
HAI
194
3
0
25 Apr 2024
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
Olivia Wiles
Chuhan Zhang
Isabela Albuquerque
Ivana Kajić
Su Wang
...
Jordi Pont-Tuset
Aida Nematzadeh
Anant Nawalgaria
Jordi Pont-Tuset
Aida Nematzadeh
EGVM
979
32
0
25 Apr 2024
Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs
Davide Caffagni
Federico Cocchi
Nicholas Moratelli
Sara Sarto
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
KELM
363
75
0
23 Apr 2024
On Train-Test Class Overlap and Detection for Image Retrieval
Computer Vision and Pattern Recognition (CVPR), 2024
Chull Hwan Song
Jooyoung Yoon
Taebaek Hwang
Shunghyun Choi
Yeong Hyeon Gu
Yannis Avrithis
209
4
0
01 Apr 2024
Semantic Prompting with Image-Token for Continual Learning
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Jisu Han
Jaemin Na
Wonjun Hwang
CLL
VLM
188
1
0
18 Mar 2024
N-QR: Natural Quick Response Codes for Multi-Robot Instance Correspondence
IEEE International Conference on Robotics and Automation (ICRA), 2024
Nathan Glaser
Rajashree Ravi
Z. Kira
173
0
0
09 Mar 2024
A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Mathilde Caron
Ahmet Iscen
Alireza Fathi
Cordelia Schmid
353
7
0
04 Mar 2024
Grounding Language Models for Visual Entity Recognition
Zilin Xiao
Ming Gong
Paola Cascante-Bonilla
Xingyao Zhang
Jie Wu
Vicente Ordonez
VLM
259
13
0
28 Feb 2024
NocPlace: Nocturnal Visual Place Recognition via Generative and Inherited Knowledge Transfer
Bingxi Liu
Yiqun Wang
Huaqi Tao
Tingjun Huang
Fulin Tang
Yihong Wu
Jinqiang Cui
Hong Zhang
366
1
0
27 Feb 2024
PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers
Weizhe Lin
Jingbiao Mei
Jinghong Chen
Bill Byrne
VLM
AI4Ed
305
42
0
13 Feb 2024
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data
Chenhui Zhang
Sherrie Wang
283
36
0
31 Jan 2024
FedRSU: Federated Learning for Scene Flow Estimation on Roadside Units
Shaoheng Fang
Rui Ye
Wenhao Wang
Zuhong Liu
Yuxiao Wang
Yafei Wang
Siheng Chen
Yanfeng Wang
333
4
0
23 Jan 2024
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Lihe Yang
Bingyi Kang
Zilong Huang
Xiaohan Li
Jiashi Feng
Hengshuang Zhao
VLM
672
1,406
0
19 Jan 2024
Image Similarity using An Ensemble of Context-Sensitive Models
Knowledge Discovery and Data Mining (KDD), 2024
Zukang Liao
Min Chen
158
2
0
15 Jan 2024
Cross-modal Retrieval for Knowledge-based Visual Question Answering
European Conference on Information Retrieval (ECIR), 2024
Paul Lerner
Olivier Ferret
C. Guinaudeau
252
13
0
11 Jan 2024
Learning-To-Rank Approach for Identifying Everyday Objects Using a Physical-World Search Engine
Kanta Kaneda
Shunya Nagashima
Ryosuke Korekata
Motonari Kambara
Komei Sugiura
247
9
0
26 Dec 2023
Previous
1
2
3
4
5
Next