v1v2 (latest)

Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval

Computer Vision and Pattern Recognition (CVPR), 2020

3 April 2020

ArXiv (abs)PDF HTML Github (794★)

Papers citing "Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval"

50 / 246 papers shown

Adaptive Blind All-in-One Image Restoration

David Serrano-Lozano

Luis Herranz

Shaolin Su

Javier Vázquez-Corral

VLM

552

27 Nov 2024

Systematic Reward Gap Optimization for Mitigating VLM Hallucinations

606

26 Nov 2024

Beyond Sight: Towards Cognitive Alignment in LVLM via Enriched Visual KnowledgeComputer Vision and Pattern Recognition (CVPR), 2024

Victor Shea-Jay Huang

280

25 Nov 2024

INQUIRE: A Natural World Text-to-Image Retrieval BenchmarkNeural Information Processing Systems (NeurIPS), 2024

364

04 Nov 2024

ManiBox: Enhancing Embodied Spatial Generalization via Scalable Simulation Data Generations

Hang Su

Jun Zhu

267

04 Nov 2024

TIPS: Text-Image Pretraining with Spatial awarenessInternational Conference on Learning Representations (ICLR), 2024

Kevis-Kokitsi Maninis

...

Mojtaba Seyedhosseini

Howard Zhou

Andre Araujo

VLM

436

21 Oct 2024

Taking off the Rose-Tinted Glasses: A Critical Look at Adversarial ML Through the Lens of Evasion Attacks

279

15 Oct 2024

Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision TransformersIEEE Robotics and Automation Letters (RA-L), 2024

Stephen Hausler

Peyman Moghadam

SSL ViT

403

09 Oct 2024

MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space TerrainEuropean Conference on Computer Vision (ECCV), 2024

Timothy Chase Jr

Karthik Dantu

275

07 Oct 2024

MM-R$^3$: On (In-)Consistency of Vision-Language Models (VLMs)

MM-R

^3

: On (In-)Consistency of Vision-Language Models (VLMs)

288

07 Oct 2024

Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2024

Mengjie Zhao

Muhammad Jehanzeb Mirza

Lina Yao

497

01 Oct 2024

Efficient and Discriminative Image Feature Extraction for Universal Image RetrievalGerman Conference on Pattern Recognition (DAGM), 2024

216

20 Sep 2024

Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding AlignmentSimilarity Search and Applications (SISAP), 2024

287

03 Sep 2024

AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level RetrievalEuropean Conference on Computer Vision (ECCV), 2024

Pavel Suma

Giorgos Kordopatis-Zilos

Ahmet Iscen

Giorgos Tolias

VLM

385

06 Aug 2024

Autonomous Improvement of Instruction Following Skills via Foundation Models

252

30 Jul 2024

LookupForensics: A Large-Scale Multi-Task Dataset for Multi-Phase Image-Based Fact Verification

Isao Echizen

202

26 Jul 2024

EchoSight: Advancing Visual-Language Models with Wiki Knowledge

Yibin Yan

Weidi Xie

RALM

233

17 Jul 2024

Benchmarking Vision Language Models for Cultural Understanding

Sjoerd van Steenkiste

294

15 Jul 2024

SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs

353

28 Jun 2024

Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data AugmentationNeural Information Processing Systems (NeurIPS), 2024

Ning-Hsu Wang

Yu-Lun Liu

MDE

252

18 Jun 2024

Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion

G. An

Yuchi Huo

Sung-eui Yoon

260

17 Jun 2024

359

1,064

13 Jun 2024

DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification

270

13 Jun 2024

UDON: Universal Dynamic Online distillatioN for generic image representations

Nikolaos-Antonios Ypsilantis

Kaifeng Chen

André Araujo

Ondřej Chum

172

12 Jun 2024

Accelerating Heterogeneous Federated Learning with Closed-form Classifiers

266

03 Jun 2024

Quantum Visual Feature Encoding Revisited

227

30 May 2024

Federated Learning under Partially Class-Disjoint Data via Manifold Reshaping

Jiangchao Yao

252

29 May 2024

Federated Learning with Bilateral Curation for Partially Class-Disjoint Data

Ziqing Fan

Ruipeng Zhang

Jiangchao Yao

Bo Han

Ya Zhang

Yanfeng Wang

FedML

275

29 May 2024

Recent Trends in Personalized Dialogue Generation: A Review of Datasets, Methodologies, and Evaluations

300

28 May 2024

AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval

Sihe Zhang

Qingdong He

Jinlong Peng

Yabiao Wang

Chengjie Wang

223

28 May 2024

Prompt-Aware Adapter: Towards Learning Adaptive Visual Tokens for Multimodal Large Language Models

Yue Zhang

Hehe Fan

Yi Yang

287

24 May 2024

No Filter: Cultural and Socioeconomic Diversity in Contrastive Vision-Language Models

Ibrahim Alabdulmohsin

VLM

265

22 May 2024

General Place Recognition Survey: Towards Real-World Autonomy

464

08 May 2024

Retrieval Robust to Object Motion Blur

Rong Zou

Marc Pollefeys

D. Rozumnyi

192

27 Apr 2024

Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval

194

25 Apr 2024

Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

...

Anant Nawalgaria

Jordi Pont-Tuset

Aida Nematzadeh

EGVM

979

25 Apr 2024

Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs

Lorenzo Baraldi

363

23 Apr 2024

On Train-Test Class Overlap and Detection for Image RetrievalComputer Vision and Pattern Recognition (CVPR), 2024

209

01 Apr 2024

Semantic Prompting with Image-Token for Continual LearningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

188

18 Mar 2024

N-QR: Natural Quick Response Codes for Multi-Robot Instance CorrespondenceIEEE International Conference on Robotics and Automation (ICRA), 2024

Nathan Glaser

Rajashree Ravi

Z. Kira

173

09 Mar 2024

A Generative Approach for Wikipedia-Scale Visual Entity Recognition

353

04 Mar 2024

Grounding Language Models for Visual Entity Recognition

Zilin Xiao

Ming Gong

Paola Cascante-Bonilla

259

28 Feb 2024

NocPlace: Nocturnal Visual Place Recognition via Generative and Inherited Knowledge Transfer

366

27 Feb 2024

PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers

305

13 Feb 2024

Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data

Chenhui Zhang

Sherrie Wang

283

31 Jan 2024

FedRSU: Federated Learning for Scene Flow Estimation on Roadside Units

Rui Ye

Siheng Chen

333

23 Jan 2024

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

672

1,406

19 Jan 2024

Image Similarity using An Ensemble of Context-Sensitive ModelsKnowledge Discovery and Data Mining (KDD), 2024

Zukang Liao

Min Chen

158

15 Jan 2024

Cross-modal Retrieval for Knowledge-based Visual Question AnsweringEuropean Conference on Information Retrieval (ECIR), 2024

Paul Lerner

Olivier Ferret

C. Guinaudeau

252

11 Jan 2024

Learning-To-Rank Approach for Identifying Everyday Objects Using a Physical-World Search Engine

247

26 Dec 2023