ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.01804
  4. Cited By
Google Landmarks Dataset v2 -- A Large-Scale Benchmark for
  Instance-Level Recognition and Retrieval

Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval

3 April 2020
Tobias Weyand
A. Araújo
Bingyi Cao
Jack Sim
ArXivPDFHTML

Papers citing "Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval"

50 / 212 papers shown
Title
StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation
StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation
Daniel A. P. Oliveira
D. Matos
VGen
17
0
0
15 May 2025
Learning Compatible Multi-Prize Subnetworks for Asymmetric Retrieval
Learning Compatible Multi-Prize Subnetworks for Asymmetric Retrieval
Yushuai Sun
Zikun Zhou
D. Jiang
Yaowei Wang
Jun Yu
Guangming Lu
Wenjie Pei
29
0
0
16 Apr 2025
MIEB: Massive Image Embedding Benchmark
MIEB: Massive Image Embedding Benchmark
Chenghao Xiao
Isaac Chung
Imene Kerboua
Jamie Stirling
Xin Zhang
Márton Kardos
Roman Solomatin
Noura Al Moubayed
K. Enevoldsen
Niklas Muennighoff
VLM
35
0
0
14 Apr 2025
Evolved Hierarchical Masking for Self-Supervised Learning
Evolved Hierarchical Masking for Self-Supervised Learning
Zhanzhou Feng
Shiliang Zhang
37
0
0
12 Apr 2025
Boosting multi-demographic federated learning for chest x-ray analysis using general-purpose self-supervised representations
Boosting multi-demographic federated learning for chest x-ray analysis using general-purpose self-supervised representations
Mahshad Lotfinia
Arash Tayebiarasteh
Samaneh Samiei
Mehdi Joodaki
Soroosh Tayebi Arasteh
28
0
0
11 Apr 2025
Taxonomy-Aware Evaluation of Vision-Language Models
Taxonomy-Aware Evaluation of Vision-Language Models
Vésteinn Snæbjarnarson
Kevin Du
Niklas Stoehr
Serge J. Belongie
Ryan Cotterell
Nico Lang
Stella Frank
32
0
0
07 Apr 2025
LOCORE: Image Re-ranking with Long-Context Sequence Modeling
LOCORE: Image Re-ranking with Long-Context Sequence Modeling
Zilin Xiao
Pavel Suma
Ayush Sachdeva
Hao-Jen Wang
Giorgos Kordopatis-Zilos
Giorgos Tolias
Vicente Ordonez
57
0
0
27 Mar 2025
Vision as LoRA
Vision as LoRA
Han Wang
Yongjie Ye
Bingru Li
Yuxiang Nie
Jinghui Lu
Jingqun Tang
Yanjie Wang
Can Huang
86
0
0
26 Mar 2025
Distilling Monocular Foundation Model for Fine-grained Depth Completion
Distilling Monocular Foundation Model for Fine-grained Depth Completion
Yingping Liang
Yutao Hu
Wenqi Shao
Ying Fu
MDE
42
0
0
21 Mar 2025
Prototype Perturbation for Relaxing Alignment Constraints in Backward-Compatible Learning
Prototype Perturbation for Relaxing Alignment Constraints in Backward-Compatible Learning
Zikun Zhou
Yushuai Sun
Wenjie Pei
X. Li
Yaowei Wang
CLL
75
1
0
19 Mar 2025
RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment
RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment
Chao Wang
Giulio Franzese
A. Finamore
Pietro Michiardi
64
0
0
18 Mar 2025
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
Xinyu Ma
Ziyang Ding
Zhicong Luo
C. L. P. Chen
Zonghao Guo
Derek F. Wong
Xiaoyi Feng
Maosong Sun
VLM
LRM
76
0
0
17 Mar 2025
Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization
Mihcael Green
Matan Levy
Issar Tzachor
Dvir Samuel
N. Darshan
Rami Ben-Ari
54
0
0
10 Mar 2025
Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search
Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search
Daniel de Souza Severo
Giuseppe Ottaviano
Matthew Muckley
Karen Ullrich
Matthijs Douze
MQ
43
0
0
16 Jan 2025
MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training
MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training
Xingyi He He
Hao Yu
Sida Peng
Dongli Tan
Zehong Shen
Hujun Bao
Xiaowei Zhou
46
4
0
13 Jan 2025
Adaptive Blind All-in-One Image Restoration
Adaptive Blind All-in-One Image Restoration
David Serrano-Lozano
Luis Herranz
Shaolin Su
Javier Vázquez-Corral
VLM
92
0
0
27 Nov 2024
A Topic-level Self-Correctional Approach to Mitigate Hallucinations in
  MLLMs
A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs
Lehan He
Zeren Chen
Zhelun Shi
Tianyu Yu
Jing Shao
Lu Sheng
MLLM
111
1
0
26 Nov 2024
Beyond Sight: Towards Cognitive Alignment in LVLM via Enriched Visual
  Knowledge
Beyond Sight: Towards Cognitive Alignment in LVLM via Enriched Visual Knowledge
Yaqi Zhao
Yuanyang Yin
Lin Li
Mingan Lin
Victor Shea-Jay Huang
Siwei Chen
Weipeng Chen
Baoqun Yin
Zenan Zhou
Wentao Zhang
75
0
0
25 Nov 2024
INQUIRE: A Natural World Text-to-Image Retrieval Benchmark
INQUIRE: A Natural World Text-to-Image Retrieval Benchmark
Edward Vendrow
Omiros Pantazis
Alexander Shepard
Gabriel J. Brostow
Kate E. Jones
Oisin Mac Aodha
Sara Beery
Grant Van Horn
VLM
36
3
0
04 Nov 2024
ManiBox: Enhancing Spatial Grasping Generalization via Scalable
  Simulation Data Generation
ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
Hengkai Tan
Xuezhou Xu
Chengyang Ying
Xinyi Mao
Songming Liu
Xingxing Zhang
Hang Su
J. Zhu
41
4
0
04 Nov 2024
TIPS: Text-Image Pretraining with Spatial awareness
TIPS: Text-Image Pretraining with Spatial awareness
Kevis-Kokitsi Maninis
Kaifeng Chen
Soham Ghosh
Arjun Karpur
Koert Chen
...
Jan Dlabal
Dan Gnanapragasam
Mojtaba Seyedhosseini
Howard Zhou
Andre Araujo
VLM
35
3
0
21 Oct 2024
Taking off the Rose-Tinted Glasses: A Critical Look at Adversarial ML
  Through the Lens of Evasion Attacks
Taking off the Rose-Tinted Glasses: A Critical Look at Adversarial ML Through the Lens of Evasion Attacks
Kevin Eykholt
Farhan Ahmed
Pratik Vaishnavi
Amir Rahmati
AAML
29
0
0
15 Oct 2024
Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers
Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers
Stephen Hausler
Peyman Moghadam
SSL
ViT
29
2
0
09 Oct 2024
MARs: Multi-view Attention Regularizations for Patch-based Feature
  Recognition of Space Terrain
MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain
Timothy Chase Jr
Karthik Dantu
28
0
0
07 Oct 2024
MM-R$^3$: On (In-)Consistency of Multi-modal Large Language Models
  (MLLMs)
MM-R3^33: On (In-)Consistency of Multi-modal Large Language Models (MLLMs)
Shih-Han Chou
Shivam Chandhok
James J. Little
Leonid Sigal
35
0
0
07 Oct 2024
Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
Saurav Jha
Shiqi Yang
Masato Ishii
Mengjie Zhao
Christian Simon
Muhammad Jehanzeb Mirza
Dong Gong
Lina Yao
Shusuke Takahashi
Yuki Mitsufuji
DiffM
61
2
0
01 Oct 2024
Efficient and Discriminative Image Feature Extraction for Universal
  Image Retrieval
Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval
Morris Florek
David Tschirschwitz
Björn Barz
Volker Rodehorst
VLM
23
0
0
20 Sep 2024
Optimizing CLIP Models for Image Retrieval with Maintained
  Joint-Embedding Alignment
Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment
Konstantin Schall
Kai Uwe Barthel
Nico Hezel
Klaus Jung
VLM
31
3
0
03 Sep 2024
AMES: Asymmetric and Memory-Efficient Similarity Estimation for
  Instance-level Retrieval
AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval
Pavel Suma
Giorgos Kordopatis-Zilos
Ahmet Iscen
Giorgos Tolias
VLM
37
3
0
06 Aug 2024
Autonomous Improvement of Instruction Following Skills via Foundation
  Models
Autonomous Improvement of Instruction Following Skills via Foundation Models
Zhiyuan Zhou
P. Atreya
Abraham Lee
Homer Walke
Oier Mees
Sergey Levine
30
10
0
30 Jul 2024
LookupForensics: A Large-Scale Multi-Task Dataset for Multi-Phase
  Image-Based Fact Verification
LookupForensics: A Large-Scale Multi-Task Dataset for Multi-Phase Image-Based Fact Verification
Shuhan Cui
H. Nguyen
Trung-Nghia Le
Chun-Shien Lu
Isao Echizen
23
0
0
26 Jul 2024
EchoSight: Advancing Visual-Language Models with Wiki Knowledge
EchoSight: Advancing Visual-Language Models with Wiki Knowledge
Yibin Yan
Weidi Xie
RALM
30
9
0
17 Jul 2024
Benchmarking Vision Language Models for Cultural Understanding
Benchmarking Vision Language Models for Cultural Understanding
Shravan Nayak
Kanishk Jain
Rabiul Awal
Siva Reddy
Sjoerd van Steenkiste
Lisa Anne Hendricks
Karolina Stañczak
Aishwarya Agrawal
VLM
CoGe
50
24
0
15 Jul 2024
SK-VQA: Synthetic Knowledge Generation at Scale for Training
  Context-Augmented Multimodal LLMs
SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs
Xin Su
Man Luo
Kris W Pan
Tien Pei Chou
Vasudev Lal
Phillip Howard
43
3
0
28 Jun 2024
Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective
  Distillation and Unlabeled Data Augmentation
Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation
Ning-Hsu Wang
Yu-Lun Liu
MDE
32
4
0
18 Jun 2024
Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware
  Hypergraph Diffusion
Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion
G. An
Yuchi Huo
Sung-eui Yoon
42
0
0
17 Jun 2024
Depth Anything V2
Depth Anything V2
Lihe Yang
Bingyi Kang
Zilong Huang
Zhen Zhao
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
DiffM
VLM
MDE
59
323
0
13 Jun 2024
DenoiseReID: Denoising Model for Representation Learning of Person
  Re-Identification
DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification
Zhengrui Xu
Guan’an Wang
Xiaowen Huang
Jitao Sang
39
0
0
13 Jun 2024
UDON: Universal Dynamic Online distillatioN for generic image
  representations
UDON: Universal Dynamic Online distillatioN for generic image representations
Nikolaos-Antonios Ypsilantis
Kaifeng Chen
André Araujo
Ondřej Chum
35
3
0
12 Jun 2024
Accelerating Heterogeneous Federated Learning with Closed-form
  Classifiers
Accelerating Heterogeneous Federated Learning with Closed-form Classifiers
Eros Fani
Raffaello Camoriano
Barbara Caputo
Marco Ciccone
26
4
0
03 Jun 2024
Quantum Visual Feature Encoding Revisited
Quantum Visual Feature Encoding Revisited
Xuan-Bac Nguyen
Hoang-Quan Nguyen
Hugh Churchill
Samee U. Khan
Khoa Luu
22
9
0
30 May 2024
Federated Learning under Partially Class-Disjoint Data via Manifold
  Reshaping
Federated Learning under Partially Class-Disjoint Data via Manifold Reshaping
Ziqing Fan
Jiangchao Yao
Ruipeng Zhang
Lingjuan Lyu
Ya-Qin Zhang
Yanfeng Wang
FedML
32
2
0
29 May 2024
Federated Learning with Bilateral Curation for Partially Class-Disjoint
  Data
Federated Learning with Bilateral Curation for Partially Class-Disjoint Data
Ziqing Fan
Ruipeng Zhang
Jiangchao Yao
Bo Han
Ya-Qin Zhang
Yanfeng Wang
FedML
32
12
0
29 May 2024
Recent Trends in Personalized Dialogue Generation: A Review of Datasets,
  Methodologies, and Evaluations
Recent Trends in Personalized Dialogue Generation: A Review of Datasets, Methodologies, and Evaluations
Yi-Pei Chen
Noriki Nishida
Hideki Nakayama
Yuji Matsumoto
LLMAG
41
10
0
28 May 2024
AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval
AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval
Sihe Zhang
Qingdong He
Jinlong Peng
Yuxi Li
Zhengkai Jiang
Jiafu Wu
Mingmin Chi
Yabiao Wang
Chengjie Wang
40
0
0
28 May 2024
Prompt-Aware Adapter: Towards Learning Adaptive Visual Tokens for
  Multimodal Large Language Models
Prompt-Aware Adapter: Towards Learning Adaptive Visual Tokens for Multimodal Large Language Models
Yue Zhang
Hehe Fan
Yi Yang
43
3
0
24 May 2024
No Filter: Cultural and Socioeconomic Diversity in Contrastive
  Vision-Language Models
No Filter: Cultural and Socioeconomic Diversity in Contrastive Vision-Language Models
Angeline Pouget
Lucas Beyer
Emanuele Bugliarello
Xiao Wang
Andreas Steiner
Xiao-Qi Zhai
Ibrahim M. Alabdulmohsin
VLM
31
7
0
22 May 2024
General Place Recognition Survey: Towards Real-World Autonomy
General Place Recognition Survey: Towards Real-World Autonomy
Peng Yin
Jianhao Jiao
Shiqi Zhao
Lingyun Xu
Guoquan Huang
Howie Choset
Sebastian A. Scherer
Jianda Han
42
7
0
08 May 2024
Retrieval Robust to Object Motion Blur
Retrieval Robust to Object Motion Blur
Rong Zou
Marc Pollefeys
D. Rozumnyi
26
0
0
27 Apr 2024
Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval
Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval
Ryoya Nara
Yu-Chieh Lin
Yuji Nozawa
Youyang Ng
Goh Itoh
Osamu Torii
Yusuke Matsui
HAI
29
2
0
25 Apr 2024
12345
Next