ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.11163
43
1

VLMs as GeoGuessr Masters: Exceptional Performance, Hidden Biases, and Privacy Risks

16 February 2025
Jingyuan Huang
Jen-tse Huang
Ziyi Liu
Xiaoyuan Liu
Wenxuan Wang
Jieyu Zhao
ArXivPDFHTML
Abstract

Visual-Language Models (VLMs) have shown remarkable performance across various tasks, particularly in recognizing geographic information from images. However, significant challenges remain, including biases and privacy concerns. To systematically address these issues in the context of geographic information recognition, we introduce a benchmark dataset consisting of 1,200 images paired with detailed geographic metadata. Evaluating four VLMs, we find that while these models demonstrate the ability to recognize geographic information from images, achieving up to 53.8%53.8\%53.8% accuracy in city prediction, they exhibit significant regional biases. Specifically, performance is substantially higher for economically developed and densely populated regions compared to less developed (−12.5%-12.5\%−12.5%) and sparsely populated (−17.0%-17.0\%−17.0%) areas. Moreover, the models exhibit regional biases, frequently overpredicting certain locations; for instance, they consistently predict Sydney for images taken in Australia. The strong performance of VLMs also raises privacy concerns, particularly for users who share images online without the intent of being identified. Our code and dataset are publicly available atthis https URL.

View on arXiv
@article{huang2025_2502.11163,
  title={ VLMs as GeoGuessr Masters: Exceptional Performance, Hidden Biases, and Privacy Risks },
  author={ Jingyuan Huang and Jen-tse Huang and Ziyi Liu and Xiaoyuan Liu and Wenxuan Wang and Jieyu Zhao },
  journal={arXiv preprint arXiv:2502.11163},
  year={ 2025 }
}
Comments on this paper