ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.14999
28
0

Leveraging Habitat Information for Fine-grained Bird Identification

22 December 2023
Tin Nguyen
Anh Nguyen
Anh Nguyen
    VLM
ArXivPDFHTML
Abstract

Traditional bird classifiers mostly rely on the visual characteristics of birds. Some prior works even train classifiers to be invariant to the background, completely discarding the living environment of birds. Instead, we are the first to explore integrating habitat information, one of the four major cues for identifying birds by ornithologists, into modern bird classifiers. We focus on two leading model types: (1) CNNs and ViTs trained on the downstream bird datasets; and (2) original, multi-modal CLIP. Training CNNs and ViTs with habitat-augmented data results in an improvement of up to +0.83 and +0.23 points on NABirds and CUB-200, respectively. Similarly, adding habitat descriptors to the prompts for CLIP yields a substantial accuracy boost of up to +0.99 and +1.1 points on NABirds and CUB-200, respectively. We find consistent accuracy improvement after integrating habitat features into the image augmentation process and into the textual descriptors of vision-language CLIP classifiers. Code is available at:this https URL.

View on arXiv
@article{nguyen2025_2312.14999,
  title={ Leveraging Habitat Information for Fine-grained Bird Identification },
  author={ Tin Nguyen and Peijie Chen and Anh Totti Nguyen },
  journal={arXiv preprint arXiv:2312.14999},
  year={ 2025 }
}
Comments on this paper