ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.10642
28
0

Demographic User Modeling for Social Robotics with Multimodal Pre-trained Models

15 February 2025
Hamed Rahimi
Mouad Abrini
Mahdi Khoramshahi
Mohamed Chetouani
ArXivPDFHTML
Abstract

This paper investigates the performance of multimodal pre-trained models in user profiling tasks based on visual-linguistic demographic data. These models are critical for adapting to the needs and preferences of human users in social robotics, thereby providing personalized responses and enhancing interaction quality. First, we introduce two datasets specifically curated to represent demographic characteristics derived from user facial images. Next, we evaluate the performance of a prominent contrastive multimodal pre-trained model, CLIP, on these datasets, both in its out-of-the-box state and after fine-tuning. Initial results indicate that CLIP performs suboptimal in matching images to demographic descriptions without fine-tuning. Although fine-tuning significantly enhances its predictive capacity, the model continues to exhibit limitations in effectively generalizing subtle demographic nuances. To address this, we propose adopting a masked image modeling strategy to improve generalization and better capture subtle demographic attributes. This approach offers a pathway for enhancing demographic sensitivity in multimodal user modeling tasks.

View on arXiv
@article{rahimi2025_2502.10642,
  title={ Demographic User Modeling for Social Robotics with Multimodal Pre-trained Models },
  author={ Hamed Rahimi and Mouad Abrini and Mahdi Khoramshahi and Mohamed Chetouani },
  journal={arXiv preprint arXiv:2502.10642},
  year={ 2025 }
}
Comments on this paper