ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.07801
21
0

FairEval: Evaluating Fairness in LLM-Based Recommendations with Personality Awareness

10 April 2025
Chandan Kumar Sah
Xiaoli Lian
Tony Xu
Li Zhang
ArXivPDFHTML
Abstract

Recent advances in Large Language Models (LLMs) have enabled their application to recommender systems (RecLLMs), yet concerns remain regarding fairness across demographic and psychological user dimensions. We introduce FairEval, a novel evaluation framework to systematically assess fairness in LLM-based recommendations. FairEval integrates personality traits with eight sensitive demographic attributes,including gender, race, and age, enabling a comprehensive assessment of user-level bias. We evaluate models, including ChatGPT 4o and Gemini 1.5 Flash, on music and movie recommendations. FairEval's fairness metric, PAFS, achieves scores up to 0.9969 for ChatGPT 4o and 0.9997 for Gemini 1.5 Flash, with disparities reaching 34.79 percent. These results highlight the importance of robustness in prompt sensitivity and support more inclusive recommendation systems.

View on arXiv
@article{sah2025_2504.07801,
  title={ FairEval: Evaluating Fairness in LLM-Based Recommendations with Personality Awareness },
  author={ Chandan Kumar Sah and Xiaoli Lian and Tony Xu and Li Zhang },
  journal={arXiv preprint arXiv:2504.07801},
  year={ 2025 }
}
Comments on this paper