ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.04482
32
1

Understanding and Meeting Practitioner Needs When Measuring Representational Harms Caused by LLM-Based Systems

4 June 2025
Emma Harvey
Emily Sheng
Su Lin Blodgett
Alexandra Chouldechova
Jean Garcia-Gathright
Alexandra Olteanu
Hanna M. Wallach
ArXiv (abs)PDFHTML
Abstract

The NLP research community has made publicly available numerous instruments for measuring representational harms caused by large language model (LLM)-based systems. These instruments have taken the form of datasets, metrics, tools, and more. In this paper, we examine the extent to which such instruments meet the needs of practitioners tasked with evaluating LLM-based systems. Via semi-structured interviews with 12 such practitioners, we find that practitioners are often unable to use publicly available instruments for measuring representational harms. We identify two types of challenges. In some cases, instruments are not useful because they do not meaningfully measure what practitioners seek to measure or are otherwise misaligned with practitioner needs. In other cases, instruments - even useful instruments - are not used by practitioners due to practical and institutional barriers impeding their uptake. Drawing on measurement theory and pragmatic measurement, we provide recommendations for addressing these challenges to better meet practitioner needs.

View on arXiv
@article{harvey2025_2506.04482,
  title={ Understanding and Meeting Practitioner Needs When Measuring Representational Harms Caused by LLM-Based Systems },
  author={ Emma Harvey and Emily Sheng and Su Lin Blodgett and Alexandra Chouldechova and Jean Garcia-Gathright and Alexandra Olteanu and Hanna Wallach },
  journal={arXiv preprint arXiv:2506.04482},
  year={ 2025 }
}
Main:9 Pages
1 Figures
Bibliography:6 Pages
4 Tables
Appendix:3 Pages
Comments on this paper