17
0

ScreenAudit: Detecting Screen Reader Accessibility Errors in Mobile Apps Using Large Language Models

Abstract

Many mobile apps are inaccessible, thereby excluding people from their potential benefits. Existing rule-based accessibility checkers aim to mitigate these failures by identifying errors early during development but are constrained in the types of errors they can detect. We present ScreenAudit, an LLM-powered system designed to traverse mobile app screens, extract metadata and transcripts, and identify screen reader accessibility errors overlooked by existing checkers. We recruited six accessibility experts including one screen reader user to evaluate ScreenAudit's reports across 14 unique app screens. Our findings indicate that ScreenAudit achieves an average coverage of 69.2%, compared to only 31.3% with a widely-used accessibility checker. Expert feedback indicated that ScreenAudit delivered higher-quality feedback and addressed more aspects of screen reader accessibility compared to existing checkers, and that ScreenAudit would benefit app developers in real-world settings.

View on arXiv
@article{zhong2025_2504.02110,
  title={ ScreenAudit: Detecting Screen Reader Accessibility Errors in Mobile Apps Using Large Language Models },
  author={ Mingyuan Zhong and Ruolin Chen and Xia Chen and James Fogarty and Jacob O. Wobbrock },
  journal={arXiv preprint arXiv:2504.02110},
  year={ 2025 }
}
Comments on this paper