From Inductive to Deductive: LLMs-Based Qualitative Data Analysis in Requirements Engineering

27 April 2025

Abstract

Requirements Engineering (RE) is essential for developing complex and regulated software projects. Given the challenges in transforming stakeholder inputs into consistent software designs, Qualitative Data Analysis (QDA) provides a systematic approach to handling free-form data. However, traditional QDA methods are time-consuming and heavily reliant on manual effort. In this paper, we explore the use of Large Language Models (LLMs), including GPT-4, Mistral, and LLaMA-2, to improve QDA tasks in RE. Our study evaluates LLMs' performance in inductive (zero-shot) and deductive (one-shot, few-shot) annotation tasks, revealing that GPT-4 achieves substantial agreement with human analysts in deductive settings, with Cohen's Kappa scores exceeding 0.7, while zero-shot performance remains limited. Detailed, context-rich prompts significantly improve annotation accuracy and consistency, particularly in deductive scenarios, and GPT-4 demonstrates high reliability across repeated runs. These findings highlight the potential of LLMs to support QDA in RE by reducing manual effort while maintaining annotation quality. The structured labels automatically provide traceability of requirements and can be directly utilized as classes in domain models, facilitating systematic software design.

View on arXiv

@article{shah2025_2504.19384,
  title={ From Inductive to Deductive: LLMs-Based Qualitative Data Analysis in Requirements Engineering },
  author={ Syed Tauhid Ullah Shah and Mohamad Hussein and Ann Barcomb and Mohammad Moshirpour },
  journal={arXiv preprint arXiv:2504.19384},
  year={ 2025 }
}

Comments on this paper