v1v2 (latest)

Evaluating the Use of Large Language Models as Synthetic Social Agents in Social Science Research

30 September 2025

Emma Rose Madden

LLMAG

ArXiv (abs)PDF HTML

Main:22 Pages

Bibliography:1 Pages

Appendix:1 Pages

Abstract

Large Language Models (LLMs) are being increasingly used as synthetic agents in social science, in applications ranging from augmenting survey responses to powering multi-agent simulations. This paper outlines cautions that should be taken when interpreting LLM outputs and proposes a pragmatic reframing for the social sciences in which LLMs are used as high-capacity pattern matchers for quasi-predictive interpolation under explicit scope conditions and not as substitutes for probabilistic inference. Practical guardrails such as independent draws, preregistered human baselines, reliability-aware validation, and subgroup calibration, are introduced so that researchers may engage in useful prototyping and forecasting while avoiding category errors.

View on arXiv

Comments on this paper