37
0

MentalChat16K: A Benchmark Dataset for Conversational Mental Health Assistance

Abstract

We introduce MentalChat16K, an English benchmark dataset combining a synthetic mental health counseling dataset and a dataset of anonymized transcripts from interventions between Behavioral Health Coaches and Caregivers of patients in palliative or hospice care. Covering a diverse range of conditions like depression, anxiety, and grief, this curated dataset is designed to facilitate the development and evaluation of large language models for conversational mental health assistance. By providing a high-quality resource tailored to this critical domain, MentalChat16K aims to advance research on empathetic, personalized AI solutions to improve access to mental health support services. The dataset prioritizes patient privacy, ethical considerations, and responsible data usage. MentalChat16K presents a valuable opportunity for the research community to innovate AI technologies that can positively impact mental well-being.

View on arXiv
@article{xu2025_2503.13509,
  title={ MentalChat16K: A Benchmark Dataset for Conversational Mental Health Assistance },
  author={ Jia Xu and Tianyi Wei and Bojian Hou and Patryk Orzechowski and Shu Yang and Ruochen Jin and Rachael Paulbeck and Joost Wagenaar and George Demiris and Li Shen },
  journal={arXiv preprint arXiv:2503.13509},
  year={ 2025 }
}
Comments on this paper