
Title |
|---|
![]() Re-evaluating Theory of Mind evaluation in large language modelsPhilosophical transactions of the Royal Society of London. Series B, Biological sciences (Philos Trans R Soc Lond B Biol Sci), 2025 |
![]() Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in
Closed-Source LLMsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024 |
![]() Towards A Holistic Landscape of Situated Theory of Mind in Large
Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
![]() GPTEval: A Survey on Assessments of ChatGPT and GPT-4International Conference on Language Resources and Evaluation (LREC), 2023 |
![]() Deception Abilities Emerged in Large Language ModelsProceedings of the National Academy of Sciences of the United States of America (PNAS), 2023 |
![]() Thinking Fast and Slow in Large Language ModelsNature Computational Science (Nat. Comput. Sci.), 2022 |