Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2008.09094
Cited By
v1
v2 (latest)
Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life Anecdotes
20 August 2020
Nicholas Lourie
Ronan Le Bras
Yejin Choi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life Anecdotes"
50 / 83 papers shown
MM-MoralBench: A MultiModal Moral Evaluation Benchmark for Large Vision-Language Models
Bei Yan
Jie M. Zhang
Zhiyuan Chen
Shiguang Shan
Xilin Chen
ELM
408
9
0
10 Apr 2026
From Competition to Coordination: Market Making as a Scalable Framework for Safe and Aligned Multi-Agent LLM Systems
Brendan Gho
Suman Muppavarapu
Afnan Shaik
Tyson Tsay
James Begin
Kevin Zhu
Archana Vaidheeswaran
Vasu Sharma
Vasu Sharma
LLMAG
216
0
0
18 Nov 2025
RoleConflictBench: A Benchmark of Role Conflict Scenarios for Evaluating LLMs' Contextual Sensitivity
Jisu Shin
Hoyun Song
Juhyun Oh
Changgeon Ko
Eunsu Kim
Chani Jung
Alice Oh
198
1
0
30 Sep 2025
MORABLES: A Benchmark for Assessing Abstract Moral Reasoning in LLMs with Fables
Matteo Marcuzzo
A. Zangari
A. Albarelli
Jose Camacho-Collados
Mohammad Taher Pilehvar
264
6
0
15 Sep 2025
EthicsMH: A Pilot Benchmark for Ethical Reasoning in Mental Health AI
Sai Kartheek Reddy Kasu
AI4MH
199
1
0
15 Sep 2025
Human Psychometric Questionnaires Mischaracterize LLM Psychology: Evidence from Generation Behavior
Dongmin Choi
Woojung Song
Jongwook Han
Eun-Ju Lee
Yohan Jo
128
1
0
12 Sep 2025
Discerning What Matters: A Multi-Dimensional Assessment of Moral Competence in LLMs
Daniel Kilov
Caroline Hendy
Secil Yanik Guyot
Aaron J. Snoswell
Seth Lazar
ELM
461
6
0
16 Jun 2025
Synthetic Socratic Debates: Examining Persona Effects on Moral Decision and Persuasion Dynamics
Jiarui Liu
Yueqi Song
Yunze Xiao
Mingqian Zheng
Lindia Tjuatja
Jana Schaich Borg
Mona Diab
Maarten Sap
249
9
0
14 Jun 2025
Multi-level Value Alignment in Agentic AI Systems: Survey and Perspectives
Wei Zeng
Hengshu Zhu
Chuan Qin
Han Wu
Yihang Cheng
...
Xiaowei Jin
Yinuo Shen
Zhenxing Wang
Feimin Zhong
Hui Xiong
AI4TS
532
0
0
11 Jun 2025
Value Portrait: Assessing Language Models' Values through Psychometrically and Ecologically Valid Items
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jongwook Han
Dongmin Choi
Woojung Song
Eun-Ju Lee
Yohan Jo
PILM
625
0
0
02 May 2025
Auditing the Ethical Logic of Generative AI Models
W. Russell Neuman
Chad Coleman
Ali Dasdan
Safinah Ali
Manan Shah
ELM
LRM
346
4
0
24 Apr 2025
CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives
Ayoung Lee
Ryan Sungmo Kwon
Peter Railton
Lu Wang
ELM
590
4
0
15 Apr 2025
Are Rules Meant to be Broken? Understanding Multilingual Moral Reasoning as a Computational Pipeline with UniMoral
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Shivani Kumar
David Jurgens
LRM
385
7
0
21 Feb 2025
The Goofus & Gallant Story Corpus for Practical Value Alignment
International Conference on Machine Learning and Applications (ICMLA), 2024
Md Sultan al Nahian
Tasmia Tasrin
Spencer Frazier
Mark O. Riedl
Brent Harrison
264
0
0
17 Jan 2025
Ethical Concern Identification in NLP: A Corpus of ACL Anthology Ethics Statements
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Antonia Karamolegkou
Sandrine Schiller Hansen
Ariadni Christopoulou
Filippos Stamatiou
Anne Lauscher
Anders Søgaard
191
0
0
12 Nov 2024
A Novel Psychometrics-Based Approach to Developing Professional Competency Benchmark for Large Language Models
Elena Kardanova
Alina Ivanova
Ksenia Tarasova
Taras Pashchenko
Aleksei Tikhoniuk
Elen Yusupova
Anatoly Kasprzhak
Yaroslav Kuzminov
Ekaterina Kruchinskaia
Irina Brun
414
1
0
29 Oct 2024
Extended Japanese Commonsense Morality Dataset with Masked Token and Label Enhancement
International Conference on Information and Knowledge Management (CIKM), 2024
Takumi Ohashi
Tsubasa Nakagawa
Hitoshi Iyatomi
194
0
0
12 Oct 2024
Fine-Tuning Language Models for Ethical Ambiguity: A Comparative Study of Alignment with Human Responses
Pranav Senthilkumar
Visshwa Balasubramanian
Prisha Jain
Aneesa Maity
Jonathan Lu
Kevin Zhu
212
4
0
10 Oct 2024
Intuitions of Compromise: Utilitarianism vs. Contractualism
Jared Moore
Yejin Choi
Sydney Levine
269
1
0
07 Oct 2024
DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life
International Conference on Learning Representations (ICLR), 2024
Yu Ying Chiu
Liwei Jiang
Yejin Choi
423
43
0
03 Oct 2024
Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models
International Conference on Learning Representations (ICLR), 2024
Wenxuan Zhang
Juil Sock
Mohamed Elhoseiny
Adel Bibi
598
26
0
27 Aug 2024
CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Linhao Yu
Yongqi Leng
Yufei Huang
Shang Wu
Haixin Liu
...
Jinwang Song
Tingting Cui
Xiaoqing Cheng
Tao Liu
Deyi Xiong
ELM
182
11
0
19 Aug 2024
VERISCORE: Evaluating the factuality of verifiable claims in long-form text generation
Yixiao Song
Yekyung Kim
Mohit Iyyer
HILM
291
84
0
27 Jun 2024
Does Cross-Cultural Alignment Change the Commonsense Morality of Language Models?
Yuu Jinnai
384
8
0
24 Jun 2024
Navigating LLM Ethics: Advancements, Challenges, and Future Directions
AI and Ethics (AI & Ethics), 2024
Junfeng Jiao
S. Afroogh
Yiming Xu
Connor Phillips
AILaw
727
80
0
14 May 2024
Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models
Jan-Philipp Fränken
Kanishk Gandhi
Tori Qiu
Ayesha Khawaja
Noah D. Goodman
Tobias Gerstenberg
ELM
363
4
0
17 Apr 2024
SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety
Paul Röttger
Fabio Pernisi
Bertie Vidgen
Dirk Hovy
ELM
KELM
428
72
0
08 Apr 2024
Harnessing the power of LLMs for normative reasoning in MASs
B. Savarimuthu
Surangika Ranathunga
Stephen Cranefield
LLMAG
315
10
0
25 Mar 2024
Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?
Seunghyeok Hong
Sangwon Baek
Sangdae Nam
Guijin Son
Seungone Kim
ELM
LRM
492
30
0
18 Feb 2024
Morality is Non-Binary: Building a Pluralist Moral Sentence Embedding Space using Contrastive Learning
Jeongwoo Park
Enrico Liscio
P. Murukannaiah
AILaw
325
8
0
30 Jan 2024
Cross Fertilizing Empathy from Brain to Machine as a Value Alignment Strategy
Devin Gonier
Adrian Adduci
Cassidy LoCascio
219
0
0
10 Dec 2023
MOKA: Moral Knowledge Augmentation for Moral Event Extraction
Xinliang Frederick Zhang
Winston Wu
Nick Beauchamp
Lu Wang
277
13
0
16 Nov 2023
MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks
Neural Information Processing Systems (NeurIPS), 2023
Allen Nie
Yuhui Zhang
Atharva Amdekar
Chris Piech
Tatsunori Hashimoto
Tobias Gerstenberg
344
60
0
30 Oct 2023
Moral Sparks in Social Media Narratives
ACM Conference on Hypertext & Social Media (HT), 2023
Ruijie Xi
Munindar P. Singh
LRM
317
2
0
30 Oct 2023
EtiCor: Corpus for Analyzing LLMs for Etiquettes
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ashutosh Dwivedi
Pradhyumna Lavania
Ashutosh Modi
236
37
0
29 Oct 2023
Do Differences in Values Influence Disagreements in Online Discussions?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Michiel van der Meer
Piek T. J. M. Vossen
Catholijn M. Jonker
P. Murukannaiah
313
20
0
24 Oct 2023
Values, Ethics, Morals? On the Use of Moral Concepts in NLP Research
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Karina Vida
Judith Simon
Anne Lauscher
287
24
0
21 Oct 2023
The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and Values
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hannah Rose Kirk
Andrew M. Bean
Bertie Vidgen
Paul Röttger
Scott A. Hale
ALM
427
67
0
11 Oct 2023
Large Language Model Alignment: A Survey
Shangda Wu
Renren Jin
Yufei Huang
Chuang Liu
Weilong Dong
Zishan Guo
Xinwei Wu
Yan Liu
Deyi Xiong
LM&MA
451
303
0
26 Sep 2023
Probing the Moral Development of Large Language Models through Defining Issues Test
Kumar Tanmay
Aditi Khandelwal
Utkarsh Agarwal
Monojit Choudhury
LRM
332
31
0
23 Sep 2023
SafetyBench: Evaluating the Safety of Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhexin Zhang
Leqi Lei
Lindong Wu
Rui Sun
Yongkang Huang
Chong Long
Xiao Liu
Xuanyu Lei
Jie Tang
Shiyu Huang
LRM
LM&MA
ELM
381
196
0
13 Sep 2023
Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?
Jingyan Zhou
Minda Hu
Junan Li
Xiaoying Zhang
Xixin Wu
Irwin King
Helen M. Meng
LRM
353
42
0
29 Aug 2023
From Instructions to Intrinsic Human Values -- A Survey of Alignment Goals for Big Models
Jing Yao
Xiaoyuan Yi
Xiting Wang
Yongfeng Zhang
Xing Xie
ALM
467
64
0
23 Aug 2023
Evaluating the Moral Beliefs Encoded in LLMs
Neural Information Processing Systems (NeurIPS), 2023
Nino Scherrer
Claudia Shi
Amir Feder
David M. Blei
289
234
0
26 Jul 2023
Let's Do a Thought Experiment: Using Counterfactuals to Improve Moral Reasoning
Xiao Ma
Swaroop Mishra
Ahmad Beirami
Alex Beutel
Jilin Chen
ELM
ReLM
LRM
195
19
0
25 Jun 2023
Knowledge of cultural moral norms in large language models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Aida Ramezani
Yang Xu
ELM
AILaw
229
73
0
02 Jun 2023
KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Application
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Hwaran Lee
Seokhee Hong
Joonsuk Park
Takyoung Kim
Gunhee Kim
Jung-Woo Ha
404
36
0
28 May 2023
SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created Through Human-Machine Collaboration
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Hwaran Lee
Seokhee Hong
Joonsuk Park
Takyoung Kim
M. Cha
...
Eun-Ju Lee
Yong Lim
Alice Oh
San-hee Park
Jung-Woo Ha
258
20
0
28 May 2023
NormBank: A Knowledge Bank of Situational Social Norms
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Caleb Ziems
Jane Dwivedi-Yu
Yi-Chia Wang
A. Halevy
Diyi Yang
370
64
0
26 May 2023
NormMark: A Weakly Supervised Markov Model for Socio-cultural Norm Discovery
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Farhad Moghimifar
Shilin Qu
Tongtong Wu
Yuan-Fang Li
Gholamreza Haffari
210
6
0
26 May 2023
1
2
Next
Page 1 of 2