v1v2 (latest)

Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life Anecdotes

20 August 2020

Nicholas Lourie

Ronan Le Bras

Yejin Choi

ArXiv (abs)PDF HTML

Papers citing "Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life Anecdotes"

50 / 83 papers shown

MM-MoralBench: A MultiModal Moral Evaluation Benchmark for Large Vision-Language Models

408

10 Apr 2026

From Competition to Coordination: Market Making as a Scalable Framework for Safe and Aligned Multi-Agent LLM Systems

Archana Vaidheeswaran

Vasu Sharma

LLMAG

216

18 Nov 2025

RoleConflictBench: A Benchmark of Role Conflict Scenarios for Evaluating LLMs' Contextual Sensitivity

198

30 Sep 2025

MORABLES: A Benchmark for Assessing Abstract Moral Reasoning in LLMs with Fables

Matteo Marcuzzo

A. Zangari

A. Albarelli

Jose Camacho-Collados

Mohammad Taher Pilehvar

264

15 Sep 2025

EthicsMH: A Pilot Benchmark for Ethical Reasoning in Mental Health AI

Sai Kartheek Reddy Kasu

AI4MH

199

15 Sep 2025

Human Psychometric Questionnaires Mischaracterize LLM Psychology: Evidence from Generation Behavior

128

12 Sep 2025

Discerning What Matters: A Multi-Dimensional Assessment of Moral Competence in LLMs

461

16 Jun 2025

Synthetic Socratic Debates: Examining Persona Effects on Moral Decision and Persuasion Dynamics

249

14 Jun 2025

Multi-level Value Alignment in Agentic AI Systems: Survey and Perspectives

...

532

11 Jun 2025

Value Portrait: Assessing Language Models' Values through Psychometrically and Ecologically Valid ItemsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

625

02 May 2025

Auditing the Ethical Logic of Generative AI Models

346

24 Apr 2025

CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives

590

15 Apr 2025

Are Rules Meant to be Broken? Understanding Multilingual Moral Reasoning as a Computational Pipeline with UniMoralAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Shivani Kumar

David Jurgens

LRM

385

21 Feb 2025

The Goofus & Gallant Story Corpus for Practical Value AlignmentInternational Conference on Machine Learning and Applications (ICMLA), 2024

264

17 Jan 2025

Ethical Concern Identification in NLP: A Corpus of ACL Anthology Ethics StatementsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Antonia Karamolegkou

Sandrine Schiller Hansen

Ariadni Christopoulou

Filippos Stamatiou

Anne Lauscher

Anders Søgaard

191

12 Nov 2024

A Novel Psychometrics-Based Approach to Developing Professional Competency Benchmark for Large Language Models

Ekaterina Kruchinskaia

Irina Brun

414

29 Oct 2024

Extended Japanese Commonsense Morality Dataset with Masked Token and Label EnhancementInternational Conference on Information and Knowledge Management (CIKM), 2024

Takumi Ohashi

Tsubasa Nakagawa

Hitoshi Iyatomi

194

12 Oct 2024

Fine-Tuning Language Models for Ethical Ambiguity: A Comparative Study of Alignment with Human Responses

Pranav Senthilkumar

Visshwa Balasubramanian

212

10 Oct 2024

Intuitions of Compromise: Utilitarianism vs. Contractualism

Jared Moore

Yejin Choi

Sydney Levine

269

07 Oct 2024

DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily LifeInternational Conference on Learning Representations (ICLR), 2024

Yu Ying Chiu

Liwei Jiang

Yejin Choi

423

03 Oct 2024

Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language ModelsInternational Conference on Learning Representations (ICLR), 2024

598

27 Aug 2024

CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Yufei Huang

...

Tao Liu

Deyi Xiong

ELM

182

19 Aug 2024

VERISCORE: Evaluating the factuality of verifiable claims in long-form text generation

291

27 Jun 2024

Does Cross-Cultural Alignment Change the Commonsense Morality of Language Models?

Yuu Jinnai

384

24 Jun 2024

Navigating LLM Ethics: Advancements, Challenges, and Future DirectionsAI and Ethics (AI & Ethics), 2024

727

14 May 2024

Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models

363

17 Apr 2024

SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety

Paul Röttger

428

08 Apr 2024

Harnessing the power of LLMs for normative reasoning in MASs

315

25 Mar 2024

Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?

492

18 Feb 2024

Morality is Non-Binary: Building a Pluralist Moral Sentence Embedding Space using Contrastive Learning

325

30 Jan 2024

Cross Fertilizing Empathy from Brain to Machine as a Value Alignment Strategy

Devin Gonier

Adrian Adduci

Cassidy LoCascio

219

10 Dec 2023

MOKA: Moral Knowledge Augmentation for Moral Event Extraction

Xinliang Frederick Zhang

Winston Wu

Nick Beauchamp

Lu Wang

277

16 Nov 2023

MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment TasksNeural Information Processing Systems (NeurIPS), 2023

Tatsunori Hashimoto

344

30 Oct 2023

Moral Sparks in Social Media NarrativesACM Conference on Hypertext & Social Media (HT), 2023

Ruijie Xi

Munindar P. Singh

LRM

317

30 Oct 2023

EtiCor: Corpus for Analyzing LLMs for EtiquettesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Ashutosh Dwivedi

Pradhyumna Lavania

Ashutosh Modi

236

29 Oct 2023

Do Differences in Values Influence Disagreements in Online Discussions?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

313

24 Oct 2023

Values, Ethics, Morals? On the Use of Moral Concepts in NLP ResearchConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Karina Vida

Judith Simon

Anne Lauscher

287

21 Oct 2023

The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and ValuesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Paul Röttger

427

11 Oct 2023

Large Language Model Alignment: A Survey

451

303

26 Sep 2023

Probing the Moral Development of Large Language Models through Defining Issues Test

332

23 Sep 2023

SafetyBench: Evaluating the Safety of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Xiao Liu

381

196

13 Sep 2023

Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?

Irwin King

353

29 Aug 2023

From Instructions to Intrinsic Human Values -- A Survey of Alignment Goals for Big Models

Xing Xie

467

23 Aug 2023

Evaluating the Moral Beliefs Encoded in LLMsNeural Information Processing Systems (NeurIPS), 2023

289

234

26 Jul 2023

Let's Do a Thought Experiment: Using Counterfactuals to Improve Moral Reasoning

195

25 Jun 2023

Knowledge of cultural moral norms in large language modelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Aida Ramezani

Yang Xu

ELM AILaw

229

02 Jun 2023

KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model ApplicationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

404

28 May 2023

SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created Through Human-Machine CollaborationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

...

258

28 May 2023

NormBank: A Knowledge Bank of Situational Social NormsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Diyi Yang

370

26 May 2023

NormMark: A Weakly Supervised Markov Model for Socio-cultural Norm DiscoveryAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

210

26 May 2023