The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems

Annual Meeting of the Association for Computational Linguistics (ACL), 2022

6 April 2022

Diyi Yang

ArXiv (abs)PDF HTML Github (19★)

Papers citing "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"

50 / 72 papers shown

MoVa: Towards Generalizable Classification of Human Morals and Values

140

29 Sep 2025

The Sum Leaks More Than Its Parts: Compositional Privacy Risks and Mitigations in Multi-Agent Collaboration

Vaidehi Patil

Elias Stengel-Eskin

Mohit Bansal

253

16 Sep 2025

Steerable Pluralism: Pluralistic Alignment via Few-Shot Comparative Regression

Bharadwaj Ravichandran

Aaron Bray

A. Hoogs

Arslan Basharat

126

11 Aug 2025

Multi-level Value Alignment in Agentic AI Systems: Survey and Perspectives

...

527

11 Jun 2025

The Staircase of Ethics: Probing LLM Value Priorities through Multi-Step Induction to Complex Moral Dilemmas

352

23 May 2025

Auditing the Ethical Logic of Generative AI Models

329

24 Apr 2025

News is More than a Collection of Facts: Moral Frame Preserving News Summarization

Enrico Liscio

Michela Lorandi

P. Murukannaiah

301

01 Apr 2025

Societal Alignment Frameworks Can Improve LLM Alignment

...

1.1K

27 Feb 2025

Evaluating Vision-Language Models for Emotion RecognitionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

Sree Bhattacharyya

James Z. Wang

VLM

373

08 Feb 2025

AI-LieDar: Examine the Trade-off Between Utility and Truthfulness in LLM AgentsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

441

13 Sep 2024

CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Yufei Huang

...

Tao Liu

Deyi Xiong

ELM

167

19 Aug 2024

VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values

309

03 Jul 2024

Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing

728

20 Jun 2024

CELL your Model: Contrastive Explanations for Large Language Models

Ronny Luss

Erik Miehling

Amit Dhurandhar

653

17 Jun 2024

Culturally Aware and Adapted NLP: A Taxonomy and a Survey of the State of the Art

Chen Cecilia Liu

Iryna Gurevych

Anna Korhonen

634

06 Jun 2024

Beyond Human Norms: Unveiling Unique Values of Large Language Models through Interdisciplinary Approaches

Pablo Biedma

Xiaoyuan Yi

Linus Huang

Maosong Sun

Xing Xie

PILM

388

19 Apr 2024

Harnessing the power of LLMs for normative reasoning in MASs

288

25 Mar 2024

Contextual Moral Value Alignment Through Context-Based Aggregation

236

19 Mar 2024

SaGE: Evaluating Moral Consistency in Large Language Models

Vamshi Krishna Bonagiri

Sreeram Vennam

Priyanshul Govil

Ponnurangam Kumaraguru

Manas Gaur

ELM

240

21 Feb 2024

Ranking Large Language Models without Ground Truth

Karthikeyan N. Ramamurthy

HILM ALM ELM

581

21 Feb 2024

Roadmap on Incentive Compatibility for AI Alignment and Governance in Sociotechnical Systems

506

20 Feb 2024

A Note on Bias to Complete

Jia Xu

Mona Diab

327

18 Feb 2024

RENOVI: A Benchmark Towards Remediating Norm Violations in Socio-Cultural Conversations

...

305

17 Feb 2024

Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking

Heng Ji

253

14 Feb 2024

GrounDial: Human-norm Grounded Safe Dialog Response Generation

195

14 Feb 2024

Improving Dialog Safety using Socially Aware Contrastive Learning

Souvik Das

Rohini Srihari

265

01 Feb 2024

SADAS: A Dialogue Assistant System Towards Remediating Norm Violations in Bilingual Socio-Cultural Conversations

Yuncheng Hua

Zhuang Li

Linhao Luo

Kadek Ananta Satriadi

...

287

29 Jan 2024

Measuring Moral Inconsistencies in Large Language Models

Vamshi Krishna Bonagiri

Sreeram Vennam

Manas Gaur

Ponnurangam Kumaraguru

319

26 Jan 2024

Building Trustworthy NeuroSymbolic AI Systems: Consistency, Reliability, Explainability, and SafetyThe AI Magazine (AI Mag.), 2023

Manas Gaur

Amit P. Sheth

252

05 Dec 2023

Interpretation modeling: Social grounding of sentences by reasoning over their implicit moral judgments

Liesbeth Allein

Maria Mihaela Trucscva

Marie-Francine Moens

249

27 Nov 2023

Large Language Models in Education: Vision and OpportunitiesBigData Congress [Services Society] (BSS), 2023

Wensheng Gan

316

149

22 Nov 2023

MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment TasksNeural Information Processing Systems (NeurIPS), 2023

Tatsunori Hashimoto

327

30 Oct 2023

Moral Sparks in Social Media NarrativesACM Conference on Hypertext & Social Media (HT), 2023

Ruijie Xi

Munindar P. Singh

LRM

299

30 Oct 2023

EtiCor: Corpus for Analyzing LLMs for EtiquettesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Ashutosh Dwivedi

Pradhyumna Lavania

Ashutosh Modi

229

29 Oct 2023

NormDial: A Comparable Bilingual Synthetic Dialog Dataset for Modeling Social Norm Adherence and ViolationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Smaranda Muresan

260

23 Oct 2023

Values, Ethics, Morals? On the Use of Moral Concepts in NLP ResearchConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Karina Vida

Judith Simon

Anne Lauscher

286

21 Oct 2023

Denevil: Towards Deciphering and Navigating the Ethical Values of Large Language Models via Instruction LearningInternational Conference on Learning Representations (ICLR), 2023

Xing Xie

283

17 Oct 2023

The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and ValuesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Paul Röttger

427

11 Oct 2023

Aligning Language Models with Human Preferences via a Bayesian ApproachNeural Information Processing Systems (NeurIPS), 2023

418

09 Oct 2023

STREAM: Social data and knowledge collective intelligence platform for TRaining Ethical AI ModelsAi & Society (AI & Society), 2023

Yi Zeng

244

09 Oct 2023

SYNDICOM: Improving Conversational Commonsense with Error-Injection and Natural Language FeedbackSIGDIAL Conferences (SIGDIAL), 2023

Christopher Richardson

Anirudh S. Sundar

Larry Heck

LRM

345

18 Sep 2023

SafetyBench: Evaluating the Safety of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Xiao Liu

367

192

13 Sep 2023

Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?

Irwin King

348

29 Aug 2023

From Instructions to Intrinsic Human Values -- A Survey of Alignment Goals for Big Models

Xing Xie

448

23 Aug 2023

Through the Lens of Core Competency: Survey on Evaluation of Large Language ModelsChina National Conference on Chinese Computational Linguistics (CNCCL), 2023

221

15 Aug 2023

Modeling Cross-Cultural Pragmatic Inference with Codenames DuetAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Diyi Yang

298

04 Jun 2023

Conflicts, Villains, Resolutions: Towards models of Narrative Media FramingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

337

03 Jun 2023

NormBank: A Knowledge Bank of Situational Social NormsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Diyi Yang

357

26 May 2023

Training Socially Aligned Language Models on Simulated Social InteractionsInternational Conference on Learning Representations (ICLR), 2023

Ruibo Liu

Diyi Yang

396

26 May 2023

NormMark: A Weakly Supervised Markov Model for Socio-cultural Norm DiscoveryAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

210

26 May 2023