Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2204.03021
Cited By
The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
6 April 2022
Caleb Ziems
Jane A. Yu
Yi-Chia Wang
A. Halevy
Diyi Yang
Re-assign community
ArXiv (abs)
PDF
HTML
Github (19★)
Papers citing
"The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"
50 / 71 papers shown
Title
MoVa: Towards Generalizable Classification of Human Morals and Values
Ziyu Chen
Junfei Sun
Chenxi Li
Tuan Dung Nguyen
Jing Yao
Xiaoyuan Yi
Xing Xie
Chenhao Tan
Lexing Xie
52
1
0
29 Sep 2025
The Sum Leaks More Than Its Parts: Compositional Privacy Risks and Mitigations in Multi-Agent Collaboration
Vaidehi Patil
Elias Stengel-Eskin
Mohit Bansal
106
0
0
16 Sep 2025
Steerable Pluralism: Pluralistic Alignment via Few-Shot Comparative Regression
Jadie Adams
Brian Hu
Emily Veenhuis
David Joy
Bharadwaj Ravichandran
Aaron Bray
A. Hoogs
Arslan Basharat
72
1
0
11 Aug 2025
Multi-level Value Alignment in Agentic AI Systems: Survey and Perspectives
Wei Zeng
Hengshu Zhu
Chuan Qin
Han Wu
Yihang Cheng
...
Xiaowei Jin
Yinuo Shen
Zhenxing Wang
Feimin Zhong
Hui Xiong
AI4TS
325
3
0
11 Jun 2025
The Staircase of Ethics: Probing LLM Value Priorities through Multi-Step Induction to Complex Moral Dilemmas
Ya Wu
Qiang Sheng
Danding Wang
Guang Yang
Yifan Sun
Zhengjia Wang
Yuyan Bu
Juan Cao
126
4
0
23 May 2025
Auditing the Ethical Logic of Generative AI Models
W. Russell Neuman
Chad Coleman
Ali Dasdan
Safinah Ali
Manan Shah
ELM
LRM
237
4
0
24 Apr 2025
News is More than a Collection of Facts: Moral Frame Preserving News Summarization
Enrico Liscio
Michela Lorandi
P. Murukannaiah
217
1
0
01 Apr 2025
Societal Alignment Frameworks Can Improve LLM Alignment
Karolina Stañczak
Nicholas Meade
Mehar Bhatia
Hattie Zhou
Konstantin Böttinger
...
Timothy P. Lillicrap
Ana Marasović
Sylvie Delacroix
Gillian K. Hadfield
Siva Reddy
932
3
0
27 Feb 2025
AI-LieDar: Examine the Trade-off Between Utility and Truthfulness in LLM Agents
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Zhe Su
Xuhui Zhou
Sanketh Rangreji
Anubha Kabra
Julia Mendelsohn
Faeze Brahman
Maarten Sap
LLMAG
353
19
0
13 Sep 2024
CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Linhao Yu
Yongqi Leng
Yufei Huang
Shang Wu
Haixin Liu
...
Jinwang Song
Tingting Cui
Xiaoqing Cheng
Tao Liu
Deyi Xiong
ELM
70
9
0
19 Aug 2024
VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values
Zhe Hu
Yixiao Ren
Jing Li
Yu Yin
VLM
217
11
0
03 Jul 2024
Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing
Han Jiang
Xiaoyuan Yi
Zhihua Wei
Ziang Xiao
Shu Wang
Xing Xie
ELM
ALM
394
11
0
20 Jun 2024
CELL your Model: Contrastive Explanations for Large Language Models
Ronny Luss
Erik Miehling
Amit Dhurandhar
438
0
0
17 Jun 2024
Culturally Aware and Adapted NLP: A Taxonomy and a Survey of the State of the Art
Chen Cecilia Liu
Iryna Gurevych
Anna Korhonen
446
14
0
06 Jun 2024
Beyond Human Norms: Unveiling Unique Values of Large Language Models through Interdisciplinary Approaches
Pablo Biedma
Xiaoyuan Yi
Linus Huang
Maosong Sun
Xing Xie
PILM
255
9
0
19 Apr 2024
Harnessing the power of LLMs for normative reasoning in MASs
B. Savarimuthu
Surangika Ranathunga
Stephen Cranefield
LLMAG
193
7
0
25 Mar 2024
Contextual Moral Value Alignment Through Context-Based Aggregation
Pierre Dognin
Jesus Rios
Ronny Luss
Inkit Padhi
Matthew D Riemer
Miao Liu
P. Sattigeri
Manish Nagireddy
Kush R. Varshney
Djallel Bouneffouf
106
9
0
19 Mar 2024
SaGE: Evaluating Moral Consistency in Large Language Models
Vamshi Krishna Bonagiri
Sreeram Vennam
Priyanshul Govil
Ponnurangam Kumaraguru
Manas Gaur
ELM
157
0
0
21 Feb 2024
Ranking Large Language Models without Ground Truth
Amit Dhurandhar
Rahul Nair
Moninder Singh
Elizabeth M. Daly
Karthikeyan N. Ramamurthy
HILM
ALM
ELM
346
9
0
21 Feb 2024
Roadmap on Incentive Compatibility for AI Alignment and Governance in Sociotechnical Systems
Zhaowei Zhang
Fengshuo Bai
Mingzhi Wang
Haoyang Ye
Chengdong Ma
Yaodong Yang
307
6
0
20 Feb 2024
A Note on Bias to Complete
Jia Xu
Mona Diab
214
2
0
18 Feb 2024
RENOVI: A Benchmark Towards Remediating Norm Violations in Socio-Cultural Conversations
Haolan Zhan
Zhuang Li
Xiaoxi Kang
Tao Feng
Yuncheng Hua
...
Linhao Luo
Lay-Ki Soon
Zhaleh Semnani Azad
Ingrid Zukerman
Gholamreza Haffari
190
13
0
17 Feb 2024
Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking
Yi R. Fung
Ruining Zhao
Jae Doo
Chenkai Sun
Heng Ji
170
42
0
14 Feb 2024
GrounDial: Human-norm Grounded Safe Dialog Response Generation
Siwon Kim
Shuyang Dai
Mohammad Kachuee
Shayan Ray
Tara Taghavi
Sungroh Yoon
103
2
0
14 Feb 2024
Improving Dialog Safety using Socially Aware Contrastive Learning
Souvik Das
Rohini Srihari
178
1
0
01 Feb 2024
SADAS: A Dialogue Assistant System Towards Remediating Norm Violations in Bilingual Socio-Cultural Conversations
Yuncheng Hua
Zhuang Li
Linhao Luo
Kadek Ananta Satriadi
Tao Feng
...
Zhuang Li
Suraj Sharma
Ingrid Zukerman
Zhaleh Semnani Azad
Gholamreza Haffari
139
2
0
29 Jan 2024
Measuring Moral Inconsistencies in Large Language Models
Vamshi Krishna Bonagiri
Sreeram Vennam
Manas Gaur
Ponnurangam Kumaraguru
152
1
0
26 Jan 2024
Building Trustworthy NeuroSymbolic AI Systems: Consistency, Reliability, Explainability, and Safety
The AI Magazine (AI Mag.), 2023
Manas Gaur
Amit P. Sheth
152
22
0
05 Dec 2023
Interpretation modeling: Social grounding of sentences by reasoning over their implicit moral judgments
Liesbeth Allein
Maria Mihaela Trucscva
Marie-Francine Moens
162
2
0
27 Nov 2023
Large Language Models in Education: Vision and Opportunities
BigData Congress [Services Society] (BSS), 2023
Wensheng Gan
Zhenlian Qi
Jiayang Wu
Chun-Wei Lin
AI4Ed
245
128
0
22 Nov 2023
MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks
Neural Information Processing Systems (NeurIPS), 2023
Allen Nie
Yuhui Zhang
Atharva Amdekar
Chris Piech
Tatsunori Hashimoto
Tobias Gerstenberg
190
54
0
30 Oct 2023
Moral Sparks in Social Media Narratives
ACM Conference on Hypertext & Social Media (HT), 2023
Ruijie Xi
Munindar P. Singh
LRM
172
2
0
30 Oct 2023
EtiCor: Corpus for Analyzing LLMs for Etiquettes
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ashutosh Dwivedi
Pradhyumna Lavania
Ashutosh Modi
139
33
0
29 Oct 2023
NormDial: A Comparable Bilingual Synthetic Dialog Dataset for Modeling Social Norm Adherence and Violation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Aochong Li
Mallika Subramanian
Arkadiy Saakyan
Sky CH-Wang
Smaranda Muresan
151
20
0
23 Oct 2023
Values, Ethics, Morals? On the Use of Moral Concepts in NLP Research
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Karina Vida
Judith Simon
Anne Lauscher
163
20
0
21 Oct 2023
Denevil: Towards Deciphering and Navigating the Ethical Values of Large Language Models via Instruction Learning
International Conference on Learning Representations (ICLR), 2023
Shitong Duan
Xiaoyuan Yi
Peng Zhang
Tun Lu
Xing Xie
Ning Gu
170
23
0
17 Oct 2023
The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and Values
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hannah Rose Kirk
Andrew M. Bean
Bertie Vidgen
Paul Röttger
Scott A. Hale
ALM
271
60
0
11 Oct 2023
Aligning Language Models with Human Preferences via a Bayesian Approach
Neural Information Processing Systems (NeurIPS), 2023
Jiashuo Wang
Haozhao Wang
Shichao Sun
Wenjie Li
ALM
262
34
0
09 Oct 2023
STREAM: Social data and knowledge collective intelligence platform for TRaining Ethical AI Models
Ai & Society (AI & Society), 2023
Yuwei Wang
Enmeng Lu
Zizhe Ruan
Yao Liang
Yi Zeng
AI4TS
173
5
0
09 Oct 2023
SYNDICOM: Improving Conversational Commonsense with Error-Injection and Natural Language Feedback
SIGDIAL Conferences (SIGDIAL), 2023
Christopher Richardson
Anirudh S. Sundar
Larry Heck
LRM
198
6
0
18 Sep 2023
SafetyBench: Evaluating the Safety of Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhexin Zhang
Leqi Lei
Lindong Wu
Rui Sun
Yongkang Huang
Chong Long
Xiao Liu
Xuanyu Lei
Jie Tang
Shiyu Huang
LRM
LM&MA
ELM
222
159
0
13 Sep 2023
Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?
Jingyan Zhou
Minda Hu
Junan Li
Xiaoying Zhang
Xixin Wu
Irwin King
Helen M. Meng
LRM
198
38
0
29 Aug 2023
From Instructions to Intrinsic Human Values -- A Survey of Alignment Goals for Big Models
Jing Yao
Xiaoyuan Yi
Xiting Wang
Yongfeng Zhang
Xing Xie
ALM
317
55
0
23 Aug 2023
Through the Lens of Core Competency: Survey on Evaluation of Large Language Models
China National Conference on Chinese Computational Linguistics (CNCCL), 2023
Ziyu Zhuang
Qiguang Chen
Longxuan Ma
Mingda Li
Yi Han
Yushan Qian
Haopeng Bai
Zixian Feng
Weinan Zhang
Ting Liu
ELM
129
22
0
15 Aug 2023
Modeling Cross-Cultural Pragmatic Inference with Codenames Duet
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Omar Shaikh
Caleb Ziems
William B. Held
Aryan Pariani
Fred Morstatter
Diyi Yang
164
18
0
04 Jun 2023
Conflicts, Villains, Resolutions: Towards models of Narrative Media Framing
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Lea Frermann
Jiatong Li
Shima Khanehzar
Gosia Mikołajczak
216
19
0
03 Jun 2023
NormBank: A Knowledge Bank of Situational Social Norms
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Caleb Ziems
Jane Dwivedi-Yu
Yi-Chia Wang
A. Halevy
Diyi Yang
243
55
0
26 May 2023
Training Socially Aligned Language Models on Simulated Social Interactions
International Conference on Learning Representations (ICLR), 2023
Ruibo Liu
Ruixin Yang
Chenyan Jia
Ge Zhang
Denny Zhou
Andrew M. Dai
Diyi Yang
Soroush Vosoughi
ALM
188
87
0
26 May 2023
NormMark: A Weakly Supervised Markov Model for Socio-cultural Norm Discovery
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Farhad Moghimifar
Shilin Qu
Tongtong Wu
Yuan-Fang Li
Gholamreza Haffari
121
5
0
26 May 2023
Sociocultural Norm Similarities and Differences via Situational Alignment and Explainable Textual Entailment
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sky CH-Wang
Arkadiy Saakyan
Aochong Li
Zhou Yu
Smaranda Muresan
182
24
0
23 May 2023
1
2
Next