ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.16960
  4. Cited By
Training Socially Aligned Language Models on Simulated Social
  Interactions
v1v2v3 (latest)

Training Socially Aligned Language Models on Simulated Social Interactions

26 May 2023
Ruibo Liu
Ruixin Yang
Chenyan Jia
Ge Zhang
Denny Zhou
Andrew M. Dai
Diyi Yang
Soroush Vosoughi
    ALM
ArXiv (abs)PDFHTMLHuggingFace (3 upvotes)

Papers citing "Training Socially Aligned Language Models on Simulated Social Interactions"

16 / 16 papers shown
Title
From Language to Action: A Review of Large Language Models as Autonomous Agents and Tool Users
From Language to Action: A Review of Large Language Models as Autonomous Agents and Tool Users
Sadia Sultana Chowa
Riasad Alvi
Subhey Sadi Rahman
M. Rahman
M. R
M. Islam
Mukhtar Hussain
Sami Azam
LLMAGLM&RoELM
56
0
0
24 Aug 2025
Multi-level Value Alignment in Agentic AI Systems: Survey and Perspectives
Multi-level Value Alignment in Agentic AI Systems: Survey and Perspectives
Wei Zeng
Hengshu Zhu
Chuan Qin
Han Wu
Yihang Cheng
...
Xiaowei Jin
Yinuo Shen
Zhenxing Wang
Feimin Zhong
Hui Xiong
AI4TS
171
0
0
11 Jun 2025
From Threat to Tool: Leveraging Refusal-Aware Injection Attacks for Safety Alignment
From Threat to Tool: Leveraging Refusal-Aware Injection Attacks for Safety Alignment
Kyubyung Chae
Hyunbin Jin
Taesup Kim
74
0
0
07 Jun 2025
An Embarrassingly Simple Defense Against LLM Abliteration Attacks
An Embarrassingly Simple Defense Against LLM Abliteration Attacks
Harethah Shairah
Hasan Hammoud
Bernard Ghanem
G. Turkiyyah
115
1
0
25 May 2025
The Call for Socially Aware Language Technologies
The Call for Socially Aware Language Technologies
Diyi Yang
Dirk Hovy
David Jurgens
Barbara Plank
VLM
218
14
0
24 Feb 2025
AgentSociety: Large-Scale Simulation of LLM-Driven Generative Agents Advances Understanding of Human Behaviors and Society
AgentSociety: Large-Scale Simulation of LLM-Driven Generative Agents Advances Understanding of Human Behaviors and Society
J. Piao
Yuwei Yan
Jun Zhang
Nian Li
Junbo Yan
...
Fengli Xu
Fang Zhang
Ke Rong
Jun Su
Yongqian Li
AI4CE
240
37
0
12 Feb 2025
Trustworthy AI: Safety, Bias, and Privacy -- A Survey
Trustworthy AI: Safety, Bias, and Privacy -- A Survey
Xingli Fang
Jianwei Li
Varun Mulchandani
Jung-Eun Kim
149
1
0
11 Feb 2025
Simulating Human-like Daily Activities with Desire-driven Autonomy
Simulating Human-like Daily Activities with Desire-driven Autonomy
Yiding Wang
Yuxuan Chen
Fangwei Zhong
Long Ma
Yizhou Wang
251
10
0
09 Dec 2024
Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset
Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset
Khaoula Chehbouni
Jonathan Colaço-Carr
Yash More
Jackie CK Cheung
G. Farnadi
250
4
0
12 Nov 2024
Large Language Models, and LLM-Based Agents, Should Be Used to Enhance the Digital Public Sphere
Large Language Models, and LLM-Based Agents, Should Be Used to Enhance the Digital Public Sphere
Seth Lazar
Luke Thorburn
Tian Jin
Luca Belli
105
4
0
15 Oct 2024
Moral Alignment for LLM Agents
Moral Alignment for LLM Agents
Elizaveta Tennant
Stephen Hailes
Mirco Musolesi
203
13
0
02 Oct 2024
Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large
  Language Models
Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language Models
Fabio Pernisi
Dirk Hovy
Paul Röttger
120
1
0
08 Aug 2024
Scaling Large Language Model-based Multi-Agent Collaboration
Scaling Large Language Model-based Multi-Agent Collaboration
Chen Qian
Zihao Xie
YiFei Wang
Wei Liu
Yufan Dang
...
Zhuoyun Du
Weize Chen
Cheng Yang
Zhiyuan Liu
Maosong Sun
AI4CELLMAGLM&Ro
207
101
0
11 Jun 2024
Confidence Calibration and Rationalization for LLMs via Multi-Agent
  Deliberation
Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation
Ruixin Yang
Dheeraj Rajagopal
S. Hayati
Bin Hu
Dongyeop Kang
LLMAG
174
9
0
14 Apr 2024
Emergence of Social Norms in Generative Agent Societies: Principles and
  Architecture
Emergence of Social Norms in Generative Agent Societies: Principles and Architecture
Siyue Ren
Zhiyao Cui
Ruiqi Song
Zhen Wang
Shuyue Hu
LLMAG
123
10
0
13 Mar 2024
COPR: Continual Human Preference Learning via Optimal Policy
  Regularization
COPR: Continual Human Preference Learning via Optimal Policy Regularization
Han Zhang
Lin Gui
Yu Lei
Yuanzhao Zhai
Yehong Zhang
...
Hui Wang
Yue Yu
Kam-Fai Wong
Bin Liang
Ruifeng Xu
CLL
144
5
0
22 Feb 2024
1