Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.05070
Cited By
A Roadmap to Pluralistic Alignment
7 February 2024
Taylor Sorensen
Jared Moore
Jillian R. Fisher
Mitchell L. Gordon
Niloofar Mireshghallah
Christopher Rytting
Andre Ye
Liwei Jiang
Ximing Lu
Nouha Dziri
Tim Althoff
Yejin Choi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Roadmap to Pluralistic Alignment"
11 / 11 papers shown
Title
Evaluating the Prompt Steerability of Large Language Models
Erik Miehling
Michael Desmond
K. Ramamurthy
Elizabeth M. Daly
Pierre L. Dognin
Jesus Rios
Djallel Bouneffouf
Miao Liu
LLMSV
79
3
0
19 Nov 2024
Moral Alignment for LLM Agents
Elizaveta Tennant
Stephen Hailes
Mirco Musolesi
28
0
0
02 Oct 2024
Open-World Evaluation for Retrieving Diverse Perspectives
Hung-Ting Chen
Eunsol Choi
23
0
0
26 Sep 2024
Programming Refusal with Conditional Activation Steering
Bruce W. Lee
Inkit Padhi
K. Ramamurthy
Erik Miehling
Pierre L. Dognin
Manish Nagireddy
Amit Dhurandhar
LLMSV
87
13
0
06 Sep 2024
From Distributional to Overton Pluralism: Investigating Large Language Model Alignment
Thom Lake
Eunsol Choi
Greg Durrett
34
9
0
25 Jun 2024
Understanding the Effects of RLHF on LLM Generalisation and Diversity
Robert Kirk
Ishita Mediratta
Christoforos Nalmpantis
Jelena Luketina
Eric Hambro
Edward Grefenstette
Roberta Raileanu
AI4CE
ALM
95
63
0
10 Oct 2023
Cognitive Reframing of Negative Thoughts through Human-Language Model Interaction
Ashish Sharma
Kevin Rushton
Inna Wanyin Lin
David Wadden
Khendra G. Lucas
Adam S. Miner
Theresa Nguyen
Tim Althoff
58
45
0
04 May 2023
Generative Agents: Interactive Simulacra of Human Behavior
J. Park
Joseph C. O'Brien
Carrie J. Cai
Meredith Ringel Morris
Percy Liang
Michael S. Bernstein
LM&Ro
AI4CE
204
1,701
0
07 Apr 2023
Improving alignment of dialogue agents via targeted human judgements
Amelia Glaese
Nat McAleese
Maja Trkebacz
John Aslanides
Vlad Firoiu
...
John F. J. Mellor
Demis Hassabis
Koray Kavukcuoglu
Lisa Anne Hendricks
G. Irving
ALM
AAML
217
495
0
28 Sep 2022
Moral Mimicry: Large Language Models Produce Moral Rationalizations Tailored to Political Identity
Gabriel Simmons
84
36
0
24 Sep 2022
The Authenticity Gap in Human Evaluation
Kawin Ethayarajh
Dan Jurafsky
71
24
0
24 May 2022
1