Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.01247
Cited By
Conversational Complexity for Assessing Risk in Large Language Models
2 September 2024
John Burden
Manuel Cebrian
José Hernández Orallo
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Conversational Complexity for Assessing Risk in Large Language Models"
2 / 2 papers shown
Title
Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned
Deep Ganguli
Liane Lovitt
John Kernion
Amanda Askell
Yuntao Bai
...
Nicholas Joseph
Sam McCandlish
C. Olah
Jared Kaplan
Jack Clark
216
327
0
23 Aug 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
1