Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.10415
Cited By
PRISM: A Design Framework for Open-Source Foundation Model Safety
14 June 2024
Terrence Neumann
Bryan Jones
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PRISM: A Design Framework for Open-Source Foundation Model Safety"
3 / 3 papers shown
Title
Beyond Release: Access Considerations for Generative AI Systems
Irene Solaiman
Rishi Bommasani
Dan Hendrycks
Ariel Herbert-Voss
Yacine Jernite
Aviya Skowron
Andrew Trask
60
1
0
23 Feb 2025
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
Boyi Wei
Kaixuan Huang
Yangsibo Huang
Tinghao Xie
Xiangyu Qi
Mengzhou Xia
Prateek Mittal
Mengdi Wang
Peter Henderson
AAML
55
78
0
07 Feb 2024
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
226
4,424
0
23 Jan 2020
1