Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.09490
Cited By
Balancing Transparency and Risk: The Security and Privacy Risks of Open-Source Machine Learning Models
18 August 2023
Dominik Hintersdorf
Lukas Struppek
Kristian Kersting
SILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Balancing Transparency and Risk: The Security and Privacy Risks of Open-Source Machine Learning Models"
6 / 6 papers shown
Title
Unelicitable Backdoors in Language Models via Cryptographic Transformer Circuits
Andis Draguns
Andrew Gritsevskiy
S. Motwani
Charlie Rogers-Smith
Jeffrey Ladish
Christian Schroeder de Witt
40
2
0
03 Jun 2024
MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models
Xin Liu
Yichen Zhu
Jindong Gu
Yunshi Lan
Chao Yang
Yu Qiao
19
80
0
29 Nov 2023
Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models
Xianjun Yang
Xiao Wang
Qi Zhang
Linda R. Petzold
William Yang Wang
Xun Zhao
Dahua Lin
18
160
0
04 Oct 2023
Plug & Play Attacks: Towards Robust and Flexible Model Inversion Attacks
Lukas Struppek
Dominik Hintersdorf
Antonio De Almeida Correia
Antonia Adler
Kristian Kersting
MIACV
45
62
0
28 Jan 2022
Extracting Training Data from Large Language Models
Nicholas Carlini
Florian Tramèr
Eric Wallace
Matthew Jagielski
Ariel Herbert-Voss
...
Tom B. Brown
D. Song
Ulfar Erlingsson
Alina Oprea
Colin Raffel
MLAU
SILM
267
1,798
0
14 Dec 2020
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
262
10,320
0
12 Dec 2018
1