Communities
Connect sessions
AI calendar
Organizations
Contact Sales
Search
Open menu
Home
Papers
2506.16078
Cited By
Probing the Robustness of Large Language Models Safety to Latent Perturbations
19 June 2025
Tianle Gu
Kexin Huang
Zongqi Wang
Yixu Wang
Jie Li
Yuanqi Yao
Yang Yao
Yujiu Yang
Yan Teng
Yingchun Wang
AAML
LLMSV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Probing the Robustness of Large Language Models Safety to Latent Perturbations"
1 / 1 papers shown
Title
The Rogue Scalpel: Activation Steering Compromises LLM Safety
Anton Korznikov
Andrey V. Galichin
Alexey Dontsov
Oleg Y. Rogov
Ivan Oseledets
Elena Tutubalina
LLMSV
AAML
12
0
0
26 Sep 2025
1