Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.22770
Cited By
InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models
30 October 2024
H. Li
Xiaogeng Liu
SILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models"
3 / 3 papers shown
Title
JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model
Yi Nian
Shenzhe Zhu
Yuehan Qin
Li Li
Z. Wang
Chaowei Xiao
Yue Zhao
21
0
0
03 Apr 2025
Riddle Me This! Stealthy Membership Inference for Retrieval-Augmented Generation
A. Naseh
Yuefeng Peng
Anshuman Suri
Harsh Chaudhari
Alina Oprea
Amir Houmansadr
SILM
AAML
RALM
49
0
0
01 Feb 2025
SoK: Unifying Cybersecurity and Cybersafety of Multimodal Foundation Models with an Information Theory Approach
Ruoxi Sun
Jiamin Chang
Hammond Pearce
Chaowei Xiao
B. Li
Qi Wu
Surya Nepal
Minhui Xue
30
0
0
17 Nov 2024
1