Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy

9 October 2024

Papers citing "Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy"

2 / 2 papers shown

Title
The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them) Zihao Wang Yibo Jiang Jiahao Yu Heqing Huang 33 0 0 01 May 2025
WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks Ivan Evtimov Arman Zharmagambetov Aaron Grattafiori Chuan Guo Kamalika Chaudhuri AAML 33 0 0 22 Apr 2025