Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2408.11182
Cited By

Hide Your Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Carrier Articles

v1v2 (latest)

Hide Your Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Carrier Articles

20 August 2024

Peng Liu

ArXiv (abs)PDF HTML

Papers citing "Hide Your Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Carrier Articles"

1 / 1 papers shown

Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense

Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack DefenseNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

Kaixiong Zhou

339

7

0

05 Jan 2025

Page 1 of 1