ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2510.07968
  4. Cited By
From Defender to Devil? Unintended Risk Interactions Induced by LLM Defenses

From Defender to Devil? Unintended Risk Interactions Induced by LLM Defenses

9 October 2025
Xiangtao Meng
Tianshuo Cong
Li Wang
Wenyu Chen
Zheng Li
Shanqing Guo
Xiaoyun Wang
    AAML
ArXiv (abs)PDFHTML

Papers citing "From Defender to Devil? Unintended Risk Interactions Induced by LLM Defenses"

1 / 1 papers shown
Title
Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following
Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following
Qingyu Ren
Qianyu He
Bowei Zhang
Jie Zeng
Jiaqing Liang
Yanghua Xiao
Weikang Zhou
Zeye Sun
Fei Yu
OffRLLRM
38
0
0
04 Aug 2025
1