ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2511.21050
  4. Cited By
Breaking the Safety-Capability Tradeoff: Reinforcement Learning with Verifiable Rewards Maintains Safety Guardrails in LLMs

Breaking the Safety-Capability Tradeoff: Reinforcement Learning with Verifiable Rewards Maintains Safety Guardrails in LLMs

26 November 2025
Dongkyu Derek Cho
Huan Song
Arijit Ghosh Chowdhury
Haotian An
Y. X. R. Wang
Rohit Thekkanal
Negin Sokhandan
Sharlina Keshava
Hannah R Marlowe
ArXiv (abs)PDFHTML

Papers citing "Breaking the Safety-Capability Tradeoff: Reinforcement Learning with Verifiable Rewards Maintains Safety Guardrails in LLMs"

0 / 0 papers shown

No papers found

Page 1 of 0