Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2511.21050
Cited By
Breaking the Safety-Capability Tradeoff: Reinforcement Learning with Verifiable Rewards Maintains Safety Guardrails in LLMs
26 November 2025
Dongkyu Derek Cho
Huan Song
Arijit Ghosh Chowdhury
Haotian An
Y. X. R. Wang
Rohit Thekkanal
Negin Sokhandan
Sharlina Keshava
Hannah R Marlowe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Breaking the Safety-Capability Tradeoff: Reinforcement Learning with Verifiable Rewards Maintains Safety Guardrails in LLMs"
0 / 0 papers shown
No papers found
Page 1 of 0