Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2511.21050
Cited By

Breaking the Safety-Capability Tradeoff: Reinforcement Learning with Verifiable Rewards Maintains Safety Guardrails in LLMs

Breaking the Safety-Capability Tradeoff: Reinforcement Learning with Verifiable Rewards Maintains Safety Guardrails in LLMs

26 November 2025

Dongkyu Derek Cho

Arijit Ghosh Chowdhury

Rohit Thekkanal

Negin Sokhandan

Sharlina Keshava

Hannah R Marlowe

ArXiv (abs)PDF HTML

Papers citing "Breaking the Safety-Capability Tradeoff: Reinforcement Learning with Verifiable Rewards Maintains Safety Guardrails in LLMs"

0 / 0 papers shown

No papers found

Page 1 of 0