ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.10218
  4. Cited By
RAIDEN-R1: Improving Role-awareness of LLMs via GRPO with Verifiable Reward

RAIDEN-R1: Improving Role-awareness of LLMs via GRPO with Verifiable Reward

15 May 2025
Zongsheng Wang
Kaili Sun
Bowen Wu
Qun Yu
Ying Li
Baoxun Wang
ArXiv (abs)PDFHTML

Papers citing "RAIDEN-R1: Improving Role-awareness of LLMs via GRPO with Verifiable Reward"

2 / 2 papers shown
Title
Too Good to be Bad: On the Failure of LLMs to Role-Play Villains
Too Good to be Bad: On the Failure of LLMs to Role-Play Villains
Zihao Yi
Qingxuan Jiang
Ruotian Ma
Xingyu Chen
Qu Yang
...
Fanghua Ye
Ying Shen
Zhaopeng Tu
Xiaolong Li
Linus
106
1
0
07 Nov 2025
REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards
REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards
Zafir Stojanovski
Oliver Stanley
Joe Sharratt
Richard Jones
Abdulhakeem Adefioye
Jean Kaddour
Andreas Kopf
OffRLLRM
294
36
0
30 May 2025
1