Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2510.11062
Cited By
v1
v2 (latest)
Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs
13 October 2025
Yujie Zhao
Lanxiang Hu
Y. Wang
Minmin Hou
Hao Zhang
Ke Ding
Jishen Zhao
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (23 upvotes)
Github (4★)
Papers citing
"Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs"
1 / 1 papers shown
Title
MARFT: Multi-Agent Reinforcement Fine-Tuning
Junwei Liao
Muning Wen
Jun Wang
Weinan Zhang
OffRL
337
17
0
21 Apr 2025
1