ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2510.11062
  4. Cited By
Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs
v1v2 (latest)

Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs

13 October 2025
Yujie Zhao
Lanxiang Hu
Y. Wang
Minmin Hou
Hao Zhang
Ke Ding
Jishen Zhao
ArXiv (abs)PDFHTMLHuggingFace (23 upvotes)Github (4★)

Papers citing "Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs"

1 / 1 papers shown
Title
MARFT: Multi-Agent Reinforcement Fine-Tuning
MARFT: Multi-Agent Reinforcement Fine-Tuning
Junwei Liao
Muning Wen
Jun Wang
Weinan Zhang
OffRL
337
17
0
21 Apr 2025
1