ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2510.04695
  4. Cited By
Beyond Outcome Reward: Decoupling Search and Answering Improves LLM Agents

Beyond Outcome Reward: Decoupling Search and Answering Improves LLM Agents

6 October 2025
Yiding Wang
Zhepei Wei
Xinyu Zhu
Yu Meng
ArXiv (abs)PDFHTMLGithub (4★)

Papers citing "Beyond Outcome Reward: Decoupling Search and Answering Improves LLM Agents"

1 / 1 papers shown
Title
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
Zhepei Wei
Wenlin Yao
Yao Liu
Weizhi Zhang
Qin Lu
...
Puyang Xu
Chao Zhang
Bing Yin
Hyokun Yun
Lihong Li
OffRLCLLOnRLLRM
235
29
0
22 May 2025
1