ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2508.07976
  4. Cited By
Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL
v1v2v3 (latest)

Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

11 August 2025
Jiaxuan Gao
Wei Fu
Minyang Xie
Shusheng Xu
Chuyi He
Zhiyu Mei
Banghua Zhu
Yi Wu
    OffRL
ArXiv (abs)PDFHTMLHuggingFace (44 upvotes)Github (392★)

Papers citing "Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL"

8 / 8 papers shown
Title
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning
Kuan Li
Zhongwang Zhang
Huifeng Yin
Rui Ye
Yida Zhao
...
Zhen Zhang
Yong Jiang
Pengjun Xie
Fei Huang
Jingren Zhou
0
0
0
16 Sep 2025
Scaling Agents via Continual Pre-training
Scaling Agents via Continual Pre-training
L. Su
Zhen Zhang
Guangyu Li
Zhuo Chen
Chenxi Wang
...
Chenxiong Qian
Yong Jiang
Pengjun Xie
Fei Huang
Jingren Zhou
LLMAGAIFinCLLLM&RoLRM
10
3
0
16 Sep 2025
Single-stream Policy Optimization
Single-stream Policy Optimization
Zhongwen Xu
Zihan Ding
OffRL
1
0
0
16 Sep 2025
ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization
ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization
Xixi Wu
Kuan Li
Yida Zhao
Liwen Zhang
Litu Ou
...
Fei Huang
Minhao Cheng
Shuai Wang
Hong Cheng
Jingren Zhou
LLMAGRALM
0
0
0
16 Sep 2025
Reinforcement Learning Foundations for Deep Research Systems: A Survey
Reinforcement Learning Foundations for Deep Research Systems: A Survey
Wenjun Li
Z. Chen
Jingru Lin
Hannan Cao
Wei Han
...
Zhi Zhang
Kuicai Dong
Dexun Li
Chen Zhang
Yong Liu
OffRL
0
0
0
08 Sep 2025
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
Junteng Liu
Yunji Li
Chi Zhang
Jingyang Li
Aili Chen
...
Jiayuan Song
Z. Zhu
Wenhu Chen
Pengyu Zhao
Junxian He
LLMAG
0
4
0
08 Sep 2025
MMSearch-Plus: A Simple Yet Challenging Benchmark for Multimodal Browsing Agents
MMSearch-Plus: A Simple Yet Challenging Benchmark for Multimodal Browsing Agents
Xijia Tao
Yihua Teng
Xinxing Su
Xinyu Fu
Jihao Wu
Chaofan Tao
Ziru Liu
Haoli Bai
Rui Liu
Lingpeng Kong
VLMLRM
24
0
0
29 Aug 2025
Hybrid Deep Searcher: Integrating Parallel and Sequential Search Reasoning
Hybrid Deep Searcher: Integrating Parallel and Sequential Search Reasoning
Dayoon Ko
J. Kim
Haeju Park
Sohyeon Kim
Dahyun Lee
Yongrae Jo
Gunhee Kim
Moontae Lee
Kyungjae Lee
LRMVLM
24
0
0
26 Aug 2025
1