Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2509.01055
Cited By

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

v1v2v3 (latest)

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

1 September 2025

ArXiv (abs)PDF HTML HuggingFace (61 upvotes)Github (612★)

Papers citing "VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use"

18 / 18 papers shown

Environment Scaling for Interactive Agentic Experience Collection: A Survey

Environment Scaling for Interactive Agentic Experience Collection: A Survey

148

0

0

24 Dec 2025

On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral

On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral

Christos Thrampoulidis

37

2

0

03 Dec 2025

Agentic Policy Optimization via Instruction-Policy Co-Evolution

99

0

0

01 Dec 2025

SkyRL-Agent: Efficient RL Training for Multi-turn LLM Agent

Sumanth R. Hegde

...

Matei A. Zaharia

Joseph E. Gonzalez

122

2

0

20 Nov 2025

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

LLMAG LM&Ro SyDa

717

3

0

20 Nov 2025

The Path Not Taken: RLVR Provably Learns Off the Principals

The Path Not Taken: RLVR Provably Learns Off the Principals

...

191

3

0

11 Nov 2025

Scaling Agent Learning via Experience Synthesis

Scaling Agent Learning via Experience Synthesis

...

456

1

0

05 Nov 2025

Sharpness-Controlled Group Relative Policy Optimization with Token-Level Probability Shaping

Sharpness-Controlled Group Relative Policy Optimization with Token-Level Probability Shaping

136

0

0

29 Oct 2025

Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning

Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning

238

5

0

27 Oct 2025

DeepAgent: A General Reasoning Agent with Scalable Toolsets

DeepAgent: A General Reasoning Agent with Scalable Toolsets

...

130

7

0

24 Oct 2025

A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications

A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications

Charu C. Aggarwal

555

1

0

19 Oct 2025

Agentic Entropy-Balanced Policy Optimization

Agentic Entropy-Balanced Policy Optimization

...

91

2

0

16 Oct 2025

Reinforcement Learning for Tool-Integrated Interleaved Thinking towards Cross-Domain Generalization

Reinforcement Learning for Tool-Integrated Interleaved Thinking towards Cross-Domain Generalization

116

1

0

13 Oct 2025

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

...

194

1

0

12 Oct 2025

AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning

AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning

...

Masashi Sugiyama

112

0

0

05 Oct 2025

QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL

QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL

130

2

0

01 Oct 2025

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

153

3

0

26 Sep 2025

Variational Reasoning for Language Models

Variational Reasoning for Language Models

202

0

0

26 Sep 2025