Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2505.20289
Cited By

VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection

v1v2 (latest)

VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection

26 May 2025

Anirudh Sundara Rajan

ArXiv (abs)PDF HTML HuggingFace (10 upvotes)

Papers citing "VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection"

18 / 18 papers shown

Reinforcement Learning for Large Model: A Survey

Reinforcement Learning for Large Model: A Survey

Mike Zheng Shou

316

2

0

24 Dec 2025

From Illusion to Intention: Visual Rationale Learning for Vision-Language Reasoning

From Illusion to Intention: Visual Rationale Learning for Vision-Language Reasoning

312

0

0

28 Nov 2025

Yo'City: Personalized and Boundless 3D Realistic City Scene Generation via Self-Critic Expansion

Yo'City: Personalized and Boundless 3D Realistic City Scene Generation via Self-Critic Expansion

231

0

0

24 Nov 2025

ViPER: Empowering the Self-Evolution of Visual Perception Abilities in Vision-Language Model

ViPER: Empowering the Self-Evolution of Visual Perception Abilities in Vision-Language Model

...

212

3

0

28 Oct 2025

Putting on the Thinking Hats: A Survey on Chain of Thought Fine-tuning from the Perspective of Human Reasoning Mechanism

Putting on the Thinking Hats: A Survey on Chain of Thought Fine-tuning from the Perspective of Human Reasoning Mechanism

224

0

0

15 Oct 2025

RECODE: Reasoning Through Code Generation for Visual Question Answering

RECODE: Reasoning Through Code Generation for Visual Question Answering

Ameet Talwalkar

Cordelia Schmid

172

0

0

15 Oct 2025

Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning

Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning

Ernesto Gabriel Hernández Montoya

...

Rakshith S Srinivasa

325

3

0

14 Oct 2025

Look Less, Reason More: Rollout-Guided Adaptive Pixel-Space Reasoning

Look Less, Reason More: Rollout-Guided Adaptive Pixel-Space Reasoning

223

2

0

02 Oct 2025

Latent Visual Reasoning

Latent Visual Reasoning

200

6

0

29 Sep 2025

Lego-Edit: A General Image Editing Framework with Model-Level Bricks and MLLM Builder

Lego-Edit: A General Image Editing Framework with Model-Level Bricks and MLLM Builder

121

2

0

16 Sep 2025

CoRGI: Verified Chain-of-Thought Reasoning with Post-hoc Visual Grounding

CoRGI: Verified Chain-of-Thought Reasoning with Post-hoc Visual Grounding

128

0

0

01 Aug 2025

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Chengquan Jiang

OffRL ReLM SyDa KELM LRM

496

184

0

15 Apr 2025

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

MU OffRL LRM MLLM ReLM VLM

575

353

0

09 Mar 2025

Qwen2.5-VL Technical Report

Qwen2.5-VL Technical Report

...

720

2,841

0

20 Feb 2025

OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning

OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning

173

41

0

16 Feb 2025

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

...

OffRL AI4TS LRM ReLM VLM

1.2K

5,342

0

22 Jan 2025

RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?

RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?

...

ReLM OffRL LRM VLM

288

40

0

20 Jan 2025

ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning

ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning

...

Junchi Yan

Yu Qiao

520

109

0

19 Feb 2024