ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.20289
  4. Cited By
VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
v1v2 (latest)

VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection

26 May 2025
Zeyi Huang
Zeyi Huang
Anirudh Sundara Rajan
Zefan Cai
Wen Xiao
Junjie Hu
Junjie Hu
Yong Jae Lee
ArXiv (abs)PDFHTMLHuggingFace (10 upvotes)

Papers citing "VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection"

18 / 18 papers shown
Reinforcement Learning for Large Model: A Survey
Reinforcement Learning for Large Model: A Survey
Weijia Wu
Chen Gao
Joya Chen
Kevin Lin
Qingwei Meng
Yiming Zhang
Yuke Qiu
Hong Zhou
Mike Zheng Shou
316
2
0
24 Dec 2025
From Illusion to Intention: Visual Rationale Learning for Vision-Language Reasoning
From Illusion to Intention: Visual Rationale Learning for Vision-Language Reasoning
C. Wang
Haozhe Wang
Xi Chen
J. Liu
Taofeng Xue
Chong Peng
Donglian Qi
Fangzhen Lin
Yunfeng Yan
OffRLLRM
312
0
0
28 Nov 2025
Yo'City: Personalized and Boundless 3D Realistic City Scene Generation via Self-Critic Expansion
Yo'City: Personalized and Boundless 3D Realistic City Scene Generation via Self-Critic Expansion
K. Lu
S. K. Zhou
Hongbin Xu
Gang Xu
Zhifei Yang
Y. Wang
Zhen Xiao
Jieyi Long
Ming Li
231
0
0
24 Nov 2025
ViPER: Empowering the Self-Evolution of Visual Perception Abilities in Vision-Language Model
ViPER: Empowering the Self-Evolution of Visual Perception Abilities in Vision-Language Model
J. Zhang
Song Jin
Chuanqi Cheng
Yuhan Liu
Yankai Lin
...
Yufei Zhang
F. Jiang
G. Yin
Wei Lin
Rui Yan
VLM
212
3
0
28 Oct 2025
Putting on the Thinking Hats: A Survey on Chain of Thought Fine-tuning from the Perspective of Human Reasoning Mechanism
Putting on the Thinking Hats: A Survey on Chain of Thought Fine-tuning from the Perspective of Human Reasoning Mechanism
Xiaoshu Chen
Sihang Zhou
Ke Liang
Duanyang Yuan
Haoyuan Chen
Xiaoyu Sun
Linyuan Meng
Xinwang Liu
ReLMLRM
224
0
0
15 Oct 2025
RECODE: Reasoning Through Code Generation for Visual Question Answering
RECODE: Reasoning Through Code Generation for Visual Question Answering
Junhong Shen
Mu Cai
Bo Hu
Ameet Talwalkar
David A. Ross
Cordelia Schmid
Alireza Fathi
ReLMLRM
172
0
0
15 Oct 2025
Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning
Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning
Xingang Guo
Utkarsh Tyagi
Advait Gosai
Paula Vergara
Ernesto Gabriel Hernández Montoya
...
Bin Hu
Yunzhong He
Bing Liu
Bing Liu
Rakshith S Srinivasa
VLMLRM
325
3
0
14 Oct 2025
Look Less, Reason More: Rollout-Guided Adaptive Pixel-Space Reasoning
Look Less, Reason More: Rollout-Guided Adaptive Pixel-Space Reasoning
Xuchen Li
Xuzhao Li
Jiahui Gao
Renjie Pi
Shiyu Hu
Wentao Zhang
VLMLRM
223
2
0
02 Oct 2025
Latent Visual Reasoning
Latent Visual Reasoning
Bangzheng Li
Ximeng Sun
Jiang-Long Liu
Ze Wang
Jialian Wu
Xiaodong Yu
Hao Chen
Emad Barsoum
Muhao Chen
Zicheng Liu
LRMVLM
200
6
0
29 Sep 2025
Lego-Edit: A General Image Editing Framework with Model-Level Bricks and MLLM Builder
Lego-Edit: A General Image Editing Framework with Model-Level Bricks and MLLM Builder
Qifei Jia
Yu Liu
Yajie Chai
Xintong Yao
Qiming Lu
Y. Zhang
Runyu Shi
Y. Huang
Guoquan Zhang
LM&Ro
121
2
0
16 Sep 2025
CoRGI: Verified Chain-of-Thought Reasoning with Post-hoc Visual Grounding
CoRGI: Verified Chain-of-Thought Reasoning with Post-hoc Visual Grounding
Shixin Yi
Lin Shang
ReLMLRM
128
0
0
01 Aug 2025
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Jiazhan Feng
Shijue Huang
Xingwei Qu
Ge Zhang
Yujia Qin
Baoquan Zhong
Chengquan Jiang
Jinxin Chi
Wanjun Zhong
OffRLReLMSyDaKELMLRM
496
184
0
15 Apr 2025
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models
Hao Wu
Bohan Jia
Zijie Zhai
Shaosheng Cao
Zheyu Ye
Fei Zhao
Zhe Xu
Yao Hu
Shaohui Lin
MUOffRLLRMMLLMReLMVLM
575
353
0
09 Mar 2025
Qwen2.5-VL Technical Report
Qwen2.5-VL Technical Report
S. Bai
Keqin Chen
Xuejing Liu
Jialin Wang
Wenbin Ge
...
Zesen Cheng
Hang Zhang
Zhibo Yang
Haiyang Xu
Junyang Lin
VLM
720
2,841
0
20 Feb 2025
OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning
OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning
Pan Lu
Bowen Chen
Sheng Liu
Rahul Thapa
Joseph Boen
James Zou
LRM
173
41
0
16 Feb 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
OffRLAI4TSLRMReLMVLM
1.2K
5,342
0
22 Jan 2025
RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?
RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?
Haotian Xu
Xing Wu
Weinong Wang
Zhongzhi Li
Da Zheng
...
Yingying Zhang
Zhijiang Guo
Yaodong Yang
Muhan Zhang
Debing Zhang
ReLMOffRLLRMVLM
288
40
0
20 Jan 2025
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning
Renqiu Xia
Bo Zhang
Hancheng Ye
Xiangchao Yan
Zijun Chen
...
Min Dou
Ding Wang
Junchi Yan
Junchi Yan
Yu Qiao
LRM
520
109
0
19 Feb 2024
1