ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2507.07998
  4. Cited By
PyVision: Agentic Vision with Dynamic Tooling
v1v2v3 (latest)

PyVision: Agentic Vision with Dynamic Tooling

10 July 2025
Shitian Zhao
H. Zhang
Shaoheng Lin
Ming Li
Qilong Wu
Kaipeng Zhang
Chen Wei
    LRM
ArXiv (abs)PDFHTMLHuggingFace (28 upvotes)

Papers citing "PyVision: Agentic Vision with Dynamic Tooling"

13 / 13 papers shown
CodeV: Code with Images for Faithful Visual Reasoning via Tool-Aware Policy Optimization
CodeV: Code with Images for Faithful Visual Reasoning via Tool-Aware Policy Optimization
X. Hou
Shaoyuan Xu
Manan Biyani
Mayan Li
Jia-Wei Liu
Todd C. Hollon
Bryan Wang
140
0
0
24 Nov 2025
DeepEyesV2: Toward Agentic Multimodal Model
DeepEyesV2: Toward Agentic Multimodal ModelIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025
Jack Hong
Chenxiao Zhao
ChengLin Zhu
Weiheng Lu
Guohai Xu
Xing Yu
130
5
0
07 Nov 2025
V-Thinker: Interactive Thinking with Images
V-Thinker: Interactive Thinking with Images
Runqi Qiao
Qiuna Tan
Minghan Yang
Guanting Dong
Peiqing Yang
...
Lan Yang
Chong Sun
Chen Li
Honggang Zhang
Honggang Zhang
MLLMLRM
428
2
0
06 Nov 2025
TIR-Bench: A Comprehensive Benchmark for Agentic Thinking-with-Images Reasoning
TIR-Bench: A Comprehensive Benchmark for Agentic Thinking-with-Images Reasoning
Ming Li
Jike Zhong
Shitian Zhao
H. Zhang
Shaoheng Lin
Yuxiang Lai
Chen Wei
Konstantinos Psounis
Kaipeng Zhang
EGVMLRMVLM
468
3
0
03 Nov 2025
MGA: Memory-Driven GUI Agent for Observation-Centric Interaction
MGA: Memory-Driven GUI Agent for Observation-Centric Interaction
Weihua Cheng
Ersheng Ni
Wenlong Wang
Yifei Sun
Junming Liu
Wangyu Shen
Yirong Chen
Ding Wang
Botian Shi
LLMAGLM&Ro
296
1
0
28 Oct 2025
Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents
Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents
Yihong Tang
Kehai Chen
Liang Yue
Jinxin Fan
Caishen Zhou
...
Kaiyang Guo
Xingshan Zeng
Wenjing Cun
L. Shang
Min Zhang
LLMAG
158
0
0
20 Oct 2025
Video-STAR: Reinforcing Open-Vocabulary Action Recognition with Tools
Video-STAR: Reinforcing Open-Vocabulary Action Recognition with Tools
Zhenlong Yuan
Xiangyan Qu
Chengxuan Qian
Rui Chen
Jing Tang
...
Xiangxiang Chu
Dapeng Zhang
Yiwei Wang
Y. Cai
Shuo Li
VLMLRM
140
8
0
09 Oct 2025
Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails
Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails
Siwei Han
Jiaqi Liu
Yaofeng Su
Wenbo Duan
Xinyuan Liu
Cihang Xie
Mohit Bansal
Mingyu Ding
Linjun Zhang
Huaxiu Yao
137
1
0
06 Oct 2025
PhysicsMinions: Winning Gold Medals in the Latest Physics Olympiads with a Coevolutionary Multimodal Multi-Agent System
PhysicsMinions: Winning Gold Medals in the Latest Physics Olympiads with a Coevolutionary Multimodal Multi-Agent System
F. Yu
Junchi Yao
Ziyi Wang
Haiyuan Wan
Y. Huang
...
Ning Ding
Ganqu Cui
Wenlong Zhang
Wanli Ouyang
Peng Ye
LRMAI4CE
94
2
0
29 Sep 2025
Visual Programmability: A Guide for Code-as-Thought in Chart Understanding
Visual Programmability: A Guide for Code-as-Thought in Chart Understanding
Bohao Tang
Yan Ma
Fei Zhang
Jiadi Su
Ethan Chern
Zhulin Hu
Zhixin Wang
Pengfei Liu
Ya Zhang
LRM
133
0
0
11 Sep 2025
Reinforced Visual Perception with Tools
Reinforced Visual Perception with Tools
Zetong Zhou
Dongping Chen
Zixian Ma
Zhihan Hu
Mingyang Fu
Sinan Wang
Yao Wan
Zhou Zhao
Ranjay Krishna
OffRLVLMLRM
155
11
0
01 Sep 2025
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent
Xin Guan
Peng Xia
Zhen Zhang
Xinyu Wang
Qiuchen Wang
...
Kuan Li
Yong Jiang
Pengjun Xie
Fei Huang
Jingren Zhou
328
31
0
07 Aug 2025
Geoint-R1: Formalizing Multimodal Geometric Reasoning with Dynamic Auxiliary Constructions
Geoint-R1: Formalizing Multimodal Geometric Reasoning with Dynamic Auxiliary Constructions
Jingxuan Wei
Caijun Jia
Qi Chen
Honghao He
Linzhuang Sun
Conghui He
Lijun Wu
Bihui Yu
Cheng Tan
LRM
188
3
0
05 Aug 2025
1