ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.10956
  4. Cited By
Spider2-V: How Far Are Multimodal Agents From Automating Data Science
  and Engineering Workflows?

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

15 July 2024
Ruisheng Cao
Fangyu Lei
Haoyuan Wu
Jixuan Chen
Yeqiao Fu
Hongcheng Gao
Xinzhuang Xiong
Hanchong Zhang
Yuchen Mao
Wenjing Hu
Tianbao Xie
Hongshen Xu
Danyang Zhang
Sida Wang
Ruoxi Sun
Pengcheng Yin
Caiming Xiong
Ansong Ni
Qian Liu
Victor Zhong
Lu Chen
Kai Yu
Tao Yu
ArXiv (abs)PDFHTMLHuggingFace (7 upvotes)

Papers citing "Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?"

22 / 22 papers shown
Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents
Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents
Yihong Tang
Kehai Chen
Liang Yue
Jinxin Fan
Caishen Zhou
...
Kaiyang Guo
Xingshan Zeng
Wenjing Cun
L. Shang
Min Zhang
LLMAG
161
0
0
20 Oct 2025
LEGOMem: Modular Procedural Memory for Multi-agent LLM Systems for Workflow Automation
LEGOMem: Modular Procedural Memory for Multi-agent LLM Systems for Workflow Automation
Dongge Han
Camille Couturier
Daniel Madrigal Diaz
Xuchao Zhang
Victor Rühle
Saravan Rajmohan
110
2
0
06 Oct 2025
GuirlVG: Incentivize GUI Visual Grounding via Empirical Exploration on Reinforcement Learning
GuirlVG: Incentivize GUI Visual Grounding via Empirical Exploration on Reinforcement Learning
Weitai Kang
Bin Lei
Gaowen Liu
Caiwen Ding
Yan Yan
152
1
0
06 Aug 2025
Large Language Model-based Data Science Agent: A Survey
Large Language Model-based Data Science Agent: A Survey
Ke Chen
Peiran Wang
Ke Chen
Xianyang Zhan
Haohan Wang
LLMAGLM&RoAI4TSLM&MAAI4CE
789
7
0
02 Aug 2025
DSBC : Data Science task Benchmarking with Context engineering
DSBC : Data Science task Benchmarking with Context engineering
Ram Mohan Rao Kadiyala
Siddhant Gupta
Jebish Purbey
Giulio Martini
Suman Debnath
Suman Debnath
Hamza Farooq
LLMAG
216
1
0
31 Jul 2025
OS-MAP: How Far Can Computer-Using Agents Go in Breadth and Depth?
OS-MAP: How Far Can Computer-Using Agents Go in Breadth and Depth?
Xuetian Chen
Yinghao Chen
Xinfeng Yuan
Zhuo Peng
Lu Chen
...
Tianbao Xie
Zhiyong Wu
Qiushi Sun
Biqing Qi
Bowen Zhou
180
3
0
25 Jul 2025
Augmented Vision-Language Models: A Systematic Review
Augmented Vision-Language Models: A Systematic Review
Anthony C Davis
Burhan Sadiq
Tianmin Shu
Chien-Ming Huang
VLMLRM
196
0
0
24 Jul 2025
Pixels, Patterns, but No Poetry: To See The World like Humans
Pixels, Patterns, but No Poetry: To See The World like Humans
Hongcheng Gao
Longxiang Zhang
Lin Xu
Jingyi Tang
X. Li
...
Xinlong Yang
Ge Wu
Balong Bi
Hongyu Chen
Wentao Zhang
MLLMLRMVLM
160
4
0
21 Jul 2025
CRABS: A syntactic-semantic pincer strategy for bounding LLM interpretation of Python notebooks
CRABS: A syntactic-semantic pincer strategy for bounding LLM interpretation of Python notebooks
Meng Li
Timothy M. McPhillips
Dingmin Wang
Shin-Rong Tsai
Bertram Ludäscher
132
0
0
15 Jul 2025
Multi-level Value Alignment in Agentic AI Systems: Survey and Perspectives
Multi-level Value Alignment in Agentic AI Systems: Survey and Perspectives
Wei Zeng
Hengshu Zhu
Chuan Qin
Han Wu
Yihang Cheng
...
Xiaowei Jin
Yinuo Shen
Zhenxing Wang
Feimin Zhong
Hui Xiong
AI4TS
440
0
0
11 Jun 2025
What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities
Wendong Bu
Yang Wu
Qifan Yu
Minghe Gao
Bingchen Miao
...
Mengze Li
Wei Ji
Juncheng Billy Li
Siliang Tang
Yueting Zhuang
ELM
226
3
0
10 Jun 2025
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows
Qiushi Sun
Zhoumianze Liu
Chang Ma
Zichen Ding
Fangzhi Xu
...
B. Kao
Wenhai Wang
Biqing Qi
Lingpeng Kong
Zhiyong Wu
LLMAGLM&Ro
511
14
0
26 May 2025
Visual Test-time Scaling for GUI Agent Grounding
Visual Test-time Scaling for GUI Agent Grounding
Tiange Luo
Lajanugen Logeswaran
Justin Johnson
Honglak Lee
379
10
0
01 May 2025
ELT-Bench: An End-to-End Benchmark for Evaluating AI Agents on ELT Pipelines
ELT-Bench: An End-to-End Benchmark for Evaluating AI Agents on ELT Pipelines
Tengjun Jin
Yuxuan Zhu
Daniel Kang
LMTDELM
323
8
0
07 Apr 2025
LLM Agents for Education: Advances and Applications
LLM Agents for Education: Advances and Applications
Zhendong Chu
Shen Wang
Jian Xie
Tinghui Zhu
Yibo Yan
...
Aoxiao Zhong
Xuming Hu
Jing Liang
Philip S. Yu
Qingsong Wen
LLMAGELM
336
49
0
14 Mar 2025
From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems
From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support SystemsConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Zekun Zhou
Xiaocheng Feng
Daigang Xu
Xiachong Feng
Ziyun Song
...
Baoxin Wang
Dayong Wu
Guoping Hu
Ting Liu
Bing Qin
AI4TS
509
7
0
03 Mar 2025
CoddLLM: Empowering Large Language Models for Data Analytics
CoddLLM: Empowering Large Language Models for Data Analytics
Jiani Zhang
Hengrui Zhang
Rishav Chakravarti
Yiqun Hu
Patrick Ng
Asterios Katsifodimos
Huzefa Rangwala
George Karypis
Alon Halevy
SyDaELM
900
5
0
01 Feb 2025
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL WorkflowsInternational Conference on Learning Representations (ICLR), 2024
Fangyu Lei
Jixuan Chen
Yuxiao Ye
Ruisheng Cao
Dongchan Shin
...
Caiming Xiong
Ruoxi Sun
Qian Liu
Sida I. Wang
Tao Yu
LMTD
388
128
0
12 Nov 2024
COMMA: A Communicative Multimodal Multi-Agent Benchmark
COMMA: A Communicative Multimodal Multi-Agent Benchmark
Timothy Ossowski
Jixuan Chen
Danyal Maqbool
Zefan Cai
Tyler Bradshaw
Junjie Hu
VLM
538
6
0
10 Oct 2024
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI AgentsInternational Conference on Learning Representations (ICLR), 2024
Boyu Gou
Ruohan Wang
Boyuan Zheng
Yanan Xie
Cheng Chang
Yiheng Shu
Huan Sun
Eric Fosler-Lussier
LM&RoLLMAG
641
238
0
07 Oct 2024
Large Language Models are Good Multi-lingual Learners : When LLMs Meet Cross-lingual Prompts
Large Language Models are Good Multi-lingual Learners : When LLMs Meet Cross-lingual PromptsInternational Conference on Computational Linguistics (COLING), 2024
Teng Wang
Zhenqi He
Wing-Yin Yu
Xiaojin Fu
Xiongwei Han
LRM
456
13
0
17 Sep 2024
Text2SQL is Not Enough: Unifying AI and Databases with TAG
Text2SQL is Not Enough: Unifying AI and Databases with TAG
Asim Biswal
Liana Patel
Siddarth Jha
Amog Kamsetty
Zhifei Li
Joseph E. Gonzalez
Carlos Guestrin
Matei A. Zaharia
LMTD3DV
195
44
0
27 Aug 2024
1
Page 1 of 1