ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.19100
  4. Cited By
VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

24 October 2024
Lawrence Jang
Yinheng Li
Charles Ding
Justin Lin
Paul Pu Liang
Dan Zhao
Rogerio Bonatti
K. Koishida
ArXivPDFHTML

Papers citing "VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks"

3 / 3 papers shown
Title
RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users
RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users
Suyu Ye
Haojun Shi
Darren Shih
Hyokun Yun
Tanya Roosta
Tianmin Shu
19
0
0
14 Apr 2025
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World
  Tasks
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
Frank F. Xu
Yufan Song
Boxuan Li
Yuxuan Tang
Kritanjali Jain
...
Wayne Chi
Lawrence Jang
Yiqing Xie
Shuyan Zhou
Graham Neubig
LLMAG
124
20
0
18 Dec 2024
The BrowserGym Ecosystem for Web Agent Research
The BrowserGym Ecosystem for Web Agent Research
Thibault Le Sellier De Chezelles
Maxime Gasse
Alexandre Lacoste
Alexandre Drouin
Massimo Caccia
...
Siva Reddy
Quentin Cappart
Graham Neubig
Ruslan Salakhutdinov
Nicolas Chapados
LLMAG
96
9
0
06 Dec 2024
1