Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.08942
Cited By
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
11 April 2025
Xing Han Lù
Amirhossein Kazemnejad
Nicholas Meade
Arkil Patel
Dongchan Shin
Alejandra Zambrano
Karolina Stañczak
Peter Shaw
Christopher Pal
Siva Reddy
LLMAG
Re-assign community
ArXiv
PDF
HTML
Papers citing
"AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories"
1 / 1 papers shown
Title
An Illusion of Progress? Assessing the Current State of Web Agents
Tianci Xue
Weijian Qi
Tianneng Shi
Chan Hee Song
Boyu Gou
D. Song
Huan Sun
Yu Su
LLMAG
ELM
92
4
1
02 Apr 2025
1