AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

11 April 2025

Papers citing "AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories"

1 / 1 papers shown

Title
An Illusion of Progress? Assessing the Current State of Web Agents Tianci Xue Weijian Qi Tianneng Shi Chan Hee Song Boyu Gou D. Song Huan Sun Yu Su LLMAG ELM 92 4 1 02 Apr 2025