Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.01623
Cited By
WebSuite: Systematically Evaluating Why Web Agents Fail
1 June 2024
Eric Li
Jim Waldo
LLMAG
Re-assign community
ArXiv
PDF
HTML
Papers citing
"WebSuite: Systematically Evaluating Why Web Agents Fail"
3 / 3 papers shown
Title
REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real Websites
Divyansh Garg
Shaun VanWeelden
Diego Caples
Andis Draguns
Nikil Ravi
...
Youngchul Joo
Jindong Gu
Charles London
Christian Schroeder de Witt
S. Motwani
39
1
0
15 Apr 2025
Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks
Adam Fourney
Gagan Bansal
Hussein Mozannar
Cheng Tan
Eduardo Salinas
...
Victor C. Dibia
Ahmed Hassan Awadallah
Ece Kamar
Rafah Hosn
Saleema Amershi
AI4CE
LRM
LLMAG
38
34
0
07 Nov 2024
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model
Mengkang Hu
Tianxing Chen
Qiguang Chen
Yao Mu
Wenqi Shao
Ping Luo
LM&Ro
LLMAG
RALM
29
3
0
18 Aug 2024
1