Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2507.22034
Cited By
UserBench: An Interactive Gym Environment for User-Centric Agents
29 July 2025
Cheng Qian
Zuxin Liu
Akshara Prabhakar
Zhiwei Liu
Jianguo Zhang
H. Chen
Heng Ji
Weiran Yao
Shelby Heinecke
Silvio Savarese
Caiming Xiong
Huan Wang
LLMAG
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (26 upvotes)
Papers citing
"UserBench: An Interactive Gym Environment for User-Centric Agents"
7 / 7 papers shown
Title
CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents
Jiayu Liu
Cheng Qian
Zhaochen Su
Qing Zong
Shijue Huang
Bingxiang He
Yi R. Fung
LLMAG
80
0
0
04 Nov 2025
InteractComp: Evaluating Search Agents With Ambiguous Queries
Mingyi Deng
Lijun Huang
Yani Fan
Jiayi Zhang
Fashen Ren
...
Xinyu Wang
Xiangru Tang
Nan Tang
Chenglin Wu
Yuyu Luo
124
1
0
28 Oct 2025
Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics
Akshara Prabhakar
Roshan Ram
Zixiang Chen
Silvio Savarese
Frank Wang
Caiming Xiong
Huan Wang
Weiran Yao
134
0
0
20 Oct 2025
COMPASS: A Multi-Turn Benchmark for Tool-Mediated Planning & Preference Optimization
Tian Qin
Felix Bai
Ting-Yao Hu
Raviteja Vemulapalli
H. Koppula
...
Bowen Jin
Mert Cemri
Jiarui Lu
Zirui Wang
Meng Cao
LLMAG
119
0
0
08 Oct 2025
VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications
Wei He
Yueqing Sun
Hongyan Hao
Xueyuan Hao
Zhikang Xia
...
X. Su
Xiaodong Cai
Xunliang Cai
Yu Yang
Yunke Zhao
142
0
0
30 Sep 2025
RecoWorld: Building Simulated Environments for Agentic Recommender Systems
Fei Liu
Xinyu Lin
Hanchao Yu
Mingyuan Wu
Jianyu Wang
...
Mingze Gao
Qifan Wang
Lizhu Zhang
Benyu Zhang
Xiangjun Fan
136
2
0
12 Sep 2025
ShortageSim: Simulating Drug Shortages under Information Asymmetry
Mingxuan Cui
Yilan Jiang
Duo Zhou
Cheng Qian
Yuji Zhang
Q. Wang
104
0
0
01 Sep 2025
1