Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2410.08164
Cited By
Agent S: An Open Agentic Framework that Uses Computers Like a Human
International Conference on Learning Representations (ICLR), 2024
10 October 2024
Saaket Agashe
Jiuzhou Han
Shuyu Gan
Jiachen Yang
Ang Li
Xin Eric Wang
LLMAG
LM&Ro
AIFin
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (25 upvotes)
Github (5250★)
Papers citing
"Agent S: An Open Agentic Framework that Uses Computers Like a Human"
50 / 52 papers shown
Title
OpenApps: Simulating Environment Variations to Measure UI-Agent Reliability
Karen Ullrich
Jingtong Su
Claudia Shi
Arjun Subramonian
Amir Bar
Ivan Evtimov
Nikolaos Tsilivis
Randall Balestriero
Julia Kempe
Mark Ibrahim
44
0
0
25 Nov 2025
WebCoach: Self-Evolving Web Agents with Cross-Session Memory Guidance
Genglin Liu
Shijie Geng
Sha Li
Hejie Cui
Sarah Zhang
Xin Liu
Tianyi Liu
CLL
367
0
0
17 Nov 2025
Learning from Online Videos at Inference Time for Computer-Use Agents
Yujian Liu
Ze Wang
Hao Chen
Ximeng Sun
X. Yu
J. Wu
Jiang-Long Liu
Emad Barsoum
Zicheng Liu
Shiyu Chang
117
0
0
06 Nov 2025
MGA: Memory-Driven GUI Agent for Observation-Centric Interaction
Weihua Cheng
Ersheng Ni
Wenlong Wang
Yifei Sun
Junming Liu
Wangyu Shen
Yirong Chen
Botian Shi
Ding Wang
LLMAG
LM&Ro
197
0
0
28 Oct 2025
LightAgent: Mobile Agentic Foundation Models
Yangqin Jiang
Chao Huang
LLMAG
90
0
0
24 Oct 2025
Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents
Gil Pasternak
Dheeraj Rajagopal
Julia White
Dhruv Atreja
Matthew Thomas
George Hurn-Maloney
Ash Lewis
LLMAG
115
0
0
22 Oct 2025
CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using Agent
Haojia Lin
Xiaoyu Tan
Yulei Qin
Zihan Xu
Yuchen Shi
...
Shaofei Cai
Siqi Cai
Chaoyou Fu
Ke Li
Xing Sun
ALM
107
0
0
21 Oct 2025
R-WoM: Retrieval-augmented World Model For Computer-use Agents
Kai Mei
Jiang Guo
Shuaichen Chang
Mingwen Dong
Dongkyu Lee
Xing Niu
Jiarong Jiang
76
0
0
13 Oct 2025
Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Xiao Yu
Baolin Peng
Michel Galley
Hao Cheng
Qianhui Wu
Janardhan Kulkarni
Suman Nath
Zhou Yu
Jianfeng Gao
LRM
AI4CE
70
0
0
10 Oct 2025
Don't Adapt Small Language Models for Tools; Adapt Tool Schemas to the Models
Jonggeun Lee
Woojung Song
Jongwook Han
Haesung Pyun
Yohan Jo
CLL
86
0
0
08 Oct 2025
Black-box Detection of LLM-generated Text Using Generalized Jensen-Shannon Divergence
Shuangyi Chen
Ashish Khisti
DeLMO
156
0
0
08 Oct 2025
A Case for Declarative LLM-friendly Interfaces for Improved Efficiency of Computer-Use Agents
Yuan Wang
Mingyu Li
Haibo Chen
LLMAG
ELM
95
0
0
06 Oct 2025
Improving Cooperation in Collaborative Embodied AI
Hima Jacob Leven Suprabha
Laxmi Nag Laxminarayan Nagesh
Ajith Nair
Alvin Reuben Amal Selvaster
Ayan Khan
...
Titouan Puech
Venkataramireddy Marella
Vishal Sonar
Alessandro Suglia
Oliver Lemon
LLMAG
69
0
0
03 Oct 2025
The Unreasonable Effectiveness of Scaling Agents for Computer Use
Gonzalo Gonzalez-Pumariega
Vincent Tu
Chih-Lun Lee
Jiachen Yang
Ang Li
Xin Eric Wang
100
2
0
02 Oct 2025
D-Artemis: A Deliberative Cognitive Framework for Mobile GUI Multi-Agents
Hongze Mi
Yibo Feng
Wenjie Lu
Y. Wang
Jinyuan Li
...
Xuelin Zhang
Haotian Luo
Di Sun
Naiqiang Tan
Gang Pan
92
0
0
26 Sep 2025
GUI-ARP: Enhancing Grounding with Adaptive Region Perception for GUI Agents
Xianhang Ye
Yiqing Li
Wei Dai
Miancan Liu
Ziyuan Chen
...
Hongbo Min
Jinkui Ren
Xiantao Zhang
Wen Yang
Zhi Jin
96
3
0
19 Sep 2025
InfraMind: A Novel Exploration-based GUI Agentic Framework for Mission-critical Industrial Management
Liangtao Lin
Zhaomeng Zhu
Tianwei Zhang
Yonggang Wen
AI4CE
113
2
0
17 Sep 2025
Interaction-Driven Browsing: A Human-in-the-Loop Conceptual Framework Informed by Human Web Browsing for Browser-Using Agents
Hyeonggeun Yun
Jinkyu Jang
97
0
0
15 Sep 2025
Agentic Lybic: Multi-Agent Execution System with Tiered Reasoning and Orchestration
Liangxuan Guo
Bin Zhu
Qingqian Tao
Kangning Liu
Xun Zhao
Xianzhe Qin
Jin Gao
Guangfu Hao
209
1
0
14 Sep 2025
FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games
Jaewoo Ahn
J. Kim
Heeseung Yun
Jaehyeon Son
Dongmin Park
Jaewoong Cho
Gunhee Kim
144
1
0
01 Sep 2025
Mobile-Agent-v3: Fundamental Agents for GUI Automation
Jiabo Ye
Xi Zhang
Haiyang Xu
Haowei Liu
Junyang Wang
...
Jitong Liao
Qi Zheng
Fei Huang
Jingren Zhou
Ming Yan
LLMAG
LM&Ro
224
28
0
21 Aug 2025
Transduction is All You Need for Structured Data Workflows
A. Gliozzo
Naweed Khan
Christodoulos Constantinides
Nandana Mihindukulasooriya
Nahuel Defosse
Gaetano Rossiello
Junkyu Lee
AI4CE
88
1
0
21 Aug 2025
UI-Venus Technical Report: Building High-performance UI Agents with RFT
Zhangxuan Gu
Zhengwen Zeng
Zhenyu Xu
Xingran Zhou
Shuheng Shen
...
Yuan Guo
Yong Deng
Zhenyu Guo
Liang Chen
Weiqiang Wang
LLMAG
LM&Ro
255
14
0
14 Aug 2025
OpenCUA: Open Foundations for Computer-Use Agents
Xinyuan Wang
Bowen Wang
Dunjie Lu
Junlin Yang
Tianbao Xie
...
Victor Zhong
Flood Sung
Y.Charles
Zhilin Yang
Tao Yu
ELM
VLM
194
22
0
12 Aug 2025
OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
Xueyu Hu
Tao Xiong
Biao Yi
Zishu Wei
Ruixuan Xiao
...
Zhou Zhao
Hongxia Yang
Fan Wu
Shengyu Zhang
Fei Wu
LLMAG
LM&Ro
AI4TS
162
28
0
06 Aug 2025
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience
Zeyi Sun
Ziyu Liu
Yuhang Zang
Yuhang Cao
Xiaoyi Dong
Tong Wu
Dahua Lin
Yuan Liu
LLMAG
199
11
0
06 Aug 2025
RoboMemory: A Brain-inspired Multi-memory Agentic Framework for Interactive Environmental Learning in Physical Embodied Systems
Mingcong Lei
Honghao Cai
Zezhou Cui
Zezhou Cui
Liangchen Tan
...
Zhenglin Wan
Zhen Li
Shuguang Cui
Yiming Zhao
Yatong Han
339
0
0
02 Aug 2025
Measuring Harmfulness of Computer-Using Agents
Aaron Xuxiang Tian
Ruofan Zhang
J. Tang
Ji Wang
Tianyu Shi
Jiaxin Wen
ELM
64
0
0
31 Jul 2025
OS-MAP: How Far Can Computer-Using Agents Go in Breadth and Depth?
Xuetian Chen
Yinghao Chen
Xinfeng Yuan
Zhuo Peng
Lu Chen
...
Tianbao Xie
Zhiyong Wu
Qiushi Sun
Biqing Qi
Bowen Zhou
127
3
0
25 Jul 2025
GTA1: GUI Test-time Scaling Agent
Yan Yang
Dongxu Li
Yutong Dai
Yuhao Yang
Ziyang Luo
...
Ran Xu
Liyuan Pan
Silvio Savarese
Caiming Xiong
Junnan Li
LLMAG
326
33
0
08 Jul 2025
Multi-level Value Alignment in Agentic AI Systems: Survey and Perspectives
Wei Zeng
Hengshu Zhu
Chuan Qin
Han Wu
Yihang Cheng
...
Xiaowei Jin
Yinuo Shen
Zhenxing Wang
Feimin Zhong
Hui Xiong
AI4TS
325
3
0
11 Jun 2025
GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior
Penghao Wu
Shengnan Ma
Bo Wang
Jiaheng Yu
Lewei Lu
Ziwei Liu
179
9
0
09 Jun 2025
BIMgent: Towards Autonomous Building Modeling via Computer-use Agents
Zihan Deng
Changyu Du
Stavros Nousias
A. Borrmann
LM&Ro
AI4CE
201
4
0
08 Jun 2025
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development
Zhenran Xu
Xue Yang
Yiyu Wang
Qingli Hu
Zijiao Wu
L. Wang
Weihua Luo
Kaifu Zhang
Baotian Hu
Min Zhang
LLMAG
217
4
0
05 Jun 2025
How Far Are We from Generating Missing Modalities with Foundation Models?
Guanzhou Ke
Yi Xie
Xiaoli Wang
Guoqing Chao
Bo Wang
VLM
230
0
0
04 Jun 2025
RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents
Jingyi Yang
Shuai Shao
Dongrui Liu
Jing Shao
470
7
0
31 May 2025
UI-Evol: Automatic Knowledge Evolving for Computer Use Agents
Ziyun Zhang
Xinyi Liu
Xiaoyi Zhang
Jun Wang
Gang Chen
Yan Lu
LLMAG
264
1
0
28 May 2025
Vibe Coding vs. Agentic Coding: Fundamentals and Practical Implications of Agentic AI
Ranjan Sapkota
Konstantinos I. Roumeliotis
Manoj Karkee
307
20
0
26 May 2025
ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World
Runliang Niu
Jinglong Ji
Yi Chang
Zhiqiang Zhang
152
0
0
25 May 2025
lmgame-Bench: How Good are LLMs at Playing Games?
Lanxiang Hu
Mingjia Huo
Yu Zhang
Haoyang Yu
Eric P. Xing
Ion Stoica
Tajana Rosing
Haojian Jin
Hao Zhang
401
8
0
21 May 2025
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis
Tianbao Xie
Jiaqi Deng
Xiaochuan Li
Junlin Yang
Haoyuan Wu
...
Yiheng Xu
Junli Wang
Doyen Sahoo
Tao Yu
Caiming Xiong
295
43
0
19 May 2025
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges
Information Fusion (Inf. Fusion), 2025
Ranjan Sapkota
Konstantinos I. Roumeliotis
Manoj Karkee
AI4TS
796
124
0
15 May 2025
UFO2: The Desktop AgentOS
Chaoyun Zhang
He Huang
Chiming Ni
J. Mu
Si Qin
...
Minghua Ma
Jian-Guang Lou
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
LLMAG
581
13
0
20 Apr 2025
WebLists: Extracting Structured Information From Complex Interactive Websites Using Executable LLM Agents
Arth Bohra
Manvel Saroyan
Danil Melkozerov
Vahe Karufanyan
Gabriel Maher
Pascal Weinberger
Artem Harutyunyan
Giovanni Campagna
LLMAG
232
0
0
17 Apr 2025
TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for Generalized GUI Agents
Bofei Zhang
Zirui Shang
Zhi Gao
Wang Zhang
Rui Xie
Xiaojian Ma
Tao Yuan
Xinxiao Wu
Song-Chun Zhu
Qing Li
LLMAG
308
21
0
17 Apr 2025
APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay
Akshara Prabhakar
Ziqiang Liu
Weiran Yao
Jianguo Zhang
Ming Zhu
...
Juan Carlos Niebles
Shelby Heinecke
Han Wang
Siyang Song
Caiming Xiong
VGen
346
57
0
04 Apr 2025
Self-Resource Allocation in Multi-Agent LLM Systems
Alfonso Amayuelas
Jingbo Yang
Saaket Agashe
Ashwin Nagarajan
Antonis Antoniades
Xinze Wang
William Wang
393
8
0
02 Apr 2025
Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment
Gaole Dai
Shiqi Jiang
Ting Cao
Yuanchun Li
Yue Yang
Rui Tan
Mo Li
Lili Qiu
376
21
0
20 Mar 2025
HARBOR: Exploring Persona Dynamics in Multi-Agent Competition
Kenan Jiang
Li Xiong
Fei Liu
334
3
0
17 Feb 2025
Visual Large Language Models for Generalized and Specialized Applications
Jiayi Zhang
Zhixin Lai
Wentao Bao
Zhen Tan
Anh Dao
Kewei Sui
Jiayi Shen
Dong Liu
Huan Liu
Yu Kong
VLM
398
32
0
06 Jan 2025
1
2
Next