Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2508.04482
Cited By
OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
6 August 2025
Xueyu Hu
Tao Xiong
Biao Yi
Zishu Wei
Ruixuan Xiao
Yurun Chen
Jiasheng Ye
Meiling Tao
Xiangxin Zhou
Ziyu Zhao
Yuhuai Li
Shengze Xu
Shenzhi Wang
Xinchen Xu
Shuofei Qiao
Zhaokai Wang
Kun Kuang
Tieyong Zeng
Liang Wang
Jiwei Li
Yuchen Eleanor Jiang
Wangchunshu Zhou
Guoyin Wang
Keting Yin
Zhou Zhao
Hongxia Yang
Fan Wu
Shengyu Zhang
Fei Wu
LLMAG
LM&Ro
AI4TS
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (8 upvotes)
Github (178410★)
Papers citing
"OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use"
24 / 24 papers shown
DualTAP: A Dual-Task Adversarial Protector for Mobile MLLM Agents
Fuyao Zhang
Jiaming Zhang
C. Wang
Xiongtao Sun
Yurong Hao
Guowei Guan
Wenjie Li
Longtao Huang
Wei Yang Bryan Lim
AAML
181
1
0
17 Nov 2025
GUI-Rise: Structured Reasoning and History Summarization for GUI Navigation
Tao Liu
Chongyu Wang
Rongjie Li
Yingchen Yu
Xuming He
Bai Song
LLMAG
LRM
109
0
0
31 Oct 2025
Measuring the Security of Mobile LLM Agents under Adversarial Prompts from Untrusted Third-Party Channels
Chenghao Du
Quanfeng Huang
Tingxuan Tang
Zihao Wang
Adwait Nadkarni
Yue Xiao
AAML
264
0
0
31 Oct 2025
ColorEcosystem: Powering Personalized, Standardized, and Trustworthy Agentic Service in massive-agent Ecosystem
Fangwen Wu
Zheng Wu
Jihong Wang
Yihao Chen
Ruiguang Pei
...
Zhihui Fu
Weiwen Liu
Zhuosheng Zhang
Weinan Zhang
Jun Wang
LM&Ro
192
0
0
24 Oct 2025
ColorAgent: Building A Robust, Personalized, and Interactive OS Agent
Ning Li
Qiqiang Lin
Zheng Wu
Xiaoyun Mo
Weiming Zhang
...
Xingyu Lou
Jun Wang
Weiwen Liu
Zhuosheng Zhang
Weinan Zhang
LLMAG
VLM
186
0
0
22 Oct 2025
Experience-Driven Exploration for Efficient API-Free AI Agents
Chenwei Tang
Jingyu Xing
Xinyu Liu
Zizhou Wang
Jiawei Du
Liangli Zhen
Jiancheng Lv
206
0
0
17 Oct 2025
Say One Thing, Do Another? Diagnosing Reasoning-Execution Gaps in VLM-Powered Mobile-Use Agents
Lingzhong Dong
Ziqi Zhou
Shuaibo Yang
Haiyue Sheng
Pengzhou Cheng
Zongru Wu
Zheng Wu
Gongshen Liu
Zhuosheng Zhang
LRM
161
0
0
02 Oct 2025
Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Yurun Chen
Xavier Hu
Y. Liu
Ziqi Wang
Zeyi Liao
...
Feng Wei
Yuxi Qian
Bo Zheng
Keting Yin
Shengyu Zhang
LLMAG
237
1
0
01 Oct 2025
OceanGym: A Benchmark Environment for Underwater Embodied Agents
Yida Xue
Mingjun Mao
Xiangyuan Ru
Yuqi Zhu
Baochang Ren
...
Shumin Deng
Xinyu An
Ningyu Zhang
Ying Chen
Huajun Chen
201
0
0
30 Sep 2025
UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning
Zhengxi Lu
Jiabo Ye
Fei Tang
Yongliang Shen
Haiyang Xu
...
Weiming Lu
Ming Yan
Fei Huang
Jun Xiao
Yueting Zhuang
OffRL
OnRL
478
3
0
15 Sep 2025
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
Haoming Wang
Haoyang Zou
Huatong Song
J. Feng
Junjie Fang
...
Xianzheng Ma
Xiaojun Xiao
X. Y. Huang
Xinjie Chen
Yidi Du
LLMAG
287
52
0
02 Sep 2025
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Weiyun Wang
Zhangwei Gao
Lixin Gu
Hengjun Pu
Long Cui
...
Bowen Zhou
Kai Chen
Yu Qiao
Wenhai Wang
Gen Luo
MLLM
LRM
304
265
0
25 Aug 2025
AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications
Dawei Gao
Zitao Li
Yuexiang Xie
Weirui Kuang
Liuyi Yao
...
Weikai Liao
Farruh Isakulovich Kushnazarov
Yaliang Li
Bolin Ding
Jingren Zhou
LLMAG
AI4TS
170
2
0
22 Aug 2025
Mobile-Agent-v3: Fundamental Agents for GUI Automation
Jiabo Ye
Xi Zhang
Haiyang Xu
Haowei Liu
Junyang Wang
...
Jitong Liao
Qi Zheng
Fei Huang
Jingren Zhou
Ming Yan
LLMAG
LM&Ro
268
40
0
21 Aug 2025
Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Yuqi Zhu
Yi Zhong
Jintian Zhang
Ziheng Zhang
Shuofei Qiao
Yujie Luo
Lun Du
Da Zheng
Ningyu Zhang
Huajun Chen
ELM
403
1
0
24 Jun 2025
AgentAlign: Navigating Safety Alignment in the Shift from Informative to Agentic Large Language Models
Jinchuan Zhang
Lu Yin
Yan Zhou
Songlin Hu
LLMAG
LM&Ro
214
3
0
29 May 2025
Hidden Ghost Hand: Unveiling Backdoor Vulnerabilities in MLLM-Powered Mobile GUI Agents
Pengzhou Cheng
Haowen Hu
Zheng Wu
Zongru Wu
Tianjie Ju
Zhuosheng Zhang
Zhuosheng Zhang
LLMAG
AAML
389
5
0
20 May 2025
A Survey on the Safety and Security Threats of Computer-Using Agents: JARVIS or Ultron?
Ada Chen
Yongjiang Wu
Jing Zhang
Shu Yang
Shu Yang
Jen-tse Huang
Wenxuan Wang
Wenxuan Wang
S. Wang
ELM
437
11
0
16 May 2025
EcoAgent: An Efficient Device-Cloud Collaborative Multi-Agent Framework for Mobile Automation
Biao Yi
Xavier Hu
Yexin Chen
Shengyu Zhang
Hongxia Yang
Fan Wu
LLMAG
1.2K
3
0
08 May 2025
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
Yuhang Liu
Pengxiang Li
C. Xie
Xavier Hu
Xiaotian Han
Shengyu Zhang
Hongxia Yang
Fei Wu
LLMAG
LM&Ro
LRM
AI4CE
382
73
0
19 Apr 2025
A Survey of WebAgents: Towards Next-Generation AI Agents for Web Automation with Large Foundation Models
Liangbo Ning
Ziran Liang
Zhuohang Jiang
Haohao Qu
Yujuan Ding
...
Xiao Wei
Shanru Lin
Hui Liu
Philip S. Yu
Qing Li
LLMAG
LM&Ro
606
49
0
30 Mar 2025
CHOP: Mobile Operating Assistant with Constrained High-frequency Optimized Subtask Planning
Yuqi Zhou
Shuai Wang
Sunhao Dai
Qinglin Jia
Zhaocheng Du
Zhenhua Dong
Jun Xu
LM&Ro
316
4
0
05 Mar 2025
Evaluating the Robustness of Multimodal Agents Against Active Environmental Injection Attacks
Yurun Chen
Xavier Hu
Keting Yin
Juncheng Billy Li
Shengyu Zhang
AAML
275
11
0
18 Feb 2025
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection
Yunxing Liu
Pengxiang Li
Zishu Wei
C. Xie
Xueyu Hu
Xinchen Xu
Shengyu Zhang
Xiaotian Han
Hongxia Yang
Leilei Gan
LLMAG
LRM
318
44
0
08 Jan 2025
1