Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration

3 June 2024

Ming Yan

Ji Zhang

Fei Huang

Jitao Sang

LM&Ro

LLMAG

ArXiv (abs)PDF HTML HuggingFace (35 upvotes)Github (4278★)

Papers citing "Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration"

50 / 65 papers shown

Transforming Monolithic Foundation Models into Embodied Multi-Agent Architectures for Human-Robot Collaboration

113

30 Nov 2025

OSWorld-MCP: Benchmarking MCP Tool Invocation In Computer-Use Agents

178

28 Oct 2025

GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?

200

23 Oct 2025

ColorAgent: Building A Robust, Personalized, and Interactive OS Agent

...

190

22 Oct 2025

Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents

...

182

20 Oct 2025

CORE: Reducing UI Exposure in Mobile Agents via Collaboration Between Cloud and Local LLMs

132

17 Oct 2025

A Survey on Agentic Multimodal Large Language Models

...

LM&Ro AIFin AI4TS LRM AI4CE

250

13 Oct 2025

Training-Free Group Relative Policy Optimization

...

264

09 Oct 2025

Cross-Embodiment Dexterous Hand Articulation Generation via Morphology-Aware Learning

165

07 Oct 2025

Agent-ScanKit: Unraveling Memory and Reasoning of Multimodal Agents via Sensitivity Perturbations

405

01 Oct 2025

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

...

404

26 Sep 2025

MobileRAG: Enhancing Mobile Agent with Retrieval-Augmented Generation

121

04 Sep 2025

Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control

280

01 Sep 2025

KG-RAG: Enhancing GUI Agent Decision-Making via Knowledge Graph-Driven Retrieval-Augmented Generation

...

116

30 Aug 2025

PG-Agent: An Agent Powered by Page Graph

146

27 Aug 2025

AppAgent-Pro: A Proactive GUI Agent System for Multidomain Information Integration and User Assistance

233

26 Aug 2025

PerPilot: Personalizing VLM-based Mobile Agents via Memory and Exploration

...

25 Aug 2025

Mobile-Agent-v3: Fundamental Agents for GUI Automation

...

269

21 Aug 2025

CRAFT-GUI: Curriculum-Reinforced Agent For GUI Tasks

129

15 Aug 2025

UI-Venus Technical Report: Building High-performance UI Agents with RFT

...

338

14 Aug 2025

MVISU-Bench: Benchmarking Mobile Agents for Real-World Tasks by Multi-App, Vague, Interactive, Single-App and Unethical Instructions

148

12 Aug 2025

Uncertainty-Aware GUI Agent: Adaptive Perception through Component Recommendation and Human-in-the-Loop Refinement

Chao Hao

Shuai Wang

Kaiwen Zhou

208

06 Aug 2025

NatureGAIA: Pushing the Frontiers of GUI Agents with a Challenging Benchmark and High-Quality Trajectory Dataset

187

02 Aug 2025

MapAgent: Trajectory-Constructed Memory-Augmented Planning for Mobile Task Automation

409

29 Jul 2025

Enhancing Jailbreak Attacks on LLMs via Persona Prompts

172

28 Jul 2025

OS-MAP: How Far Can Computer-Using Agents Go in Breadth and Depth?

...

188

25 Jul 2025

GUI-G

^2

: Gaussian Reward Modeling for GUI Grounding

...

390

21 Jul 2025

VisualTrap: A Stealthy Backdoor Attack on GUI Agents via Visual Grounding Manipulation

339

09 Jul 2025

Mobile-R1: Towards Interactive Reinforcement Learning for VLM-Based Mobile Agent via Task-Level Rewards

...

242

25 Jun 2025

Deep Research Agents: A Systematic Examination And Roadmap

...

Youssef Attia El Hili

Jun Wang

LLMAG

305

22 Jun 2025

Towards Pervasive Distributed Agentic Generative AI -- A State of The Art

Gianni Molinari

Fabio Ciravegna

LLMAG LM&Ro AI4CE

486

16 Jun 2025

Multi-level Value Alignment in Agentic AI Systems: Survey and Perspectives

...

444

11 Jun 2025

GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior

257

09 Jun 2025

Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation

...

411

05 Jun 2025

XBOUND: Exploring Capability Boundaries of Device-Control Agents at the State Level

392

27 May 2025

BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism

270

27 May 2025

TransBench: Breaking Barriers for Transferable Graphical User Interface Agents in Dynamic Digital EnvironmentsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

515

23 May 2025

Hidden Ghost Hand: Unveiling Backdoor Vulnerabilities in MLLM-Powered Mobile GUI Agents

406

20 May 2025

Mobile-Agent-V: A Video-Guided Approach for Effortless and Efficient Operational Knowledge Injection in Mobile Automation

476

20 May 2025

From Assistants to Adversaries: Exploring the Security Risks of Mobile LLM Agents

457

19 May 2025

Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning

...

449

18 May 2025

Can Global XAI Methods Reveal Injected Bias in LLMs? SHAP vs Rule Extraction vs RuleSHAP

Francesco Sovrano

566

16 May 2025

EcoAgent: An Efficient Device-Cloud Collaborative Multi-Agent Framework for Mobile Automation

1.2K

08 May 2025

Visual Test-time Scaling for GUI Agent Grounding

395

01 May 2025

Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning

312

01 May 2025

ViMo: A Generative Visual GUI World Model for App Agents

554

15 Apr 2025

CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games

...

344

12 Mar 2025

CHOP: Mobile Operating Assistant with Constrained High-frequency Optimized Subtask Planning

316

05 Mar 2025

SpiritSight Agent: Advanced GUI Agent with One LookComputer Vision and Pattern Recognition (CVPR), 2025

443

05 Mar 2025

AutoEval: A Practical Framework for Autonomous Evaluation of Mobile Agents

Jiahui Sun

Zhichao Hua

Yubin Xia

402

04 Mar 2025