ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.24180
28
0

Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up

31 March 2025
Ziming Cheng
Zhiyuan Huang
Junting Pan
Zhaohui Hou
Mingjie Zhan
ArXivPDFHTML
Abstract

Graphical user interfaces (GUI) automation agents are emerging as powerful tools, enabling humans to accomplish increasingly complex tasks on smart devices. However, users often inadvertently omit key information when conveying tasks, which hinders agent performance in the current agent paradigm that does not support immediate user intervention. To address this issue, we introduce a Self-Correction GUI Navigation\textbf{Self-Correction GUI Navigation}Self-Correction GUI Navigation task that incorporates interactive information completion capabilities within GUI agents. We developed the Navi-plus\textbf{Navi-plus}Navi-plus dataset with GUI follow-up question-answer pairs, alongside a Dual-Stream Trajectory Evaluation\textbf{Dual-Stream Trajectory Evaluation}Dual-Stream Trajectory Evaluation method to benchmark this new capability. Our results show that agents equipped with the ability to ask GUI follow-up questions can fully recover their performance when faced with ambiguous user tasks.

View on arXiv
@article{cheng2025_2503.24180,
  title={ Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up },
  author={ Ziming Cheng and Zhiyuan Huang and Junting Pan and Zhaohui Hou and Mingjie Zhan },
  journal={arXiv preprint arXiv:2503.24180},
  year={ 2025 }
}
Comments on this paper