ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11029
  4. Cited By
META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI

META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI

23 May 2022
Liangtai Sun
Xingyu Chen
Lu Chen
Tianle Dai
Zichen Zhu
Kai Yu
    LLMAG
ArXivPDFHTML

Papers citing "META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI"

13 / 13 papers shown
Title
MobileSteward: Integrating Multiple App-Oriented Agents with Self-Evolution to Automate Cross-App Instructions
MobileSteward: Integrating Multiple App-Oriented Agents with Self-Evolution to Automate Cross-App Instructions
Yuxuan Liu
Hongda Sun
Wei Liu
Jian Luan
Bo Du
Rui Yan
55
2
0
24 Feb 2025
Large Language Models Empowered Personalized Web Agents
Large Language Models Empowered Personalized Web Agents
Hongru Cai
Yongqi Li
Luu Anh Tuan
Fengbin Zhu
Xiaoyu Shen
Wenjie Li
Tat-Seng Chua
LLMAG
46
12
0
22 Oct 2024
Benchmarking Mobile Device Control Agents across Diverse Configurations
Benchmarking Mobile Device Control Agents across Diverse Configurations
Juyong Lee
Taywon Min
Minyong An
Changyeon Kim
Kimin Lee
36
9
0
25 Apr 2024
Tur[k]ingBench: A Challenge Benchmark for Web Agents
Tur[k]ingBench: A Challenge Benchmark for Web Agents
Kevin Xu
Yeganeh Kordi
Kate Sanders
Yizhong Wang
Adam Byerly
Kate Sanders
Adam Byerly
Jingyu Zhang
Benjamin Van Durme
Daniel Khashabi
LLMAG
75
6
0
18 Mar 2024
Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Jiwen Zhang
Jihao Wu
Yihua Teng
Minghui Liao
Nuo Xu
Xiao Xiao
Zhongyu Wei
Duyu Tang
LLMAG
LM&Ro
37
50
0
05 Mar 2024
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Xing Han Lù
Zdeněk Kasner
Siva Reddy
32
59
0
08 Feb 2024
Dual-View Visual Contextualization for Web Navigation
Dual-View Visual Contextualization for Web Navigation
Jihyung Kil
Chan Hee Song
Boyuan Zheng
Xiang Deng
Yu-Chuan Su
Wei-Lun Chao
EgoV
22
12
0
06 Feb 2024
Can Current Task-oriented Dialogue Models Automate Real-world Scenarios
  in the Wild?
Can Current Task-oriented Dialogue Models Automate Real-world Scenarios in the Wild?
Sang-Woo Lee
Sungdong Kim
Donghyeon Ko
Dong-hyun Ham
Youngki Hong
...
Wangkyo Jung
Kyunghyun Cho
Donghyun Kwak
H. Noh
W. Park
51
1
0
20 Dec 2022
DFM: Dialogue Foundation Model for Universal Large-Scale
  Dialogue-Oriented Task Learning
DFM: Dialogue Foundation Model for Universal Large-Scale Dialogue-Oriented Task Learning
Zhi Chen
Jijia Bao
Lu Chen
Yuncong Liu
Da Ma
...
Xinhsuai Dong
Fujiang Ge
Qingliang Miao
Jian-Guang Lou
Kai Yu
ALM
AI4CE
43
3
0
25 May 2022
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document
  Understanding
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Yang Xu
Yiheng Xu
Tengchao Lv
Lei Cui
Furu Wei
...
D. Florêncio
Cha Zhang
Wanxiang Che
Min Zhang
Lidong Zhou
ViT
MLLM
153
498
0
29 Dec 2020
FLIN: A Flexible Natural Language Interface for Web Navigation
FLIN: A Flexible Natural Language Interface for Web Navigation
Sahisnu Mazumder
Oriana Riva
LRM
48
23
0
24 Oct 2020
BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded
  Dialogues
BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues
Hung Le
Doyen Sahoo
Nancy F. Chen
S. Hoi
40
30
0
20 Oct 2020
Task-Oriented Dialogue as Dataflow Synthesis
Task-Oriented Dialogue as Dataflow Synthesis
Semantic Machines
Jacob Andreas
J. Bufe
David Burkett
Charles C. Chen
...
Izabela Witoszko
Jason Wolfe
A. Wray
Yuchen Zhang
Alexander Zotov
AIFin
195
153
0
24 Sep 2020
1