Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.15099
Cited By
MUG: Interactive Multimodal Grounding on User Interfaces
29 September 2022
Tao Li
Gang Li
Jingjie Zheng
Purple Wang
Yang Li
LLMAG
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MUG: Interactive Multimodal Grounding on User Interfaces"
5 / 5 papers shown
Title
Tur[k]ingBench: A Challenge Benchmark for Web Agents
Kevin Xu
Yeganeh Kordi
Kate Sanders
Yizhong Wang
Adam Byerly
Kate Sanders
Adam Byerly
Jingyu Zhang
Benjamin Van Durme
Daniel Khashabi
LLMAG
67
6
0
18 Mar 2024
Explore, Select, Derive, and Recall: Augmenting LLM with Human-like Memory for Mobile Task Automation
Sunjae Lee
Junyoung Choi
Jungjae Lee
Munim Hasan Wasi
Hojun Choi
Steven Y. Ko
Sangeun Oh
Insik Shin
RALM
29
6
0
04 Dec 2023
Enabling Conversational Interaction with Mobile UI using Large Language Models
Bryan Wang
Gang Li
Yang Li
171
132
0
18 Sep 2022
Analysis of Language Change in Collaborative Instruction Following
Anna Effenberger
Eva Yan
Rhia Singh
Alane Suhr
Yoav Artzi
37
13
0
09 Sep 2021
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
144
1,464
0
06 Jun 2016
1