ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.15099
  4. Cited By
MUG: Interactive Multimodal Grounding on User Interfaces

MUG: Interactive Multimodal Grounding on User Interfaces

29 September 2022
Tao Li
Gang Li
Jingjie Zheng
Purple Wang
Yang Li
    LLMAG
ArXivPDFHTML

Papers citing "MUG: Interactive Multimodal Grounding on User Interfaces"

5 / 5 papers shown
Title
Tur[k]ingBench: A Challenge Benchmark for Web Agents
Tur[k]ingBench: A Challenge Benchmark for Web Agents
Kevin Xu
Yeganeh Kordi
Kate Sanders
Yizhong Wang
Adam Byerly
Kate Sanders
Adam Byerly
Jingyu Zhang
Benjamin Van Durme
Daniel Khashabi
LLMAG
67
6
0
18 Mar 2024
Explore, Select, Derive, and Recall: Augmenting LLM with Human-like
  Memory for Mobile Task Automation
Explore, Select, Derive, and Recall: Augmenting LLM with Human-like Memory for Mobile Task Automation
Sunjae Lee
Junyoung Choi
Jungjae Lee
Munim Hasan Wasi
Hojun Choi
Steven Y. Ko
Sangeun Oh
Insik Shin
RALM
29
24
0
04 Dec 2023
Enabling Conversational Interaction with Mobile UI using Large Language
  Models
Enabling Conversational Interaction with Mobile UI using Large Language Models
Bryan Wang
Gang Li
Yang Li
171
132
0
18 Sep 2022
Analysis of Language Change in Collaborative Instruction Following
Analysis of Language Change in Collaborative Instruction Following
Anna Effenberger
Eva Yan
Rhia Singh
Alane Suhr
Yoav Artzi
37
13
0
09 Sep 2021
Multimodal Compact Bilinear Pooling for Visual Question Answering and
  Visual Grounding
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
144
1,464
0
06 Jun 2016
1