ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.04476
  4. Cited By
Dual-View Visual Contextualization for Web Navigation
v1v2 (latest)

Dual-View Visual Contextualization for Web Navigation

6 February 2024
Jihyung Kil
Chan Hee Song
Boyuan Zheng
Xiang Deng
Yu-Chuan Su
Wei-Lun Chao
    EgoV
ArXiv (abs)PDFHTMLGithub (971★)

Papers citing "Dual-View Visual Contextualization for Web Navigation"

13 / 13 papers shown
Fundamentals of Building Autonomous LLM Agents
Fundamentals of Building Autonomous LLM Agents
Victor de Lamo Castrillo
Habtom Kahsay Gidey
Alexander Lenz
Alois Knoll
LLMAGLM&Ro
270
5
0
10 Oct 2025
Watch and Learn: Learning to Use Computers from Online Videos
Watch and Learn: Learning to Use Computers from Online Videos
Chan Hee Song
Yiwen Song
Palash Goyal
Yu-Chuan Su
Oriana Riva
Hamid Palangi
Tomas Pfister
271
2
0
06 Oct 2025
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation
Rabiul Awal
Mahsa Massoud
Aarash Feizi
Zichao Li
Suyuchen Wang
...
Siva Reddy
Juan A. Rodriguez
Perouz Taslakian
Spandana Gella
Sai Rajeswar
LRM
230
13
0
22 Aug 2025
OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
Xueyu Hu
Tao Xiong
Biao Yi
Zishu Wei
Ruixuan Xiao
...
Zhou Zhao
Hongxia Yang
Fan Wu
Shengyu Zhang
Fei Wu
LLMAGLM&RoAI4TS
382
43
0
06 Aug 2025
Turbocharging Web Automation: The Impact of Compressed History States
Turbocharging Web Automation: The Impact of Compressed History StatesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Xiyue Zhu
Peng Tang
Haofu Liao
Srikar Appalaraju
OffRL
294
3
0
28 Jul 2025
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
Zhepei Wei
Wenlin Yao
Yao Liu
Weizhi Zhang
Qin Lu
...
Puyang Xu
Chao Zhang
Bing Yin
Hyokun Yun
Lihong Li
OffRLCLLOnRLLRM
568
92
0
22 May 2025
A Survey of WebAgents: Towards Next-Generation AI Agents for Web Automation with Large Foundation Models
A Survey of WebAgents: Towards Next-Generation AI Agents for Web Automation with Large Foundation Models
Liangbo Ning
Ziran Liang
Zhuohang Jiang
Haohao Qu
Yujuan Ding
...
Xiao Wei
Shanru Lin
Hui Liu
Philip S. Yu
Qing Li
LLMAGLM&Ro
837
78
0
30 Mar 2025
GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration
GUI-Xplore: Empowering Generalizable GUI Agents with One ExplorationComputer Vision and Pattern Recognition (CVPR), 2025
Yuchen Sun
Shanhui Zhao
Tao Yu
Hao Wen
Samith Va
Mengwei Xu
Yan Liang
Chongyang Zhang
LLMAG
415
13
0
22 Mar 2025
SpiritSight Agent: Advanced GUI Agent with One Look
SpiritSight Agent: Advanced GUI Agent with One LookComputer Vision and Pattern Recognition (CVPR), 2025
Zhiyuan Huang
Ziming Cheng
Junting Pan
Zhaohui Hou
Mingjie Zhan
LLMAG
539
15
0
05 Mar 2025
Attention-driven GUI Grounding: Leveraging Pretrained Multimodal Large
  Language Models without Fine-Tuning
Attention-driven GUI Grounding: Leveraging Pretrained Multimodal Large Language Models without Fine-TuningAAAI Conference on Artificial Intelligence (AAAI), 2024
Hai-Ming Xu
Qi Chen
Lei Wang
Lingqiao Liu
347
11
0
14 Dec 2024
The BrowserGym Ecosystem for Web Agent Research
The BrowserGym Ecosystem for Web Agent Research
Thibault Le Sellier De Chezelles
Maxime Gasse
Alexandre Lacoste
Alexandre Drouin
Massimo Caccia
...
Siva Reddy
Quentin Cappart
Graham Neubig
Ruslan Salakhutdinov
Nicolas Chapados
LLMAG
2.1K
80
0
06 Dec 2024
MMInA: Benchmarking Multihop Multimodal Internet Agents
MMInA: Benchmarking Multihop Multimodal Internet Agents
Ziniu Zhang
Ziniu Zhang
Liangyu Chen
Yu Qiao
LLMAGLM&Ro
421
42
0
15 Apr 2024
Tur[k]ingBench: A Challenge Benchmark for Web Agents
Tur[k]ingBench: A Challenge Benchmark for Web Agents
Kevin Xu
Yeganeh Kordi
Kate Sanders
Yizhong Wang
Adam Byerly
Kate Sanders
Adam Byerly
Jingyu Zhang
Benjamin Van Durme
Daniel Khashabi
LLMAG
653
16
0
18 Mar 2024
1
Page 1 of 1