Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.13824
Cited By
Harnessing Webpage UIs for Text-Rich Visual Understanding
17 October 2024
Junpeng Liu
Tianyue Ou
Yifan Song
Yuxiao Qu
Wai Lam
Chenyan Xiong
Wenhu Chen
Graham Neubig
Xiang Yue
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Harnessing Webpage UIs for Text-Rich Visual Understanding"
2 / 2 papers shown
Title
OmniCaptioner: One Captioner to Rule Them All
Yiting Lu
Jiakang Yuan
Zhen Li
Shitian Zhao
Qi Qin
...
Lei Bai
Zhibo Chen
Peng Gao
Bo Zhang
Peng Gao
MLLM
68
0
0
09 Apr 2025
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Boyu Gou
Ruohan Wang
Boyuan Zheng
Yanan Xie
Cheng Chang
Yiheng Shu
Huan Sun
Yu Su
LM&Ro
LLMAG
35
1
0
07 Oct 2024
1