Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.18262
Cited By
Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding
28 February 2024
Hongshen Xu
Lu Chen
Zihan Zhao
Da Ma
Ruisheng Cao
Zichen Zhu
Kai Yu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding"
5 / 5 papers shown
Title
DOM-LM: Learning Generalizable Representations for HTML Documents
Xiang Deng
Prashant Shiralkar
Colin Lockard
Binxuan Huang
Huan Sun
AI4TS
AI4CE
37
37
0
25 Jan 2022
Simplified DOM Trees for Transferable Attribute Extraction from the Web
Yichao Zhou
Ying Sheng
N. Vo
Nick Edmonds
Sandeep Tata
121
28
0
07 Jan 2021
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Yang Xu
Yiheng Xu
Tengchao Lv
Lei Cui
Furu Wei
...
D. Florêncio
Cha Zhang
Wanxiang Che
Min Zhang
Lidong Zhou
ViT
MLLM
145
498
0
29 Dec 2020
Feature Pyramid Networks for Object Detection
Tsung-Yi Lin
Piotr Dollár
Ross B. Girshick
Kaiming He
Bharath Hariharan
Serge J. Belongie
ObjD
166
21,785
0
09 Dec 2016
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
268
10,214
0
16 Nov 2016
1