Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.14740
Cited By
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
29 December 2020
Yang Xu
Yiheng Xu
Tengchao Lv
Lei Cui
Furu Wei
Guoxin Wang
Yijuan Lu
D. Florêncio
Cha Zhang
Wanxiang Che
Min Zhang
Lidong Zhou
ViT
MLLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding"
3 / 3 papers shown
Title
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume
H. K. Ekenel
Jean-Philippe Thiran
96
259
0
27 May 2019
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
261
6,278
0
16 Nov 2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,435
0
26 Sep 2016
1