Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.12029
Cited By
VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification
24 May 2022
Souhail Bakkali
Zuheng Ming
Mickael Coustaty
Marccal Rusinol
O. R. Terrades
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification"
3 / 3 papers shown
Title
RegCLR: A Self-Supervised Framework for Tabular Representation Learning in the Wild
Weiyao Wang
Byung-Hak Kim
Varun Ganapathi
SSL
LMTD
10
1
0
02 Nov 2022
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Yang Xu
Yiheng Xu
Tengchao Lv
Lei Cui
Furu Wei
...
D. Florêncio
Cha Zhang
Wanxiang Che
Min Zhang
Lidong Zhou
ViT
MLLM
137
492
0
29 Dec 2020
CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images
Madhav Agarwal
Ajoy Mondal
C. V. Jawahar
32
61
0
25 Aug 2020
1