Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.02358
Cited By
VisualWordGrid: Information Extraction From Scanned Documents Using A Multimodal Approach
5 October 2020
Mohamed Kerroumi
Othmane Sayem
A. Shabou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VisualWordGrid: Information Extraction From Scanned Documents Using A Multimodal Approach"
9 / 9 papers shown
Title
ViBERTgrid BiLSTM-CRF: Multimodal Key Information Extraction from Unstructured Financial Documents
Furkan Pala
Mehmet Yasin Akpınar
Onur Deniz
Gülşen Eryiğit
17
0
0
23 Sep 2024
Exploring the Capabilities of Large Multimodal Models on Dense Text
Shuo Zhang
Biao Yang
Zhang Li
Zhiyin Ma
Yuliang Liu
Xiang Bai
VLM
34
7
0
09 May 2024
DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents
M. Dhouib
G. Bettaieb
A. Shabou
17
20
0
24 Apr 2023
Document AI: Benchmarks, Models and Applications
Lei Cui
Yiheng Xu
Tengchao Lv
Furu Wei
VLM
21
69
0
16 Nov 2021
Information Extraction from Visually Rich Documents with Font Style Embeddings
Ismail Oussaid
William Vanhuffel
Pirashanth Ratnamogan
Mhamed Hajaiej
Alexis Mathey
Thomas Gilles
16
1
0
07 Nov 2021
Position Masking for Improved Layout-Aware Document Understanding
Anik Saha
Catherine Finegan-Dollak
Ashish Verma
17
2
0
01 Sep 2021
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Weihong Lin
Qifang Gao
Lei-huan Sun
Zhuoyao Zhong
Kaiqin Hu
Qin Ren
Qiang Huo
23
37
0
25 May 2021
LAMPRET: Layout-Aware Multimodal PreTraining for Document Understanding
Te-Lin Wu
Cheng-rong Li
Mingyang Zhang
Tao Chen
Spurthi Amba Hombaiah
Michael Bendersky
13
14
0
16 Apr 2021
A Survey of Deep Learning Approaches for OCR and Document Understanding
Nishant Subramani
Alexandre Matton
Malcolm Greaves
Adrian Lam
8
48
0
27 Nov 2020
1