VisualWordGrid: Information Extraction From Scanned Documents Using A Multimodal Approach

5 October 2020

Papers citing "VisualWordGrid: Information Extraction From Scanned Documents Using A Multimodal Approach"

9 / 9 papers shown

Title
ViBERTgrid BiLSTM-CRF: Multimodal Key Information Extraction from Unstructured Financial Documents Furkan Pala Mehmet Yasin Akpınar Onur Deniz Gülşen Eryiğit 17 0 0 23 Sep 2024
Exploring the Capabilities of Large Multimodal Models on Dense Text Shuo Zhang Biao Yang Zhang Li Zhiyin Ma Yuliang Liu Xiang Bai VLM 34 7 0 09 May 2024
DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents M. Dhouib G. Bettaieb A. Shabou 17 20 0 24 Apr 2023
Document AI: Benchmarks, Models and Applications Lei Cui Yiheng Xu Tengchao Lv Furu Wei VLM 21 69 0 16 Nov 2021
Information Extraction from Visually Rich Documents with Font Style Embeddings Ismail Oussaid William Vanhuffel Pirashanth Ratnamogan Mhamed Hajaiej Alexis Mathey Thomas Gilles 16 1 0 07 Nov 2021
Position Masking for Improved Layout-Aware Document Understanding Anik Saha Catherine Finegan-Dollak Ashish Verma 17 2 0 01 Sep 2021
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents Weihong Lin Qifang Gao Lei-huan Sun Zhuoyao Zhong Kaiqin Hu Qin Ren Qiang Huo 23 37 0 25 May 2021
LAMPRET: Layout-Aware Multimodal PreTraining for Document Understanding Te-Lin Wu Cheng-rong Li Mingyang Zhang Tao Chen Spurthi Amba Hombaiah Michael Bendersky 13 14 0 16 Apr 2021
A Survey of Deep Learning Approaches for OCR and Document Understanding Nishant Subramani Alexandre Matton Malcolm Greaves Adrian Lam 8 48 0 27 Nov 2020