ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.02358
  4. Cited By
VisualWordGrid: Information Extraction From Scanned Documents Using A
  Multimodal Approach

VisualWordGrid: Information Extraction From Scanned Documents Using A Multimodal Approach

5 October 2020
Mohamed Kerroumi
Othmane Sayem
A. Shabou
ArXivPDFHTML

Papers citing "VisualWordGrid: Information Extraction From Scanned Documents Using A Multimodal Approach"

9 / 9 papers shown
Title
ViBERTgrid BiLSTM-CRF: Multimodal Key Information Extraction from
  Unstructured Financial Documents
ViBERTgrid BiLSTM-CRF: Multimodal Key Information Extraction from Unstructured Financial Documents
Furkan Pala
Mehmet Yasin Akpınar
Onur Deniz
Gülşen Eryiğit
17
0
0
23 Sep 2024
Exploring the Capabilities of Large Multimodal Models on Dense Text
Exploring the Capabilities of Large Multimodal Models on Dense Text
Shuo Zhang
Biao Yang
Zhang Li
Zhiyin Ma
Yuliang Liu
Xiang Bai
VLM
34
7
0
09 May 2024
DocParser: End-to-end OCR-free Information Extraction from Visually Rich
  Documents
DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents
M. Dhouib
G. Bettaieb
A. Shabou
17
20
0
24 Apr 2023
Document AI: Benchmarks, Models and Applications
Document AI: Benchmarks, Models and Applications
Lei Cui
Yiheng Xu
Tengchao Lv
Furu Wei
VLM
21
69
0
16 Nov 2021
Information Extraction from Visually Rich Documents with Font Style
  Embeddings
Information Extraction from Visually Rich Documents with Font Style Embeddings
Ismail Oussaid
William Vanhuffel
Pirashanth Ratnamogan
Mhamed Hajaiej
Alexis Mathey
Thomas Gilles
16
1
0
07 Nov 2021
Position Masking for Improved Layout-Aware Document Understanding
Position Masking for Improved Layout-Aware Document Understanding
Anik Saha
Catherine Finegan-Dollak
Ashish Verma
17
2
0
01 Sep 2021
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for
  Key Information Extraction from Documents
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Weihong Lin
Qifang Gao
Lei-huan Sun
Zhuoyao Zhong
Kaiqin Hu
Qin Ren
Qiang Huo
23
37
0
25 May 2021
LAMPRET: Layout-Aware Multimodal PreTraining for Document Understanding
LAMPRET: Layout-Aware Multimodal PreTraining for Document Understanding
Te-Lin Wu
Cheng-rong Li
Mingyang Zhang
Tao Chen
Spurthi Amba Hombaiah
Michael Bendersky
13
14
0
16 Apr 2021
A Survey of Deep Learning Approaches for OCR and Document Understanding
A Survey of Deep Learning Approaches for OCR and Document Understanding
Nishant Subramani
Alexandre Matton
Malcolm Greaves
Adrian Lam
8
48
0
27 Nov 2020
1