v1v2v3 (latest)

PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks

International Conference on Pattern Recognition (ICPR), 2020

16 April 2020

ArXiv (abs)PDF HTML Github (563★)

Papers citing "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks"

50 / 69 papers shown

Document Intelligence in the Era of Large Language Models: A Survey

278

15 Oct 2025

OTCR: Optimal Transmission, Compression and Representation for Multimodal Information Extraction

122

17 Sep 2025

Seeing is Believing? Mitigating OCR Hallucinations in Multimodal Large Language Models

267

25 Jun 2025

A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions

Anh Le

Thanh Lam

Dung Nguyen

288

05 Jun 2025

FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document UnderstandingInternational Conference on Computational Linguistics (COLING), 2025

Amit Agarwal

Srikant Panda

Kulbhushan Pachauri

287

22 May 2025

OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models

378

22 Feb 2025

GraphRevisedIE: Multimodal Information Extraction with Graph-Revised NetworkPattern Recognition (Pattern Recogn.), 2023

Panfeng Cao

Jian Wu

235

02 Oct 2024

SynJAC: Synthetic-data-driven Joint-granular Adaptation and Calibration for Domain Specific Scanned Document Key Information Extraction

323

02 Oct 2024

ViBERTgrid BiLSTM-CRF: Multimodal Key Information Extraction from Unstructured Financial Documents

183

23 Sep 2024

ViRED: Prediction of Visual Relations in Engineering DrawingsInternational Conference on Mobile Ad-hoc and Sensor Networks (ICMASN), 2024

283

02 Sep 2024

Arctic-TILT. Business Document Understanding at Sub-Billion Scale

Michał Pietruszka

...

Artur Zawłocki

Łukasz Duhr

Paweł Dyda

Michał Turski

VLM

309

08 Aug 2024

Deep Learning based Visually Rich Document Content Understanding: A Survey

571

02 Aug 2024

XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form Parser

Xiang Li

...

Zhoujun Li

270

27 May 2024

SmartFlow: Robotic Process Automation using LLMs

21 May 2024

HRVDA: High-Resolution Visual Document AssistantComputer Vision and Pattern Recognition (CVPR), 2024

Xin Li

313

10 Apr 2024

OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition

Yuliang Liu

Fei Huang

335

28 Mar 2024

Transformers and Language Models in Form Understanding: A Comprehensive Review of Scanned Document Analysis

289

06 Mar 2024

DocGraphLM: Documental Graph Language Model for Information Extraction

Dongsheng Wang

218

05 Jan 2024

DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency

321

09 Nov 2023

Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation

Lianwen Jin

478

25 Oct 2023

Enhancing Document Information Analysis with Multi-Task Pre-training: A Robust Approach for Information Extraction in Visually-Rich DocumentsIEEE International Joint Conference on Neural Network (IJCNN), 2023

Tofik Ali

Partha Pratim Roy

264

25 Oct 2023

GenKIE: Robust Generative Multimodal Document Key Information ExtractionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

232

24 Oct 2023

DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine ReadingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

167

23 Oct 2023

Long-Range Transformer Architectures for Document Understanding

219

11 Sep 2023

Improving Information Extraction on Business Documents with Specific Pre-Training TasksInternational Workshop on Document Analysis Systems (DAS), 2023

186

11 Sep 2023

Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region ConcentrationIEEE International Conference on Computer Vision (ICCV), 2023

319

03 Sep 2023

DocTr: Document Transformer for Structured Information Extraction in DocumentsIEEE International Conference on Computer Vision (ICCV), 2023

252

16 Jul 2023

Transcending Traditional Boundaries: Leveraging Inter-Annotator Agreement (IAA) for Enhancing Data Management Operations (DMOps)

144

26 Jun 2023

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document ImagesIEEE International Conference on Document Analysis and Recognition (ICDAR), 2023

...

Jingdong Wang

222

05 Jun 2023

Layout and Task Aware Instruction Prompt for Zero-shot Document Image Question Answering

553

01 Jun 2023

GVdoc: Graph-based Visual Document Classification

Fnu Mohbat

Mohammed J Zaki

Catherine Finegan-Dollak

Ashish Verma

OOD

240

26 May 2023

RE$^2$: Region-Aware Relation Extraction from Visually Rich Documents

^2

: Region-Aware Relation Extraction from Visually Rich DocumentsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

321

24 May 2023

Detecting automatically the layout of clinical documents to enhance the performances of downstream natural language processing

218

23 May 2023

Visual Information Extraction in the Wild: Practical Dataset and End-to-end SolutionIEEE International Conference on Document Analysis and Recognition (ICDAR), 2023

Dingkang Liang

383

12 May 2023

Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text Documents via Semantic-Oriented Hierarchical GraphsInternational Conference on Language Resources and Evaluation (LREC), 2023

322

03 May 2023

Information Redundancy and Biases in Public Document Information Extraction BenchmarksIEEE International Conference on Document Analysis and Recognition (ICDAR), 2023

S. Laatiri

Pirashanth Ratnamogan

175

28 Apr 2023

Large Scale Genealogical Information Extraction From Handwritten Quebec Parish RecordsInternational Journal on Document Analysis and Recognition (IJDAR), 2023

Christopher Kermorvant

251

27 Apr 2023

GeoLayoutLM: Geometric Pre-training for Visual Information ExtractionComputer Vision and Pattern Recognition (CVPR), 2023

408

21 Apr 2023

A Question-Answering Approach to Key Value Pair Extraction from Form-like Document ImagesAAAI Conference on Artificial Intelligence (AAAI), 2023

269

17 Apr 2023

Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections

206

07 Oct 2022

ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding

...

Dianhai Yu

210

18 Sep 2022

Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks

Andrea Gemelli

Sanket Biswas

Enrico Civitelli

Josep Lladós

S. Marinai

187

23 Aug 2022

Information Extraction from Scanned Invoice Images using Text Analysis and Layout FeaturesSignal processing. Image communication (SPIC), 2021

H. Ha

Ales Horak

152

08 Aug 2022

TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents

165

14 Jul 2022

GMN: Generative Multi-modal Network for Practical Document Information ExtractionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

163

11 Jul 2022

Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document UnderstandingInternational Journal on Document Analysis and Recognition (IJDAR), 2022

326

27 Jun 2022

Business Document Information Extraction: Towards Practical BenchmarksConference and Labs of the Evaluation Forum (CLEF), 2022

258

20 Jun 2022

RDU: A Region-based Approach to Form-style Document Understanding

295

14 Jun 2022

FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information ExtractionAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Chen-Yu Lee

Chun-Liang Li

Joshua Ainslie

Yasuhisa Fujii

Tomas Pfister

255

16 Mar 2022

LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Jiapeng Wang

Lianwen Jin

Kai Ding

VLM

275

186

28 Feb 2022