ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.07464
  4. Cited By
PICK: Processing Key Information Extraction from Documents using
  Improved Graph Learning-Convolutional Networks
v1v2v3 (latest)

PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks

International Conference on Pattern Recognition (ICPR), 2020
16 April 2020
Wenwen Yu
Ning Lu
Xianbiao Qi
Ping Gong
Rong Xiao
ArXiv (abs)PDFHTMLGithub (563★)

Papers citing "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks"

50 / 69 papers shown
Document Intelligence in the Era of Large Language Models: A Survey
Document Intelligence in the Era of Large Language Models: A Survey
Weishi Wang
Hengchang Hu
Zhijie Zhang
Zhaochen Li
Hongxin Shao
Daniel Dahlmeier
AI4TS
278
4
0
15 Oct 2025
OTCR: Optimal Transmission, Compression and Representation for Multimodal Information Extraction
OTCR: Optimal Transmission, Compression and Representation for Multimodal Information Extraction
Y. Li
Yajiao Wang
Wenhao Hu
Z. Zhang
Mengting Zhang
122
0
0
17 Sep 2025
Seeing is Believing? Mitigating OCR Hallucinations in Multimodal Large Language Models
Seeing is Believing? Mitigating OCR Hallucinations in Multimodal Large Language Models
Zhentao He
C. Zhang
Ziheng Wu
Z. Chen
Yufei Zhan
Yifan Li
Zhao Zhang
Xian Wang
Minghui Qiu
MLLM
267
7
0
25 Jun 2025
A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions
A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions
Anh Le
Thanh Lam
Dung Nguyen
288
1
0
05 Jun 2025
FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding
FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document UnderstandingInternational Conference on Computational Linguistics (COLING), 2025
Amit Agarwal
Srikant Panda
Kulbhushan Pachauri
287
14
0
22 May 2025
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
Wenwen Yu
Zhibo Yang
Jianqiang Wan
Sibo Song
J. Tang
Wenqing Cheng
Yunxing Liu
Xiang Bai
378
20
0
22 Feb 2025
GraphRevisedIE: Multimodal Information Extraction with Graph-Revised
  Network
GraphRevisedIE: Multimodal Information Extraction with Graph-Revised NetworkPattern Recognition (Pattern Recogn.), 2023
Panfeng Cao
Jian Wu
235
21
0
02 Oct 2024
SynJAC: Synthetic-data-driven Joint-granular Adaptation and Calibration for Domain Specific Scanned Document Key Information Extraction
SynJAC: Synthetic-data-driven Joint-granular Adaptation and Calibration for Domain Specific Scanned Document Key Information Extraction
Yihao Ding
S. Han
Zechuan Li
Hyunsuk Chung
323
3
0
02 Oct 2024
ViBERTgrid BiLSTM-CRF: Multimodal Key Information Extraction from
  Unstructured Financial Documents
ViBERTgrid BiLSTM-CRF: Multimodal Key Information Extraction from Unstructured Financial Documents
Furkan Pala
Mehmet Yasin Akpınar
Onur Deniz
Gülşen Eryiğit
183
2
0
23 Sep 2024
ViRED: Prediction of Visual Relations in Engineering Drawings
ViRED: Prediction of Visual Relations in Engineering DrawingsInternational Conference on Mobile Ad-hoc and Sensor Networks (ICMASN), 2024
Chao Gu
Ke Lin
Yiyang Luo
Jiahui Hou
Xiang-Yang Li
283
1
0
02 Sep 2024
Arctic-TILT. Business Document Understanding at Sub-Billion Scale
Arctic-TILT. Business Document Understanding at Sub-Billion Scale
Łukasz Borchmann
Michał Pietruszka
Wojciech Ja'skowski
Dawid Jurkiewicz
Piotr Halama
...
Gabriela Nowakowska
Artur Zawłocki
Łukasz Duhr
Paweł Dyda
Michał Turski
VLM
309
4
0
08 Aug 2024
Deep Learning based Visually Rich Document Content Understanding: A Survey
Deep Learning based Visually Rich Document Content Understanding: A Survey
Muhammad Ali
Jean Lee
Salman Khan
Eduard Hovy
571
21
0
02 Aug 2024
XFormParser: A Simple and Effective Multimodal Multilingual
  Semi-structured Form Parser
XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form Parser
Xianfu Cheng
Hang Zhang
Zhiqiang Wang
Xiang Li
Weixiao Zhou
...
Fei Liu
Wei Zhang
Tao Sun
Tongliang Li
Zhoujun Li
270
4
0
27 May 2024
SmartFlow: Robotic Process Automation using LLMs
SmartFlow: Robotic Process Automation using LLMs
Arushi Jain
Shubham Paliwal
Monika Sharma
Lovekesh Vig
Gautam M. Shroff
99
3
0
21 May 2024
HRVDA: High-Resolution Visual Document Assistant
HRVDA: High-Resolution Visual Document AssistantComputer Vision and Pattern Recognition (CVPR), 2024
Chaohu Liu
Kun Yin
Haoyu Cao
Xinghua Jiang
Xin Li
Yinsong Liu
Deqiang Jiang
Xing Sun
Linli Xu
VLM
313
40
0
10 Apr 2024
OmniParser: A Unified Framework for Text Spotting, Key Information
  Extraction and Table Recognition
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition
Jianqiang Wan
Sibo Song
Wenwen Yu
Yuliang Liu
Wenqing Cheng
Fei Huang
Xiang Bai
Cong Yao
Zhibo Yang
335
87
0
28 Mar 2024
Transformers and Language Models in Form Understanding: A Comprehensive
  Review of Scanned Document Analysis
Transformers and Language Models in Form Understanding: A Comprehensive Review of Scanned Document Analysis
Abdelrahman Abdallah
Daniel Eberharter
Zoe Pfister
Adam Jatowt
289
17
0
06 Mar 2024
DocGraphLM: Documental Graph Language Model for Information Extraction
DocGraphLM: Documental Graph Language Model for Information Extraction
Dongsheng Wang
Zhiqiang Ma
Armineh Nourbakhsh
Kang Gu
Sameena Shah
218
16
0
05 Jan 2024
DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing
  Learning Efficiency
DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency
Azhar Shaikh
Michael Cochez
Denis Diachkov
Michiel de Rijcke
Sahar Yousefi
321
2
0
09 Nov 2023
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and
  In-depth Evaluation
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation
Yongxin Shi
Dezhi Peng
Wenhui Liao
Zening Lin
Xinhong Chen
Chongyu Liu
Yuyi Zhang
Lianwen Jin
MLLM
478
58
0
25 Oct 2023
Enhancing Document Information Analysis with Multi-Task Pre-training: A
  Robust Approach for Information Extraction in Visually-Rich Documents
Enhancing Document Information Analysis with Multi-Task Pre-training: A Robust Approach for Information Extraction in Visually-Rich DocumentsIEEE International Joint Conference on Neural Network (IJCNN), 2023
Tofik Ali
Partha Pratim Roy
264
1
0
25 Oct 2023
GenKIE: Robust Generative Multimodal Document Key Information Extraction
GenKIE: Robust Generative Multimodal Document Key Information ExtractionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Panfeng Cao
Ye Wang
Qiang Zhang
Zaiqiao Meng
SyDa
232
9
0
24 Oct 2023
DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye
  Movement for Machine Reading
DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine ReadingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hao Wang
Qingxuan Wang
Yue Li
Changqing Wang
Chenhui Chu
Rui Wang
VGen
167
4
0
23 Oct 2023
Long-Range Transformer Architectures for Document Understanding
Long-Range Transformer Architectures for Document Understanding
Thibault Douzon
S. Duffner
Christophe Garcia
Jérémy Espinas
VLM
219
3
0
11 Sep 2023
Improving Information Extraction on Business Documents with Specific
  Pre-Training Tasks
Improving Information Extraction on Business Documents with Specific Pre-Training TasksInternational Workshop on Document Analysis Systems (DAS), 2023
Thibault Douzon
S. Duffner
Christophe Garcia
Jérémy Espinas
186
9
0
11 Sep 2023
Attention Where It Matters: Rethinking Visual Document Understanding
  with Selective Region Concentration
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region ConcentrationIEEE International Conference on Computer Vision (ICCV), 2023
H. Cao
Changcun Bao
Chaohu Liu
Huang-wei Chen
Kun Yin
Hao Liu
Yinsong Liu
Deqiang Jiang
Xing Sun
319
21
0
03 Sep 2023
DocTr: Document Transformer for Structured Information Extraction in
  Documents
DocTr: Document Transformer for Structured Information Extraction in DocumentsIEEE International Conference on Computer Vision (ICCV), 2023
Haofu Liao
Aruni RoyChowdhury
Weijian Li
Ankan Bansal
Yuting Zhang
Zhuowen Tu
R. Satzoda
R. Manmatha
Vijay Mahadevan
252
26
0
16 Jul 2023
Transcending Traditional Boundaries: Leveraging Inter-Annotator
  Agreement (IAA) for Enhancing Data Management Operations (DMOps)
Transcending Traditional Boundaries: Leveraging Inter-Annotator Agreement (IAA) for Enhancing Data Management Operations (DMOps)
Damrin Kim
Namhyeok Kim
Chanjun Park
Harksoo Kim
144
1
0
26 Jun 2023
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich
  Document Images
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document ImagesIEEE International Conference on Document Analysis and Recognition (ICDAR), 2023
Wenwen Yu
Chengquan Zhang
H. Cao
Wei Hua
Bohan Li
...
Hao Fei
Dimosthenis Karatzas
Xingchao Sun
Jingdong Wang
Xiang Bai
222
22
0
05 Jun 2023
Layout and Task Aware Instruction Prompt for Zero-shot Document Image
  Question Answering
Layout and Task Aware Instruction Prompt for Zero-shot Document Image Question Answering
Wenjin Wang
Yunhao Li
Yixin Ou
Yin Zhang
VLM
553
36
0
01 Jun 2023
GVdoc: Graph-based Visual Document Classification
GVdoc: Graph-based Visual Document Classification
Fnu Mohbat
Mohammed J Zaki
Catherine Finegan-Dollak
Ashish Verma
OOD
240
2
0
26 May 2023
RE$^2$: Region-Aware Relation Extraction from Visually Rich Documents
RE2^22: Region-Aware Relation Extraction from Visually Rich DocumentsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Pritika Ramu
Sijia Wang
Lalla Mouatadid
Joy Rimchala
Lifu Huang
321
0
0
24 May 2023
Detecting automatically the layout of clinical documents to enhance the
  performances of downstream natural language processing
Detecting automatically the layout of clinical documents to enhance the performances of downstream natural language processing
C. Gérardin
Perceval Wajsburt
Basile Dura
Alice Calliger
Alexandre Mouchet
X. Tannier
R. Bey
218
2
0
23 May 2023
Visual Information Extraction in the Wild: Practical Dataset and
  End-to-end Solution
Visual Information Extraction in the Wild: Practical Dataset and End-to-end SolutionIEEE International Conference on Document Analysis and Recognition (ICDAR), 2023
Jianfeng Kuang
Wei Hua
Dingkang Liang
Mingkun Yang
Deqiang Jiang
Bo Ren
Xiang Bai
383
60
0
12 May 2023
Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text
  Documents via Semantic-Oriented Hierarchical Graphs
Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text Documents via Semantic-Oriented Hierarchical GraphsInternational Conference on Language Resources and Evaluation (LREC), 2023
Fengbin Zhu
Chao Wang
Fuli Feng
Zifeng Ren
Moxin Li
Tat-Seng Chua
322
7
0
03 May 2023
Information Redundancy and Biases in Public Document Information
  Extraction Benchmarks
Information Redundancy and Biases in Public Document Information Extraction BenchmarksIEEE International Conference on Document Analysis and Recognition (ICDAR), 2023
S. Laatiri
Pirashanth Ratnamogan
Joel Tang
Laurent Lam
William Vanhuffel
Fabien Caspani
175
4
0
28 Apr 2023
Large Scale Genealogical Information Extraction From Handwritten Quebec
  Parish Records
Large Scale Genealogical Information Extraction From Handwritten Quebec Parish RecordsInternational Journal on Document Analysis and Recognition (IJDAR), 2023
Solène Tarride
Martin Maarand
Mélodie Boillet
James McGrath
Eugénie Capel
H. Vézina
Christopher Kermorvant
251
16
0
27 Apr 2023
GeoLayoutLM: Geometric Pre-training for Visual Information Extraction
GeoLayoutLM: Geometric Pre-training for Visual Information ExtractionComputer Vision and Pattern Recognition (CVPR), 2023
Chuwei Luo
Changxu Cheng
Qi Zheng
Cong Yao
408
66
0
21 Apr 2023
A Question-Answering Approach to Key Value Pair Extraction from
  Form-like Document Images
A Question-Answering Approach to Key Value Pair Extraction from Form-like Document ImagesAAAI Conference on Artificial Intelligence (AAAI), 2023
Kai Hu
Zhuoyuan Wu
Zhuoyao Zhong
Weihong Lin
Lei-huan Sun
Qiang Huo
269
15
0
17 Apr 2023
Key Information Extraction in Purchase Documents using Deep Learning and
  Rule-based Corrections
Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections
R. Arroyo
J. Yebes
E. Martínez
Hector Corrales
Javier Lorenzo
206
3
0
07 Oct 2022
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document
  Understanding
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Wenjin Wang
Zhengjie Huang
Bin Luo
Qianglong Chen
Qiming Peng
...
Weichong Yin
Shi Feng
Yu Sun
Dianhai Yu
Yin Zhang
ViT
210
14
0
18 Sep 2022
Doc2Graph: a Task Agnostic Document Understanding Framework based on
  Graph Neural Networks
Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks
Andrea Gemelli
Sanket Biswas
Enrico Civitelli
Josep Lladós
S. Marinai
187
23
0
23 Aug 2022
Information Extraction from Scanned Invoice Images using Text Analysis
  and Layout Features
Information Extraction from Scanned Invoice Images using Text Analysis and Layout FeaturesSignal processing. Image communication (SPIC), 2021
H. Ha
Ales Horak
152
29
0
08 Aug 2022
TRIE++: Towards End-to-End Information Extraction from Visually Rich
  Documents
TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents
Zhanzhan Cheng
Peng Zhang
Can Li
Qiao Liang
Yunlu Xu
Pengfei Li
Shiliang Pu
Yi Niu
Fei Wu
165
15
0
14 Jul 2022
GMN: Generative Multi-modal Network for Practical Document Information
  Extraction
GMN: Generative Multi-modal Network for Practical Document Information ExtractionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
H. Cao
Jiefeng Ma
Antai Guo
Yiqing Hu
Hao Liu
Deqiang Jiang
Yinsong Liu
Bo Ren
163
9
0
11 Jul 2022
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document UnderstandingInternational Journal on Document Analysis and Recognition (IJDAR), 2022
Chuwei Luo
Guozhi Tang
Qi Zheng
Cong Yao
Lianwen Jin
Chenliang Li
Yang Xue
Luo Si
326
25
0
27 Jun 2022
Business Document Information Extraction: Towards Practical Benchmarks
Business Document Information Extraction: Towards Practical BenchmarksConference and Labs of the Evaluation Forum (CLEF), 2022
Matyás Skalický
Stepán Simsa
Michal Uřičář
Milan Šulc
258
14
0
20 Jun 2022
RDU: A Region-based Approach to Form-style Document Understanding
RDU: A Region-based Approach to Form-style Document Understanding
Fengbin Zhu
Chao Wang
Wenqiang Lei
Ziyang Liu
Tat-Seng Chua
295
2
0
14 Jun 2022
FormNet: Structural Encoding beyond Sequential Modeling in Form Document
  Information Extraction
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information ExtractionAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Chen-Yu Lee
Chun-Liang Li
Timothy Dozat
Vincent Perot
Guolong Su
Nan Hua
Joshua Ainslie
Renshen Wang
Yasuhisa Fujii
Tomas Pfister
255
90
0
16 Mar 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for
  Structured Document Understanding
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Jiapeng Wang
Lianwen Jin
Kai Ding
VLM
275
186
0
28 Feb 2022
12
Next
Page 1 of 2