Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2203.13530
Cited By
v1
v2 (latest)
Multimodal Pre-training Based on Graph Attention Network for Document Understanding
IEEE transactions on multimedia (IEEE TMM), 2022
25 March 2022
Zhenrong Zhang
Jiefeng Ma
Jun Du
Licheng Wang
Jianshu Zhang
Re-assign community
ArXiv (abs)
PDF
HTML
Github (43★)
Papers citing
"Multimodal Pre-training Based on Graph Attention Network for Document Understanding"
19 / 19 papers shown
Title
Cascaded Robust Rectification for Arbitrary Document Images
Chaoyun Wang
Quanxin Huang
I-Chao Shen
Takeo Igarashi
Nanning Zheng
Caigui Jiang
112
0
0
28 Nov 2025
OTCR: Optimal Transmission, Compression and Representation for Multimodal Information Extraction
Y. Li
Yajiao Wang
Wenhao Hu
Z. Zhang
Mengting Zhang
64
0
0
17 Sep 2025
Document Image Rectification Bases on Self-Adaptive Multitask Fusion
Heng Li
Xiangping Wu
Qingcai Chen
329
0
0
09 May 2025
Problem Solved? Information Extraction Design Space for Layout-Rich Documents using LLMs
Gaye Colakoglu
Gürkan Solmaz
Jonathan Fürst
273
4
0
25 Feb 2025
Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Chong Zhang
Yi Tu
Yixi Zhao
Chenshu Yuan
Huan Chen
...
Mingxu Chai
Ya Guo
Huijia Zhu
Qi Zhang
Tao Gui
176
10
0
29 Sep 2024
See then Tell: Enhancing Key Information Extraction with Vision Grounding
Shuhang Liu
Zhenrong Zhang
Pengfei Hu
Jiefeng Ma
Jun Du
Qing Wang
Jianshu Zhang
Chenyu Liu
235
1
0
29 Sep 2024
DocMamba: Efficient Document Pre-training with State Space Model
AAAI Conference on Artificial Intelligence (AAAI), 2024
Pengfei Hu
Zhenrong Zhang
Jiefeng Ma
Shuhang Liu
Jun Du
Jianshu Zhang
Mamba
262
1
0
18 Sep 2024
Deep Learning based Visually Rich Document Content Understanding: A Survey
Muhammad Ali
Jean Lee
Salman Khan
Eduard Hovy
434
14
0
02 Aug 2024
SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding
Jiefeng Ma
Yan Wang
Chenyu Liu
Jun Du
Yu Hu
Zhenrong Zhang
Pengfei Hu
Qing Wang
Jianshu Zhang
174
1
0
13 Jun 2024
BuDDIE: A Business Document Dataset for Multi-task Information Extraction
Ran Zmigrod
Dongsheng Wang
Mathieu Sibue
Yulong Pei
Petr Babkin
...
Antony Papadimitriou
William Watson
Zhiqiang Ma
Armineh Nourbakhsh
Sameena Shah
208
7
0
05 Apr 2024
DocLLM: A layout-aware generative language model for multimodal document understanding
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Dongsheng Wang
Natraj Raman
Mathieu Sibue
Zhiqiang Ma
Petr Babkin
Simerjot Kaur
Yulong Pei
Armineh Nourbakhsh
Xiaomo Liu
VLM
232
100
0
31 Dec 2023
Document Understanding for Healthcare Referrals
IEEE International Conference on Healthcare Informatics (ICHI), 2023
Jimit Mistry
N. Arzeno
MedIm
94
1
0
22 Sep 2023
LMDX: Language Model-based Document Information Extraction and Localization
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Vincent Perot
Kai Kang
Florian Luisier
Guolong Su
Xiaoyu Sun
...
Zifeng Wang
Jiaqi Mu
Hao Zhang
Chen-Yu Lee
Nan Hua
193
51
0
19 Sep 2023
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2023
Wenwen Yu
Chengquan Zhang
H. Cao
Wei Hua
Bohan Li
...
Hao Fei
Dimosthenis Karatzas
Xingchao Sun
Jingdong Wang
Xiang Bai
174
18
0
05 Jun 2023
RE
2
^2
2
: Region-Aware Relation Extraction from Visually Rich Documents
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Pritika Ramu
Sijia Wang
Lalla Mouatadid
Joy Rimchala
Lifu Huang
191
0
0
24 May 2023
Deep Unrestricted Document Image Rectification
IEEE transactions on multimedia (IEEE TMM), 2023
Hao Feng
Shaokai Liu
Jiajun Deng
Wen-gang Zhou
Houqiang Li
ViT
289
24
0
18 Apr 2023
PDFVQA: A New Dataset for Real-World VQA on PDF Documents
Yihao Ding
Siwen Luo
Hyunsuk Chung
S. Han
383
24
0
13 Apr 2023
HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures
AAAI Conference on Artificial Intelligence (AAAI), 2023
Jiefeng Ma
Jun Du
Pengfei Hu
Zhenrong Zhang
Jianshu Zhang
Huihui Zhu
Cong Liu
192
18
0
24 Mar 2023
DocILE Benchmark for Document Information Localization and Extraction
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2023
vStvepán vSimsa
Milan vSulc
Michal Uvrivcávr
Yash J. Patel
Ahmed Hamdi
...
Matyávs Skalický
Jivrí Matas
Antoine Doucet
Mickael Coustaty
Dimosthenis Karatzas
177
48
0
11 Feb 2023
1