Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2305.18721
Cited By
v1
v2 (latest)
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
30 May 2023
Yi Tu
Ya Guo
Huan Chen
Jinyang Tang
Re-assign community
ArXiv (abs)
PDF
HTML
Github (75438★)
Papers citing
"LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding"
11 / 11 papers shown
DocPolarBERT: A Pre-trained Model for Document Understanding with Relative Polar Coordinate Encoding of Layout Structures
Benno Uthayasooriyar
Antoine Ly
Franck Vermet
Caio Corro
392
0
0
11 Jul 2025
Relation-Rich Visual Document Generator for Visual Information Extraction
Computer Vision and Pattern Recognition (CVPR), 2025
Zi-Han Jiang
Chien-Wei Lin
Wei-Hua Li
Hsuan-Tung Liu
Yi-Ren Yeh
Chu-Song Chen
301
3
0
14 Apr 2025
TextBite: A Historical Czech Document Dataset for Logical Page Segmentation
Martin Kostelník
Karel Beneš
Michal Hradiš
237
0
0
20 Mar 2025
ReLayout: Towards Real-World Document Understanding via Layout-enhanced Pre-training
International Conference on Computational Linguistics (COLING), 2024
Zhouqiang Jiang
Bowen Wang
Junhao Chen
Yuta Nakashima
289
5
0
14 Oct 2024
Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Chong Zhang
Yi Tu
Yixi Zhao
Chenshu Yuan
Huan Chen
...
Mingxu Chai
Ya Guo
Huijia Zhu
Qi Zhang
Tao Gui
244
18
0
29 Sep 2024
DocMamba: Efficient Document Pre-training with State Space Model
AAAI Conference on Artificial Intelligence (AAAI), 2024
Pengfei Hu
Zhenrong Zhang
Jiefeng Ma
Shuhang Liu
Jun Du
Jianshu Zhang
Mamba
374
4
0
18 Sep 2024
UNER: A Unified Prediction Head for Named Entity Recognition in Visually-rich Documents
ACM Multimedia (MM), 2024
Yi Tu
Chong Zhang
Ya Guo
Huan Chen
Jinyang Tang
Huijia Zhu
Tao Gui
335
5
0
02 Aug 2024
Deep Learning based Visually Rich Document Content Understanding: A Survey
Muhammad Ali
Jean Lee
Salman Khan
Eduard Hovy
571
23
0
02 Aug 2024
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
Chuwei Luo
Yufan Shen
Zhaoqing Zhu
Qi Zheng
Zhi Yu
Cong Yao
442
123
0
08 Apr 2024
On Task-personalized Multimodal Few-shot Learning for Visually-rich Document Entity Retrieval
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jiayi Chen
H. Dai
Bo Dai
Aidong Zhang
Wei Wei
344
3
0
01 Nov 2023
Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Chong Zhang
Ya Guo
Yi Tu
Huan Chen
Jinyang Tang
Huijia Zhu
Tao Gui
Tao Gui
3DV
285
34
0
17 Oct 2023
1
Page 1 of 1