Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2408.01287
Cited By
v1
v2 (latest)
Deep Learning based Visually Rich Document Content Understanding: A Survey
2 August 2024
Muhammad Ali
Jean Lee
Salman Khan
Eduard Hovy
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Learning based Visually Rich Document Content Understanding: A Survey"
7 / 7 papers shown
Title
Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding
Sensen Gao
Shanshan Zhao
Xu Jiang
Lunhao Duan
Yong Xien Chng
Qing-Guo Chen
Weihua Luo
Kaifu Zhang
Jia-Wang Bian
Mingming Gong
162
0
0
17 Oct 2025
Document Intelligence in the Era of Large Language Models: A Survey
Weishi Wang
Hengchang Hu
Zhijie Zhang
Zhaochen Li
Hongxin Shao
Daniel Dahlmeier
AI4TS
120
0
0
15 Oct 2025
DocPruner: A Storage-Efficient Framework for Multi-Vector Visual Document Retrieval via Adaptive Patch-Level Embedding Pruning
Yibo Yan
Guangwei Xu
Xin Zou
Shuliang Liu
James Kwok
Xuming Hu
132
4
0
28 Sep 2025
LLM/Agent-as-Data-Analyst: A Survey
Zirui Tang
Weizheng Wang
Z. Zhou
Yang Jiao
Bangrui Xu
...
Conghui He
Bin Wang
Conghui He
Xiaoyang Wang
Fan Wu
154
5
0
28 Sep 2025
Multi-Modal Vision vs. Text-Based Parsing: Benchmarking LLM Strategies for Invoice Processing
David Berghaus
Armin Berger
L. Hillebrand
K. Cvejoski
R. Sifa
44
0
0
29 Aug 2025
Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Pei Fu
Tongkun Guan
Zining Wang
Zhentao Guo
Chen Duan
...
Boming Chen
Jiayao Ma
Qianyi Jiang
Kai Zhou
Junfeng Luo
VLM
367
1
0
23 Feb 2025
DAViD: Domain Adaptive Visually-Rich Document Understanding with Synthetic Insights
Yihao Ding
S. Han
Zechuan Li
Hyunsuk Chung
137
3
0
02 Oct 2024
1