Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2211.15421
Cited By

VRDU: A Benchmark for Visually-rich Document Understanding

v1v2v3 (latest)

VRDU: A Benchmark for Visually-rich Document Understanding

Knowledge Discovery and Data Mining (KDD), 2022

15 November 2022

Chen-Yu Lee

ArXiv (abs)PDF HTML

Papers citing "VRDU: A Benchmark for Visually-rich Document Understanding"

17 / 17 papers shown

ALDEN: Reinforcement Learning for Active Navigation and Evidence Gathering in Long Documents

ALDEN: Reinforcement Learning for Active Navigation and Evidence Gathering in Long Documents

Jan Philip Wahle

300

0

0

29 Oct 2025

Document Intelligence in the Era of Large Language Models: A Survey

Document Intelligence in the Era of Large Language Models: A Survey

Daniel Dahlmeier

251

3

0

15 Oct 2025

SynDoc: A Hybrid Discriminative-Generative Framework for Enhancing Synthetic Domain-Adaptive Document Key Information Extraction

SynDoc: A Hybrid Discriminative-Generative Framework for Enhancing Synthetic Domain-Adaptive Document Key Information Extraction

Soyeon Caren Han

133

0

0

27 Sep 2025

Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning

Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning

...

354

4

0

24 May 2025

Problem Solved? Information Extraction Design Space for Layout-Rich Documents using LLMs

Problem Solved? Information Extraction Design Space for Layout-Rich Documents using LLMs

Jonathan Fürst

397

4

0

25 Feb 2025

"What is the value of {templates}?" Rethinking Document Information
Extraction Datasets for LLMs

"What is the value of {templates}?" Rethinking Document Information Extraction Datasets for LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Armineh Nourbakhsh

179

6

0

20 Oct 2024

SynJAC: Synthetic-data-driven Joint-granular Adaptation and Calibration for Domain Specific Scanned Document Key Information Extraction

SynJAC: Synthetic-data-driven Joint-granular Adaptation and Calibration for Domain Specific Scanned Document Key Information Extraction

281

3

0

02 Oct 2024

Modeling Layout Reading Order as Ordering Relations for Visually-rich
Document Understanding

Modeling Layout Reading Order as Ordering Relations for Visually-rich Document UnderstandingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Huan Chen

...

Qi Zhang

237

15

0

29 Sep 2024

Deep Learning based Visually Rich Document Content Understanding: A Survey

Deep Learning based Visually Rich Document Content Understanding: A Survey

547

21

0

02 Aug 2024

OfficeBench: Benchmarking Language Agents across Multiple Applications
for Office Automation

OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation

Bill Yuchen Lin

286

26

0

26 Jul 2024

DocGenome: An Open Large-scale Scientific Document Benchmark for
Training and Testing Multi-modal Large Language Models

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models

Xiangchao Yan

Bo Zhang

...

Yongwei Wang

Bin Wang

Junchi Yan

Yu Qiao

277

28

0

17 Jun 2024

KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in
Business Documents

KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents

Foad Abo Dahood

...

Yevgeny Burshtein

Adi Raz Goldfarb

236

3

0

01 May 2024

BuDDIE: A Business Document Dataset for Multi-task Information
Extraction

BuDDIE: A Business Document Dataset for Multi-task Information Extraction

...

Antony Papadimitriou

Armineh Nourbakhsh

277

8

0

05 Apr 2024

RealKIE: Five Novel Datasets for Enterprise Key Information Extraction

RealKIE: Five Novel Datasets for Enterprise Key Information Extraction

Benjamin Townsend

Katherine Mackowiak

Christopher Wells

318

3

0

29 Mar 2024

ANLS* -- A Universal Document Processing Metric for Generative Large
Language Models

ANLS* -- A Universal Document Processing Metric for Generative Large Language Models

Philemon Schöpf

Sebastian Stabinger

352

9

0

06 Feb 2024

DocLLM: A layout-aware generative language model for multimodal document
understanding

DocLLM: A layout-aware generative language model for multimodal document understandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Armineh Nourbakhsh

306

129

0

31 Dec 2023

LMDX: Language Model-based Document Information Extraction and
Localization

LMDX: Language Model-based Document Information Extraction and LocalizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Florian Luisier

...

Chen-Yu Lee

279

58

0

19 Sep 2023

Page 1 of 1