ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.15421
  4. Cited By
VRDU: A Benchmark for Visually-rich Document Understanding
v1v2v3 (latest)

VRDU: A Benchmark for Visually-rich Document Understanding

Knowledge Discovery and Data Mining (KDD), 2022
15 November 2022
Zilong Wang
Yichao Zhou
Wei Wei
Chen-Yu Lee
Sandeep Tata
ArXiv (abs)PDFHTML

Papers citing "VRDU: A Benchmark for Visually-rich Document Understanding"

17 / 17 papers shown
ALDEN: Reinforcement Learning for Active Navigation and Evidence Gathering in Long Documents
ALDEN: Reinforcement Learning for Active Navigation and Evidence Gathering in Long Documents
Tianyu Yang
Terry Ruas
Yijun Tian
Jan Philip Wahle
Daniel Kurzawe
Bela Gipp
VLM
300
0
0
29 Oct 2025
Document Intelligence in the Era of Large Language Models: A Survey
Document Intelligence in the Era of Large Language Models: A Survey
Weishi Wang
Hengchang Hu
Zhijie Zhang
Zhaochen Li
Hongxin Shao
Daniel Dahlmeier
AI4TS
251
3
0
15 Oct 2025
SynDoc: A Hybrid Discriminative-Generative Framework for Enhancing Synthetic Domain-Adaptive Document Key Information Extraction
SynDoc: A Hybrid Discriminative-Generative Framework for Enhancing Synthetic Domain-Adaptive Document Key Information Extraction
Yihao Ding
Soyeon Caren Han
Yanbei Jiang
Yan Li
Zechuan Li
Yifan Peng
SyDa
133
0
0
27 Sep 2025
Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning
Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning
Ye Mo
Zirui Shao
Kai Ye
Xianwei Mao
Bo Zhang
...
Gang Huang
Kehan Chen
Zhou Huan
Zixu Yan
Sheng Zhou
LRM
354
4
0
24 May 2025
Problem Solved? Information Extraction Design Space for Layout-Rich Documents using LLMs
Problem Solved? Information Extraction Design Space for Layout-Rich Documents using LLMs
Gaye Colakoglu
Gürkan Solmaz
Jonathan Fürst
397
4
0
25 Feb 2025
"What is the value of {templates}?" Rethinking Document Information
  Extraction Datasets for LLMs
"What is the value of {templates}?" Rethinking Document Information Extraction Datasets for LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Ran Zmigrod
Pranav Shetty
Mathieu Sibue
Zhiqiang Ma
Armineh Nourbakhsh
Xiaomo Liu
Manuela Veloso
179
6
0
20 Oct 2024
SynJAC: Synthetic-data-driven Joint-granular Adaptation and Calibration for Domain Specific Scanned Document Key Information Extraction
SynJAC: Synthetic-data-driven Joint-granular Adaptation and Calibration for Domain Specific Scanned Document Key Information Extraction
Yihao Ding
S. Han
Zechuan Li
Hyunsuk Chung
281
3
0
02 Oct 2024
Modeling Layout Reading Order as Ordering Relations for Visually-rich
  Document Understanding
Modeling Layout Reading Order as Ordering Relations for Visually-rich Document UnderstandingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Chong Zhang
Yi Tu
Yixi Zhao
Chenshu Yuan
Huan Chen
...
Mingxu Chai
Ya Guo
Huijia Zhu
Qi Zhang
Tao Gui
237
15
0
29 Sep 2024
Deep Learning based Visually Rich Document Content Understanding: A Survey
Deep Learning based Visually Rich Document Content Understanding: A Survey
Muhammad Ali
Jean Lee
Salman Khan
Eduard Hovy
547
21
0
02 Aug 2024
OfficeBench: Benchmarking Language Agents across Multiple Applications
  for Office Automation
OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation
Zilong Wang
Yuedong Cui
Li Zhong
Zimin Zhang
Da Yin
Bill Yuchen Lin
Jingbo Shang
286
26
0
26 Jul 2024
DocGenome: An Open Large-scale Scientific Document Benchmark for
  Training and Testing Multi-modal Large Language Models
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models
Renqiu Xia
Song Mao
Xiangchao Yan
Hongbin Zhou
Bo Zhang
...
Yongwei Wang
Bin Wang
Junchi Yan
Fei Wu
Yu Qiao
277
28
0
17 Jun 2024
KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in
  Business Documents
KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents
O. Naparstek
Roi Pony
Inbar Shapira
Foad Abo Dahood
Ophir Azulai
...
Idan Friedman
Orit Prince
Yevgeny Burshtein
Adi Raz Goldfarb
Udi Barzelay
236
3
0
01 May 2024
BuDDIE: A Business Document Dataset for Multi-task Information
  Extraction
BuDDIE: A Business Document Dataset for Multi-task Information Extraction
Ran Zmigrod
Dongsheng Wang
Mathieu Sibue
Yulong Pei
Petr Babkin
...
Antony Papadimitriou
William Watson
Zhiqiang Ma
Armineh Nourbakhsh
Sameena Shah
277
8
0
05 Apr 2024
RealKIE: Five Novel Datasets for Enterprise Key Information Extraction
RealKIE: Five Novel Datasets for Enterprise Key Information Extraction
Benjamin Townsend
Madison May
Katherine Mackowiak
Christopher Wells
SyDa
318
3
0
29 Mar 2024
ANLS* -- A Universal Document Processing Metric for Generative Large
  Language Models
ANLS* -- A Universal Document Processing Metric for Generative Large Language Models
David Peer
Philemon Schöpf
V. Nebendahl
A. Rietzler
Sebastian Stabinger
352
9
0
06 Feb 2024
DocLLM: A layout-aware generative language model for multimodal document
  understanding
DocLLM: A layout-aware generative language model for multimodal document understandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Dongsheng Wang
Natraj Raman
Mathieu Sibue
Zhiqiang Ma
Petr Babkin
Simerjot Kaur
Yulong Pei
Armineh Nourbakhsh
Xiaomo Liu
VLM
306
129
0
31 Dec 2023
LMDX: Language Model-based Document Information Extraction and
  Localization
LMDX: Language Model-based Document Information Extraction and LocalizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Vincent Perot
Kai Kang
Florian Luisier
Guolong Su
Xiaoyu Sun
...
Zifeng Wang
Jiaqi Mu
Hao Zhang
Chen-Yu Lee
Nan Hua
279
58
0
19 Sep 2023
1
Page 1 of 1