v1v2v3 (latest)

DocVQA: A Dataset for VQA on Document Images

1 July 2020

Minesh Mathew

Dimosthenis Karatzas

C. V. Jawahar

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "DocVQA: A Dataset for VQA on Document Images"

9 / 759 papers shown

Kleister: Key Information Extraction Datasets Involving Long Documents with Complex LayoutsIEEE International Conference on Document Analysis and Recognition (ICDAR), 2021

214

114

12 May 2021

TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene textComputer Vision and Pattern Recognition (CVPR), 2021

Amanpreet Singh

257

214

12 May 2021

InfographicVQAIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021

373

370

26 Apr 2021

LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding

261

165

18 Apr 2021

Going Full-TILT Boogie on Document Understanding with Text-Image-Layout TransformerIEEE International Conference on Document Analysis and Recognition (ICDAR), 2021

346

183

18 Feb 2021

VisualMRC: Machine Reading Comprehension on Document ImagesAAAI Conference on Artificial Intelligence (AAAI), 2021

Ryota Tanaka

Kyosuke Nishida

Sen Yoshida

286

188

27 Jan 2021

WebSRC: A Dataset for Web-Based Structural Reading ComprehensionConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

231

116

23 Jan 2021

LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

...

Min Zhang

824

607

29 Dec 2020

Document Visual Question Answering Challenge 2020

211

20 Aug 2020