Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval

25 February 2015

Papers citing "Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval"

50 / 188 papers shown

Title
Class-wise and reduced calibration methods Michael Panchenko Anes Benmerzoug Miguel de Benito Delgado 21 0 0 07 Oct 2022
XDoc: Unified Pre-training for Cross-Format Document Understanding Jingye Chen Tengchao Lv Lei Cui Changrong Zhang Furu Wei 50 13 0 06 Oct 2022
ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots Yu-Chung Hsiao Fedir Zubach Maria Wang Jindong Chen Victor Carbune Jason Lin Maria Wang Yun Zhu Jindong Chen RALM 160 25 0 16 Sep 2022
Augraphy: A Data Augmentation Library for Document Images Alexander Groleau Kok Wei Chee Stefan Larson Samay Maini Jonathan Boarman 38 10 0 30 Aug 2022
Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks Andrea Gemelli Sanket Biswas Enrico Civitelli Josep Lladós S. Marinai 23 15 0 23 Aug 2022
Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis Siwen Luo Yi Ding Siqu Long Josiah Poon S. Han GNN 45 16 0 22 Aug 2022
Just-in-Time Aggregation for Federated Learning K.R. Jayaram Ashish Verma Gegi Thomas Vinod Muthusamy FedML 33 6 0 20 Aug 2022
Understanding Long Documents with Different Position-Aware Attentions Hai Pham Guoxin Wang Yijuan Lu D. Florêncio Changrong Zhang 19 9 0 17 Aug 2022
Information Extraction from Scanned Invoice Images using Text Analysis and Layout Features H. Ha Ales Horak 25 14 0 08 Aug 2022
TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents Zhanzhan Cheng Peng Zhang Can Li Qiao Liang Yunlu Xu Pengfei Li Shiliang Pu Yi Niu Fei Wu 18 10 0 14 Jul 2022
Sequence-aware multimodal page classification of Brazilian legal documents Pedro Henrique Luz de Araujo Ana Paula G. S. de Almeida Fabricio Ataides Braz Nilton Correia da Silva Flávio de Barros Vidal Teofilo de Campos 14 7 0 02 Jul 2022
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding Chuwei Luo Guozhi Tang Qi Zheng Cong Yao Lianwen Jin Chenliang Li Yang Xue Luo Si 27 16 0 27 Jun 2022
Business Document Information Extraction: Towards Practical Benchmarks Matyás Skalický Stepán Simsa Michal Uřičář Milan Šulc 30 9 0 20 Jun 2022
V-Doc : Visual questions answers with Documents Yihao Ding Zhe Huang Runlin Wang Yanhang Zhang Xianru Chen Yuzhong Ma Hyunsuk Chung S. Han 31 15 0 27 May 2022
VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification Souhail Bakkali Zuheng Ming Mickael Coustaty Marccal Rusinol O. R. Terrades VLM 51 30 0 24 May 2022
MATrIX -- Modality-Aware Transformer for Information eXtraction Thomas Delteil Edouard Belval Lei Chen Luis Goncalves Vijay Mahadevan 25 3 0 17 May 2022
Unified Pretraining Framework for Document Understanding Jiuxiang Gu Jason Kuen Vlad I. Morariu Handong Zhao Nikolaos Barmpalios R. Jain A. Nenkova Tong Sun 32 96 0 22 Apr 2022
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking Yupan Huang Tengchao Lv Lei Cui Yutong Lu Furu Wei 35 432 0 18 Apr 2022
End-to-end Document Recognition and Understanding with Dessurt Brian L. Davis B. Morse Brian L. Price Chris Tensmeyer Curtis Wigington Vlad I. Morariu VLM ViT 37 73 0 30 Mar 2022
Multimodal Pre-training Based on Graph Attention Network for Document Understanding Zhenrong Zhang Jiefeng Ma Jun Du Licheng Wang Jianshu Zhang 18 37 0 25 Mar 2022
Adaptive Aggregation For Federated Learning K.R. Jayaram Vinod Muthusamy Gegi Thomas Ashish Verma Mark Purcell FedML 33 16 0 23 Mar 2022
A Survey of Historical Document Image Datasets Konstantina Nikolaidou Mathias Seuret Hamam Mokayed Marcus Liwicki 27 29 0 16 Mar 2022
DiT: Self-supervised Pre-training for Document Image Transformer Junlong Li Yiheng Xu Tengchao Lv Lei Cui Chaoxi Zhang Furu Wei ViT VLM 38 160 0 04 Mar 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding Jiapeng Wang Lianwen Jin Kai Ding VLM 35 138 0 28 Feb 2022
OCR-IDL: OCR Annotations for Industry Document Library Dataset Ali Furkan Biten Rubèn Pérez Tito Lluís Gómez Ernest Valveny Dimosthenis Karatzas 25 26 0 25 Feb 2022
Combining Deep Learning and Reasoning for Address Detection in Unstructured Text Documents Matthias Engelbach Dennis Klau Jens Drawehn Maximilien Kintz 11 2 0 07 Feb 2022
Text Classification Models for Form Entity Linking M. Villota C. Domínguez Jónathan Heras Eloy J. Mata Vico Pascual MedIm 26 2 0 14 Dec 2021
OCR-free Document Understanding Transformer Geewook Kim Teakgyu Hong Moonbin Yim Jeongyeon Nam Jinyoung Park Jinyeong Yim Wonseok Hwang Sangdoo Yun Dongyoon Han Seunghyun Park ViT 61 263 0 30 Nov 2021
Document AI: Benchmarks, Models and Applications Lei Cui Yiheng Xu Tengchao Lv Furu Wei VLM 24 69 0 16 Nov 2021
Information Extraction from Visually Rich Documents with Font Style Embeddings Ismail Oussaid William Vanhuffel Pirashanth Ratnamogan Mhamed Hajaiej Alexis Mathey Thomas Gilles 19 1 0 07 Nov 2021
Domain Agnostic Few-Shot Learning For Document Intelligence J. Mandivarapu Eric Hunch G. Fung OOD VLM 30 1 0 29 Oct 2021
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding Junlong Li Yiheng Xu Lei Cui Furu Wei VLM 3DGS 31 59 0 16 Oct 2021
OPAD: An Optimized Policy-based Active Learning Framework for Document Content Analysis Sumit Shekhar Bhanu Prakash Reddy Guda Ashutosh Chaubey Ishan Jindal Avanish Jain 33 0 0 01 Oct 2021
Skim-Attention: Learning to Focus via Document Layout Laura Nguyen Thomas Scialom Jacopo Staiano Benjamin Piwowarski 27 9 0 02 Sep 2021
Position Masking for Improved Layout-Aware Document Understanding Anik Saha Catherine Finegan-Dollak Ashish Verma 19 2 0 01 Sep 2021
BoundaryNet: An Attentive Deep Network with Fast Marching Distance Maps for Semi-automatic Layout Annotation Abhishek Trivedi Ravi Kiran Sarvadevabhatla 11 1 0 21 Aug 2021
BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents Teakgyu Hong Donghyun Kim Mingi Ji Wonseok Hwang Daehyun Nam Sungrae Park VLM 34 150 0 10 Aug 2021
StrucTexT: Structured Text Understanding with Multi-Modal Transformers Yulin Li Yuxi Qian Yuchen Yu Xiameng Qin Chengquan Zhang Yan Liu Kun Yao Junyu Han Jingtuo Liu Errui Ding 27 113 0 06 Aug 2021
Exploring Out-of-Distribution Generalization in Text Classifiers Trained on Tobacco-3482 and RVL-CDIP Stefan Larson Navtej Singh Saarthak Maheshwari Shanti Stewart U. Krishnaswamy OODD OOD 22 3 0 05 Aug 2021
Graph-based Deep Generative Modelling for Document Layout Generation Sanket Biswas Pau Riba Josep Lladós Umapada Pal 19 3 0 09 Jul 2021
Object Detection Based Handwriting Localization Yuli Wu Yucheng Hu Suting Miao 19 4 0 28 Jun 2021
DocFormer: End-to-End Transformer for Document Understanding Srikar Appalaraju Bhavan A. Jasani Bhargava Urala Kota Yusheng Xie R. Manmatha ViT 41 270 0 22 Jun 2021
SelfDoc: Self-Supervised Document Representation Learning Peizhao Li Jiuxiang Gu Jason Kuen Vlad I. Morariu Handong Zhao R. Jain Varun Manjunatha Hongfu Liu ViT SSL 28 159 0 07 Jun 2021
StructuralLM: Structural Pre-training for Form Understanding Chenliang Li Bin Bi Ming Yan Wei Wang Songfang Huang Fei Huang Luo Si LMTD AI4CE 31 131 0 24 May 2021
End-to-End Unsupervised Document Image Blind Denoising M. Gangeh Marcin Plata Hamid Motahari Nigel P. Duffy 15 11 0 19 May 2021
Separation of Powers in Federated Learning P. Cheng Kevin Eykholt Zhongshu Gu Hani Jamjoom K.R. Jayaram Enriquillo Valdez Ashish Verma FedML 26 13 0 19 May 2021
DocReader: Bounding-Box Free Training of a Document Information Extraction Model S. Klaiman Marius Lehne 21 6 0 10 May 2021
Current Status and Performance Analysis of Table Recognition in Document Images with Deep Neural Networks K. Hashmi Marcus Liwicki D. Stricker Muhammad Adnan Afzal Muhammad Ahtsham Afzal Muhammad Zeshan Afzal LMTD 40 48 0 29 Apr 2021
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding Yiheng Xu Tengchao Lv Lei Cui Guoxin Wang Yijuan Lu D. Florêncio Cha Zhang Furu Wei MLLM VLM 38 127 0 18 Apr 2021
LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis Zejiang Shen Ruochen Zhang Melissa Dell Benjamin Charles Germain Lee Jacob Carlson Weining Li 3DV 8 95 0 29 Mar 2021