ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.07058
  4. Cited By
Evaluation of Deep Convolutional Nets for Document Image Classification
  and Retrieval

Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval

25 February 2015
Adam W. Harley
Alex Ufkes
Konstantinos G. Derpanis
ArXivPDFHTML

Papers citing "Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval"

50 / 188 papers shown
Title
Class-wise and reduced calibration methods
Class-wise and reduced calibration methods
Michael Panchenko
Anes Benmerzoug
Miguel de Benito Delgado
21
0
0
07 Oct 2022
XDoc: Unified Pre-training for Cross-Format Document Understanding
XDoc: Unified Pre-training for Cross-Format Document Understanding
Jingye Chen
Tengchao Lv
Lei Cui
Changrong Zhang
Furu Wei
50
13
0
06 Oct 2022
ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
Yu-Chung Hsiao
Fedir Zubach
Maria Wang
Jindong Chen
Victor Carbune
Jason Lin
Maria Wang
Yun Zhu
Jindong Chen
RALM
160
25
0
16 Sep 2022
Augraphy: A Data Augmentation Library for Document Images
Augraphy: A Data Augmentation Library for Document Images
Alexander Groleau
Kok Wei Chee
Stefan Larson
Samay Maini
Jonathan Boarman
38
10
0
30 Aug 2022
Doc2Graph: a Task Agnostic Document Understanding Framework based on
  Graph Neural Networks
Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks
Andrea Gemelli
Sanket Biswas
Enrico Civitelli
Josep Lladós
S. Marinai
23
15
0
23 Aug 2022
Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout
  Analysis
Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis
Siwen Luo
Yi Ding
Siqu Long
Josiah Poon
S. Han
GNN
45
16
0
22 Aug 2022
Just-in-Time Aggregation for Federated Learning
Just-in-Time Aggregation for Federated Learning
K.R. Jayaram
Ashish Verma
Gegi Thomas
Vinod Muthusamy
FedML
33
6
0
20 Aug 2022
Understanding Long Documents with Different Position-Aware Attentions
Understanding Long Documents with Different Position-Aware Attentions
Hai Pham
Guoxin Wang
Yijuan Lu
D. Florêncio
Changrong Zhang
19
9
0
17 Aug 2022
Information Extraction from Scanned Invoice Images using Text Analysis
  and Layout Features
Information Extraction from Scanned Invoice Images using Text Analysis and Layout Features
H. Ha
Ales Horak
25
14
0
08 Aug 2022
TRIE++: Towards End-to-End Information Extraction from Visually Rich
  Documents
TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents
Zhanzhan Cheng
Peng Zhang
Can Li
Qiao Liang
Yunlu Xu
Pengfei Li
Shiliang Pu
Yi Niu
Fei Wu
18
10
0
14 Jul 2022
Sequence-aware multimodal page classification of Brazilian legal
  documents
Sequence-aware multimodal page classification of Brazilian legal documents
Pedro Henrique Luz de Araujo
Ana Paula G. S. de Almeida
Fabricio Ataides Braz
Nilton Correia da Silva
Flávio de Barros Vidal
Teofilo de Campos
14
7
0
02 Jul 2022
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich
  Document Understanding
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding
Chuwei Luo
Guozhi Tang
Qi Zheng
Cong Yao
Lianwen Jin
Chenliang Li
Yang Xue
Luo Si
27
16
0
27 Jun 2022
Business Document Information Extraction: Towards Practical Benchmarks
Business Document Information Extraction: Towards Practical Benchmarks
Matyás Skalický
Stepán Simsa
Michal Uřičář
Milan Šulc
30
9
0
20 Jun 2022
V-Doc : Visual questions answers with Documents
V-Doc : Visual questions answers with Documents
Yihao Ding
Zhe Huang
Runlin Wang
Yanhang Zhang
Xianru Chen
Yuzhong Ma
Hyunsuk Chung
S. Han
31
15
0
27 May 2022
VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal
  Document Classification
VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification
Souhail Bakkali
Zuheng Ming
Mickael Coustaty
Marccal Rusinol
O. R. Terrades
VLM
51
30
0
24 May 2022
MATrIX -- Modality-Aware Transformer for Information eXtraction
MATrIX -- Modality-Aware Transformer for Information eXtraction
Thomas Delteil
Edouard Belval
Lei Chen
Luis Goncalves
Vijay Mahadevan
25
3
0
17 May 2022
Unified Pretraining Framework for Document Understanding
Unified Pretraining Framework for Document Understanding
Jiuxiang Gu
Jason Kuen
Vlad I. Morariu
Handong Zhao
Nikolaos Barmpalios
R. Jain
A. Nenkova
Tong Sun
32
96
0
22 Apr 2022
LayoutLMv3: Pre-training for Document AI with Unified Text and Image
  Masking
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking
Yupan Huang
Tengchao Lv
Lei Cui
Yutong Lu
Furu Wei
35
432
0
18 Apr 2022
End-to-end Document Recognition and Understanding with Dessurt
End-to-end Document Recognition and Understanding with Dessurt
Brian L. Davis
B. Morse
Brian L. Price
Chris Tensmeyer
Curtis Wigington
Vlad I. Morariu
VLM
ViT
37
73
0
30 Mar 2022
Multimodal Pre-training Based on Graph Attention Network for Document
  Understanding
Multimodal Pre-training Based on Graph Attention Network for Document Understanding
Zhenrong Zhang
Jiefeng Ma
Jun Du
Licheng Wang
Jianshu Zhang
18
37
0
25 Mar 2022
Adaptive Aggregation For Federated Learning
Adaptive Aggregation For Federated Learning
K.R. Jayaram
Vinod Muthusamy
Gegi Thomas
Ashish Verma
Mark Purcell
FedML
33
16
0
23 Mar 2022
A Survey of Historical Document Image Datasets
A Survey of Historical Document Image Datasets
Konstantina Nikolaidou
Mathias Seuret
Hamam Mokayed
Marcus Liwicki
27
29
0
16 Mar 2022
DiT: Self-supervised Pre-training for Document Image Transformer
DiT: Self-supervised Pre-training for Document Image Transformer
Junlong Li
Yiheng Xu
Tengchao Lv
Lei Cui
Chaoxi Zhang
Furu Wei
ViT
VLM
38
160
0
04 Mar 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for
  Structured Document Understanding
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding
Jiapeng Wang
Lianwen Jin
Kai Ding
VLM
35
138
0
28 Feb 2022
OCR-IDL: OCR Annotations for Industry Document Library Dataset
OCR-IDL: OCR Annotations for Industry Document Library Dataset
Ali Furkan Biten
Rubèn Pérez Tito
Lluís Gómez
Ernest Valveny
Dimosthenis Karatzas
25
26
0
25 Feb 2022
Combining Deep Learning and Reasoning for Address Detection in
  Unstructured Text Documents
Combining Deep Learning and Reasoning for Address Detection in Unstructured Text Documents
Matthias Engelbach
Dennis Klau
Jens Drawehn
Maximilien Kintz
11
2
0
07 Feb 2022
Text Classification Models for Form Entity Linking
Text Classification Models for Form Entity Linking
M. Villota
C. Domínguez
Jónathan Heras
Eloy J. Mata
Vico Pascual
MedIm
26
2
0
14 Dec 2021
OCR-free Document Understanding Transformer
OCR-free Document Understanding Transformer
Geewook Kim
Teakgyu Hong
Moonbin Yim
Jeongyeon Nam
Jinyoung Park
Jinyeong Yim
Wonseok Hwang
Sangdoo Yun
Dongyoon Han
Seunghyun Park
ViT
61
263
0
30 Nov 2021
Document AI: Benchmarks, Models and Applications
Document AI: Benchmarks, Models and Applications
Lei Cui
Yiheng Xu
Tengchao Lv
Furu Wei
VLM
24
69
0
16 Nov 2021
Information Extraction from Visually Rich Documents with Font Style
  Embeddings
Information Extraction from Visually Rich Documents with Font Style Embeddings
Ismail Oussaid
William Vanhuffel
Pirashanth Ratnamogan
Mhamed Hajaiej
Alexis Mathey
Thomas Gilles
19
1
0
07 Nov 2021
Domain Agnostic Few-Shot Learning For Document Intelligence
Domain Agnostic Few-Shot Learning For Document Intelligence
J. Mandivarapu
Eric Hunch
G. Fung
OOD
VLM
30
1
0
29 Oct 2021
MarkupLM: Pre-training of Text and Markup Language for Visually-rich
  Document Understanding
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding
Junlong Li
Yiheng Xu
Lei Cui
Furu Wei
VLM
3DGS
31
59
0
16 Oct 2021
OPAD: An Optimized Policy-based Active Learning Framework for Document
  Content Analysis
OPAD: An Optimized Policy-based Active Learning Framework for Document Content Analysis
Sumit Shekhar
Bhanu Prakash Reddy Guda
Ashutosh Chaubey
Ishan Jindal
Avanish Jain
33
0
0
01 Oct 2021
Skim-Attention: Learning to Focus via Document Layout
Skim-Attention: Learning to Focus via Document Layout
Laura Nguyen
Thomas Scialom
Jacopo Staiano
Benjamin Piwowarski
27
9
0
02 Sep 2021
Position Masking for Improved Layout-Aware Document Understanding
Position Masking for Improved Layout-Aware Document Understanding
Anik Saha
Catherine Finegan-Dollak
Ashish Verma
19
2
0
01 Sep 2021
BoundaryNet: An Attentive Deep Network with Fast Marching Distance Maps
  for Semi-automatic Layout Annotation
BoundaryNet: An Attentive Deep Network with Fast Marching Distance Maps for Semi-automatic Layout Annotation
Abhishek Trivedi
Ravi Kiran Sarvadevabhatla
11
1
0
21 Aug 2021
BROS: A Pre-trained Language Model Focusing on Text and Layout for
  Better Key Information Extraction from Documents
BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents
Teakgyu Hong
Donghyun Kim
Mingi Ji
Wonseok Hwang
Daehyun Nam
Sungrae Park
VLM
34
150
0
10 Aug 2021
StrucTexT: Structured Text Understanding with Multi-Modal Transformers
StrucTexT: Structured Text Understanding with Multi-Modal Transformers
Yulin Li
Yuxi Qian
Yuchen Yu
Xiameng Qin
Chengquan Zhang
Yan Liu
Kun Yao
Junyu Han
Jingtuo Liu
Errui Ding
27
113
0
06 Aug 2021
Exploring Out-of-Distribution Generalization in Text Classifiers Trained
  on Tobacco-3482 and RVL-CDIP
Exploring Out-of-Distribution Generalization in Text Classifiers Trained on Tobacco-3482 and RVL-CDIP
Stefan Larson
Navtej Singh
Saarthak Maheshwari
Shanti Stewart
U. Krishnaswamy
OODD
OOD
22
3
0
05 Aug 2021
Graph-based Deep Generative Modelling for Document Layout Generation
Graph-based Deep Generative Modelling for Document Layout Generation
Sanket Biswas
Pau Riba
Josep Lladós
Umapada Pal
19
3
0
09 Jul 2021
Object Detection Based Handwriting Localization
Object Detection Based Handwriting Localization
Yuli Wu
Yucheng Hu
Suting Miao
19
4
0
28 Jun 2021
DocFormer: End-to-End Transformer for Document Understanding
DocFormer: End-to-End Transformer for Document Understanding
Srikar Appalaraju
Bhavan A. Jasani
Bhargava Urala Kota
Yusheng Xie
R. Manmatha
ViT
41
270
0
22 Jun 2021
SelfDoc: Self-Supervised Document Representation Learning
SelfDoc: Self-Supervised Document Representation Learning
Peizhao Li
Jiuxiang Gu
Jason Kuen
Vlad I. Morariu
Handong Zhao
R. Jain
Varun Manjunatha
Hongfu Liu
ViT
SSL
28
159
0
07 Jun 2021
StructuralLM: Structural Pre-training for Form Understanding
StructuralLM: Structural Pre-training for Form Understanding
Chenliang Li
Bin Bi
Ming Yan
Wei Wang
Songfang Huang
Fei Huang
Luo Si
LMTD
AI4CE
31
131
0
24 May 2021
End-to-End Unsupervised Document Image Blind Denoising
End-to-End Unsupervised Document Image Blind Denoising
M. Gangeh
Marcin Plata
Hamid Motahari
Nigel P. Duffy
15
11
0
19 May 2021
Separation of Powers in Federated Learning
Separation of Powers in Federated Learning
P. Cheng
Kevin Eykholt
Zhongshu Gu
Hani Jamjoom
K.R. Jayaram
Enriquillo Valdez
Ashish Verma
FedML
26
13
0
19 May 2021
DocReader: Bounding-Box Free Training of a Document Information
  Extraction Model
DocReader: Bounding-Box Free Training of a Document Information Extraction Model
S. Klaiman
Marius Lehne
21
6
0
10 May 2021
Current Status and Performance Analysis of Table Recognition in Document
  Images with Deep Neural Networks
Current Status and Performance Analysis of Table Recognition in Document Images with Deep Neural Networks
K. Hashmi
Marcus Liwicki
D. Stricker
Muhammad Adnan Afzal
Muhammad Ahtsham Afzal
Muhammad Zeshan Afzal
LMTD
40
48
0
29 Apr 2021
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich
  Document Understanding
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
Yiheng Xu
Tengchao Lv
Lei Cui
Guoxin Wang
Yijuan Lu
D. Florêncio
Cha Zhang
Furu Wei
MLLM
VLM
38
127
0
18 Apr 2021
LayoutParser: A Unified Toolkit for Deep Learning Based Document Image
  Analysis
LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis
Zejiang Shen
Ruochen Zhang
Melissa Dell
Benjamin Charles Germain Lee
Jacob Carlson
Weining Li
3DV
8
95
0
29 Mar 2021
Previous
1234
Next