Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.12273
Cited By
DAN: a Segmentation-free Document Attention Network for Handwritten Document Recognition
23 March 2022
Denis Coquenet
Clément Chatelain
Thierry Paquet
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DAN: a Segmentation-free Document Attention Network for Handwritten Document Recognition"
39 / 39 papers shown
Title
CM1 - A Dataset for Evaluating Few-Shot Information Extraction with Large Vision Language Models
Fabian Wolf
Oliver Tüselmann
Arthur Matei
Lukas Hennies
Christoph Rass
Gernot A. Fink
48
0
0
07 May 2025
Classifying the Unknown: In-Context Learning for Open-Vocabulary Text and Symbol Recognition
Tom Simon
William Mocaer
Pierrick Tranouez
Clément Chatelain
Thierry Paquet
MLLM
VLM
51
0
0
09 Apr 2025
VISTA-OCR: Towards generative and interactive end to end OCR models
Laziz Hamdi
Amine Tamasna
Pascal Boisson
Thierry Paquet
38
0
0
04 Apr 2025
Multimodal LLMs for OCR, OCR Post-Correction, and Named Entity Recognition in Historical Documents
Gavin Greif
Niclas Griesshaber
Robin Greif
OffRL
43
0
0
01 Apr 2025
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis
Jiawei Wang
Kai Hu
Qiang Huo
53
0
0
20 Mar 2025
Handwritten Text Recognition: A Survey
Carlos Garrido-Munoz
Antonio Ríos-Vila
Jorge Calvo-Zaragoza
101
0
0
12 Feb 2025
DocTTT: Test-Time Training for Handwritten Document Recognition Using Meta-Auxiliary Learning
Wenhao Gu
Li Gu
Ziqiang Wang
Ching Yee Suen
Yang Wang
49
0
0
22 Jan 2025
Integrating Canonical Neural Units and Multi-Scale Training for Handwritten Text Recognition
Zi-Rui Wang
24
0
0
24 Oct 2024
General Detection-based Text Line Recognition
Raphael Baena
Syrine Kalleli
Mathieu Aubry
74
0
0
25 Sep 2024
HTR-VT: Handwritten Text Recognition with Vision Transformer
Yuting Li
Dexiong Chen
Tinglong Tang
Xi Shen
ViT
21
7
0
13 Sep 2024
Self-Supervised Learning for Text Recognition: A Critical Survey
Carlos Peñarrubia
J. J. Valero-Mas
Jorge Calvo-Zaragoza
69
1
0
29 Jul 2024
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents
Thomas Constum
Pierrick Tranouez
Thierry Paquet
21
5
0
12 Jul 2024
End-to-end information extraction in handwritten documents: Understanding Paris marriage records from 1880 to 1940
Thomas Constum
Lucas Preel
Théo Larcher
Pierrick Tranouez
Thierry Paquet
Sandra Brée
18
3
0
30 Apr 2024
Revisiting N-Gram Models: Their Impact in Modern Neural Networks for Handwritten Text Recognition
Solène Tarride
Christopher Kermorvant
29
1
0
30 Apr 2024
Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism
Lei Kang
Rubèn Pérez Tito
Ernest Valveny
Dimosthenis Karatzas
22
5
0
29 Apr 2024
Improving Automatic Text Recognition with Language Models in the PyLaia Open-Source Library
Solène Tarride
Yoann Schneider
Marie Generali-Lince
Mélodie Boillet
Bastien Abadie
Christopher Kermorvant
26
3
0
29 Apr 2024
The Socface Project: Large-Scale Collection, Processing, and Analysis of a Century of French Censuses
Mélodie Boillet
Solène Tarride
Manon Blanco
Valentin Rigal
Yoann Schneider
Bastien Abadie
Lionel Kesztenbaum
Christopher Kermorvant
22
3
0
29 Apr 2024
Reading Order Independent Metrics for Information Extraction in Handwritten Documents
David Villanova-Aparisi
Solène Tarride
Carlos David Martínez Hinarejos
Verónica Romero
Christopher Kermorvant
Moisés Pastor
16
0
0
29 Apr 2024
GatedLexiconNet: A Comprehensive End-to-End Handwritten Paragraph Text Recognition System
Lalita Kumari
Sukhdeep Singh
V. Rathore
Anuj Sharma
42
1
0
22 Apr 2024
Spatial Context-based Self-Supervised Learning for Handwritten Text Recognition
Carlos Peñarrubia
Carlos Garrido-Munoz
J. J. Valero-Mas
Jorge Calvo-Zaragoza
24
1
0
17 Apr 2024
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
Yufan Chen
Jiaming Zhang
Kunyu Peng
Junwei Zheng
Ruiping Liu
Philip H. S. Torr
Rainer Stiefelhagen
OOD
29
5
0
21 Mar 2024
GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation
Ayan Banerjee
Sanket Biswas
Josep Lladós
Umapada Pal
30
1
0
17 Feb 2024
Sheet Music Transformer: End-To-End Optical Music Recognition Beyond Monophonic Transcription
Antonio Ríos-Vila
Jorge Calvo-Zaragoza
Thierry Paquet
36
9
0
12 Feb 2024
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
Maoyuan Ye
Jing Zhang
Juhua Liu
Chenyu Liu
Baocai Yin
Cong Liu
Bo Du
Dacheng Tao
VLM
30
10
0
31 Jan 2024
Proceedings of the 5th International Workshop on Reading Music Systems
Jorge Calvo-Zaragoza
Alexander Pacha
Elona Shatri
11
0
0
07 Nov 2023
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation
Yongxin Shi
Dezhi Peng
Wenhui Liao
Zening Lin
Xinhong Chen
Chongyu Liu
Yuyi Zhang
Lianwen Jin
MLLM
21
41
0
25 Oct 2023
Leveraging Vision-Language Foundation Models for Fine-Grained Downstream Tasks
Denis Coquenet
Clément Rambour
Emanuele Dalsasso
Nicolas Thome
MLLM
CLIP
VLM
19
1
0
13 Jul 2023
Handwritten Text Recognition from Crowdsourced Annotations
Solène Tarride
Tristan Faine
Mélodie Boillet
Harold Mouchère
Christopher Kermorvant
21
4
0
19 Jun 2023
SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation
Subhajit Maity
Sanket Biswas
Siladittya Manna
Ayan Banerjee
Josep Lladós
Saumik Bhattacharya
Umapada Pal
34
5
0
01 May 2023
Large Scale Genealogical Information Extraction From Handwritten Quebec Parish Records
Solène Tarride
Martin Maarand
Mélodie Boillet
James McGrath
Eugénie Capel
H. Vézina
Christopher Kermorvant
22
10
0
27 Apr 2023
SIMARA: a database for key-value information extraction from full pages
Solène Tarride
Mélodie Boillet
Jean-Franccois Moufflet
Christopher Kermorvant
11
1
0
26 Apr 2023
Key-value information extraction from full handwritten pages
Solène Tarride
Mélodie Boillet
Christopher Kermorvant
11
10
0
26 Apr 2023
MSdocTr-Lite: A Lite Transformer for Full Page Multi-script Handwriting Recognition
M. Dhiaf
Ahmed Cheikh Rouhou
Yousri Kessentini
Sinda Ben Salem
27
11
0
24 Mar 2023
Entry Separation using a Mixed Visual and Textual Language Model: Application to 19th century French Trade Directories
Bertrand Duménieu
Edwin Carlinet
N. Abadie
Joseph Chazalon
14
0
0
17 Feb 2023
Faster DAN: Multi-target Queries with Document Positional Encoding for End-to-end Handwritten Document Recognition
Denis Coquenet
Clément Chatelain
Thierry Paquet
8
9
0
25 Jan 2023
End-to-End Page-Level Assessment of Handwritten Text Recognition
Enrique Vidal
Alejandro H. Toselli
Antonio Ríos-Vila
Jorge Calvo-Zaragoza
13
16
0
14 Jan 2023
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
93
340
0
21 Sep 2021
SPAN: a Simple Predict & Align Network for Handwritten Paragraph Recognition
Denis Coquenet
Clément Chatelain
Thierry Paquet
18
22
0
17 Feb 2021
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Yang Xu
Yiheng Xu
Tengchao Lv
Lei Cui
Furu Wei
...
D. Florêncio
Cha Zhang
Wanxiang Che
Min Zhang
Lidong Zhou
ViT
MLLM
142
498
0
29 Dec 2020
1