Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.02622
Cited By
Lexically Aware Semi-Supervised Learning for OCR Post-Correction
4 November 2021
Shruti Rijhwani
Daisy Rosenblum
Antonios Anastasopoulos
Graham Neubig
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Lexically Aware Semi-Supervised Learning for OCR Post-Correction"
8 / 8 papers shown
Title
RoundTripOCR: A Data Generation Technique for Enhancing Post-OCR Error Correction in Low-Resource Devanagari Languages
Harshvivek Kashid
Pushpak Bhattacharyya
79
1
0
14 Dec 2024
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Sebastian Ruder
J. Clark
Alexander Gutkin
Mihir Kale
Min Ma
...
Dan Garrette
R. Ingle
Melvin Johnson
Dmitry Panteleev
Partha P. Talukdar
ELM
22
38
0
19 May 2023
User-Centric Evaluation of OCR Systems for Kwak'wala
Shruti Rijhwani
Daisy Rosenblum
Michayla King
Antonios Anastasopoulos
Graham Neubig
11
5
0
26 Feb 2023
Algorithms for Acyclic Weighted Finite-State Automata with Failure Arcs
Anej Svete
Benjamin Dayan
Tim Vieira
Ryan Cotterell
Jason Eisner
24
1
0
17 Jan 2023
A Benchmark and Dataset for Post-OCR text correction in Sanskrit
Ayush Maheshwari
Nikhil Singh
Amrith Krishna
Ganesh Ramakrishnan
26
12
0
15 Nov 2022
Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval
Uri Alon
Frank F. Xu
Junxian He
Sudipta Sengupta
Dan Roth
Graham Neubig
RALM
77
62
0
28 Jan 2022
Lacuna Reconstruction: Self-supervised Pre-training for Low-Resource Historical Document Transcription
Nikolai Vogler
J. Allen
M. Miller
Taylor Berg-Kirkpatrick
26
5
0
16 Dec 2021
Revisiting Self-Training for Neural Sequence Generation
Junxian He
Jiatao Gu
Jiajun Shen
MarcÁurelio Ranzato
SSL
LRM
244
269
0
30 Sep 2019
1