Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.02737
Cited By
Efficient OCR for Building a Diverse Digital History
5 April 2023
Jacob Carlson
Tom Bryan
Melissa Dell
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Efficient OCR for Building a Diverse Digital History"
11 / 11 papers shown
Title
DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms
Xiaojun Bi
Shuo Li
Z. Wang
Fuwen Luo
Weizheng Qiao
Lu Han
Ziwei Sun
Peng Li
Yang Liu
78
0
0
05 Mar 2025
Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models
Jonathan Bourne
70
0
0
24 Feb 2025
Newswire: A Large-Scale Structured Database of a Century of Historical News
Emily Silcock
Abhishek Arora
Luca DÁmico-Wong
Melissa Dell
AI4TS
GNN
37
3
0
13 Jun 2024
EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge
Tom Bryan
Jacob Carlson
Abhishek Arora
Melissa Dell
18
8
0
16 Oct 2023
American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
Melissa Dell
Jacob Carlson
Tom Bryan
Emily Silcock
Abhishek Arora
Zejiang Shen
Luca DÁmico-Wong
Q. Le
Pablo Querubin
Leander Heldring
AI4TS
17
12
0
24 Aug 2023
Quantifying Character Similarity with Vision Transformers
Xinmei Yang
Abhishek Arora
Shao-Yu Jheng
Melissa Dell
19
3
0
24 May 2023
Linking Representations with Multimodal Contrastive Learning
Abhishek Arora
Xinmei Yang
Shao-Yu Jheng
Melissa Dell
19
1
0
07 Apr 2023
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
93
340
0
21 Sep 2021
A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios
Michael A. Hedderich
Lukas Lange
Heike Adel
Jannik Strötgen
Dietrich Klakow
191
283
0
23 Oct 2020
Revisiting the Sibling Head in Object Detector
Guanglu Song
Yu Liu
Xiaogang Wang
ObjD
165
343
0
17 Mar 2020
Convolutional Character Networks
Linjie Xing
Zhi Tian
Weilin Huang
Matthew R. Scott
46
155
0
17 Oct 2019
1