Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.12477
Cited By
American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
24 August 2023
Melissa Dell
Jacob Carlson
Tom Bryan
Emily Silcock
Abhishek Arora
Zejiang Shen
Luca DÁmico-Wong
Q. Le
Pablo Querubin
Leander Heldring
AI4TS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers"
4 / 4 papers shown
Title
The Lucie-7B LLM and the Lucie Training Dataset: Open resources for multilingual language generation
Olivier Gouvert
Julie Hunter
Jérôme Louradour
Christophe Cerisara
Evan Dufraisse
Yaya Sy
Laura Rivière
Jean-Pierre Lorré
OpenLLM-France community
75
0
0
15 Mar 2025
EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge
Tom Bryan
Jacob Carlson
Abhishek Arora
Melissa Dell
18
8
0
16 Oct 2023
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
93
340
0
21 Sep 2021
Revisiting the Sibling Head in Object Detector
Guanglu Song
Yu Liu
Xiaogang Wang
ObjD
165
343
0
17 Mar 2020
1