ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.10282
  4. Cited By
TrOCR: Transformer-based Optical Character Recognition with Pre-trained
  Models

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models

21 September 2021
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
    ViT
ArXivPDFHTML

Papers citing "TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models"

20 / 20 papers shown
Title
GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling
GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling
Siqi Li
Yufan Shen
Xiangnan Chen
Jiayi Chen
Hengwei Ju
...
Licheng Wen
Botian Shi
Y. Liu
Xinyu Cai
Yu Qiao
VLM
ELM
84
0
0
30 Apr 2025
AdaParse: An Adaptive Parallel PDF Parsing and Resource Scaling Engine
AdaParse: An Adaptive Parallel PDF Parsing and Resource Scaling Engine
Carlo Siebenschuh
Kyle Hippe
Ozan Gokdemir
Alexander Brace
A. Khan
...
V. Vishwanath
R. Stevens
Arvind Ramanathan
Ian Foster
Robert Underwood
MoE
25
0
0
23 Apr 2025
Towards Making Flowchart Images Machine Interpretable
Towards Making Flowchart Images Machine Interpretable
S. Kamath S
Prajwal Gatti
Yogesh Kumar
Vikash Yadav
Anand Mishra
36
4
0
29 Jan 2025
Enhancing Complex Formula Recognition with Hierarchical Detail-Focused Network
Enhancing Complex Formula Recognition with Hierarchical Detail-Focused Network
Jiale Wang
Junhui Yu
Huanyong Liu
Chenanran Kong
AIMat
35
0
0
10 Jan 2025
Type-R: Automatically Retouching Typos for Text-to-Image Generation
Type-R: Automatically Retouching Typos for Text-to-Image Generation
Wataru Shimoda
Naoto Inoue
Daichi Haraguchi
Hayato Mitani
S. Uchida
Kota Yamaguchi
DiffM
89
0
0
27 Nov 2024
MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild
Xi Fang
Jiankun Wang
X. Cai
Shangqian Chen
Shuwen Yang
Lin Yao
Linfeng Zhang
Guolin Ke
Linfeng Zhang
Guolin Ke
35
1
0
17 Nov 2024
HATFormer: Historic Handwritten Arabic Text Recognition with Transformers
HATFormer: Historic Handwritten Arabic Text Recognition with Transformers
Adrian Chan
Anupam Mijar
Mehreen Saeed
Chau-Wai Wong
Akram Khater
34
0
0
03 Oct 2024
General Detection-based Text Line Recognition
General Detection-based Text Line Recognition
Raphael Baena
Syrine Kalleli
Mathieu Aubry
35
0
0
25 Sep 2024
BMI Prediction from Handwritten English Characters Using a Convolutional Neural Network
BMI Prediction from Handwritten English Characters Using a Convolutional Neural Network
N. T. Diba
N. Akter
S. Chowdhury
J. E. Giti
30
0
0
04 Sep 2024
StylusAI: Stylistic Adaptation for Robust German Handwritten Text
  Generation
StylusAI: Stylistic Adaptation for Robust German Handwritten Text Generation
Nauman Riaz
S. Saifullah
S. Agne
Andreas Dengel
Sheraz Ahmed
DiffM
18
0
0
22 Jul 2024
Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate
  Video-based Bug Reports
Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports
Yanfu Yan
Nathan Cooper
Oscar Chaparro
Kevin Moran
Denys Poshyvanyk
22
5
0
11 Jul 2024
AnyTrans: Translate AnyText in the Image with Large Scale Models
AnyTrans: Translate AnyText in the Image with Large Scale Models
Zhipeng Qian
Pei Zhang
Baosong Yang
Kai Fan
Yiwei Ma
Derek F. Wong
Xiaoshuai Sun
Rongrong Ji
VLM
21
1
0
17 Jun 2024
Improving Automatic Text Recognition with Language Models in the PyLaia
  Open-Source Library
Improving Automatic Text Recognition with Language Models in the PyLaia Open-Source Library
Solène Tarride
Yoann Schneider
Marie Generali-Lince
Mélodie Boillet
Bastien Abadie
Christopher Kermorvant
21
3
0
29 Apr 2024
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
Da Chang
Yu Li
36
2
0
19 Apr 2024
Enhancing Small Object Encoding in Deep Neural Networks: Introducing
  Fast&Focused-Net with Volume-wise Dot Product Layer
Enhancing Small Object Encoding in Deep Neural Networks: Introducing Fast&Focused-Net with Volume-wise Dot Product Layer
Tofik Ali
Partha Pratim Roy
ObjD
9
2
0
18 Jan 2024
Vulnerability Analysis of Transformer-based Optical Character
  Recognition to Adversarial Attacks
Vulnerability Analysis of Transformer-based Optical Character Recognition to Adversarial Attacks
Lucas Beerens
D. Higham
13
1
0
28 Nov 2023
EfficientOCR: An Extensible, Open-Source Package for Efficiently
  Digitizing World Knowledge
EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge
Tom Bryan
Jacob Carlson
Abhishek Arora
Melissa Dell
13
8
0
16 Oct 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
28
34
0
30 Aug 2023
Fine-tuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting Recognition
Fine-tuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting Recognition
Jan Kohút
Michal Hradiš
41
7
0
13 Feb 2023
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
3,790
0
24 Feb 2021
1