ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.09231
  4. Cited By
Towards Unconstrained End-to-End Text Spotting

Towards Unconstrained End-to-End Text Spotting

IEEE International Conference on Computer Vision (ICCV), 2019
24 August 2019
Siyang Qin
Alessandro Bissacco
Michalis Raptis
Yasuhisa Fujii
Y. Xiao
ArXiv (abs)PDFHTML

Papers citing "Towards Unconstrained End-to-End Text Spotting"

50 / 72 papers shown
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
Yan Shu
Hangui Lin
Yexin Liu
Yan Zhang
Gangyan Zeng
Yan Li
Can Ma
Ser-Nam Lim
Harry Yang
Andrii Zadaianchuk
MLLMVLM
414
6
0
05 Jun 2025
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
Wenwen Yu
Zhibo Yang
Jianqiang Wan
Sibo Song
J. Tang
Wenqing Cheng
Yunxing Liu
Xiang Bai
361
18
0
22 Feb 2025
Platypus: A Generalized Specialist Model for Reading Text in Various
  Forms
Platypus: A Generalized Specialist Model for Reading Text in Various FormsEuropean Conference on Computer Vision (ECCV), 2024
Peng Wang
Zhaohai Li
Jun Tang
Humen Zhong
Fei Huang
Zhibo Yang
Cong Yao
VLMObjD
229
3
0
27 Aug 2024
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text SpottingInternational Conference on Pattern Recognition (ICPR), 2024
Alloy Das
Sanket Biswas
Umapada Pal
Josep Lladós
Saumik Bhattacharya
354
8
0
27 Aug 2024
Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Qilong Zhangli
Jindong Jiang
Di Liu
Licheng Yu
Xiaoliang Dai
Ankit Ramchandani
Guan Pang
Dimitris N. Metaxas
Praveen Krishnan
DiffM
544
19
0
03 Jun 2024
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout
  Analysis
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Tianci Bi
Xiaoyi Zhang
Zhizheng Zhang
Wenxuan Xie
Cuiling Lan
Yan Lu
Nanning Zheng
VLM
268
5
0
13 May 2024
VimTS: A Unified Video and Image Text Spotter for Enhancing the
  Cross-domain Generalization
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Yuliang Liu
Mingxin Huang
Hao Yan
Linger Deng
Weijia Wu
Hao Lu
Chunhua Shen
Lianwen Jin
Xiang Bai
273
4
0
30 Apr 2024
Bridging the Gap Between End-to-End and Two-Step Text Spotting
Bridging the Gap Between End-to-End and Two-Step Text Spotting
Mingxin Huang
Hongliang Li
Yuliang Liu
Xiang Bai
Lianwen Jin
248
11
0
06 Apr 2024
OmniParser: A Unified Framework for Text Spotting, Key Information
  Extraction and Table Recognition
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition
Jianqiang Wan
Sibo Song
Wenwen Yu
Yuliang Liu
Wenqing Cheng
Fei Huang
Xiang Bai
Cong Yao
Zhibo Yang
307
85
0
28 Mar 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with
  Pre-trained Language Model
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Can Ma
VLM
316
8
0
15 Mar 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
SwinTextSpotter v2: Towards Better Synergy for Scene Text SpottingInternational Journal of Computer Vision (IJCV), 2024
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
387
9
0
15 Jan 2024
Inverse-like Antagonistic Scene Text Spotting via Reading-Order
  Estimation and Dynamic Sampling
Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic SamplingIEEE Transactions on Image Processing (TIP), 2024
Shi-Xue Zhang
Chun Yang
Xiaobin Zhu
Hongyang Zhou
Hongfa Wang
Xu-Cheng Yin
313
16
0
08 Jan 2024
Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Hierarchical Text Spotter for Joint Text Spotting and Layout AnalysisIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Shangbang Long
Siyang Qin
Yasuhisa Fujii
Alessandro Bissacco
Michalis Raptis
243
9
0
25 Oct 2023
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards
  Enhancing Text Spotting Performance
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting PerformanceIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Alloy Das
Sanket Biswas
Ayan Banerjee
Josep Lladós
Umapada Pal
Saumik Bhattacharya
361
4
0
02 Oct 2023
Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes
Diving into the Depths of Spotting Text in Multi-Domain Noisy ScenesIEEE International Conference on Robotics and Automation (ICRA), 2023
Alloy Das
Sanket Biswas
Umapada Pal
Josep Lladós
364
4
0
01 Oct 2023
Deformation Robust Text Spotting with Geometric Prior
Deformation Robust Text Spotting with Geometric PriorInternational Conference on Information Photonics (ICIP), 2023
Xixuan Hao
Aozhong Zhang
Xianze Meng
Bin Fu
353
0
0
31 Aug 2023
bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents
bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents
Imam Mohammad Zulkarnain
Shayekh Bin Islam
Md. Zami Al Zunaed Farabe
Md. Mehedi Hasan Shawon
Jawaril Munshad Abedin
...
Istiak Shihab
Syed Mobassir
Md. Nazmuddoha Ansary
Asif Sushmit
Farig Sadeque
177
2
0
21 Aug 2023
Turning a CLIP Model into a Scene Text Spotter
Turning a CLIP Model into a Scene Text SpotterIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Wenwen Yu
Yuliang Liu
Xingkui Zhu
H. Cao
Xing Sun
Xiang Bai
VLMCLIP
205
23
0
21 Aug 2023
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy
  in Transformer
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in TransformerIEEE International Conference on Computer Vision (ICCV), 2023
Mingxin Huang
Jiaxin Zhang
Dezhi Peng
Hao Lu
Can Huang
Yuliang Liu
Xiang Bai
Lianwen Jin
262
43
0
20 Aug 2023
MPDIoU: A Loss for Efficient and Accurate Bounding Box Regression
MPDIoU: A Loss for Efficient and Accurate Bounding Box Regression
Siliang Ma
Yong Xu
178
406
0
14 Jul 2023
TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision
TextFormer: A Query-based End-to-End Text Spotter with Mixed SupervisionMachine Intelligence Research (MIR), 2023
Yukun Zhai
Xiaoqiang Zhang
Xiameng Qin
Sanyuan Zhao
Xingping Dong
Jianbing Shen
307
7
0
06 Jun 2023
ICDAR 2023 Competition on Hierarchical Text Detection and Recognition
ICDAR 2023 Competition on Hierarchical Text Detection and RecognitionIEEE International Conference on Document Analysis and Recognition (ICDAR), 2023
Shangbang Long
Siyang Qin
Dmitry Panteleev
Alessandro Bissacco
Yasuhisa Fujii
Michalis Raptis
VLM
227
25
0
16 May 2023
Read Pointer Meters in complex environments based on a Human-like
  Alignment and Recognition Algorithm
Read Pointer Meters in complex environments based on a Human-like Alignment and Recognition Algorithm
Yan Shu
Gangyan Zeng
Honglei Xu
F. Jiang
174
0
0
28 Feb 2023
SPTS v2: Single-Point Scene Text Spotting
SPTS v2: Single-Point Scene Text SpottingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yuliang Liu
Jiaxin Zhang
Dezhi Peng
Mingxin Huang
Xinyu Wang
...
Can Huang
Dahua Lin
Chunhua Shen
Xiang Bai
Lianwen Jin
VLM
460
74
0
04 Jan 2023
OCR-RTPS: An OCR-based real-time positioning system for the valet
  parking
OCR-RTPS: An OCR-based real-time positioning system for the valet parking
Zizhang Wu
Xinyuan Chen
Jizheng Wang
Xiaoquan Wang
Y. Gan
Muqing Fang
Tianhao Xu
188
13
0
08 Dec 2022
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text
  Spotting
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text SpottingComputer Vision and Pattern Recognition (CVPR), 2022
Maoyuan Ye
Jing Zhang
Shanshan Zhao
Juhua Liu
Tongliang Liu
Bo Du
Dacheng Tao
448
106
0
19 Nov 2022
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for
  Scene Text Spotting
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text SpottingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Shancheng Fang
Zhendong Mao
Hongtao Xie
Yuxin Wang
C. Yan
Yongdong Zhang
327
76
0
19 Nov 2022
Task Grouping for Multilingual Text Recognition
Task Grouping for Multilingual Text Recognition
Jing Huang
Kevin J. Liang
Rama Kovvuri
Tal Hassner
248
6
0
13 Oct 2022
ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
ScreenQA: Large-Scale Question-Answer Pairs over Mobile App ScreenshotsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Yu-Chung Hsiao
Fedir Zubach
Maria Wang
Jindong Chen
Victor Carbune
Jason Lin
Maria Wang
Yun Zhu
Jindong Chen
RALM
1.1K
52
0
16 Sep 2022
Out-of-Vocabulary Challenge Report
Out-of-Vocabulary Challenge Report
Sergi Garcia-Bordils
Andrés Mafla
Ali Furkan Biten
Oren Nuriel
Aviad Aberdam
Shai Mazor
Ron Litman
Dimosthenis Karatzas
187
23
0
14 Sep 2022
GLASS: Global to Local Attention for Scene-Text Spotting
GLASS: Global to Local Attention for Scene-Text SpottingEuropean Conference on Computer Vision (ECCV), 2022
Roi Ronen
Shahar Tsiper
Oron Anschel
I. Lavi
Amir Markovitz
R. Manmatha
210
56
0
05 Aug 2022
Single Shot Self-Reliant Scene Text Spotter by Decoupled yet
  Collaborative Detection and Recognition
Single Shot Self-Reliant Scene Text Spotter by Decoupled yet Collaborative Detection and Recognition
Jingjing Wu
Pengyuan Lyu
Guangming Lu
Chengquan Zhang
Wenjie Pei
283
3
0
15 Jul 2022
Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text
  Spotting
Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text SpottingEuropean Conference on Computer Vision (ECCV), 2022
Ying-Cong Chen
Liang Qiao1
Zhanzhan Cheng
Shiliang Pu
Yi Niu
Xi Li
340
4
0
14 Jul 2022
E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial
  Vehicles
E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles
Zhenyu Hu
Zhenyu Wu
Pengcheng Pi
Yunhe Xue
Jiayi Shen
Jianchao Tan
Xiangru Lian
Zinan Lin
Ji Liu
173
3
0
05 Jun 2022
Text Detection & Recognition in the Wild for Robot Localization
Text Detection & Recognition in the Wild for Robot Localization
Z. Raisi
John S. Zelek
256
2
0
17 May 2022
Text Spotting Transformers
Text Spotting TransformersComputer Vision and Pattern Recognition (CVPR), 2022
Xiang Zhang
Yongwen Su
Subarna Tripathi
Zhuowen Tu
ViT
243
124
0
05 Apr 2022
Towards End-to-End Unified Scene Text Detection and Layout Analysis
Towards End-to-End Unified Scene Text Detection and Layout AnalysisComputer Vision and Pattern Recognition (CVPR), 2022
Shangbang Long
Siyang Qin
Dmitry Panteleev
Alessandro Bissacco
Yasuhisa Fujii
Michalis Raptis
292
118
0
28 Mar 2022
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text
  Detection and Text Recognition
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text RecognitionComputer Vision and Pattern Recognition (CVPR), 2022
Mingxin Huang
Yuliang Liu
Zhenghao Peng
Chongyu Liu
Dahua Lin
Shenggao Zhu
N. Yuan
Kai Ding
Lianwen Jin
ViT
231
142
0
19 Mar 2022
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Seonghyeon Kim
Seung Shin
Yoonsik Kim
Han-Cheol Cho
Taeho Kil
Jaeheung Surh
Seunghyun Park
Bado Lee
Youngmin Baek
200
9
0
10 Mar 2022
Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer
Towards Weakly-Supervised Text Spotting using a Multi-Task TransformerComputer Vision and Pattern Recognition (CVPR), 2022
Yair Kittenplon
I. Lavi
Sharon Fogel
Yarin Bar
R. Manmatha
Pietro Perona
ViT
259
61
0
11 Feb 2022
SPTS: Single-Point Text Spotting
SPTS: Single-Point Text Spotting
Dezhi Peng
Xinyu Wang
Yuliang Liu
Jiaxin Zhang
Mingxin Huang
...
Jing Li
Dahua Lin
Chunhua Shen
Xiang Bai
Lianwen Jin
ViT
428
76
0
15 Dec 2021
It's About Time: Analog Clock Reading in the Wild
It's About Time: Analog Clock Reading in the Wild
Charig Yang
Weidi Xie
Andrew Zisserman
238
20
0
17 Nov 2021
TPSNet: Reverse Thinking of Thin Plate Splines for Arbitrary Shape Scene
  Text Representation
TPSNet: Reverse Thinking of Thin Plate Splines for Arbitrary Shape Scene Text RepresentationACM Multimedia (ACM MM), 2021
Wei Wang
Can Ma
Jiahao Lv
Dayan Wu
Guoqing Zhao
Ning Jiang
Weiping Wang
317
42
0
25 Oct 2021
ARTS: Eliminating Inconsistency between Text Detection and Recognition
  with Auto-Rectification Text Spotter
ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter
Humen Zhong
Jun Tang
Wenhai Wang
Zhibo Yang
Cong Yao
Tong Lu
171
11
0
20 Oct 2021
Mask is All You Need: Rethinking Mask R-CNN for Dense and
  Arbitrary-Shaped Scene Text Detection
Mask is All You Need: Rethinking Mask R-CNN for Dense and Arbitrary-Shaped Scene Text DetectionACM Multimedia (ACM MM), 2021
Xugong Qin
Can Ma
Youhui Guo
Dayan Wu
Zhihong Tian
Ning Jiang
Hongbin Wang
Weiping Wang
227
39
0
08 Sep 2021
Comprehensive Studies for Arbitrary-shape Scene Text Detection
Comprehensive Studies for Arbitrary-shape Scene Text Detection
Pengwen Dai
Xiaochun Cao
155
2
0
25 Jul 2021
CentripetalText: An Efficient Text Instance Representation for Scene
  Text Detection
CentripetalText: An Efficient Text Instance Representation for Scene Text Detection
Tao Sheng
Jie Chen
Zheng Lian
245
28
0
13 Jul 2021
Implicit Feature Alignment: Learn to Convert Text Recognizer to Text
  Spotter
Implicit Feature Alignment: Learn to Convert Text Recognizer to Text SpotterComputer Vision and Pattern Recognition (CVPR), 2021
Tianwei Wang
Yuanzhi Zhu
Lianwen Jin
Dezhi Peng
Zhe Li
Mengchao He
Yongpan Wang
Canjie Luo
OOD
153
12
0
10 Jun 2021
DUET: Detection Utilizing Enhancement for Text in Scanned or Captured
  Documents
DUET: Detection Utilizing Enhancement for Text in Scanned or Captured DocumentsInternational Conference on Pattern Recognition (ICPR), 2021
Eun-Soo Jung
HyeongGwan Son
Kyusam Oh
Yongkeun Yun
Soonhwan Kwon
Min Soo Kim
239
5
0
10 Jun 2021
A Full-Stack Search Technique for Domain Optimized Deep Learning
  Accelerators
A Full-Stack Search Technique for Domain Optimized Deep Learning AcceleratorsInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2021
Dan Zhang
Safeen Huda
Ebrahim M. Songhori
Kartik Prabhu
Quoc V. Le
Anna Goldie
Azalia Mirhoseini
277
59
0
26 May 2021
12
Next
Page 1 of 2