Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.10200
Cited By
v1
v2 (latest)
ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network
Computer Vision and Pattern Recognition (CVPR), 2020
24 February 2020
Yuliang Liu
Hao Chen
Chunhua Shen
Tong He
Lianwen Jin
Liangwei Wang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network"
50 / 162 papers shown
Bharat Scene Text: A Novel Comprehensive Dataset and Benchmark for Indian Language Scene Text Understanding
Anik De
A. S. Penamakuri
Rajeev Yadav
Aditya Rathore
Harshiv Shah
Devesh Sharma
Sagar Agarwal
Pravin Kumar
Anand Mishra
179
0
0
28 Nov 2025
LRANet++: Low-Rank Approximation Network for Accurate and Efficient Text Spotting
Yuchen Su
Z. Chen
Yongkun Du
Zuxuan Wu
Hongtao Xie
Yu-Gang Jiang
177
2
0
08 Nov 2025
Bridging Vision, Language, and Mathematics: Pictographic Character Reconstruction with Bézier Curves
Zihao Wan
Pau Tong Lin Xu
Fuwen Luo
Ziyue Wang
P. Li
Yang Liu
169
1
0
29 Oct 2025
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
Peng Liu
H. Shen
Chunxin Fang
Zhicheng Sun
Jiajia Liao
T. Zhao
MLLM
ObjD
VLM
LRM
340
6
0
30 Sep 2025
TRUDI and TITUS: A Multi-Perspective Dataset and A Three-Stage Recognition System for Transportation Unit Identification
Emre Gülsoylu
A. Kelm
Lennart Bengtson
Matthias Hirsch
Christian Wilms
Tim Rolff
Janick Edinger
Simone Frintrop
138
0
0
04 Aug 2025
OCRGenBench: A Comprehensive Benchmark for Evaluating OCR Generative Capabilities
Peirong Zhang
Haowei Xu
Jiaxin Zhang
Guitao Xu
Xuhan Zheng
Zhenhua Yang
Junle Liu
Yuyi Zhang
Lianwen Jin
Lianwen Jin
377
4
0
20 Jul 2025
Hyper-Local Deformable Transformers for Text Spotting on Historical Maps
Knowledge Discovery and Data Mining (KDD), 2024
Yijun Lin
Yao-Yi Chiang
261
9
0
17 Jun 2025
Text-Aware Image Restoration with Diffusion Models
Jaewon Min
J. Kim
Paul Hyunbin Cho
J. Lee
Jihye Park
Minkyu Park
S. Kim
Hyunhee Park
Seungryong Kim
352
3
0
11 Jun 2025
EdgeSpotter: Multi-Scale Dense Text Spotting for Industrial Panel Monitoring
Changhong Fu
Hua Lin
Haobo Zuo
Weitong Chen
Liguo Zhang
279
0
0
08 Jun 2025
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
Yan Shu
Hangui Lin
Yexin Liu
Yan Zhang
Gangyan Zeng
Yan Li
Can Ma
Ser-Nam Lim
Harry Yang
Andrii Zadaianchuk
MLLM
VLM
461
8
0
05 Jun 2025
The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection
International Joint Conference on Artificial Intelligence (IJCAI), 2025
Tianjiao Cao
Jiahao Lyu
Weichao Zeng
Weimin Mu
Can Ma
399
2
0
21 May 2025
SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
Computer Vision and Pattern Recognition (CVPR), 2025
Dongliang Luo
Hanshen Zhu
Ziyang Zhang
Dingkang Liang
Xudong Xie
Yunxing Liu
Xiang Bai
VLM
384
2
0
14 Apr 2025
Edge Approximation Text Detector
Chuang Yang
Xu Han
T. Han
Han Han
Bingxuan Zhao
Qi Wang
348
7
0
05 Apr 2025
A Context-Driven Training-Free Network for Lightweight Scene Text Segmentation and Recognition
Ritabrata Chakraborty
Shivakumara Palaiahnakote
Umapada Pal
Cheng-Lin Liu
VLM
349
2
0
19 Mar 2025
Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection
Computer Vision and Pattern Recognition (CVPR), 2025
Yifan Chang
Junjie Huang
Xiaofeng Wang
Yun Ye
Zhujin Liang
Yi Shan
Dalong Du
Xingang Wang
3DPC
434
3
0
08 Mar 2025
A Token-level Text Image Foundation Model for Document Understanding
Tongkun Guan
Zining Wang
Pei Fu
Zhengtao Guo
Wei Shen
...
Chen Duan
Hao Sun
Qianyi Jiang
Junfeng Luo
Yunbo Wang
VLM
702
8
0
04 Mar 2025
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
Wenwen Yu
Zhibo Yang
Jianqiang Wan
Sibo Song
J. Tang
Wenqing Cheng
Yunxing Liu
Xiang Bai
376
19
0
22 Feb 2025
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Jiawei Liu
Yuanzhi Zhu
Feiyu Gao
Zhiyong Yang
P. Wang
Junyang Lin
Xinyu Wang
Wenyu Liu
DiffM
476
0
0
08 Jan 2025
First-place Solution for Streetscape Shop Sign Recognition Competition
Bin Wang
Li Jing
1.0K
1
0
06 Jan 2025
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Computer Vision and Pattern Recognition (CVPR), 2024
Linke Ouyang
Yuan Qu
Hongbin Zhou
Jiawei Zhu
Rui Zhang
...
Chao Xu
Bo Zhang
Ding Wang
Zhongying Tu
Bin Wang
571
72
0
10 Dec 2024
Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
279
8
0
05 Nov 2024
HIP: Hierarchical Point Modeling and Pre-training for Visual Information Extraction
Rujiao Long
Pengfei Wang
Zhibo Yang
Cong Yao
323
0
0
02 Nov 2024
Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection
IEEE transactions on multimedia (IEEE TMM), 2024
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
307
6
0
25 Sep 2024
Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera
IEEE transactions on multimedia (IEEE TMM), 2024
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
327
12
0
25 Sep 2024
Platypus: A Generalized Specialist Model for Reading Text in Various Forms
European Conference on Computer Vision (ECCV), 2024
Peng Wang
Zhaohai Li
Jun Tang
Humen Zhong
Fei Huang
Zhibo Yang
Cong Yao
VLM
ObjD
251
3
0
27 Aug 2024
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
International Conference on Pattern Recognition (ICPR), 2024
Alloy Das
Sanket Biswas
Umapada Pal
Josep Lladós
Saumik Bhattacharya
375
8
0
27 Aug 2024
LaneTCA: Enhancing Video Lane Detection with Temporal Context Aggregation
Keyi Zhou
Li Li
Wengang Zhou
Yonghui Wang
Hao Feng
Houqiang Li
ViT
304
2
0
25 Aug 2024
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval
ACM Multimedia (MM), 2024
Gangyan Zeng
Yuan Zhang
Jin Wei
Dongbao Yang
Peng Zhang
Yiwen Gao
Xugong Qin
Can Ma
VLM
CLIP
276
9
0
01 Aug 2024
DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
ACM Multimedia (MM), 2024
Xi Chen
Qian Qiao
Jun Gao
Tianxiang Wu
Rahul Bhadani
Jiaqing Fan
Ziqiang Cao
Larry Head
DiffM
439
13
0
01 Aug 2024
WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting
European Conference on Computer Vision (ECCV), 2024
Jingjing Wu
Zhengyao Fang
Pengyuan Lyu
Chengquan Zhang
Fanglin Chen
Guangming Lu
Wenjie Pei
567
4
0
28 Jul 2024
Visual Text Generation in the Wild
Yuanzhi Zhu
Jiawei Liu
Feiyu Gao
Wenyu Liu
Xinggang Wang
Peng Wang
Fei Huang
Cong Yao
Zhibo Yang
DiffM
282
18
0
19 Jul 2024
SegHist: A General Segmentation-based Framework for Chinese Historical Document Text Line Detection
Xingjian Hu
Baole Wei
Liangcai Gao
Jun Wang
369
1
0
17 Jun 2024
Towards Unified Multi-granularity Text Detection with Interactive Attention
Xingyu Wan
Chengquan Zhang
Pengyuan Lyu
Sen Fan
Zihan Ni
Kun Yao
Errui Ding
Jingdong Wang
226
4
0
30 May 2024
LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model
Hongen Liu
Di Sun
Jiahao Wang
Lu Dong
Gang Pan
350
2
0
29 May 2024
DisC-GS: Discontinuity-aware Gaussian Splatting
Haoxuan Qu
Zhuoling Li
Hossein Rahmani
Yujun Cai
Jun Liu
3DGS
311
11
0
24 May 2024
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Tianci Bi
Xiaoyi Zhang
Zhizheng Zhang
Wenxuan Xie
Cuiling Lan
Yan Lu
Nanning Zheng
VLM
297
5
0
13 May 2024
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Yuliang Liu
Mingxin Huang
Hao Yan
Linger Deng
Weijia Wu
Hao Lu
Chunhua Shen
Lianwen Jin
Xiang Bai
298
4
0
30 Apr 2024
Seeing Text in the Dark: Algorithm and Benchmark
Chengpei Xu
Hao Fu
Long Ma
Wenjing Jia
Chengqi Zhang
Xiwei Xu
Xiaoyu Ai
Binghao Li
Wenjie Zhang
370
17
0
13 Apr 2024
Single-image driven 3d viewpoint training data augmentation for effective wine label recognition
Yueh-Cheng Huang
Hsin-Yi Chen
Cheng-Jui Hung
Jen-Hui Chuang
Lei Li
180
0
0
12 Apr 2024
Bridging the Gap Between End-to-End and Two-Step Text Spotting
Mingxin Huang
Hongliang Li
Yuliang Liu
Xiang Bai
Lianwen Jin
260
10
0
06 Apr 2024
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin
Xinyu Wei
Ruichuan An
Shiyang Feng
Bocheng Zou
Yulin Luo
Siyuan Huang
Shanghang Zhang
Jiaming Song
VLM
542
98
0
29 Mar 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Can Ma
VLM
353
8
0
15 Mar 2024
MENTOR: Multilingual tExt detectioN TOward leaRning by analogy
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Hsin-Ju Lin
Tsu-Chun Chung
Ching-Chun Hsiao
Pin-Yu Chen
Wei-Chen Chiu
Ching-Chun Huang
132
0
0
12 Mar 2024
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
Chen Duan
Pei Fu
Shan Guo
Qianyi Jiang
Xiaoming Wei
VLM
369
18
0
01 Mar 2024
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
Maoyuan Ye
Jing Zhang
Juhua Liu
Chenyu Liu
Baocai Yin
Cong Liu
Bo Du
Dacheng Tao
VLM
285
37
0
31 Jan 2024
Dynamic Relation Transformer for Contextual Text Block Detection
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2024
Jiawei Wang
Shunchi Zhang
Kai Hu
Chixiang Ma
Zhuoyao Zhong
Lei-huan Sun
Qiang Huo
194
2
0
17 Jan 2024
3D Lane Detection from Front or Surround-View using Joint-Modeling & Matching
Haibin Zhou
Huabing Zhou
Jun Chang
Tao Lu
Jiayi Ma
300
3
0
16 Jan 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
International Journal of Computer Vision (IJCV), 2024
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
397
10
0
15 Jan 2024
Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling
IEEE Transactions on Image Processing (TIP), 2024
Shi-Xue Zhang
Chun Yang
Xiaobin Zhu
Hongyang Zhou
Hongfa Wang
Xu-Cheng Yin
323
16
0
08 Jan 2024
Word length-aware text spotting: Enhancing detection and recognition in dense text image
Hao Wang
Huabing Zhou
Yanduo Zhang
Tao Lu
Jiayi Ma
253
1
0
25 Dec 2023
1
2
3
4
Next
Page 1 of 4