Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1907.00945
Cited By
ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition -- RRC-MLT-2019
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2019
1 July 2019
Nibal Nayef
Yash J. Patel
M. Busta
Pinaki Nath Chowdhury
Dimosthenis Karatzas
Wafa Khlif
Jirí Matas
Umapada Pal
J. Burie
Cheng-Lin Liu
J. Ogier
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition -- RRC-MLT-2019"
50 / 135 papers shown
Title
Bharat Scene Text: A Novel Comprehensive Dataset and Benchmark for Indian Language Scene Text Understanding
Anik De
A. S. Penamakuri
Rajeev Yadav
Aditya Rathore
Harshiv Shah
Devesh Sharma
Sagar Agarwal
Pravin Kumar
Anand Mishra
108
0
0
28 Nov 2025
Evaluating Multimodal Large Language Models on Vertically Written Japanese Text
Keito Sasagawa
Shuhei Kurita
Daisuke Kawahara
68
0
0
19 Nov 2025
A Large-scale Dataset for Robust Complex Anime Scene Text Detection
Ziyi Dong
Yurui Zhang
Changmao Li
Naomi Rue Golding
Qing Long
64
0
0
09 Oct 2025
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
Peng Liu
H. Shen
Chunxin Fang
Zhicheng Sun
Jiajia Liao
T. Zhao
MLLM
ObjD
VLM
LRM
205
2
0
30 Sep 2025
TEACH: Text Encoding as Curriculum Hints for Scene Text Recognition
Xiahan Yang
Hui Zheng
VLM
90
1
0
02 Aug 2025
SAViL-Det: Semantic-Aware Vision-Language Model for Multi-Script Text Detection
Mohammed-En-Nadhir Zighem
Abdenour Hadid
VLM
88
0
0
27 Jul 2025
MSTAR: Box-free Multi-query Scene Text Retrieval with Attention Recycling
Liang Yin
Xudong Xie
Zhang Li
Xiang Bai
Yuliang Liu
LRM
282
0
0
12 Jun 2025
The OCR Quest for Generalization: Learning to recognize low-resource alphabets with model editing
Adrià Molina Rodríguez
O. R. Terrades
Josep Lladós
208
1
0
07 Jun 2025
SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
Jiahui Wang
Z. Liu
Yongming Rao
Jiwen Lu
VLM
LRM
445
3
0
05 Jun 2025
TextSR: Diffusion Super-Resolution with Multilingual OCR Guidance
Keren Ye
Ignacio Garcia Dorado
Michalis Raptis
M. Delbracio
Irene Zhu
P. Milanfar
Hossein Talebi
245
1
0
29 May 2025
SATORI-R1: Incentivizing Multimodal Reasoning through Explicit Visual Anchoring
Chuming Shen
Wei Wei
Xiaoye Qu
Yu Cheng
LRM
398
8
0
25 May 2025
The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection
International Joint Conference on Artificial Intelligence (IJCAI), 2025
Tianjiao Cao
Jiahao Lyu
Weichao Zeng
Weimin Mu
Can Ma
279
1
0
21 May 2025
DanceText: A Training-Free Layered Framework for Controllable Multilingual Text Transformation in Images
Zhenyu Yu
Mohd Yamani Idna Idris
Pei Wang
Yuelong Xia
Rizwan Qureshi
Shaina Raza
Aman Chadha
Yong Xiang
Zhixiang Chen
DiffM
228
1
0
18 Apr 2025
DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning
Computer Vision and Pattern Recognition (CVPR), 2025
Xiao-Hui Li
Fei Yin
Cheng-Lin Liu
285
2
0
05 Apr 2025
Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation
Computer Vision and Pattern Recognition (CVPR), 2025
Andrea Maracani
Savas Ozkan
Sijun Cho
Hyowon Kim
Eunchung Noh
Jeongwon Min
Cho Jung Min
Dookun Park
Mete Ozay
374
1
0
20 Mar 2025
A Context-Driven Training-Free Network for Lightweight Scene Text Segmentation and Recognition
Ritabrata Chakraborty
Shivakumara Palaiahnakote
Umapada Pal
Cheng-Lin Liu
VLM
258
1
0
19 Mar 2025
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Jiawei Liu
Yuanzhi Zhu
Feiyu Gao
Zhiyong Yang
P. Wang
Junyang Lin
Xinyu Wang
Wenyu Liu
DiffM
313
0
0
08 Jan 2025
First-place Solution for Streetscape Shop Sign Recognition Competition
Bin Wang
Li Jing
963
0
0
06 Jan 2025
TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
Xingsong Ye
Yongkun Du
Yunbo Tao
Z. Chen
DiffM
417
2
0
02 Dec 2024
Text Image Generation for Low-Resource Languages with Dual Translation Learning
Chihiro Noguchi
Shun Fukuda
Shoichiro Mihara
Masao Yamanaka
DiffM
192
0
0
26 Sep 2024
Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera
IEEE transactions on multimedia (IEEE TMM), 2024
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
281
8
0
25 Sep 2024
Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling
European Conference on Computer Vision (ECCV), 2024
Zixiao Wang
Hongtao Xie
Yuxin Wang
Yadong Qu
Fengjun Guo
Pengwei Liu
DiffM
185
1
0
20 Sep 2024
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer
ACM Multimedia (MM), 2024
Humen Zhong
Zhibo Yang
Zhaohai Li
Peng Wang
Jun Tang
Wenqing Cheng
Cong Yao
236
5
0
18 Sep 2024
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
International Conference on Pattern Recognition (ICPR), 2024
Alloy Das
Sanket Biswas
Umapada Pal
Josep Lladós
Saumik Bhattacharya
260
6
0
27 Aug 2024
Decoder Pre-Training with only Text for Scene Text Recognition
ACM Multimedia (MM), 2024
Shuai Zhao
Yongkun Du
Zhineng Chen
Yu-Gang Jiang
142
6
0
11 Aug 2024
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval
ACM Multimedia (MM), 2024
Gangyan Zeng
Yuan Zhang
Jin Wei
Dongbao Yang
Peng Zhang
Yiwen Gao
Xugong Qin
Can Ma
VLM
CLIP
202
7
0
01 Aug 2024
Self-Supervised Learning for Text Recognition: A Critical Survey
International Journal of Computer Vision (IJCV), 2024
Carlos Peñarrubia
J. J. Valero-Mas
Jorge Calvo-Zaragoza
364
4
0
29 Jul 2024
MOoSE: Multi-Orientation Sharing Experts for Open-set Scene Text Recognition
Chang Liu
Simon Corbillé
Elisa H Barney Smith
171
0
0
26 Jul 2024
Out of Length Text Recognition with Sub-String Matching
Yongkun Du
Zhineng Chen
Caiyan Jia
Xieping Gao
Yu-Gang Jiang
461
5
0
17 Jul 2024
How Control Information Influences Multilingual Text Image Generation and Editing?
Boqiang Zhang
Zuan Gao
Yadong Qu
Hongtao Xie
DiffM
282
7
0
16 Jul 2024
AnyTrans: Translate AnyText in the Image with Large Scale Models
Zhipeng Qian
Pei Zhang
Baosong Yang
Kai Fan
Yiwei Ma
Yang Li
Xiaoshuai Sun
Rongrong Ji
VLM
224
3
0
17 Jun 2024
A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges
Huangjun Shen
Liangying Shao
Wenbo Li
Zhibin Lan
Zhanyu Liu
Jinsong Su
290
4
0
21 May 2024
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering
Jingqun Tang
Qi-dong Liu
Yongjie Ye
Jinghui Lu
Shubo Wei
...
Hao Liu
Xiang Bai
Can Huang
Xiang Bai
Can Huang
706
48
0
20 May 2024
The First Swahili Language Scene Text Detection and Recognition Dataset
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2024
Fadila Wendigoundi Douamba
Jianjun Song
Ling Fu
Yuliang Liu
Xiang Bai
215
1
0
19 May 2024
FPDIoU Loss: A Loss Function for Efficient Bounding Box Regression of Rotated Object Detection
Image and Vision Computing (IVC), 2024
Siliang Ma
Yong Xu
233
7
0
16 May 2024
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
Honghui Chen
Yuhang Qiu
Jiabao Wang
Pingping Chen
Nam Ling
180
0
0
15 May 2024
Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Zuan Gao
Yuxin Wang
Yadong Qu
Boqiang Zhang
Zixiao Wang
Jianjun Xu
Hongtao Xie
ViT
163
12
0
09 May 2024
Exploring the Capabilities of Large Multimodal Models on Dense Text
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2024
Shuo Zhang
Biao Yang
Zhang Li
Zhiyin Ma
Yuliang Liu
Xiang Bai
VLM
193
12
0
09 May 2024
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin
Xinyu Wei
Ruichuan An
Shiyang Feng
Bocheng Zou
Yulin Luo
Siyuan Huang
Shanghang Zhang
Jiaming Song
VLM
379
84
0
29 Mar 2024
IndicSTR12: A Dataset for Indic Scene Text Recognition
Harsh Lunia
Ajoy Mondal
C. V. Jawahar
139
3
0
12 Mar 2024
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss
Xuhua Ren
Hengcan Shi
Jin Li
VLM
218
0
0
12 Mar 2024
TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Yuliang Liu
Biao Yang
Qiang Liu
Zhang Li
Zhiyin Ma
Shuo Zhang
Xiang Bai
MLLM
VLM
303
147
0
07 Mar 2024
Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing
Yan Shu
Weichao Zeng
Zhenhang Li
Fangmin Zhao
Can Ma
229
8
0
05 Feb 2024
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
376
12
0
29 Dec 2023
Cross-Lingual Learning in Multilingual Scene Text Recognition
Jeonghun Baek
Yusuke Matsui
Kiyoharu Aizawa
186
1
0
17 Dec 2023
Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
European Conference on Computer Vision (ECCV), 2023
Tongkun Guan
Wei Shen
Xuehang Yang
Xuehui Wang
Yunbo Wang
276
8
0
08 Dec 2023
Enhancing Scene Text Detectors with Realistic Text Image Synthesis Using Diffusion Models
Ling Fu
Zijie Wu
Yingying Zhu
Yuliang Liu
Xiang Bai
173
0
0
28 Nov 2023
Scene Text Image Super-resolution based on Text-conditional Diffusion Models
Chihiro Noguchi
Shun Fukuda
Masao Yamanaka
DiffM
200
21
0
16 Nov 2023
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation
Yongxin Shi
Dezhi Peng
Wenhui Liao
Zening Lin
Xinhong Chen
Chongyu Liu
Yuyi Zhang
Lianwen Jin
MLLM
379
53
0
25 Oct 2023
Box2Poly: Memory-Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text
AAAI Conference on Artificial Intelligence (AAAI), 2023
Xuyang Chen
Dong Wang
Konrad Schindler
Mingwei Sun
Yongliang Wang
Nicolo Savioli
Liqiu Meng
222
1
0
20 Sep 2023
1
2
3
Next