ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.09496
  4. Cited By
RoadText-1K: Text Detection & Recognition Dataset for Driving Videos

RoadText-1K: Text Detection & Recognition Dataset for Driving Videos

19 May 2020
S. Reddy
Minesh Mathew
Lluís Gómez
Marçal Rusiñol
Dimosthenis Karatzas
C. V. Jawahar
ArXiv (abs)PDFHTML

Papers citing "RoadText-1K: Text Detection & Recognition Dataset for Driving Videos"

18 / 18 papers shown
Title
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
Yan Shu
Hangui Lin
Yexin Liu
Yan Zhang
Gangyan Zeng
Yan Li
Yu Zhou
Ser-Nam Lim
Harry Yang
N. Sebe
MLLMVLM
65
0
0
05 Jun 2025
VidText: Towards Comprehensive Evaluation for Video Text Understanding
VidText: Towards Comprehensive Evaluation for Video Text Understanding
Zhoufaran Yang
Yan Shu
Zhifei Yang
Yan Zhang
Yu-Hong Li
K. Lu
Gangyan Zeng
Shaohui Liu
Yu Zhou
N. Sebe
CoGe
52
0
0
28 May 2025
LOGO: Video Text Spotting with Language Collaboration and Glyph
  Perception Model
LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model
Hongen Liu
Di Sun
Jiahao Wang
Yi Liu
Gang Pan
81
0
0
29 May 2024
DSText V2: A Comprehensive Video Text Spotting Dataset for Dense and
  Small Text
DSText V2: A Comprehensive Video Text Spotting Dataset for Dense and Small Text
Weijia Wu
Yiming Zhang
Yefei He
Luoming Zhang
Zhenyu Lou
Hong Zhou
Xiang Bai
95
6
0
29 Nov 2023
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text
  Image Super-Resolution
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution
Zuoyan Zhao
Hui Xue
Pengfei Fang
Shipeng Zhu
DiffM
64
4
0
29 Nov 2023
Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes
Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes
Alloy Das
Sanket Biswas
Umapada Pal
Josep Lladós
87
4
0
01 Oct 2023
STEP -- Towards Structured Scene-Text Spotting
STEP -- Towards Structured Scene-Text Spotting
Sergi Garcia-Bordils
Dimosthenis Karatzas
Marccal Rusinol
87
2
0
05 Sep 2023
PBFormer: Capturing Complex Scene Text Shape with Polynomial Band
  Transformer
PBFormer: Capturing Complex Scene Text Shape with Polynomial Band Transformer
Ruijin Liu
Ning Lu
Dapeng Chen
Cheng Li
Zejian Yuan
Wei Peng
70
3
0
29 Aug 2023
Reading Between the Lanes: Text VideoQA on the Road
Reading Between the Lanes: Text VideoQA on the Road
George Tom
Minesh Mathew
Sergi Garcia
Dimosthenis Karatzas
C. V. Jawahar
88
8
0
08 Jul 2023
A Large Cross-Modal Video Retrieval Dataset with Reading Comprehension
A Large Cross-Modal Video Retrieval Dataset with Reading Comprehension
Weijia Wu
Yuzhong Zhao
Zhuangzi Li
Jiahong Li
Hong Zhou
Mike Zheng Shou
Xiang Bai
82
22
0
05 May 2023
Scalable Mask Annotation for Video Text Spotting
Scalable Mask Annotation for Video Text Spotting
Haibin He
Jing Zhang
Mengyang Xu
Juhua Liu
Bo Du
Dacheng Tao
129
15
0
02 May 2023
ICDAR 2023 Video Text Reading Competition for Dense and Small Text
ICDAR 2023 Video Text Reading Competition for Dense and Small Text
Weijia Wu
Yuzhong Zhao
Zhuangzi Li
Jiahong Li
Mike Zheng Shou
Umapada Pal
Dimosthenis Karatzas
Xiang Bai
81
7
0
10 Apr 2023
Video text tracking for dense and small text based on pp-yoloe-r and
  sort algorithm
Video text tracking for dense and small text based on pp-yoloe-r and sort algorithm
Hongen Liu
75
1
0
31 Mar 2023
End-to-End Video Text Spotting with Transformer
End-to-End Video Text Spotting with Transformer
Weijia Wu
Yuanqiang Cai
Chunhua Shen
Debing Zhang
Ying Fu
Hong Zhou
Ping Luo
ViT
108
25
0
20 Mar 2022
Transfer Learning for Scene Text Recognition in Indian Languages
Transfer Learning for Scene Text Recognition in Indian Languages
Sanjana Gunna
Rohit Saluja
C. V. Jawahar
VLM
85
13
0
10 Jan 2022
A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text
  Spotter with Transformer
A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer
Weijia Wu
Yuanqiang Cai
Debing Zhang
Sibo Wang
Zhuang Li
Jiahong Li
Yejun Tang
Hong Zhou
76
31
0
09 Dec 2021
Video Text Tracking With a Spatio-Temporal Complementary Model
Video Text Tracking With a Spatio-Temporal Complementary Model
Yuzhe Gao
Xing Li
Jiajian Zhang
Yu Zhou
Dian Jin
Jing Wang
Shenggao Zhu
Xiang Bai
66
17
0
09 Nov 2021
Weakly-Supervised Domain Adaptation of Deep Regression Trackers via
  Reinforced Knowledge Distillation
Weakly-Supervised Domain Adaptation of Deep Regression Trackers via Reinforced Knowledge Distillation
Matteo Dunnhofer
N. Martinel
C. Micheloni
70
16
0
26 Mar 2021
1