Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.04888
Cited By
A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer
9 December 2021
Weijia Wu
Yuanqiang Cai
Debing Zhang
Sibo Wang
Zhuang Li
Jiahong Li
Yejun Tang
Hong Zhou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer"
11 / 11 papers shown
Title
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Weijia Wu
Mingyu Liu
Zeyu Zhu
Xi Xia
Haoen Feng
Wen Wang
Kevin Qinghong Lin
Chunhua Shen
Mike Zheng Shou
DiffM
VGen
114
1
0
22 Nov 2024
Scene-Text Grounding for Text-Based Video Question Answering
Sheng Zhou
Junbin Xiao
Xun Yang
Peipei Song
Dan Guo
Angela Yao
Meng Wang
Tat-Seng Chua
87
1
0
22 Sep 2024
GloTSFormer: Global Video Text Spotting Transformer
Hang Wang
Yanjie Wang
Yang Li
Can Huang
27
0
0
08 Jan 2024
Video text tracking for dense and small text based on pp-yoloe-r and sort algorithm
Hongen Liu
13
0
0
31 Mar 2023
Real-time End-to-End Video Text Spotter with Contrastive Representation Learning
Wejia Wu
Zhuang Li
Jiahong Li
Chunhua Shen
Hong Zhou
Size Li
Zhongyuan Wang
Ping Luo
AI4TS
21
8
0
18 Jul 2022
Explore Faster Localization Learning For Scene Text Detection
Yuzhong Zhao
Yuanqiang Cai
Weijia Wu
Weiqiang Wang
ViT
21
14
0
04 Jul 2022
End-to-End Video Text Spotting with Transformer
Weijia Wu
Yuanqiang Cai
Chunhua Shen
Debing Zhang
Ying Fu
Hong Zhou
Ping Luo
ViT
35
24
0
20 Mar 2022
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,604
0
24 Feb 2021
TransTrack: Multiple Object Tracking with Transformer
Pei Sun
Jinkun Cao
Yi-Xin Jiang
Rufeng Zhang
Enze Xie
Zehuan Yuan
Changhu Wang
Ping Luo
ViT
VOT
241
564
0
31 Dec 2020
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Jie Lei
Licheng Yu
Tamara L. Berg
Mohit Bansal
106
275
0
24 Jan 2020
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
Andreas Veit
Tomas Matera
Lukás Neumann
Jirí Matas
Serge J. Belongie
180
515
0
26 Jan 2016
1