ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.04888
  4. Cited By
A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text
  Spotter with Transformer

A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer

9 December 2021
Weijia Wu
Yuanqiang Cai
Debing Zhang
Sibo Wang
Zhuang Li
Jiahong Li
Yejun Tang
Hong Zhou
ArXivPDFHTML

Papers citing "A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer"

11 / 11 papers shown
Title
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Weijia Wu
Mingyu Liu
Zeyu Zhu
Xi Xia
Haoen Feng
Wen Wang
Kevin Qinghong Lin
Chunhua Shen
Mike Zheng Shou
DiffM
VGen
114
1
0
22 Nov 2024
Scene-Text Grounding for Text-Based Video Question Answering
Scene-Text Grounding for Text-Based Video Question Answering
Sheng Zhou
Junbin Xiao
Xun Yang
Peipei Song
Dan Guo
Angela Yao
Meng Wang
Tat-Seng Chua
87
1
0
22 Sep 2024
GloTSFormer: Global Video Text Spotting Transformer
GloTSFormer: Global Video Text Spotting Transformer
Hang Wang
Yanjie Wang
Yang Li
Can Huang
27
0
0
08 Jan 2024
Video text tracking for dense and small text based on pp-yoloe-r and
  sort algorithm
Video text tracking for dense and small text based on pp-yoloe-r and sort algorithm
Hongen Liu
13
0
0
31 Mar 2023
Real-time End-to-End Video Text Spotter with Contrastive Representation Learning
Wejia Wu
Zhuang Li
Jiahong Li
Chunhua Shen
Hong Zhou
Size Li
Zhongyuan Wang
Ping Luo
AI4TS
21
8
0
18 Jul 2022
Explore Faster Localization Learning For Scene Text Detection
Explore Faster Localization Learning For Scene Text Detection
Yuzhong Zhao
Yuanqiang Cai
Weijia Wu
Weiqiang Wang
ViT
21
14
0
04 Jul 2022
End-to-End Video Text Spotting with Transformer
End-to-End Video Text Spotting with Transformer
Weijia Wu
Yuanqiang Cai
Chunhua Shen
Debing Zhang
Ying Fu
Hong Zhou
Ping Luo
ViT
35
24
0
20 Mar 2022
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,604
0
24 Feb 2021
TransTrack: Multiple Object Tracking with Transformer
TransTrack: Multiple Object Tracking with Transformer
Pei Sun
Jinkun Cao
Yi-Xin Jiang
Rufeng Zhang
Enze Xie
Zehuan Yuan
Changhu Wang
Ping Luo
ViT
VOT
241
564
0
31 Dec 2020
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Jie Lei
Licheng Yu
Tamara L. Berg
Mohit Bansal
106
275
0
24 Jan 2020
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in
  Natural Images
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
Andreas Veit
Tomas Matera
Lukás Neumann
Jirí Matas
Serge J. Belongie
180
515
0
26 Jan 2016
1