ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.04491
  4. Cited By
DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in
  Transformer
v1v2 (latest)

DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer

AAAI Conference on Artificial Intelligence (AAAI), 2022
10 July 2022
Maoyuan Ye
Jing Zhang
Shanshan Zhao
Juhua Liu
Bo Du
Dacheng Tao
    ViT
ArXiv (abs)PDFHTMLGithub (189★)

Papers citing "DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer"

40 / 40 papers shown
SFA: Scan, Focus, and Amplify toward Guidance-aware Answering for Video TextVQA
SFA: Scan, Focus, and Amplify toward Guidance-aware Answering for Video TextVQA
Haibin He
Qihuang Zhong
Juhua Liu
Bo Du
Peng Wang
Jing Zhang
145
0
0
25 Nov 2025
LRANet++: Low-Rank Approximation Network for Accurate and Efficient Text Spotting
LRANet++: Low-Rank Approximation Network for Accurate and Efficient Text Spotting
Yuchen Su
Z. Chen
Yongkun Du
Zuxuan Wu
Hongtao Xie
Yu-Gang Jiang
177
2
0
08 Nov 2025
Exploring OCR-augmented Generation for Bilingual VQA
Exploring OCR-augmented Generation for Bilingual VQA
JoonHo Lee
Sunho Park
VLM
132
1
0
02 Oct 2025
Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR
Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR
Shashank Vempati
Nishit Anand
Gaurav Talebailkar
Arpan Garai
Chetan Arora
151
0
0
29 Aug 2025
SAViL-Det: Semantic-Aware Vision-Language Model for Multi-Script Text Detection
SAViL-Det: Semantic-Aware Vision-Language Model for Multi-Script Text Detection
Mohammed-En-Nadhir Zighem
Abdenour Hadid
VLM
113
0
0
27 Jul 2025
Text-Aware Image Restoration with Diffusion Models
Text-Aware Image Restoration with Diffusion Models
Jaewon Min
J. Kim
Paul Hyunbin Cho
J. Lee
Jihye Park
Minkyu Park
S. Kim
Hyunhee Park
Seungryong Kim
352
3
0
11 Jun 2025
The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection
The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text DetectionInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Tianjiao Cao
Jiahao Lyu
Weichao Zeng
Weimin Mu
Can Ma
396
2
0
21 May 2025
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Rundong Luo
Matthew Wallingford
Ali Farhadi
Noah Snavely
Wei-Chiu Ma
VGen
458
9
0
10 Apr 2025
DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning
DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed LearningComputer Vision and Pattern Recognition (CVPR), 2025
Xiao-Hui Li
Fei Yin
Cheng-Lin Liu
350
5
0
05 Apr 2025
TextInPlace: Indoor Visual Place Recognition in Repetitive Structures with Scene Text Spotting and Verification
TextInPlace: Indoor Visual Place Recognition in Repetitive Structures with Scene Text Spotting and Verification
Huaqi Tao
Bingxi Liu
Calvin Chen
Tingjun Huang
He Li
Jinqiang Cui
Hong Zhang
494
1
0
09 Mar 2025
PLATTER: A Page-Level Handwritten Text Recognition System for Indic Scripts
PLATTER: A Page-Level Handwritten Text Recognition System for Indic Scripts
Badri Vishal Kasuba
Dhruv Kudale
Venkatapathy Subramanian
P. Chaudhuri
Ganesh Ramakrishnan
344
4
0
10 Feb 2025
DocVLM: Make Your VLM an Efficient Reader
DocVLM: Make Your VLM an Efficient ReaderComputer Vision and Pattern Recognition (CVPR), 2024
Mor Shpigel Nacson
Aviad Aberdam
Roy Ganz
Elad Ben Avraham
Alona Golts
Yair Kittenplon
Shai Mazor
Ron Litman
VLM
762
0
0
11 Dec 2024
Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera
Spotlight Text Detector: Spotlight on Candidate Regions Like a CameraIEEE transactions on multimedia (IEEE TMM), 2024
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
326
11
0
25 Sep 2024
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text SpottingInternational Conference on Pattern Recognition (ICPR), 2024
Alloy Das
Sanket Biswas
Umapada Pal
Josep Lladós
Saumik Bhattacharya
374
8
0
27 Aug 2024
DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising TrainingACM Multimedia (MM), 2024
Xi Chen
Qian Qiao
Jun Gao
Tianxiang Wu
Rahul Bhadani
Jiaqing Fan
Ziqiang Cao
Larry Head
DiffM
439
13
0
01 Aug 2024
SegHist: A General Segmentation-based Framework for Chinese Historical
  Document Text Line Detection
SegHist: A General Segmentation-based Framework for Chinese Historical Document Text Line Detection
Xingjian Hu
Baole Wei
Liangcai Gao
Jun Wang
369
1
0
17 Jun 2024
Towards Unified Multi-granularity Text Detection with Interactive
  Attention
Towards Unified Multi-granularity Text Detection with Interactive Attention
Xingyu Wan
Chengquan Zhang
Pengyuan Lyu
Sen Fan
Zihan Ni
Kun Yao
Errui Ding
Jingdong Wang
226
4
0
30 May 2024
VimTS: A Unified Video and Image Text Spotter for Enhancing the
  Cross-domain Generalization
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Yuliang Liu
Mingxin Huang
Hao Yan
Linger Deng
Weijia Wu
Hao Lu
Chunhua Shen
Lianwen Jin
Xiang Bai
298
4
0
30 Apr 2024
Seeing Text in the Dark: Algorithm and Benchmark
Seeing Text in the Dark: Algorithm and Benchmark
Chengpei Xu
Hao Fu
Long Ma
Wenjing Jia
Chengqi Zhang
Xiwei Xu
Xiaoyu Ai
Binghao Li
Wenjie Zhang
370
17
0
13 Apr 2024
Bridging the Gap Between End-to-End and Two-Step Text Spotting
Bridging the Gap Between End-to-End and Two-Step Text Spotting
Mingxin Huang
Hongliang Li
Yuliang Liu
Xiang Bai
Lianwen Jin
259
10
0
06 Apr 2024
CPN: Complementary Proposal Network for Unconstrained Text Detection
CPN: Complementary Proposal Network for Unconstrained Text Detection
Longhuang Wu
Shangxuan Tian
Youxin Wang
Pengfei Xiong
229
2
0
18 Feb 2024
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text
  Segmentation
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
Maoyuan Ye
Jing Zhang
Juhua Liu
Chenyu Liu
Baocai Yin
Cong Liu
Bo Du
Dacheng Tao
VLM
285
37
0
31 Jan 2024
BPDO:Boundary Points Dynamic Optimization for Arbitrary Shape Scene Text
  Detection
BPDO:Boundary Points Dynamic Optimization for Arbitrary Shape Scene Text Detection
Jinzhi Zheng
Libo Zhang
Yanjun Wu
Chen Zhao
267
5
0
18 Jan 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
SwinTextSpotter v2: Towards Better Synergy for Scene Text SpottingInternational Journal of Computer Vision (IJCV), 2024
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
397
10
0
15 Jan 2024
Inverse-like Antagonistic Scene Text Spotting via Reading-Order
  Estimation and Dynamic Sampling
Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic SamplingIEEE Transactions on Image Processing (TIP), 2024
Shi-Xue Zhang
Chun Yang
Xiaobin Zhu
Hongyang Zhou
Hongfa Wang
Xu-Cheng Yin
322
16
0
08 Jan 2024
Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
Bridging Synthetic and Real Worlds for Pre-training Scene Text DetectorsEuropean Conference on Computer Vision (ECCV), 2023
Tongkun Guan
Wei Shen
Xuehang Yang
Xuehui Wang
Yunbo Wang
362
9
0
08 Dec 2023
Enhancing Scene Text Detectors with Realistic Text Image Synthesis Using
  Diffusion Models
Enhancing Scene Text Detectors with Realistic Text Image Synthesis Using Diffusion Models
Ling Fu
Zijie Wu
Yingying Zhu
Yuliang Liu
Xiang Bai
242
0
0
28 Nov 2023
Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Hierarchical Text Spotter for Joint Text Spotting and Layout AnalysisIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Shangbang Long
Siyang Qin
Yasuhisa Fujii
Alessandro Bissacco
Michalis Raptis
254
9
0
25 Oct 2023
Box2Poly: Memory-Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text
Box2Poly: Memory-Efficient Polygon Prediction of Arbitrarily Shaped and Rotated TextAAAI Conference on Artificial Intelligence (AAAI), 2023
Xuyang Chen
Dong Wang
Konrad Schindler
Mingwei Sun
Yongliang Wang
Nicolo Savioli
Liqiu Meng
289
2
0
20 Sep 2023
MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild
MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild
Yu Zeng
J. Hsieh
Xuzhao Li
Ming-Ching Chang
421
23
0
23 Aug 2023
SRFormer: Text Detection Transformer with Incorporated Segmentation and
  Regression
SRFormer: Text Detection Transformer with Incorporated Segmentation and RegressionAAAI Conference on Artificial Intelligence (AAAI), 2023
Qingwen Bu
Sungrae Park
Minsoo Khang
Yi-Bin Cheng
291
16
0
21 Aug 2023
Turning a CLIP Model into a Scene Text Spotter
Turning a CLIP Model into a Scene Text SpotterIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Wenwen Yu
Yuliang Liu
Xingkui Zhu
H. Cao
Xing Sun
Xiang Bai
VLMCLIP
217
23
0
21 Aug 2023
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy
  in Transformer
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in TransformerIEEE International Conference on Computer Vision (ICCV), 2023
Mingxin Huang
Jiaxin Zhang
Dezhi Peng
Hao Lu
Can Huang
Yuliang Liu
Xiang Bai
Lianwen Jin
268
44
0
20 Aug 2023
LRANet: Towards Accurate and Efficient Scene Text Detection with
  Low-Rank Approximation Network
LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation NetworkAAAI Conference on Artificial Intelligence (AAAI), 2023
Yuchen Su
Zhineng Chen
Zhiwen Shao
Yuning Du
Zhilong Ji
Jinfeng Bai
Yong Zhou
Yuxi Jiang
347
19
0
27 Jun 2023
DeepSolo++: Let Transformer Decoder with Explicit Points Solo for
  Multilingual Text Spotting
DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting
Maoyuan Ye
Jing Zhang
Shanshan Zhao
Juhua Liu
Tongliang Liu
Bo Du
Dacheng Tao
260
7
0
31 May 2023
Scalable Mask Annotation for Video Text Spotting
Scalable Mask Annotation for Video Text Spotting
Haibin He
Jing Zhang
Mengyang Xu
Juhua Liu
Bo Du
Dacheng Tao
257
17
0
02 May 2023
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge
  Collaborative AutoML System
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Chao Xue
Wen Liu
Shunxing Xie
Zhenfang Wang
Jiaxing Li
...
Shi-Yong Chen
Yibing Zhan
Jing Zhang
Chaoyue Wang
Dacheng Tao
272
4
0
01 Mar 2023
SPTS v2: Single-Point Scene Text Spotting
SPTS v2: Single-Point Scene Text SpottingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yuliang Liu
Jiaxin Zhang
Dezhi Peng
Mingxin Huang
Xinyu Wang
...
Can Huang
Dahua Lin
Chunhua Shen
Xiang Bai
Lianwen Jin
VLM
474
77
0
04 Jan 2023
Aggregated Text Transformer for Scene Text Detection
Aggregated Text Transformer for Scene Text Detection
Zhao Zhou
Xiangcheng Du
Yingbin Zheng
Cheng Jin
ViT
266
1
0
25 Nov 2022
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text
  Spotting
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text SpottingComputer Vision and Pattern Recognition (CVPR), 2022
Maoyuan Ye
Jing Zhang
Shanshan Zhao
Juhua Liu
Tongliang Liu
Bo Du
Dacheng Tao
484
109
0
19 Nov 2022
1
Page 1 of 1