Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.00714
Cited By
v1
v2
v3
v4
v5 (latest)
AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting
3 August 2020
Wenhai Wang
Xuebo Liu
Xiaozhong Ji
Enze Xie
Ding Liang
Zhibo Yang
Tong Lu
Chunhua Shen
Ping Luo
Re-assign community
ArXiv (abs)
PDF
HTML
Github (68★)
Papers citing
"AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting"
13 / 13 papers shown
Title
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
Yan Shu
Hangui Lin
Yexin Liu
Yan Zhang
Gangyan Zeng
Yan Li
Yu Zhou
Ser-Nam Lim
Harry Yang
N. Sebe
MLLM
VLM
79
0
0
05 Jun 2025
HIP: Hierarchical Point Modeling and Pre-training for Visual Information Extraction
Rujiao Long
Pengfei Wang
Zhibo Yang
Cong Yao
79
0
0
02 Nov 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Yu Zhou
VLM
87
3
0
15 Mar 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
196
1
0
15 Jan 2024
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance
Alloy Das
Sanket Biswas
Ayan Banerjee
Josep Lladós
Umapada Pal
Saumik Bhattacharya
117
3
0
02 Oct 2023
Deformation Robust Text Spotting with Geometric Prior
Xixuan Hao
Aozhong Zhang
Xianze Meng
Bin Fu
104
0
0
31 Aug 2023
MPDIoU: A Loss for Efficient and Accurate Bounding Box Regression
Siliang Ma
Yong Xu
75
226
0
14 Jul 2023
TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision
Yukun Zhai
Xiaoqiang Zhang
Xiameng Qin
Sanyuan Zhao
Xingping Dong
Jianbing Shen
94
4
0
06 Jun 2023
DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting
Maoyuan Ye
Jing Zhang
Shanshan Zhao
Juhua Liu
Tongliang Liu
Bo Du
Dacheng Tao
73
3
0
31 May 2023
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition
Mingxin Huang
Yuliang Liu
Zhenghao Peng
Chongyu Liu
Dahua Lin
Shenggao Zhu
N. Yuan
Kai Ding
Lianwen Jin
ViT
85
103
0
19 Mar 2022
GroupLink: An End-to-end Multitask Method for Word Grouping and Relation Extraction in Form Understanding
Zilong Wang
Mingjie Zhan
Houxing Ren
Zhaohui Hou
Yuwei Wu
Xingyan Zhang
Ding Liang
37
1
0
10 May 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
614
3,761
0
24 Feb 2021
Synthetic-to-Real Unsupervised Domain Adaptation for Scene Text Detection in the Wild
Weijia Wu
Ning Lu
Enze Xie
86
22
0
03 Sep 2020
1