ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.00714
  4. Cited By
AE TextSpotter: Learning Visual and Linguistic Representation for
  Ambiguous Text Spotting
v1v2v3v4v5 (latest)

AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting

3 August 2020
Wenhai Wang
Xuebo Liu
Xiaozhong Ji
Enze Xie
Ding Liang
Zhibo Yang
Tong Lu
Chunhua Shen
Ping Luo
ArXiv (abs)PDFHTMLGithub (68★)

Papers citing "AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting"

13 / 13 papers shown
Title
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
Yan Shu
Hangui Lin
Yexin Liu
Yan Zhang
Gangyan Zeng
Yan Li
Yu Zhou
Ser-Nam Lim
Harry Yang
N. Sebe
MLLMVLM
79
0
0
05 Jun 2025
HIP: Hierarchical Point Modeling and Pre-training for Visual Information
  Extraction
HIP: Hierarchical Point Modeling and Pre-training for Visual Information Extraction
Rujiao Long
Pengfei Wang
Zhibo Yang
Cong Yao
79
0
0
02 Nov 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with
  Pre-trained Language Model
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Yu Zhou
VLM
87
3
0
15 Mar 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
194
1
0
15 Jan 2024
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards
  Enhancing Text Spotting Performance
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance
Alloy Das
Sanket Biswas
Ayan Banerjee
Josep Lladós
Umapada Pal
Saumik Bhattacharya
117
3
0
02 Oct 2023
Deformation Robust Text Spotting with Geometric Prior
Deformation Robust Text Spotting with Geometric Prior
Xixuan Hao
Aozhong Zhang
Xianze Meng
Bin Fu
104
0
0
31 Aug 2023
MPDIoU: A Loss for Efficient and Accurate Bounding Box Regression
MPDIoU: A Loss for Efficient and Accurate Bounding Box Regression
Siliang Ma
Yong Xu
75
226
0
14 Jul 2023
TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision
TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision
Yukun Zhai
Xiaoqiang Zhang
Xiameng Qin
Sanyuan Zhao
Xingping Dong
Jianbing Shen
92
4
0
06 Jun 2023
DeepSolo++: Let Transformer Decoder with Explicit Points Solo for
  Multilingual Text Spotting
DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting
Maoyuan Ye
Jing Zhang
Shanshan Zhao
Juhua Liu
Tongliang Liu
Bo Du
Dacheng Tao
73
3
0
31 May 2023
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text
  Detection and Text Recognition
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition
Mingxin Huang
Yuliang Liu
Zhenghao Peng
Chongyu Liu
Dahua Lin
Shenggao Zhu
N. Yuan
Kai Ding
Lianwen Jin
ViT
85
103
0
19 Mar 2022
GroupLink: An End-to-end Multitask Method for Word Grouping and Relation
  Extraction in Form Understanding
GroupLink: An End-to-end Multitask Method for Word Grouping and Relation Extraction in Form Understanding
Zilong Wang
Mingjie Zhan
Houxing Ren
Zhaohui Hou
Yuwei Wu
Xingyan Zhang
Ding Liang
37
1
0
10 May 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
612
3,761
0
24 Feb 2021
Synthetic-to-Real Unsupervised Domain Adaptation for Scene Text
  Detection in the Wild
Synthetic-to-Real Unsupervised Domain Adaptation for Scene Text Detection in the Wild
Weijia Wu
Ning Lu
Enze Xie
83
22
0
03 Sep 2020
1