Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.13095
Cited By
Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
23 March 2023
Zhibo Yang
Rujiao Long
Pengfei Wang
Sibo Song
Humen Zhong
Wenqing Cheng
X. Bai
Cong Yao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Modeling Entities as Semantic Points for Visual Information Extraction in the Wild"
15 / 15 papers shown
Title
A Simple yet Effective Layout Token in Large Language Models for Document Understanding
Zhaoqing Zhu
Chuwei Luo
Zirui Shao
Feiyu Gao
Hangdi Xing
Qi Zheng
Ji Zhang
50
0
0
24 Mar 2025
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
Wenwen Yu
Zhibo Yang
Jianqiang Wan
Sibo Song
J. Tang
Wenqing Cheng
Y. Liu
Xiang Bai
43
1
0
22 Feb 2025
HIP: Hierarchical Point Modeling and Pre-training for Visual Information Extraction
Rujiao Long
Pengfei Wang
Zhibo Yang
Cong Yao
24
0
0
02 Nov 2024
Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding
Chong Zhang
Yi Tu
Yixi Zhao
Chenshu Yuan
Huan Chen
...
Mingxu Chai
Ya Guo
Huijia Zhu
Qi Zhang
Tao Gui
31
2
0
29 Sep 2024
UNER: A Unified Prediction Head for Named Entity Recognition in Visually-rich Documents
Yi Tu
Chong Zhang
Ya Guo
Huan Chen
Jinyang Tang
Huijia Zhu
Qi Zhang
30
3
0
02 Aug 2024
SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding
Jiefeng Ma
Yan Wang
Chenyu Liu
Jun Du
Yu Hu
Zhenrong Zhang
Pengfei Hu
Qing Wang
Jianshu Zhang
29
0
0
13 Jun 2024
StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond
Pengyuan Lyu
Yulin Li
Hao Zhou
Weihong Ma
Xingyu Wan
...
Liang Wu
Chengquan Zhang
Kun Yao
Errui Ding
Jingdong Wang
36
7
0
31 May 2024
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition
Jianqiang Wan
Sibo Song
Wenwen Yu
Yuliang Liu
Wenqing Cheng
Fei Huang
Xiang Bai
Cong Yao
Zhibo Yang
37
26
0
28 Mar 2024
UniVIE: A Unified Label Space Approach to Visual Information Extraction from Form-like Documents
Kai Hu
Jiawei Wang
Weihong Lin
Zhuoyao Zhong
Lei-huan Sun
Qiang Huo
16
1
0
17 Jan 2024
PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction
Zening Lin
Jiapeng Wang
Teng Li
Wenhui Liao
Dayi Huang
Longfei Xiong
Lianwen Jin
14
2
0
07 Jan 2024
LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training
Rujiao Long
Hangdi Xing
Zhibo Yang
Qi Zheng
Zhi Yu
Cong Yao
Fei Huang
17
4
0
03 Jan 2024
Vision Grid Transformer for Document Layout Analysis
Cheng Da
Chuwei Luo
Qi Zheng
Cong Yao
ViT
16
27
0
29 Aug 2023
LORE: Logical Location Regression Network for Table Structure Recognition
Hangdi Xing
Feiyu Gao
Rujiao Long
Jiajun Bu
Qi Zheng
Liangcheng Li
Cong Yao
Zhi Yu
LMTD
25
19
0
07 Mar 2023
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Yang Xu
Yiheng Xu
Tengchao Lv
Lei Cui
Furu Wei
...
D. Florêncio
Cha Zhang
Wanxiang Che
Min Zhang
Lidong Zhou
ViT
MLLM
142
492
0
29 Dec 2020
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume
H. K. Ekenel
Jean-Philippe Thiran
115
353
0
27 May 2019
1