Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2110.00061
Cited By
v1
v2
v3 (latest)
PubTables-1M: Towards comprehensive table extraction from unstructured documents
30 September 2021
B. Smock
Rohith Pesala
Robin Abraham
LMTD
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Github (2612★)
Papers citing
"PubTables-1M: Towards comprehensive table extraction from unstructured documents"
50 / 72 papers shown
Title
NVIDIA Nemotron Parse 1.1
Kateryna Chumachenko
Amala Sanjay Deshmukh
Jarno Seppänen
Ilia Karmanov
Chia-Chih Chen
...
Sandip Bhaskar
Timo Roman
Karan Sapra
Andrew Tao
Bryan Catanzaro
246
0
0
25 Nov 2025
Hierarchical structure understanding in complex tables with VLLMs: a benchmark and experiments
Luca Bindini
Simone Giovannini
S. Marinai
Valeria Nardoni
Kimiya Noor Ali
LMTD
88
0
0
11 Nov 2025
NVIDIA Nemotron Nano V2 VL
Nvidia
Amala Sanjay Deshmukh
Kateryna Chumachenko
Tuomas Rintamaki
Matthieu Le
...
Krzysztof Pawelec
Michael Evans
Katherine Luna
Jie Lou
Erick Galinkin
VLM
232
1
0
06 Nov 2025
GranViT: A Fine-Grained Vision Model With Autoregressive Perception For MLLMs
Guanghao Zheng
Bowen Shi
Mingxing Xu
Ruoyu Sun
Peisen Zhao
...
Wenrui Dai
Junni Zou
Hongkai Xiong
Xiaopeng Zhang
Qi Tian
VLM
103
0
0
23 Oct 2025
Exploring OCR-augmented Generation for Bilingual VQA
JoonHo Lee
Sunho Park
VLM
76
0
0
02 Oct 2025
SCORE: A Semantic Evaluation Framework for Generative Document Parsing
Renyu Li
Antonio Jimeno-Yepes
Yao You
Kamil Pluciński
Maximilian Operlejn
Crag Wolfe
57
1
0
16 Sep 2025
MindVL: Towards Efficient and Effective Training of Multimodal Large Language Models on Ascend NPUs
Feilong Chen
Y. Liu
Yi Huang
Hao Wang
Miren Tian
Ya-Qi Yu
Minghui Liao
Jihao Wu
MLLM
VLM
227
1
0
15 Sep 2025
Granite Embedding R2 Models
Parul Awasthy
Aashka Trivedi
Yulong Li
Meet Doshi
Riyaz Ahmad Bhat
...
Salim Roukos
David D. Cox
Luis A. Lastras
Jaydeep Sen
Radu Florian
AI4TS
120
4
0
26 Aug 2025
Extracting Information from Scientific Literature via Visual Table Question Answering Models
Dongyoun Kim
Hyung-do Choi
Youngsun Jang
John Kim
LMTD
48
0
0
26 Aug 2025
From Surface to Semantics: Semantic Structure Parsing for Table-Centric Document Analysis
Xuan Li
Jialiang Dong
Raymond Wong
LMTD
91
0
0
14 Aug 2025
Synthetic Data Augmentation for Table Detection: Re-evaluating TableNet's Performance with Automatically Generated Document Images
Automation, Control, and Information Technology (ACIT), 2025
Krishna Sahukara
Zineddine Bettouche
Andreas Fischer
LMTD
ViT
109
0
0
17 Jun 2025
Atomic Reasoning for Scientific Table Claim Verification
Yuji Zhang
Qingyun Wang
Cheng Qian
Jiateng Liu
Chenkai Sun
Denghui Zhang
Tarek Abdelzaher
Chengxiang Zhai
Preslav Nakov
Heng Ji
LMTD
LRM
151
5
0
08 Jun 2025
FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding
International Conference on Computational Linguistics (COLING), 2025
Amit Agarwal
Srikant Panda
Kulbhushan Pachauri
119
12
0
22 May 2025
Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Hao Feng
Shu Wei
Xiang Fei
Wei Shi
Yingdong Han
...
Qi Liu
Chunhui Lin
Jingqun Tang
Hao Liu
Can Huang
251
15
0
20 May 2025
TableCenterNet: A one-stage network for table structure recognition
Anyi Xiao
Cihui Yang
LMTD
246
0
0
24 Apr 2025
Title block detection and information extraction for enhanced building drawings search
Alessio Lombardi
Li Duan
Ahmed Elnagar
Ahmed Zaalouk
Khalid Ismail
Edlira Vakaj
166
0
0
11 Apr 2025
DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning
Computer Vision and Pattern Recognition (CVPR), 2025
Xiao-Hui Li
Fei Yin
Cheng-Lin Liu
241
2
0
05 Apr 2025
AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization
Martin Kiss
Michal Hradiš
Martina Dvořáková
Václav Jiroušek
Filip Kersch
262
1
0
28 Mar 2025
SPRINT: Script-agnostic Structure Recognition in Tables
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2025
Dhruv Kudale
Badri Vishal Kasuba
Venkatapathy Subramanian
P. Chaudhuri
Ganesh Ramakrishnan
LMTD
258
2
0
15 Mar 2025
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
A. Nassar
Andres Marafioti
Matteo Omenetti
Maksym Lysak
Nikolaos Livathinos
...
Yusik Kim
A. Said Gurbuz
Michele Dolfi
Miquel Farré
Peter W. J. Staar
226
26
0
14 Mar 2025
HCT-QA: A Benchmark for Question Answering on Human-Centric Tables
M. Ahmad
Zan Naeem
Michaël Aupetit
A. Elmagarmid
M. Eltabakh
Xiasong Ma
M. Ouzzani
Chaoyi Ruan
LMTD
921
1
0
09 Mar 2025
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
Wenwen Yu
Zhibo Yang
Jianqiang Wan
Sibo Song
J. Tang
Wenqing Cheng
Yunxing Liu
Xiang Bai
279
12
0
22 Feb 2025
Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence
Granite Vision Team
Leonid Karlinsky
Assaf Arbelle
Abraham Daniels
A. Nassar
...
Sriram Raghavan
Tanveer Syeda-Mahmood
Peter W. J. Staar
Tal Drory
Rogerio Feris
VLM
AI4TS
382
11
0
14 Feb 2025
MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding
Fengbin Zhu
Ziyang Liu
Xiang Yao Ng
Haohui Wu
Wenjie Wang
Fuli Feng
Chao Wang
Huanbo Luan
Tat-Seng Chua
VLM
177
10
0
25 Oct 2024
See then Tell: Enhancing Key Information Extraction with Vision Grounding
Shuhang Liu
Zhenrong Zhang
Pengfei Hu
Jiefeng Ma
Jun Du
Qing Wang
Jianshu Zhang
Chenyu Liu
206
1
0
29 Sep 2024
UniTabNet: Bridging Vision and Language Models for Enhanced Table Structure Recognition
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zhenrong Zhang
Shuhang Liu
Pengfei Hu
Jiefeng Ma
Jun Du
Jianshu Zhang
Yu Hu
LMTD
206
4
0
20 Sep 2024
PdfTable: A Unified Toolkit for Deep Learning-Based Table Extraction
Lei Sheng
Shuai-Shuai Xu
LMTD
153
0
0
08 Sep 2024
READoc: A Unified Benchmark for Realistic Document Structured Extraction
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Zichao Li
Aizier Abulaiti
Yaojie Lu
Xuanang Chen
Jia Zheng
Hongyu Lin
Xianpei Han
Le Sun
319
6
0
08 Sep 2024
Latent Diffusion for Guided Document Table Generation
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2024
Syed Jawwad Haider Hamdani
S. Saifullah
S. Agne
Andreas Dengel
Sheraz Ahmed
146
2
0
19 Aug 2024
HySem: A context length optimized LLM pipeline for unstructured tabular extraction
Narayanan PP
A. P. N. Iyer
210
1
0
18 Aug 2024
UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis
Yulong Hui
Yao Lu
Huanchen Zhang
RALM
241
33
0
21 Jun 2024
Efficient Prompting for LLM-based Generative Internet of Things
IEEE Internet of Things Journal (IEEE IoT J.), 2024
Bin Xiao
B. Kantarci
Jiawen Kang
Dusit Niyato
Mohsen Guizani
247
41
0
14 Jun 2024
TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
Weichao Zhao
Hao Feng
Qi Liu
Jingqun Tang
Shubo Wei
...
Lei Liao
Yongjie Ye
Hao Liu
Houqiang Li
Can Huang
LMTD
229
45
0
03 Jun 2024
SEMv3: A Fast and Robust Approach to Table Separation Line Detection
Chunxia Qin
Zhenrong Zhang
Pengfei Hu
Chenyu Liu
Jie Ma
Jun Du
LMTD
200
8
0
20 May 2024
End-to-End Semi-Supervised approach with Modulated Object Queries for Table Detection in Documents
Iqraa Ehsan
Tahira Shehzadi
Didier Stricker
Muhammad Zeshan Afzal
LMTD
138
8
0
08 May 2024
Towards End-to-End Semi-Supervised Table Detection with Semantic Aligned Matching Transformer
Tahira Shehzadi
Shalini Sarode
Didier Stricker
Muhammad Zeshan Afzal
LMTD
281
5
0
30 Apr 2024
The Socface Project: Large-Scale Collection, Processing, and Analysis of a Century of French Censuses
Mélodie Boillet
Solène Tarride
Manon Blanco
Valentin Rigal
Yoann Schneider
Bastien Abadie
Lionel Kesztenbaum
Christopher Kermorvant
166
6
0
29 Apr 2024
A Hybrid Approach for Document Layout Analysis in Document images
Tahira Shehzadi
Didier Stricker
Muhammad Zeshan Afzal
182
11
0
27 Apr 2024
Synthesizing Realistic Data for Table Recognition
Qiyu Hou
Jun Wang
Meixuan Qiao
Lujun Tian
LMTD
167
2
0
17 Apr 2024
JDocQA: Japanese Document Question Answering Dataset for Generative Language Models
Eri Onami
Shuhei Kurita
Taiki Miyanishi
Taro Watanabe
200
8
0
28 Mar 2024
UniTable: Towards a Unified Framework for Table Recognition via Self-Supervised Pretraining
Sheng-Hsuan Peng
Aishwarya Chakravarthy
Seongmin Lee
Xiaojing Wang
Rajarajeswari Balasubramaniyan
Duen Horng Chau
LMTD
189
4
0
07 Mar 2024
Enhancing Vision-Language Pre-training with Rich Supervisions
Yuan Gao
Kunyu Shi
Pengkai Zhu
Edouard Belval
Oren Nuriel
Srikar Appalaraju
Shabnam Ghadar
Vijay Mahadevan
Zhuowen Tu
Stefano Soatto
VLM
CLIP
326
15
0
05 Mar 2024
ClusterTabNet: Supervised clustering method for table detection and table structure recognition
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2024
Marek Polewczyk
Marco Spinaci
LMTD
186
2
0
12 Feb 2024
LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training
Rujiao Long
Hangdi Xing
Zhibo Yang
Qi Zheng
Zhi Yu
Cong Yao
Fei Huang
172
11
0
03 Jan 2024
TDeLTA: A Light-weight and Robust Table Detection Method based on Learning Text Arrangement
Yang Fan
Xiangping Wu
Qingcai Chen
Heng Li
Yan Huang
Zhixiang Cai
Qitian Wu
LMTD
144
0
0
18 Dec 2023
Object Recognition from Scientific Document based on Compartment Refinement Framework
SN Computer Science (SCS), 2023
Jinghong Li
Wen Gu
Koichi Ota
Shinobu Hasegawa
206
4
0
14 Dec 2023
Rethinking Detection Based Table Structure Recognition for Visually Rich Document Images
Bin Xiao
Murat Simsek
B. Kantarci
Ala Abu Alkheir
LMTD
248
1
0
01 Dec 2023
DSG: An End-to-End Document Structure Generator
Johannes Rausch
Gentiana Rashiti
Maxim Gusev
Ce Zhang
Stefan Feuerriegel
180
4
0
13 Oct 2023
Unveiling Document Structures with YOLOv5 Layout Detection
Herman Sugiharto
Yorissa Silviana
Langa Khumalo
123
1
0
29 Sep 2023
Bridging the Performance Gap between DETR and R-CNN for Graphical Object Detection in Document Images
Tahira Shehzadi
K. Hashmi
D. Stricker
Marcus Liwicki
Muhammad Zeshan Afzal
190
8
0
23 Jun 2023
1
2
Next