DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed LearningComputer Vision and Pattern Recognition (CVPR), 2025 |
SFDLA: Source-Free Document Layout AnalysisIEEE International Conference on Document Analysis and Recognition (ICDAR), 2025 |
A Simple yet Effective Layout Token in Large Language Models for Document UnderstandingComputer Vision and Pattern Recognition (CVPR), 2025 |
Object Recognition from Scientific Document based on Compartment
Refinement FrameworkSN Computer Science (SCS), 2023 |