"What is the value of {templates}?" Rethinking Document Information
Extraction Datasets for LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
Modeling Layout Reading Order as Ordering Relations for Visually-rich
Document UnderstandingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
DocLLM: A layout-aware generative language model for multimodal document
understandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 |
LMDX: Language Model-based Document Information Extraction and
LocalizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 |