Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.14381
Cited By
SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation
20 May 2025
Yuyang Dong
Nobuhiro Ueda
Krisztián Boros
Daiki Ito
Takuya Sera
Masafumi Oyamada
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation"
4 / 4 papers shown
Title
VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents
Ryota Tanaka
Taichi Iki
Taku Hasegawa
Kyosuke Nishida
Kuniko Saito
Jun Suzuki
VLM
122
6
0
14 Apr 2025
olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models
Jake Poznanski
Aman Rangapur
Jon Borchardt
Jason Dunkelberger
Regan Huff
Daniel Lin
Aman Rangapur
Christopher Wilhelm
Kyle Lo
Luca Soldaini
192
7
0
25 Feb 2025
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Ling Fu
Biao Yang
Zhebin Kuang
Jiajun Song
Yuzhe Li
...
Jingqun Tang
Wei Chen
Lianwen Jin
Yunxing Liu
Xiang Bai
131
22
0
31 Dec 2024
VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
S. Yu
C. Tang
Bokai Xu
Junbo Cui
Junhao Ran
...
Zhenghao Liu
Shuo Wang
Xu Han
Zhiyuan Liu
Maosong Sun
VLM
212
39
0
14 Oct 2024
1