Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.19024
Cited By
Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism
29 April 2024
Lei Kang
Rubèn Pérez Tito
Ernest Valveny
Dimosthenis Karatzas
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism"
2 / 2 papers shown
Title
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Kenton Lee
Mandar Joshi
Iulia Turc
Hexiang Hu
Fangyu Liu
Julian Martin Eisenschlos
Urvashi Khandelwal
Peter Shaw
Ming-Wei Chang
Kristina Toutanova
CLIP
VLM
158
262
0
07 Oct 2022
Big Bird: Transformers for Longer Sequences
Manzil Zaheer
Guru Guruganesh
Kumar Avinava Dubey
Joshua Ainslie
Chris Alberti
...
Philip Pham
Anirudh Ravula
Qifan Wang
Li Yang
Amr Ahmed
VLM
249
2,009
0
28 Jul 2020
1