Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.06636
Cited By
Federated Document Visual Question Answering: A Pilot Study
10 May 2024
Khanh Nguyen
Dimosthenis Karatzas
FedML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Federated Document Visual Question Answering: A Pilot Study"
3 / 3 papers shown
Title
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Kenton Lee
Mandar Joshi
Iulia Turc
Hexiang Hu
Fangyu Liu
Julian Martin Eisenschlos
Urvashi Khandelwal
Peter Shaw
Ming-Wei Chang
Kristina Toutanova
CLIP
VLM
158
262
0
07 Oct 2022
Downstream Datasets Make Surprisingly Good Pretraining Corpora
Kundan Krishna
Saurabh Garg
Jeffrey P. Bigham
Zachary Chase Lipton
38
30
0
28 Sep 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,412
0
11 Nov 2021
1