CLIPPO: Image-and-Language Understanding from Pixels OnlyComputer Vision and Pattern Recognition (CVPR), 2022 |
Language Modelling with PixelsInternational Conference on Learning Representations (ICLR), 2022 |
Robust Open-Vocabulary Translation from Visual Text RepresentationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 |