Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.03873
Cited By
Multimodal Semi-Supervised Learning for Text Recognition
8 May 2022
Aviad Aberdam
Roy Ganz
Shai Mazor
Ron Litman
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multimodal Semi-Supervised Learning for Text Recognition"
17 / 17 papers shown
Title
Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets
Martin Kiss
Michal Hradiš
34
0
0
28 Mar 2025
TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models
Jonathan Fhima
Elad Ben Avraham
Oren Nuriel
Yair Kittenplon
Roy Ganz
Aviad Aberdam
Ron Litman
VLM
26
1
0
07 Nov 2024
Self-Supervised Learning for Text Recognition: A Critical Survey
Carlos Peñarrubia
J. J. Valero-Mas
Jorge Calvo-Zaragoza
69
1
0
29 Jul 2024
Self-supervised Pre-training of Text Recognizers
M. Kišš
Michal Hradiš
SSL
29
0
0
01 May 2024
Question Aware Vision Transformer for Multimodal Reasoning
Roy Ganz
Yair Kittenplon
Aviad Aberdam
Elad Ben Avraham
Oren Nuriel
Shai Mazor
Ron Litman
24
20
0
08 Feb 2024
GRAM: Global Reasoning for Multi-Page VQA
Tsachi Blau
Sharon Fogel
Roi Ronen
Alona Golts
Roy Ganz
Elad Ben Avraham
Aviad Aberdam
Shahar Tsiper
Ron Litman
16
12
0
07 Jan 2024
CLIPAG: Towards Generator-Free Text-to-Image Generation
Roy Ganz
Michael Elad
VLM
18
7
0
29 Jun 2023
FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions
Noam Rotstein
David Bensaid
Shaked Brody
Roy Ganz
Ron Kimmel
VLM
11
26
0
28 May 2023
Fine-tuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting Recognition
Jan Kohút
Michal Hradiš
50
7
0
13 Feb 2023
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
Aviad Aberdam
David Bensaid
Alona Golts
Roy Ganz
Oren Nuriel
Royee Tichauer
Shai Mazor
Ron Litman
VLM
CLIP
19
11
0
18 Jan 2023
Towards Models that Can See and Read
Roy Ganz
Oren Nuriel
Aviad Aberdam
Yair Kittenplon
Shai Mazor
Ron Litman
14
13
0
18 Jan 2023
SoftCTC -- Semi-Supervised Learning for Text Recognition using Soft Pseudo-Labels
M. Kišš
Michal Hradiš
Karel Beneš
Petr Buchal
Michal Kula
50
4
0
05 Dec 2022
Masked Vision-Language Transformers for Scene Text Recognition
Jie Wu
Ying Peng
Shenmin Zhang
Weigang Qi
Jian Andrew Zhang
27
3
0
09 Nov 2022
Out-of-Vocabulary Challenge Report
Sergi Garcia-Bordils
Andrés Mafla
Ali Furkan Biten
Oren Nuriel
Aviad Aberdam
Shai Mazor
Ron Litman
Dimosthenis Karatzas
9
16
0
14 Sep 2022
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
Oren Nuriel
Sharon Fogel
Ron Litman
16
9
0
09 May 2021
Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results
Antti Tarvainen
Harri Valpola
OOD
MoMe
244
1,279
0
06 Mar 2017
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
Andreas Veit
Tomas Matera
Lukás Neumann
Jirí Matas
Serge J. Belongie
175
515
0
26 Jan 2016
1