ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.03873
  4. Cited By
Multimodal Semi-Supervised Learning for Text Recognition

Multimodal Semi-Supervised Learning for Text Recognition

8 May 2022
Aviad Aberdam
Roy Ganz
Shai Mazor
Ron Litman
    VLM
ArXivPDFHTML

Papers citing "Multimodal Semi-Supervised Learning for Text Recognition"

17 / 17 papers shown
Title
Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets
Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets
Martin Kiss
Michal Hradiš
26
0
0
28 Mar 2025
TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language
  Models
TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models
Jonathan Fhima
Elad Ben Avraham
Oren Nuriel
Yair Kittenplon
Roy Ganz
Aviad Aberdam
Ron Litman
VLM
26
1
0
07 Nov 2024
Self-Supervised Learning for Text Recognition: A Critical Survey
Self-Supervised Learning for Text Recognition: A Critical Survey
Carlos Peñarrubia
J. J. Valero-Mas
Jorge Calvo-Zaragoza
69
1
0
29 Jul 2024
Self-supervised Pre-training of Text Recognizers
Self-supervised Pre-training of Text Recognizers
M. Kišš
Michal Hradiš
SSL
29
0
0
01 May 2024
Question Aware Vision Transformer for Multimodal Reasoning
Question Aware Vision Transformer for Multimodal Reasoning
Roy Ganz
Yair Kittenplon
Aviad Aberdam
Elad Ben Avraham
Oren Nuriel
Shai Mazor
Ron Litman
24
8
0
08 Feb 2024
GRAM: Global Reasoning for Multi-Page VQA
GRAM: Global Reasoning for Multi-Page VQA
Tsachi Blau
Sharon Fogel
Roi Ronen
Alona Golts
Roy Ganz
Elad Ben Avraham
Aviad Aberdam
Shahar Tsiper
Ron Litman
16
12
0
07 Jan 2024
CLIPAG: Towards Generator-Free Text-to-Image Generation
CLIPAG: Towards Generator-Free Text-to-Image Generation
Roy Ganz
Michael Elad
VLM
15
7
0
29 Jun 2023
FuseCap: Leveraging Large Language Models for Enriched Fused Image
  Captions
FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions
Noam Rotstein
David Bensaid
Shaked Brody
Roy Ganz
Ron Kimmel
VLM
11
26
0
28 May 2023
Fine-tuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting Recognition
Fine-tuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting Recognition
Jan Kohút
Michal Hradiš
47
7
0
13 Feb 2023
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
Aviad Aberdam
David Bensaid
Alona Golts
Roy Ganz
Oren Nuriel
Royee Tichauer
Shai Mazor
Ron Litman
VLM
CLIP
19
11
0
18 Jan 2023
Towards Models that Can See and Read
Towards Models that Can See and Read
Roy Ganz
Oren Nuriel
Aviad Aberdam
Yair Kittenplon
Shai Mazor
Ron Litman
14
13
0
18 Jan 2023
SoftCTC -- Semi-Supervised Learning for Text Recognition using Soft
  Pseudo-Labels
SoftCTC -- Semi-Supervised Learning for Text Recognition using Soft Pseudo-Labels
M. Kišš
Michal Hradiš
Karel Beneš
Petr Buchal
Michal Kula
50
4
0
05 Dec 2022
Masked Vision-Language Transformers for Scene Text Recognition
Masked Vision-Language Transformers for Scene Text Recognition
Jie Wu
Ying Peng
Shenmin Zhang
Weigang Qi
Jian Andrew Zhang
24
3
0
09 Nov 2022
Out-of-Vocabulary Challenge Report
Out-of-Vocabulary Challenge Report
Sergi Garcia-Bordils
Andrés Mafla
Ali Furkan Biten
Oren Nuriel
Aviad Aberdam
Shai Mazor
Ron Litman
Dimosthenis Karatzas
9
16
0
14 Sep 2022
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
Oren Nuriel
Sharon Fogel
Ron Litman
14
8
0
09 May 2021
Mean teachers are better role models: Weight-averaged consistency
  targets improve semi-supervised deep learning results
Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results
Antti Tarvainen
Harri Valpola
OOD
MoMe
244
1,279
0
06 Mar 2017
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in
  Natural Images
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
Andreas Veit
Tomas Matera
Lukás Neumann
Jirí Matas
Serge J. Belongie
172
458
0
26 Jan 2016
1