ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.02201
  4. Cited By
Image Captioning with Very Scarce Supervised Data: Adversarial
  Semi-Supervised Learning Approach
v1v2 (latest)

Image Captioning with Very Scarce Supervised Data: Adversarial Semi-Supervised Learning Approach

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
5 September 2019
Dong-Jin Kim
Jinsoo Choi
Tae-Hyun Oh
In So Kweon
    SSLVLM
ArXiv (abs)PDFHTML

Papers citing "Image Captioning with Very Scarce Supervised Data: Adversarial Semi-Supervised Learning Approach"

36 / 36 papers shown
How Vision-Language Tasks Benefit from Large Pre-trained Models: A
  Survey
How Vision-Language Tasks Benefit from Large Pre-trained Models: A Survey
Yayun Qi
Hongxi Li
Yiqi Song
Xinxiao Wu
Jiebo Luo
LRMVLM
188
1
0
11 Dec 2024
Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving
  Vision-Linguistic Compositionality
Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic CompositionalityConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Youngtaek Oh
Jae-Won Cho
Dong-Jin Kim
In So Kweon
Junmo Kim
VLMCoGeCLIP
396
14
0
07 Oct 2024
IFCap: Image-like Retrieval and Frequency-based Entity Filtering for
  Zero-shot Captioning
IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot CaptioningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Soeun Lee
Si-Woo Kim
Taewhan Kim
Dong-Jin Kim
CLIPVLM
289
12
0
26 Sep 2024
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Uri Berger
Gabriel Stanovsky
Omri Abend
Lea Frermann
540
0
0
09 Aug 2024
GazeXplain: Learning to Predict Natural Language Explanations of Visual
  Scanpaths
GazeXplain: Learning to Predict Natural Language Explanations of Visual ScanpathsEuropean Conference on Computer Vision (ECCV), 2024
Xianyu Chen
Ming Jiang
Qi Zhao
294
11
0
05 Aug 2024
The Solution for the CVPR2023 NICE Image Captioning Challenge
The Solution for the CVPR2023 NICE Image Captioning Challenge
Xiangyu Wu
Yi Gao
Hailiang Zhang
Yang Yang
Weili Guo
Jianfeng Lu
347
1
0
10 Oct 2023
Multimodal Data Augmentation for Image Captioning using Diffusion Models
Multimodal Data Augmentation for Image Captioning using Diffusion Models
Changrong Xiao
S. Xu
Kunpeng Zhang
DiffM
230
19
0
03 May 2023
Prefix tuning for automated audio captioning
Prefix tuning for automated audio captioningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Minkyu Kim
Kim Sung-Bin
Tae-Hyun Oh
465
55
0
30 Mar 2023
ENInst: Enhancing Weakly-supervised Low-shot Instance Segmentation
ENInst: Enhancing Weakly-supervised Low-shot Instance SegmentationPattern Recognition (Pattern Recogn.), 2023
Moon Ye-Bin
D. Choi
Y. Kwon
Junsik Kim
Tae-Hyun Oh
ISeg
524
8
0
20 Feb 2023
Semi-Supervised Image Captioning by Adversarially Propagating Labeled
  Data
Semi-Supervised Image Captioning by Adversarially Propagating Labeled DataIEEE Access (IEEE Access), 2023
Dong-Jin Kim
Tae-Hyun Oh
Jinsoo Choi
In So Kweon
SSLVLM
184
10
0
26 Jan 2023
Signing Outside the Studio: Benchmarking Background Robustness for
  Continuous Sign Language Recognition
Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language RecognitionBritish Machine Vision Conference (BMVC), 2022
Youngjoon Jang
Youngtaek Oh
Jae-Won Cho
Dong-Jin Kim
Joon Son Chung
In So Kweon
SLR
239
18
0
01 Nov 2022
What Should the System Do Next?: Operative Action Captioning for
  Estimating System Actions
What Should the System Do Next?: Operative Action Captioning for Estimating System Actions
Taiki Nakamura
Seiya Kawano
Akishige Yuguchi
Yasutomo Kawanishi
Koichiro Yoshino
272
0
0
06 Oct 2022
REST: REtrieve & Self-Train for generative action recognition
REST: REtrieve & Self-Train for generative action recognition
Adrian Bulat
Enrique Sanchez
Brais Martínez
Georgios Tzimiropoulos
VLM
330
4
0
29 Sep 2022
Paraphrasing Is All You Need for Novel Object Captioning
Paraphrasing Is All You Need for Novel Object CaptioningNeural Information Processing Systems (NeurIPS), 2022
Cheng Yang
Yifan Hao
Wanshu Fan
Ruslan Salakhutdinov
Louis-Philippe Morency
Yu-Chiang Frank Wang
236
6
0
25 Sep 2022
Generative Bias for Robust Visual Question Answering
Generative Bias for Robust Visual Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2022
Jae-Won Cho
Dong-Jin Kim
H. Ryu
In So Kweon
OODCML
454
34
0
01 Aug 2022
Intra-agent speech permits zero-shot task acquisition
Intra-agent speech permits zero-shot task acquisitionNeural Information Processing Systems (NeurIPS), 2022
Chen Yan
Federico Carnevale
Petko Georgiev
Adam Santoro
Aurelia Guy
Alistair Muldal
Chia-Chun Hung
Josh Abramson
Timothy Lillicrap
Greg Wayne
LM&Ro
309
10
0
07 Jun 2022
Prompt-based Learning for Unpaired Image Captioning
Prompt-based Learning for Unpaired Image CaptioningIEEE transactions on multimedia (IEEE TMM), 2022
Peipei Zhu
Tianlin Li
Lin Zhu
Zhenglong Sun
Weishi Zheng
Yaowei Wang
Chen Chen
VLM
280
50
0
26 May 2022
Unpaired Image Captioning by Image-level Weakly-Supervised Visual
  Concept Recognition
Unpaired Image Captioning by Image-level Weakly-Supervised Visual Concept RecognitionIEEE transactions on multimedia (IEEE TMM), 2022
Peipei Zhu
Tianlin Li
Yong Luo
Zhenglong Sun
Wei-Shi Zheng
Yaowei Wang
Chen Chen
270
16
0
07 Mar 2022
A Fistful of Words: Learning Transferable Visual Models from
  Bag-of-Words Supervision
A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision
Ajinkya Tejankar
Maziar Sanjabi
Bichen Wu
Saining Xie
Madian Khabsa
Hamed Pirsiavash
Hamed Firooz
VLM
349
20
0
27 Dec 2021
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic
  Arithmetic
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic ArithmeticComputer Vision and Pattern Recognition (CVPR), 2021
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
475
250
0
29 Nov 2021
Single-Modal Entropy based Active Learning for Visual Question Answering
Single-Modal Entropy based Active Learning for Visual Question AnsweringBritish Machine Vision Conference (BMVC), 2021
Dong-Jin Kim
Jae-Won Cho
Jinsoo Choi
Yunjae Jung
In So Kweon
220
14
0
21 Oct 2021
R$^3$Net:Relation-embedded Representation Reconstruction Network for
  Change Captioning
R3^33Net:Relation-embedded Representation Reconstruction Network for Change CaptioningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Yunbin Tu
Liang Li
C. Yan
Shengxiang Gao
Zhengtao Yu
315
29
0
20 Oct 2021
ACP++: Action Co-occurrence Priors for Human-Object Interaction
  Detection
ACP++: Action Co-occurrence Priors for Human-Object Interaction DetectionIEEE Transactions on Image Processing (TIP), 2021
Dong-Jin Kim
Xiao Sun
Jinsoo Choi
Stephen Lin
In So Kweon
237
25
0
09 Sep 2021
LabOR: Labeling Only if Required for Domain Adaptive Semantic
  Segmentation
LabOR: Labeling Only if Required for Domain Adaptive Semantic SegmentationIEEE International Conference on Computer Vision (ICCV), 2021
Inkyu Shin
Dong-Jin Kim
Jae-Won Cho
Sanghyun Woo
Kwanyong Park
In So Kweon
328
68
0
12 Aug 2021
MCDAL: Maximum Classifier Discrepancy for Active Learning
MCDAL: Maximum Classifier Discrepancy for Active LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Jae-Won Cho
Dong-Jin Kim
Yunjae Jung
In So Kweon
419
56
0
23 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DVVLMMLLM
585
373
0
14 Jul 2021
Exploring Semantic Relationships for Unpaired Image Captioning
Exploring Semantic Relationships for Unpaired Image Captioning
Fenglin Liu
Meng Gao
Tianhao Zhang
Yuexian Zou
389
7
0
20 Jun 2021
DASO: Distribution-Aware Semantics-Oriented Pseudo-label for Imbalanced
  Semi-Supervised Learning
DASO: Distribution-Aware Semantics-Oriented Pseudo-label for Imbalanced Semi-Supervised LearningComputer Vision and Pattern Recognition (CVPR), 2021
Youngtaek Oh
Dong-Jin Kim
In So Kweon
306
87
0
10 Jun 2021
Removing Word-Level Spurious Alignment between Images and
  Pseudo-Captions in Unsupervised Image Captioning
Removing Word-Level Spurious Alignment between Images and Pseudo-Captions in Unsupervised Image CaptioningConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Ukyo Honda
Yoshitaka Ushiku
Atsushi Hashimoto
Taro Watanabe
Yuji Matsumoto
322
26
0
28 Apr 2021
Dealing with Missing Modalities in the Visual Question Answer-Difference
  Prediction Task through Knowledge Distillation
Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction Task through Knowledge Distillation
Jae-Won Cho
Dong-Jin Kim
Jinsoo Choi
Yunjae Jung
In So Kweon
VLM
166
19
0
13 Apr 2021
More Photos are All You Need: Semi-Supervised Learning for Fine-Grained
  Sketch Based Image Retrieval
More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image RetrievalComputer Vision and Pattern Recognition (CVPR), 2021
A. Bhunia
Pinaki Nath Chowdhury
Aneeshan Sain
Yongxin Yang
Tao Xiang
Yi-Zhe Song
GANSSL
298
69
0
25 Mar 2021
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for
  Image Captioning
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image CaptioningComputer Vision and Pattern Recognition (CVPR), 2021
Jun Chen
Han Guo
Kai Yi
Boyang Albert Li
Mohamed Elhoseiny
VLM
607
288
0
20 Feb 2021
Adversarial Training for Code Retrieval with Question-Description
  Relevance Regularization
Adversarial Training for Code Retrieval with Question-Description Relevance RegularizationFindings (Findings), 2020
Jie Zhao
Huan Sun
359
5
0
19 Oct 2020
Dense Relational Image Captioning via Multi-task Triple-Stream Networks
Dense Relational Image Captioning via Multi-task Triple-Stream Networks
Dong-Jin Kim
Tae-Hyun Oh
Jinsoo Choi
In So Kweon
398
39
0
08 Oct 2020
Detecting Human-Object Interactions with Action Co-occurrence Priors
Detecting Human-Object Interactions with Action Co-occurrence PriorsEuropean Conference on Computer Vision (ECCV), 2020
Dong-Jin Kim
Xiao Sun
Jinsoo Choi
Stephen Lin
In So Kweon
363
133
0
17 Jul 2020
Detection and Description of Change in Visual Streams
Detection and Description of Change in Visual Streams
Davis Gilton
Ruotian Luo
Rebecca Willett
Gregory Shakhnarovich
AI4TS
233
4
0
27 Mar 2020
1
Page 1 of 1