Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.02201
Cited By
v1
v2 (latest)
Image Captioning with Very Scarce Supervised Data: Adversarial Semi-Supervised Learning Approach
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
5 September 2019
Dong-Jin Kim
Jinsoo Choi
Tae-Hyun Oh
In So Kweon
SSL
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Image Captioning with Very Scarce Supervised Data: Adversarial Semi-Supervised Learning Approach"
36 / 36 papers shown
How Vision-Language Tasks Benefit from Large Pre-trained Models: A Survey
Yayun Qi
Hongxi Li
Yiqi Song
Xinxiao Wu
Jiebo Luo
LRM
VLM
188
1
0
11 Dec 2024
Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Youngtaek Oh
Jae-Won Cho
Dong-Jin Kim
In So Kweon
Junmo Kim
VLM
CoGe
CLIP
396
14
0
07 Oct 2024
IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Soeun Lee
Si-Woo Kim
Taewhan Kim
Dong-Jin Kim
CLIP
VLM
289
12
0
26 Sep 2024
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Uri Berger
Gabriel Stanovsky
Omri Abend
Lea Frermann
540
0
0
09 Aug 2024
GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths
European Conference on Computer Vision (ECCV), 2024
Xianyu Chen
Ming Jiang
Qi Zhao
294
11
0
05 Aug 2024
The Solution for the CVPR2023 NICE Image Captioning Challenge
Xiangyu Wu
Yi Gao
Hailiang Zhang
Yang Yang
Weili Guo
Jianfeng Lu
347
1
0
10 Oct 2023
Multimodal Data Augmentation for Image Captioning using Diffusion Models
Changrong Xiao
S. Xu
Kunpeng Zhang
DiffM
230
19
0
03 May 2023
Prefix tuning for automated audio captioning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Minkyu Kim
Kim Sung-Bin
Tae-Hyun Oh
465
55
0
30 Mar 2023
ENInst: Enhancing Weakly-supervised Low-shot Instance Segmentation
Pattern Recognition (Pattern Recogn.), 2023
Moon Ye-Bin
D. Choi
Y. Kwon
Junsik Kim
Tae-Hyun Oh
ISeg
524
8
0
20 Feb 2023
Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data
IEEE Access (IEEE Access), 2023
Dong-Jin Kim
Tae-Hyun Oh
Jinsoo Choi
In So Kweon
SSL
VLM
184
10
0
26 Jan 2023
Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition
British Machine Vision Conference (BMVC), 2022
Youngjoon Jang
Youngtaek Oh
Jae-Won Cho
Dong-Jin Kim
Joon Son Chung
In So Kweon
SLR
239
18
0
01 Nov 2022
What Should the System Do Next?: Operative Action Captioning for Estimating System Actions
Taiki Nakamura
Seiya Kawano
Akishige Yuguchi
Yasutomo Kawanishi
Koichiro Yoshino
272
0
0
06 Oct 2022
REST: REtrieve & Self-Train for generative action recognition
Adrian Bulat
Enrique Sanchez
Brais Martínez
Georgios Tzimiropoulos
VLM
330
4
0
29 Sep 2022
Paraphrasing Is All You Need for Novel Object Captioning
Neural Information Processing Systems (NeurIPS), 2022
Cheng Yang
Yifan Hao
Wanshu Fan
Ruslan Salakhutdinov
Louis-Philippe Morency
Yu-Chiang Frank Wang
236
6
0
25 Sep 2022
Generative Bias for Robust Visual Question Answering
Computer Vision and Pattern Recognition (CVPR), 2022
Jae-Won Cho
Dong-Jin Kim
H. Ryu
In So Kweon
OOD
CML
454
34
0
01 Aug 2022
Intra-agent speech permits zero-shot task acquisition
Neural Information Processing Systems (NeurIPS), 2022
Chen Yan
Federico Carnevale
Petko Georgiev
Adam Santoro
Aurelia Guy
Alistair Muldal
Chia-Chun Hung
Josh Abramson
Timothy Lillicrap
Greg Wayne
LM&Ro
309
10
0
07 Jun 2022
Prompt-based Learning for Unpaired Image Captioning
IEEE transactions on multimedia (IEEE TMM), 2022
Peipei Zhu
Tianlin Li
Lin Zhu
Zhenglong Sun
Weishi Zheng
Yaowei Wang
Chen Chen
VLM
280
50
0
26 May 2022
Unpaired Image Captioning by Image-level Weakly-Supervised Visual Concept Recognition
IEEE transactions on multimedia (IEEE TMM), 2022
Peipei Zhu
Tianlin Li
Yong Luo
Zhenglong Sun
Wei-Shi Zheng
Yaowei Wang
Chen Chen
270
16
0
07 Mar 2022
A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision
Ajinkya Tejankar
Maziar Sanjabi
Bichen Wu
Saining Xie
Madian Khabsa
Hamed Pirsiavash
Hamed Firooz
VLM
349
20
0
27 Dec 2021
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Computer Vision and Pattern Recognition (CVPR), 2021
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
475
250
0
29 Nov 2021
Single-Modal Entropy based Active Learning for Visual Question Answering
British Machine Vision Conference (BMVC), 2021
Dong-Jin Kim
Jae-Won Cho
Jinsoo Choi
Yunjae Jung
In So Kweon
220
14
0
21 Oct 2021
R
3
^3
3
Net:Relation-embedded Representation Reconstruction Network for Change Captioning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Yunbin Tu
Liang Li
C. Yan
Shengxiang Gao
Zhengtao Yu
315
29
0
20 Oct 2021
ACP++: Action Co-occurrence Priors for Human-Object Interaction Detection
IEEE Transactions on Image Processing (TIP), 2021
Dong-Jin Kim
Xiao Sun
Jinsoo Choi
Stephen Lin
In So Kweon
237
25
0
09 Sep 2021
LabOR: Labeling Only if Required for Domain Adaptive Semantic Segmentation
IEEE International Conference on Computer Vision (ICCV), 2021
Inkyu Shin
Dong-Jin Kim
Jae-Won Cho
Sanghyun Woo
Kwanyong Park
In So Kweon
328
68
0
12 Aug 2021
MCDAL: Maximum Classifier Discrepancy for Active Learning
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Jae-Won Cho
Dong-Jin Kim
Yunjae Jung
In So Kweon
419
56
0
23 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
585
373
0
14 Jul 2021
Exploring Semantic Relationships for Unpaired Image Captioning
Fenglin Liu
Meng Gao
Tianhao Zhang
Yuexian Zou
389
7
0
20 Jun 2021
DASO: Distribution-Aware Semantics-Oriented Pseudo-label for Imbalanced Semi-Supervised Learning
Computer Vision and Pattern Recognition (CVPR), 2021
Youngtaek Oh
Dong-Jin Kim
In So Kweon
306
87
0
10 Jun 2021
Removing Word-Level Spurious Alignment between Images and Pseudo-Captions in Unsupervised Image Captioning
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Ukyo Honda
Yoshitaka Ushiku
Atsushi Hashimoto
Taro Watanabe
Yuji Matsumoto
322
26
0
28 Apr 2021
Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction Task through Knowledge Distillation
Jae-Won Cho
Dong-Jin Kim
Jinsoo Choi
Yunjae Jung
In So Kweon
VLM
166
19
0
13 Apr 2021
More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval
Computer Vision and Pattern Recognition (CVPR), 2021
A. Bhunia
Pinaki Nath Chowdhury
Aneeshan Sain
Yongxin Yang
Tao Xiang
Yi-Zhe Song
GAN
SSL
298
69
0
25 Mar 2021
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning
Computer Vision and Pattern Recognition (CVPR), 2021
Jun Chen
Han Guo
Kai Yi
Boyang Albert Li
Mohamed Elhoseiny
VLM
607
288
0
20 Feb 2021
Adversarial Training for Code Retrieval with Question-Description Relevance Regularization
Findings (Findings), 2020
Jie Zhao
Huan Sun
359
5
0
19 Oct 2020
Dense Relational Image Captioning via Multi-task Triple-Stream Networks
Dong-Jin Kim
Tae-Hyun Oh
Jinsoo Choi
In So Kweon
398
39
0
08 Oct 2020
Detecting Human-Object Interactions with Action Co-occurrence Priors
European Conference on Computer Vision (ECCV), 2020
Dong-Jin Kim
Xiao Sun
Jinsoo Choi
Stephen Lin
In So Kweon
363
133
0
17 Jul 2020
Detection and Description of Change in Visual Streams
Davis Gilton
Ruotian Luo
Rebecca Willett
Gregory Shakhnarovich
AI4TS
233
4
0
27 Mar 2020
1
Page 1 of 1