ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.03803
  4. Cited By
Decoupled Novel Object Captioner
v1v2 (latest)

Decoupled Novel Object Captioner

11 April 2018
Yuehua Wu
Linchao Zhu
Lu Jiang
Yi Yang
ArXiv (abs)PDFHTML

Papers citing "Decoupled Novel Object Captioner"

27 / 27 papers shown
Improving Cross-modal Alignment with Synthetic Pairs for Text-only Image
  Captioning
Improving Cross-modal Alignment with Synthetic Pairs for Text-only Image CaptioningAAAI Conference on Artificial Intelligence (AAAI), 2023
Zhiyue Liu
Jinyuan Liu
Fanrong Ma
CLIPVLM
252
20
0
14 Dec 2023
RCA-NOC: Relative Contrastive Alignment for Novel Object Captioning
RCA-NOC: Relative Contrastive Alignment for Novel Object Captioning
Jiashuo Fan
Yaoyuan Liang
Leyao Liu
Shao-Lun Huang
Lei Zhang
256
6
0
11 Dec 2023
DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only
  Training
DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only TrainingInternational Conference on Learning Representations (ICLR), 2023
Wei Li
Linchao Zhu
Longyin Wen
Yi Yang
VLM
220
119
0
06 Mar 2023
Vision+X: A Survey on Multimodal Learning in the Light of Data
Vision+X: A Survey on Multimodal Learning in the Light of DataIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Ye Zhu
Yuehua Wu
Andrii Zadaianchuk
Yan Yan
354
38
0
05 Oct 2022
Paraphrasing Is All You Need for Novel Object Captioning
Paraphrasing Is All You Need for Novel Object CaptioningNeural Information Processing Systems (NeurIPS), 2022
Cheng Yang
Yifan Hao
Wanshu Fan
Ruslan Salakhutdinov
Louis-Philippe Morency
Yu-Chiang Frank Wang
180
6
0
25 Sep 2022
"This is my unicorn, Fluffy": Personalizing frozen vision-language
  representations
"This is my unicorn, Fluffy": Personalizing frozen vision-language representationsEuropean Conference on Computer Vision (ECCV), 2022
Niv Cohen
Rinon Gal
E. Meirom
Gal Chechik
Yuval Atzmon
VLMMLLM
348
102
0
04 Apr 2022
NOC-REK: Novel Object Captioning with Retrieved Vocabulary from External
  Knowledge
NOC-REK: Novel Object Captioning with Retrieved Vocabulary from External KnowledgeComputer Vision and Pattern Recognition (CVPR), 2022
D. Vo
Hong Chen
Akihiro Sugimoto
Hideki Nakayama
192
19
0
28 Mar 2022
Zero-shot Natural Language Video Localization
Zero-shot Natural Language Video LocalizationIEEE International Conference on Computer Vision (ICCV), 2021
Jinwoo Nam
Daechul Ahn
Luan Tuyen Chau
S. Ha
Jonghyun Choi
345
54
0
29 Aug 2021
Caption Generation on Scenes with Seen and Unseen Object Categories
Caption Generation on Scenes with Seen and Unseen Object CategoriesImage and Vision Computing (IVC), 2021
B. Demirel
R. G. Cinbis
VLM
270
2
0
13 Aug 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DVVLMMLLM
435
344
0
14 Jul 2021
Saying the Unseen: Video Descriptions via Dialog Agents
Saying the Unseen: Video Descriptions via Dialog AgentsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
212
8
0
26 Jun 2021
Learning to Select: A Fully Attentive Approach for Novel Object
  Captioning
Learning to Select: A Fully Attentive Approach for Novel Object CaptioningInternational Conference on Multimedia Retrieval (ICMR), 2021
Marco Cagrandi
Marcella Cornia
Matteo Stefanini
Lorenzo Baraldi
Rita Cucchiara
145
9
0
02 Jun 2021
ClawCraneNet: Leveraging Object-level Relation for Text-based Video
  Segmentation
ClawCraneNet: Leveraging Object-level Relation for Text-based Video Segmentation
Chen Liang
Yu Wu
Yawei Luo
Yi Yang
VOS
336
33
0
19 Mar 2021
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize
  Long-Tail Visual Concepts
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual ConceptsComputer Vision and Pattern Recognition (CVPR), 2021
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
1.1K
1,353
0
17 Feb 2021
ECOL-R: Encouraging Copying in Novel Object Captioning with
  Reinforcement Learning
ECOL-R: Encouraging Copying in Novel Object Captioning with Reinforcement LearningConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Yufei Wang
Ian D. Wood
Stephen Wan
Mark Johnson
151
8
0
25 Jan 2021
Diverse Image Captioning with Context-Object Split Latent Spaces
Diverse Image Captioning with Context-Object Split Latent SpacesNeural Information Processing Systems (NeurIPS), 2020
Shweta Mahajan
Stefan Roth
194
46
0
02 Nov 2020
VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning
VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning
Xiaowei Hu
Xi Yin
Kevin Qinghong Lin
Lijuan Wang
Guang Dai
Jianfeng Gao
Zicheng Liu
VLM
223
58
0
28 Sep 2020
Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents
Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
255
13
0
18 Aug 2020
Egoshots, an ego-vision life-logging dataset and semantic fidelity
  metric to evaluate diversity in image captioning models
Egoshots, an ego-vision life-logging dataset and semantic fidelity metric to evaluate diversity in image captioning modelsInternational Conference on Learning Representations (ICLR), 2020
Pranav Agarwal
Alejandro Betancourt
V. Panagiotou
Natalia Díaz Rodríguez
EGVM
213
11
0
26 Mar 2020
Captioning Images with Novel Objects via Online Vocabulary Expansion
Captioning Images with Novel Objects via Online Vocabulary Expansion
Mikihiro Tanaka
Tatsuya Harada
3DV
210
2
0
06 Mar 2020
Cascaded Revision Network for Novel Object Captioning
Cascaded Revision Network for Novel Object Captioning
Qianyu Feng
Yu Wu
Hehe Fan
C. Yan
Yezhou Yang
129
38
0
06 Aug 2019
Image Captioning with Unseen Objects
Image Captioning with Unseen ObjectsBritish Machine Vision Conference (BMVC), 2019
B. Demirel
R. G. Cinbis
Nazli Ikizler-Cinbis
VLM
215
17
0
31 Jul 2019
Pointing Novel Objects in Image Captioning
Pointing Novel Objects in Image Captioning
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
197
73
0
25 Apr 2019
nocaps: novel object captioning at scale
nocaps: novel object captioning at scale
Harsh Agrawal
Karan Desai
Yufei Wang
Xinlei Chen
Rishabh Jain
Mark Johnson
Dhruv Batra
Devi Parikh
Stefan Lee
Peter Anderson
VLM
355
583
0
20 Dec 2018
Intention Oriented Image Captions with Guiding Objects
Intention Oriented Image Captions with Guiding ObjectsComputer Vision and Pattern Recognition (CVPR), 2018
Yue Zheng
Yali Li
Shengjin Wang
185
56
0
19 Nov 2018
Holistic Multi-modal Memory Network for Movie Question Answering
Holistic Multi-modal Memory Network for Movie Question Answering
Anran Wang
Anh Tuan Luu
Chuan-Sheng Foo
Erik Cambria
Yi Tay
V. Chandrasekhar
175
20
0
12 Nov 2018
Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video
  Captioning
Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video CaptioningAAAI Conference on Artificial Intelligence (AAAI), 2018
Yoonchang Sung
Jiawei Wu
Da Zhang
Yu-Chuan Su
Erfaun Noorani
224
39
0
07 Nov 2018
1