ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.01646
  4. Cited By
Boosting Image Captioning with Attributes

Boosting Image Captioning with Attributes

5 November 2016
Ting Yao
Yingwei Pan
Yehao Li
Zhaofan Qiu
Tao Mei
    VLM
ArXivPDFHTML

Papers citing "Boosting Image Captioning with Attributes"

50 / 222 papers shown
Title
Cross Modification Attention Based Deliberation Model for Image
  Captioning
Cross Modification Attention Based Deliberation Model for Image Captioning
Zheng Lian
Yanan Zhang
Haichang Li
Rui Wang
Xiaohui Hu
19
4
0
17 Sep 2021
A Survey on Temporal Sentence Grounding in Videos
A Survey on Temporal Sentence Grounding in Videos
Xiaohan Lan
Yitian Yuan
Xin Eric Wang
Zhi Wang
Wenwu Zhu
27
47
0
16 Sep 2021
Label-Attention Transformer with Geometrically Coherent Objects for
  Image Captioning
Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning
Shikha Dubey
Farrukh Olimov
M. Rafique
Joonmo Kim
M. Jeon
ViT
17
37
0
16 Sep 2021
From General to Specific: Informative Scene Graph Generation via Balance
  Adjustment
From General to Specific: Informative Scene Graph Generation via Balance Adjustment
Yuyu Guo
Lianli Gao
Xuanhan Wang
Yuxuan Hu
Xing Xu
Xu Lu
Heng Tao Shen
Jingkuan Song
58
84
0
30 Aug 2021
Similar Scenes arouse Similar Emotions: Parallel Data Augmentation for
  Stylized Image Captioning
Similar Scenes arouse Similar Emotions: Parallel Data Augmentation for Stylized Image Captioning
Guodun Li
Yuchen Zhai
Zehao Lin
Yin Zhang
48
21
0
26 Aug 2021
Auto-Parsing Network for Image Captioning and Visual Question Answering
Auto-Parsing Network for Image Captioning and Visual Question Answering
Xu Yang
Chongyang Gao
Hanwang Zhang
Jianfei Cai
11
35
0
24 Aug 2021
X-modaler: A Versatile and High-performance Codebase for Cross-modal
  Analytics
X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics
Yehao Li
Yingwei Pan
Jingwen Chen
Ting Yao
Tao Mei
VLM
19
31
0
18 Aug 2021
Dual Graph Convolutional Networks with Transformer and Curriculum
  Learning for Image Captioning
Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning
Xinzhi Dong
Chengjiang Long
Wenju Xu
Chunxia Xiao
ViT
69
66
0
05 Aug 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
55
254
0
14 Jul 2021
Saying the Unseen: Video Descriptions via Dialog Agents
Saying the Unseen: Video Descriptions via Dialog Agents
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
22
6
0
26 Jun 2021
A Picture May Be Worth a Hundred Words for Visual Question Answering
A Picture May Be Worth a Hundred Words for Visual Question Answering
Yusuke Hirota
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
Ittetsu Taniguchi
Takao Onoye
ViT
8
5
0
25 Jun 2021
Exploring Semantic Relationships for Unpaired Image Captioning
Exploring Semantic Relationships for Unpaired Image Captioning
Fenglin Liu
Meng Gao
Tianhao Zhang
Yuexian Zou
16
7
0
20 Jun 2021
Conversational Fashion Image Retrieval via Multiturn Natural Language
  Feedback
Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback
Yifei Yuan
W. Lam
13
43
0
08 Jun 2021
SGNet: A Super-class Guided Network for Image Classification and Object
  Detection
SGNet: A Super-class Guided Network for Image Classification and Object Detection
Kaidong Li
Ningning Wang
Yiju Yang
Guanghui Wang
92
22
0
26 Apr 2021
Detector-Free Weakly Supervised Grounding by Separation
Detector-Free Weakly Supervised Grounding by Separation
Assaf Arbelle
Sivan Doveh
Amit Alfassy
J. Shtok
Guy Lev
...
Kate Saenko
S. Ullman
Raja Giryes
Rogerio Feris
Leonid Karlinsky
35
23
0
20 Apr 2021
Knowledge driven Description Synthesis for Floor Plan Interpretation
Knowledge driven Description Synthesis for Floor Plan Interpretation
Shreya Goyal
Chiranjoy Chattopadhyay
Gaurav Bhatnagar
3DV
23
12
0
15 Mar 2021
A Discriminative Vectorial Framework for Multi-modal Feature
  Representation
A Discriminative Vectorial Framework for Multi-modal Feature Representation
Lei Gao
L. Guan
14
11
0
09 Mar 2021
Image Captioning using Multiple Transformers for Self-Attention
  Mechanism
Image Captioning using Multiple Transformers for Self-Attention Mechanism
Farrukh Olimov
Shikha Dubey
Labina Shrestha
Tran Trung Tin
M. Jeon
ViT
23
2
0
14 Feb 2021
Scheduled Sampling in Vision-Language Pretraining with Decoupled
  Encoder-Decoder Network
Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network
Yehao Li
Yingwei Pan
Ting Yao
Jingwen Chen
Tao Mei
VLM
21
52
0
27 Jan 2021
CPTR: Full Transformer Network for Image Captioning
CPTR: Full Transformer Network for Image Captioning
Wei Liu
Sihan Chen
Longteng Guo
Xinxin Zhu
Jing Liu
ViT
10
141
0
26 Jan 2021
Macroscopic Control of Text Generation for Image Captioning
Macroscopic Control of Text Generation for Image Captioning
Zhangzi Zhu
Tianlei Wang
Hong Qu
19
4
0
20 Jan 2021
LCEval: Learned Composite Metric for Caption Evaluation
LCEval: Learned Composite Metric for Caption Evaluation
Naeha Sharif
Lyndon White
Bennamoun
Wei Liu
Syed Afaq Ali Shah
21
8
0
24 Dec 2020
SubICap: Towards Subword-informed Image Captioning
SubICap: Towards Subword-informed Image Captioning
Naeha Sharif
Bennamoun
Wei Liu
Syed Afaq Ali Shah
22
2
0
24 Dec 2020
AutoCaption: Image Captioning with Neural Architecture Search
AutoCaption: Image Captioning with Neural Architecture Search
Xinxin Zhu
Weining Wang
Longteng Guo
Jing Liu
24
9
0
16 Dec 2020
Improving Image Captioning by Leveraging Intra- and Inter-layer Global
  Representation in Transformer Network
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
Jiayi Ji
Yunpeng Luo
Xiaoshuai Sun
Fuhai Chen
Gen Luo
Yongjian Wu
Yue Gao
Rongrong Ji
ViT
41
170
0
13 Dec 2020
SuperOCR: A Conversion from Optical Character Recognition to Image
  Captioning
SuperOCR: A Conversion from Optical Character Recognition to Image Captioning
Baohua Sun
Michael Lin
Hao Sha
Lin Yang
13
5
0
21 Nov 2020
Dual Attention on Pyramid Feature Maps for Image Captioning
Dual Attention on Pyramid Feature Maps for Image Captioning
Litao Yu
Jian Andrew Zhang
Qiang Wu
16
47
0
02 Nov 2020
Boost Image Captioning with Knowledge Reasoning
Boost Image Captioning with Knowledge Reasoning
Feicheng Huang
Zhixin Li
Haiyang Wei
Canlong Zhang
Huifang Ma
6
25
0
02 Nov 2020
Teacher-Critical Training Strategies for Image Captioning
Teacher-Critical Training Strategies for Image Captioning
Yiqing Huang
Jiansheng Chen
VLM
10
8
0
30 Sep 2020
Where is the Model Looking At?--Concentrate and Explain the Network
  Attention
Where is the Model Looking At?--Concentrate and Explain the Network Attention
Wenjia Xu
Jiuniu Wang
Yang Wang
Guangluan Xu
Wei Dai
Yirong Wu
XAI
16
17
0
29 Sep 2020
Image Captioning with Attention for Smart Local Tourism using
  EfficientNet
Image Captioning with Attention for Smart Local Tourism using EfficientNet
D. H. Fudholi
Yurio Windiatmoko
Nurdi Afrianto
Prastyo Eko Susanto
Magfirah Suyuti
A. Hidayatullah
R. Rahmadi
3DH
6
10
0
18 Sep 2020
Denoising Large-Scale Image Captioning from Alt-text Data using Content
  Selection Models
Denoising Large-Scale Image Captioning from Alt-text Data using Content Selection Models
Khyathi Raghavi Chandu
Piyush Sharma
Soravit Changpinyo
Ashish V. Thapliyal
Radu Soricut
DiffM
VLM
27
3
0
10 Sep 2020
Relative Attribute Classification with Deep Rank SVM
Relative Attribute Classification with Deep Rank SVM
Sara Atito Ali Ahmed
Berrin Yanikoglu
12
5
0
09 Sep 2020
Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents
Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
11
13
0
18 Aug 2020
Describe What to Change: A Text-guided Unsupervised Image-to-Image
  Translation Approach
Describe What to Change: A Text-guided Unsupervised Image-to-Image Translation Approach
Yahui Liu
Marco De Nadai
Deng Cai
Huayang Li
Xavier Alameda-Pineda
N. Sebe
Bruno Lepri
38
59
0
10 Aug 2020
Fine-Grained Image Captioning with Global-Local Discriminative Objective
Fine-Grained Image Captioning with Global-Local Discriminative Objective
Jie Wu
Tianshui Chen
Hefeng Wu
Zhi Yang
Guangchun Luo
Liang Lin
20
59
0
21 Jul 2020
Length-Controllable Image Captioning
Length-Controllable Image Captioning
Chaorui Deng
Ning Ding
Mingkui Tan
Qi Wu
VLM
17
56
0
19 Jul 2020
Compare and Reweight: Distinctive Image Captioning Using Similar Images
  Sets
Compare and Reweight: Distinctive Image Captioning Using Similar Images Sets
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
29
44
0
14 Jul 2020
Image Captioning with Compositional Neural Module Networks
Image Captioning with Compositional Neural Module Networks
Junjiao Tian
Jean Oh
9
11
0
10 Jul 2020
PathGAN: Local Path Planning with Attentive Generative Adversarial
  Networks
PathGAN: Local Path Planning with Attentive Generative Adversarial Networks
Dooseop Choi
Seung-Jun Han
Kyoung‐Wook Min
Jeongdan Choi
GAN
11
5
0
08 Jul 2020
Diverse and Styled Image Captioning Using SVD-Based Mixture of Recurrent
  Experts
Diverse and Styled Image Captioning Using SVD-Based Mixture of Recurrent Experts
Marzi Heidari
M. Ghatee
A. Nickabadi
Arash Pourhasan Nezhad
DiffM
MoE
19
1
0
07 Jul 2020
Auto-captions on GIF: A Large-scale Video-sentence Dataset for
  Vision-language Pre-training
Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training
Yingwei Pan
Yehao Li
Jianjie Luo
Jun Xu
Ting Yao
Tao Mei
24
57
0
05 Jul 2020
A Transformer-based Audio Captioning Model with Keyword Estimation
A Transformer-based Audio Captioning Model with Keyword Estimation
Yuma Koizumi
Ryo Masumura
Kyosuke Nishida
Masahiro Yasuda
Shoichiro Saito
11
54
0
01 Jul 2020
Improving Image Captioning with Better Use of Captions
Improving Image Captioning with Better Use of Captions
Zhan Shi
Xu Zhou
Xipeng Qiu
Xiao-Dan Zhu
22
121
0
21 Jun 2020
X-Linear Attention Networks for Image Captioning
X-Linear Attention Networks for Image Captioning
Yingwei Pan
Ting Yao
Yehao Li
Tao Mei
20
509
0
31 Mar 2020
A Better Variant of Self-Critical Sequence Training
A Better Variant of Self-Critical Sequence Training
Ruotian Luo
BDL
22
37
0
22 Mar 2020
Deconfounded Image Captioning: A Causal Retrospect
Deconfounded Image Captioning: A Causal Retrospect
Xu Yang
Hanwang Zhang
Jianfei Cai
CML
10
116
0
09 Mar 2020
Exploring and Distilling Cross-Modal Information for Image Captioning
Exploring and Distilling Cross-Modal Information for Image Captioning
Fenglin Liu
Xuancheng Ren
Yuanxin Liu
Kai Lei
Xu Sun
ViT
27
51
0
28 Feb 2020
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic
  Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO
  Framework
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO Framework
C. Sur
11
7
0
16 Feb 2020
MRRC: Multiple Role Representation Crossover Interpretation for Image
  Captioning With R-CNN Feature Distribution Composition (FDC)
MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC)
C. Sur
23
16
0
15 Feb 2020
Previous
12345
Next