ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.05963
  4. Cited By
Image Captioning: Transforming Objects into Words

Image Captioning: Transforming Objects into Words

14 June 2019
Simão Herdade
Armin Kappeler
K. Boakye
Joao Soares
    ViT
ArXivPDFHTML

Papers citing "Image Captioning: Transforming Objects into Words"

50 / 161 papers shown
Title
Neural Attention for Image Captioning: Review of Outstanding Methods
Neural Attention for Image Captioning: Review of Outstanding Methods
Zanyar Zohourianshahzadi
Jugal Kalita
VLM
21
45
0
29 Nov 2021
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic
  Arithmetic
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
32
192
0
29 Nov 2021
Scaling Up Vision-Language Pre-training for Image Captioning
Scaling Up Vision-Language Pre-training for Image Captioning
Xiaowei Hu
Zhe Gan
Jianfeng Wang
Zhengyuan Yang
Zicheng Liu
Yumao Lu
Lijuan Wang
MLLM
VLM
28
246
0
24 Nov 2021
L-Verse: Bidirectional Generation Between Image and Text
L-Verse: Bidirectional Generation Between Image and Text
Taehoon Kim
Gwangmo Song
Sihaeng Lee
Sangyun Kim
Yewon Seo
Soonyoung Lee
S. Kim
Honglak Lee
Kyunghoon Bae
15
24
0
22 Nov 2021
ClipCap: CLIP Prefix for Image Captioning
ClipCap: CLIP Prefix for Image Captioning
Ron Mokady
Amir Hertz
Amit H. Bermano
CLIP
VLM
17
652
0
18 Nov 2021
LTD: Low Temperature Distillation for Robust Adversarial Training
LTD: Low Temperature Distillation for Robust Adversarial Training
Erh-Chung Chen
Che-Rung Lee
AAML
19
26
0
03 Nov 2021
Bangla Image Caption Generation through CNN-Transformer based
  Encoder-Decoder Network
Bangla Image Caption Generation through CNN-Transformer based Encoder-Decoder Network
Yuansan Liu
MD Abdullah Al Nasim
Sourav Saha
Faria Afrin
Raisa Mallik
Sathishkumar Samiappan
ViT
6
11
0
24 Oct 2021
Exploiting Cross-Modal Prediction and Relation Consistency for
  Semi-Supervised Image Captioning
Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning
Yang Yang
H. Wei
Hengshu Zhu
Dianhai Yu
Hui Xiong
Jian Yang
SSL
4
33
0
22 Oct 2021
Topic Scene Graph Generation by Attention Distillation from Caption
Topic Scene Graph Generation by Attention Distillation from Caption
Wenbin Wang
R. Wang
X. Chen
DiffM
17
14
0
12 Oct 2021
End-to-End Supermask Pruning: Learning to Prune Image Captioning Models
End-to-End Supermask Pruning: Learning to Prune Image Captioning Models
J. Tan
C. Chan
Joon Huang Chuah
VLM
49
16
0
07 Oct 2021
Geometry Attention Transformer with Position-aware LSTMs for Image
  Captioning
Geometry Attention Transformer with Position-aware LSTMs for Image Captioning
Chi-Yin Wang
Yulin Shen
Luping Ji
ViT
39
49
0
01 Oct 2021
HSVA: Hierarchical Semantic-Visual Adaptation for Zero-Shot Learning
HSVA: Hierarchical Semantic-Visual Adaptation for Zero-Shot Learning
Shiming Chen
Guosen Xie
Yang Liu
Qinmu Peng
Baigui Sun
Hao Li
Xinge You
Ling Shao
8
124
0
30 Sep 2021
Geometry-Entangled Visual Semantic Transformer for Image Captioning
Geometry-Entangled Visual Semantic Transformer for Image Captioning
Ling Cheng
Wei Wei
Feida Zhu
Yong-jin Liu
C. Miao
ViT
16
3
0
29 Sep 2021
Label-Attention Transformer with Geometrically Coherent Objects for
  Image Captioning
Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning
Shikha Dubey
Farrukh Olimov
M. Rafique
Joonmo Kim
M. Jeon
ViT
15
37
0
16 Sep 2021
Bornon: Bengali Image Captioning with Transformer-based Deep learning
  approach
Bornon: Bengali Image Captioning with Transformer-based Deep learning approach
Faisal Muhammad Shah
Mayeesha Humaira
Md Abidur Rahman Khan Jim
Amit Saha Ami
Shimul Paul
13
17
0
11 Sep 2021
We went to look for meaning and all we got were these lousy
  representations: aspects of meaning representation for computational
  semantics
We went to look for meaning and all we got were these lousy representations: aspects of meaning representation for computational semantics
Simon Dobnik
R. Cooper
Adam Ek
Bill Noble
Staffan Larsson
N. Ilinykh
Vladislav Maraev
Vidya Somashekarappa
14
0
0
10 Sep 2021
LAViTeR: Learning Aligned Visual and Textual Representations Assisted by
  Image and Caption Generation
LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation
Mohammad Abuzar Shaikh
Zhanghexuan Ji
Dana Moukheiber
Yan Shen
S. Srihari
Mingchen Gao
VLM
9
1
0
04 Sep 2021
Auto-Parsing Network for Image Captioning and Visual Question Answering
Auto-Parsing Network for Image Captioning and Visual Question Answering
Xu Yang
Chongyang Gao
Hanwang Zhang
Jianfei Cai
9
35
0
24 Aug 2021
Dual Graph Convolutional Networks with Transformer and Curriculum
  Learning for Image Captioning
Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning
Xinzhi Dong
Chengjiang Long
Wenju Xu
Chunxia Xiao
ViT
69
66
0
05 Aug 2021
Question-controlled Text-aware Image Captioning
Question-controlled Text-aware Image Captioning
Anwen Hu
Shizhe Chen
Qin Jin
11
15
0
04 Aug 2021
ReFormer: The Relational Transformer for Image Captioning
ReFormer: The Relational Transformer for Image Captioning
Xuewen Yang
Yingru Liu
Xin Wang
ViT
17
54
0
29 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
55
254
0
14 Jul 2021
Case Relation Transformer: A Crossmodal Language Generation Model for
  Fetching Instructions
Case Relation Transformer: A Crossmodal Language Generation Model for Fetching Instructions
Motonari Kambara
K. Sugiura
ViT
11
6
0
02 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
49
94
0
01 Jul 2021
Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake
  Monitoring
Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake Monitoring
Jianing Qiu
F. P. Lo
Xiao Gu
M. Jobarteh
Wenyan Jia
...
M. McCrory
Edward Sazonov
Mingui Sun
Gary Frost
Benny P. L. Lo
EgoV
30
18
0
01 Jul 2021
Neural Fashion Image Captioning : Accounting for Data Diversity
Neural Fashion Image Captioning : Accounting for Data Diversity
Gilles Hacheme
Nouréini Sayouti
12
12
0
23 Jun 2021
TCIC: Theme Concepts Learning Cross Language and Vision for Image
  Captioning
TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning
Zhihao Fan
Zhongyu Wei
Siyuan Wang
Ruize Wang
Zejun Li
Haijun Shan
Xuanjing Huang
16
26
0
21 Jun 2021
Trust It or Not: Confidence-Guided Automatic Radiology Report Generation
Trust It or Not: Confidence-Guided Automatic Radiology Report Generation
Yixin Wang
Zihao Lin
Zhe Xu
Haoyu Dong
Jiang Tian
Jie Luo
Zhongchao Shi
Yang Zhang
Jianping Fan
Zhiqiang He
UQCV
MedIm
36
12
0
21 Jun 2021
All You Can Embed: Natural Language based Vehicle Retrieval with
  Spatio-Temporal Transformers
All You Can Embed: Natural Language based Vehicle Retrieval with Spatio-Temporal Transformers
Carmelo Scribano
D. Sapienza
Giorgia Franchini
M. Verucchi
Marko Bertogna
26
4
0
18 Jun 2021
Learning to Select: A Fully Attentive Approach for Novel Object
  Captioning
Learning to Select: A Fully Attentive Approach for Novel Object Captioning
Marco Cagrandi
Marcella Cornia
Matteo Stefanini
Lorenzo Baraldi
Rita Cucchiara
11
9
0
02 Jun 2021
Learning Domain Adaptation with Model Calibration for Surgical Report
  Generation in Robotic Surgery
Learning Domain Adaptation with Model Calibration for Surgical Report Generation in Robotic Surgery
Mengya Xu
Mobarakol Islam
C. Lim
Hongliang Ren
OOD
MedIm
21
29
0
31 Mar 2021
Describing and Localizing Multiple Changes with Transformers
Describing and Localizing Multiple Changes with Transformers
Yue Qiu
Shintaro Yamamoto
Kodai Nakashima
Ryota Suzuki
K. Iwata
Hirokatsu Kataoka
Y. Satoh
25
55
0
25 Mar 2021
Context-Aware Layout to Image Generation with Enhanced Object Appearance
Context-Aware Layout to Image Generation with Enhanced Object Appearance
Sen He
Wentong Liao
M. Yang
Yongxin Yang
Yi-Zhe Song
Bodo Rosenhahn
Tao Xiang
DiffM
VLM
22
52
0
22 Mar 2021
Let Your Heart Speak in its Mother Tongue: Multilingual Captioning of
  Cardiac Signals
Let Your Heart Speak in its Mother Tongue: Multilingual Captioning of Cardiac Signals
Dani Kiyasseh
T. Zhu
David A. Clifton
17
0
0
19 Mar 2021
Enhanced Modality Transition for Image Captioning
Enhanced Modality Transition for Image Captioning
Ziwei Wang
Yadan Luo
Zi Huang
6
0
0
23 Feb 2021
Image Captioning using Multiple Transformers for Self-Attention
  Mechanism
Image Captioning using Multiple Transformers for Self-Attention Mechanism
Farrukh Olimov
Shikha Dubey
Labina Shrestha
Tran Trung Tin
M. Jeon
ViT
18
2
0
14 Feb 2021
The Singleton Fallacy: Why Current Critiques of Language Models Miss the
  Point
The Singleton Fallacy: Why Current Critiques of Language Models Miss the Point
Magnus Sahlgren
F. Carlsson
20
26
0
08 Feb 2021
CPTR: Full Transformer Network for Image Captioning
CPTR: Full Transformer Network for Image Captioning
Wei Liu
Sihan Chen
Longteng Guo
Xinxin Zhu
Jing Liu
ViT
10
141
0
26 Jan 2021
ECOL-R: Encouraging Copying in Novel Object Captioning with
  Reinforcement Learning
ECOL-R: Encouraging Copying in Novel Object Captioning with Reinforcement Learning
Yufei Wang
Ian D. Wood
Stephen Wan
Mark Johnson
20
7
0
25 Jan 2021
Fast Sequence Generation with Multi-Agent Reinforcement Learning
Fast Sequence Generation with Multi-Agent Reinforcement Learning
Longteng Guo
Jing Liu
Xinxin Zhu
Hanqing Lu
LRM
53
6
0
24 Jan 2021
Context-aware Attentional Pooling (CAP) for Fine-grained Visual
  Classification
Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification
Ardhendu Behera
Zachary Wharton
Pradeep Ruwan Padmasiri Galbokka Hewage
Asish Bera
59
108
0
17 Jan 2021
Regional Attention Network (RAN) for Head Pose and Fine-grained Gesture
  Recognition
Regional Attention Network (RAN) for Head Pose and Fine-grained Gesture Recognition
Ardhendu Behera
Zachary Wharton
Morteza Ghahremani
S. Kumar
Nikolaos Bessis
3DH
11
11
0
17 Jan 2021
Dual-Level Collaborative Transformer for Image Captioning
Dual-Level Collaborative Transformer for Image Captioning
Yunpeng Luo
Jiayi Ji
Xiaoshuai Sun
Liujuan Cao
Yongjian Wu
Feiyue Huang
Chia-Wen Lin
Rongrong Ji
ViT
14
274
0
16 Jan 2021
SubICap: Towards Subword-informed Image Captioning
SubICap: Towards Subword-informed Image Captioning
Naeha Sharif
Bennamoun
Wei Liu
Syed Afaq Ali Shah
22
2
0
24 Dec 2020
Improving Image Captioning by Leveraging Intra- and Inter-layer Global
  Representation in Transformer Network
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
Jiayi Ji
Yunpeng Luo
Xiaoshuai Sun
Fuhai Chen
Gen Luo
Yongjian Wu
Yue Gao
Rongrong Ji
ViT
41
170
0
13 Dec 2020
Image Captioning with Context-Aware Auxiliary Guidance
Image Captioning with Context-Aware Auxiliary Guidance
Zeliang Song
Xiaofei Zhou
Zhendong Mao
Jianlong Tan
20
31
0
10 Dec 2020
AdaBins: Depth Estimation using Adaptive Bins
AdaBins: Depth Estimation using Adaptive Bins
S. Bhat
Ibraheem Alhashim
Peter Wonka
3DV
MDE
ViT
6
834
0
28 Nov 2020
Structural and Functional Decomposition for Personality Image Captioning
  in a Communication Game
Structural and Functional Decomposition for Personality Image Captioning in a Communication Game
Minh-Thu Nguyen
Duy Phung
Minh Hoai
Thien Huu Nguyen
17
4
0
17 Nov 2020
Improving Factual Completeness and Consistency of Image-to-Text
  Radiology Report Generation
Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation
Yasuhide Miura
Yuhao Zhang
Emily Bao Tsai
C. Langlotz
Dan Jurafsky
MedIm
149
156
0
20 Oct 2020
Multimodal Research in Vision and Language: A Review of Current and
  Emerging Trends
Multimodal Research in Vision and Language: A Review of Current and Emerging Trends
Shagun Uppal
Sarthak Bhagat
Devamanyu Hazarika
Navonil Majumdar
Soujanya Poria
Roger Zimmermann
Amir Zadeh
18
6
0
19 Oct 2020
Previous
1234
Next