Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1603.03925
Cited By
Image Captioning with Semantic Attention
12 March 2016
Quanzeng You
Hailin Jin
Zhaowen Wang
Chen Fang
Jiebo Luo
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Image Captioning with Semantic Attention"
50 / 193 papers shown
Title
Deep Learning Approaches on Image Captioning: A Review
Taraneh Ghandi
H. Pourreza
H. Mahyar
VLM
13
89
0
31 Jan 2022
MMLatch: Bottom-up Top-down Fusion for Multimodal Sentiment Analysis
Georgios Paraskevopoulos
Efthymios Georgiou
Alexandros Potamianos
19
26
0
24 Jan 2022
An Integrated Approach for Video Captioning and Applications
Soheyla Amirian
T. Taha
Khaled Rasheed
H. Arabnia
26
1
0
23 Jan 2022
A Survey of Natural Language Generation
Chenhe Dong
Yinghui Li
Haifan Gong
M. Chen
Junxin Li
Ying Shen
Min Yang
3DV
24
43
0
22 Dec 2021
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Revanth Reddy Gangi Reddy
Xilin Rui
Manling Li
Xudong Lin
Haoyang Wen
...
Mohit Bansal
Avirup Sil
Shih-Fu Chang
A. Schwing
Heng Ji
17
31
0
20 Dec 2021
Injecting Semantic Concepts into End-to-End Image Captioning
Zhiyuan Fang
Jianfeng Wang
Xiaowei Hu
Lin Liang
Zhe Gan
Lijuan Wang
Yezhou Yang
Zicheng Liu
ViT
VLM
21
86
0
09 Dec 2021
Visual Persuasion in COVID-19 Social Media Content: A Multi-Modal Characterization
Mesut Erhan Unal
Adriana Kovashka
Wen-Ting Chung
Yu-Ru Lin
13
4
0
05 Dec 2021
Neural Attention for Image Captioning: Review of Outstanding Methods
Zanyar Zohourianshahzadi
Jugal Kalita
VLM
27
45
0
29 Nov 2021
Real-time Instance Segmentation of Surgical Instruments using Attention and Multi-scale Feature Fusion
Juan Carlos Angeles Ceron
Gilberto Ochoa-Ruiz
Leonardo Chang
Sharib Ali
16
36
0
09 Nov 2021
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling
Renrui Zhang
Rongyao Fang
Wei Zhang
Peng Gao
Kunchang Li
Jifeng Dai
Yu Qiao
Hongsheng Li
VLM
189
385
0
06 Nov 2021
Bornon: Bengali Image Captioning with Transformer-based Deep learning approach
Faisal Muhammad Shah
Mayeesha Humaira
Md Abidur Rahman Khan Jim
Amit Saha Ami
Shimul Paul
21
17
0
11 Sep 2021
Attentive Neural Controlled Differential Equations for Time-series Classification and Forecasting
Sheo Yon Jhin
H. Shin
Seoyoung Hong
Solhee Park
Noseong Park
AI4TS
19
22
0
04 Sep 2021
Group-based Distinctive Image Captioning with Memory Attention
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
8
18
0
20 Aug 2021
Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning
Xinzhi Dong
Chengjiang Long
Wenju Xu
Chunxia Xiao
ViT
73
66
0
05 Aug 2021
The Who in XAI: How AI Background Shapes Perceptions of AI Explanations
Upol Ehsan
Samir Passi
Q. V. Liao
Larry Chan
I-Hsiang Lee
Michael J. Muller
Mark O. Riedl
29
85
0
28 Jul 2021
Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation
Bingqian Lin
Yi Zhu
Yanxin Long
Xiaodan Liang
QiXiang Ye
Liang Lin
AAML
39
16
0
23 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
64
254
0
14 Jul 2021
Understanding and Evaluating Racial Biases in Image Captioning
Dora Zhao
Angelina Wang
Olga Russakovsky
19
134
0
16 Jun 2021
Recent advances and clinical applications of deep learning in medical image analysis
Xuxin Chen
Ximing Wang
Kecheng Zhang
K. Fung
T. Thai
K. Moore
Robert S. Mannel
Hong Liu
B. Zheng
Y. Qiu
OOD
18
570
0
27 May 2021
A Comprehensive Survey on Community Detection with Deep Learning
Xing Su
Shan Xue
Fanzhen Liu
Jia Wu
Jian Yang
...
Cécile Paris
Surya Nepal
Di Jin
Quan Z. Sheng
Philip S. Yu
GNN
17
321
0
26 May 2021
Visual Navigation with Spatial Attention
Bar Mayo
Tamir Hazan
A. Tal
EgoV
21
73
0
20 Apr 2021
Compressing Visual-linguistic Model via Knowledge Distillation
Zhiyuan Fang
Jianfeng Wang
Xiaowei Hu
Lijuan Wang
Yezhou Yang
Zicheng Liu
VLM
31
96
0
05 Apr 2021
Dual Attention-in-Attention Model for Joint Rain Streak and Raindrop Removal
Kaihao Zhang
Dongxu Li
Wenhan Luo
Wenqi Ren
19
73
0
12 Mar 2021
Causal Attention for Vision-Language Tasks
Xu Yang
Hanwang Zhang
Guojun Qi
Jianfei Cai
CML
28
148
0
05 Mar 2021
Efficient Palm-Line Segmentation with U-Net Context Fusion Module
Toan Pham Van
S. T. Nguyen
Linh Doan Bao
Ngoc N. Tran
Ta Minh Thanh
24
6
0
24 Feb 2021
Progressive Transformer-Based Generation of Radiology Reports
Farhad Nooralahzadeh
Nicolas Andres Perez Gonzalez
T. Frauenfelder
Koji Fujimoto
Michael Krauthammer
ViT
MedIm
15
84
0
19 Feb 2021
HDMI: High-order Deep Multiplex Infomax
Baoyu Jing
Chanyoung Park
Hanghang Tong
98
163
0
15 Feb 2021
Improved Bengali Image Captioning via deep convolutional neural network based encoder-decoder model
Mohammad Faiyaz Khan
S. M. S. Shifath
Md. Saiful Islam
VLM
25
18
0
14 Feb 2021
Diagnostic Captioning: A Survey
John Pavlopoulos
Vasiliki Kougia
Ion Androutsopoulos
D. Papamichail
3DV
MedIm
89
26
0
18 Jan 2021
KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation
Yiran Xing
Z. Shi
Zhao Meng
Gerhard Lakemeyer
Yunpu Ma
Roger Wattenhofer
VLM
64
40
0
02 Jan 2021
BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling
Jing Su
Qingyun Dai
Frank Guerin
Mian Zhou
22
24
0
03 Dec 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Humam Alwassel
Silvio Giancola
Bernard Ghanem
30
123
0
23 Nov 2020
Dual Attention on Pyramid Feature Maps for Image Captioning
Litao Yu
Jian Andrew Zhang
Qiang Wu
16
47
0
02 Nov 2020
Melody-Conditioned Lyrics Generation with SeqGANs
Yihao Chen
Alexander Lerch
GAN
MGen
26
29
0
28 Oct 2020
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
Wei-Neng Chen
Weiping Wang
Li Liu
M. Lew
VLM
112
31
0
16 Oct 2020
Improving Text Generation with Student-Forcing Optimal Transport
Guoyin Wang
Chunyuan Li
Jianqiao Li
Hao Fu
Yuh-Chen Lin
...
Ruiyi Zhang
Wenlin Wang
Dinghan Shen
Qian Yang
Lawrence Carin
OT
22
17
0
12 Oct 2020
Teacher-Critical Training Strategies for Image Captioning
Yiqing Huang
Jiansheng Chen
VLM
21
8
0
30 Sep 2020
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Andrés Mafla
S. Dey
Ali Furkan Biten
Lluís Gómez
Dimosthenis Karatzas
21
25
0
21 Sep 2020
Towards Unique and Informative Captioning of Images
Zeyu Wang
Berthy T. Feng
Karthik Narasimhan
Olga Russakovsky
17
37
0
08 Sep 2020
Counting from Sky: A Large-scale Dataset for Remote Sensing Object Counting and A Benchmark Method
Guangshuai Gao
Qingjie Liu
Yunhong Wang
13
53
0
28 Aug 2020
Neural Learning of One-of-Many Solutions for Combinatorial Problems in Structured Output Spaces
Yatin Nandwani
Deepanshu Jindal
Mausam
Parag Singla
16
13
0
27 Aug 2020
SBAT: Video Captioning with Sparse Boundary-Aware Transformer
Tao Jin
Siyu Huang
Ming Chen
Yingming Li
Zhongfei Zhang
30
52
0
23 Jul 2020
Improving Image Captioning with Better Use of Captions
Zhan Shi
Xu Zhou
Xipeng Qiu
Xiao-Dan Zhu
24
121
0
21 Jun 2020
Auxiliary Signal-Guided Knowledge Encoder-Decoder for Medical Report Generation
Mingjie Li
Fuyu Wang
Xiaojun Chang
Xiaodan Liang
MedIm
21
101
0
06 Jun 2020
A Survey on Transfer Learning in Natural Language Processing
Zaid Alyafeai
Maged S. Alshaibani
Irfan Ahmad
24
72
0
31 May 2020
Attention-guided Context Feature Pyramid Network for Object Detection
Junxu Cao
Qi Chen
Jun Guo
Ruichao Shi
ObjD
22
86
0
23 May 2020
A Spatio-Temporal Spot-Forecasting Framework for Urban Traffic Prediction
Rodrigo de Medrano
J. Aznarte
AI4TS
11
23
0
31 Mar 2020
Better Captioning with Sequence-Level Exploration
Jia Chen
Qin Jin
37
12
0
08 Mar 2020
Adaptive Offline Quintuplet Loss for Image-Text Matching
Tianlang Chen
Jiajun Deng
Jiebo Luo
181
68
0
07 Mar 2020
Show, Edit and Tell: A Framework for Editing Image Captions
Fawaz Sammani
Luke Melas-Kyriazi
KELM
DiffM
45
59
0
06 Mar 2020
Previous
1
2
3
4
Next