Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.02489
Cited By
Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation
IEEE Access (IEEE Access), 2019
5 September 2019
Wei Wei
Ling Cheng
Xian-Ling Mao
Guangyou Zhou
Feida Zhu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation"
6 / 6 papers shown
DiffCap: Exploring Continuous Diffusion on Image Captioning
Yufeng He
Zefan Cai
Xu Gan
Baobao Chang
DiffM
253
13
0
20 May 2023
CLIP-Diffusion-LM: Apply Diffusion Model on Image Captioning
Shi-You Xu
VLM
DiffM
224
18
0
10 Oct 2022
Declaration-based Prompt Tuning for Visual Question Answering
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Yuhang Liu
Wei Wei
Daowan Peng
Feida Zhu
MLLM
VLM
196
21
0
05 May 2022
A Review on Methods and Applications in Multimodal Deep Learning
Summaira Jabeen
Xi Li
Muhammad Shoib Amin
Abdul Jabbar
VLM
HAI
316
173
0
18 Feb 2022
Caption Generation on Scenes with Seen and Unseen Object Categories
Image and Vision Computing (IVC), 2021
B. Demirel
R. G. Cinbis
VLM
389
2
0
13 Aug 2021
Recent Advances and Trends in Multimodal Deep Learning: A Review
Jabeen Summaira
Xi Li
Amin Muhammad Shoib
Songyuan Li
Abdul Jabbar
HAI
374
71
0
24 May 2021
1
Page 1 of 1