Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.09915
Cited By
Embedded Heterogeneous Attention Transformer for Cross-lingual Image Captioning
19 July 2023
Zijie Song
Zhenzhen Hu
Yuanen Zhou
Ye Zhao
Richang Hong
Meng Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Embedded Heterogeneous Attention Transformer for Cross-lingual Image Captioning"
3 / 3 papers shown
Title
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
51
244
0
14 Jul 2021
Normalized and Geometry-Aware Self-Attention Network for Image Captioning
Longteng Guo
Jing Liu
Xinxin Zhu
Peng Yao
Shichen Lu
Hanqing Lu
ViT
106
164
0
19 Mar 2020
Heterogeneous Graph Transformer
Ziniu Hu
Yuxiao Dong
Kuansan Wang
Yizhou Sun
167
1,157
0
03 Mar 2020
1