Exploring and Distilling Cross-Modal Information for Image Captioning

Exploring and Distilling Cross-Modal Information for Image Captioning

28 February 2020

Xuancheng Ren

Papers citing "Exploring and Distilling Cross-Modal Information for Image Captioning"

12 / 12 papers shown

Title
Memory Reviving, Continuing Learning and Beyond: Evaluation of Pre-trained Encoders and Decoders for Multimodal Machine Translation Zhuang Yu Shiliang Sun Jing Zhao Tengfei Song Hao-Yu Yang 48 0 0 25 Apr 2025
A Systematic Review of Deep Learning-based Research on Radiology Report Generation Chang Liu Yuanhe Tian Yan Song MedIm 29 15 0 23 Nov 2023
Prophet Attention: Predicting Attention with Future Attention for Image Captioning Fenglin Liu Xuancheng Ren Xian Wu Wei Fan Yuexian Zou Xu Sun 24 46 0 19 Oct 2022
Graph-in-Graph Network for Automatic Gene Ontology Description Generation Fenglin Liu Bang-ju Yang Chenyu You Xian Wu Shen Ge Adelaide Woicik Sheng Wang GNN 28 4 0 10 Jun 2022
AlignTransformer: Hierarchical Alignment of Visual Regions and Disease Tags for Medical Report Generation Di You Fenglin Liu Shen Ge Xiaoxia Xie Jing Zhang Xian Wu ViT MedIm 18 106 0 18 Mar 2022
Deep Learning Approaches on Image Captioning: A Review Taraneh Ghandi H. Pourreza H. Mahyar VLM 8 89 0 31 Jan 2022
Auto-Encoding Knowledge Graph for Unsupervised Medical Report Generation Fenglin Liu Chenyu You Xian Wu Shen Ge Sheng Wang Xu Sun MedIm 81 91 0 08 Nov 2021
Audio-Oriented Multimodal Machine Comprehension: Task, Dataset and Model Zhiqi Huang Fenglin Liu Xian Wu Shen Ge Helin Wang Wei Fan Yuexian Zou AuLLM 21 2 0 04 Jul 2021
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network Jiayi Ji Yunpeng Luo Xiaoshuai Sun Fuhai Chen Gen Luo Yongjian Wu Yue Gao Rongrong Ji ViT 41 170 0 13 Dec 2020
Visual Agreement Regularized Training for Multi-Modal Machine Translation Pengcheng Yang Boxing Chen Pei Zhang Xu Sun 74 30 0 27 Dec 2019
Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations Fenglin Liu Yuanxin Liu Xuancheng Ren Xiaodong He Xu Sun VLM 26 81 0 15 May 2019
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions Fenglin Liu Xuancheng Ren Yuanxin Liu Houfeng Wang Xu Sun 95 65 0 27 Aug 2018