ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.12585
  4. Cited By
Exploring and Distilling Cross-Modal Information for Image Captioning

Exploring and Distilling Cross-Modal Information for Image Captioning

28 February 2020
Fenglin Liu
Xuancheng Ren
Yuanxin Liu
Kai Lei
Xu Sun
    ViT
ArXivPDFHTML

Papers citing "Exploring and Distilling Cross-Modal Information for Image Captioning"

12 / 12 papers shown
Title
Memory Reviving, Continuing Learning and Beyond: Evaluation of Pre-trained Encoders and Decoders for Multimodal Machine Translation
Memory Reviving, Continuing Learning and Beyond: Evaluation of Pre-trained Encoders and Decoders for Multimodal Machine Translation
Zhuang Yu
Shiliang Sun
Jing Zhao
Tengfei Song
Hao-Yu Yang
48
0
0
25 Apr 2025
A Systematic Review of Deep Learning-based Research on Radiology Report
  Generation
A Systematic Review of Deep Learning-based Research on Radiology Report Generation
Chang Liu
Yuanhe Tian
Yan Song
MedIm
29
15
0
23 Nov 2023
Prophet Attention: Predicting Attention with Future Attention for Image
  Captioning
Prophet Attention: Predicting Attention with Future Attention for Image Captioning
Fenglin Liu
Xuancheng Ren
Xian Wu
Wei Fan
Yuexian Zou
Xu Sun
24
46
0
19 Oct 2022
Graph-in-Graph Network for Automatic Gene Ontology Description
  Generation
Graph-in-Graph Network for Automatic Gene Ontology Description Generation
Fenglin Liu
Bang-ju Yang
Chenyu You
Xian Wu
Shen Ge
Adelaide Woicik
Sheng Wang
GNN
28
4
0
10 Jun 2022
AlignTransformer: Hierarchical Alignment of Visual Regions and Disease
  Tags for Medical Report Generation
AlignTransformer: Hierarchical Alignment of Visual Regions and Disease Tags for Medical Report Generation
Di You
Fenglin Liu
Shen Ge
Xiaoxia Xie
Jing Zhang
Xian Wu
ViT
MedIm
18
106
0
18 Mar 2022
Deep Learning Approaches on Image Captioning: A Review
Deep Learning Approaches on Image Captioning: A Review
Taraneh Ghandi
H. Pourreza
H. Mahyar
VLM
8
89
0
31 Jan 2022
Auto-Encoding Knowledge Graph for Unsupervised Medical Report Generation
Auto-Encoding Knowledge Graph for Unsupervised Medical Report Generation
Fenglin Liu
Chenyu You
Xian Wu
Shen Ge
Sheng Wang
Xu Sun
MedIm
81
91
0
08 Nov 2021
Audio-Oriented Multimodal Machine Comprehension: Task, Dataset and Model
Audio-Oriented Multimodal Machine Comprehension: Task, Dataset and Model
Zhiqi Huang
Fenglin Liu
Xian Wu
Shen Ge
Helin Wang
Wei Fan
Yuexian Zou
AuLLM
21
2
0
04 Jul 2021
Improving Image Captioning by Leveraging Intra- and Inter-layer Global
  Representation in Transformer Network
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
Jiayi Ji
Yunpeng Luo
Xiaoshuai Sun
Fuhai Chen
Gen Luo
Yongjian Wu
Yue Gao
Rongrong Ji
ViT
41
170
0
13 Dec 2020
Visual Agreement Regularized Training for Multi-Modal Machine
  Translation
Visual Agreement Regularized Training for Multi-Modal Machine Translation
Pengcheng Yang
Boxing Chen
Pei Zhang
Xu Sun
74
30
0
27 Dec 2019
Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image
  Representations
Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations
Fenglin Liu
Yuanxin Liu
Xuancheng Ren
Xiaodong He
Xu Sun
VLM
26
81
0
15 May 2019
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and
  Comprehensive Image Captions
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions
Fenglin Liu
Xuancheng Ren
Yuanxin Liu
Houfeng Wang
Xu Sun
95
65
0
27 Aug 2018
1