ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1702.05658
  4. Cited By
MAT: A Multimodal Attentive Translator for Image Captioning
v1v2v3 (latest)

MAT: A Multimodal Attentive Translator for Image Captioning

International Joint Conference on Artificial Intelligence (IJCAI), 2017
18 February 2017
Chang Liu
F. Sun
Changhu Wang
Feng Wang
Alan Yuille
ArXiv (abs)PDFHTML

Papers citing "MAT: A Multimodal Attentive Translator for Image Captioning"

17 / 17 papers shown
Title
OSIC: A New One-Stage Image Captioner Coined
OSIC: A New One-Stage Image Captioner CoinedInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Bo Wang
Zhao Zhang
Ming Zhao
Xiaojie Jin
Mingliang Xu
Meng Wang
VLM
184
6
0
04 Nov 2022
Geometry-Entangled Visual Semantic Transformer for Image Captioning
Geometry-Entangled Visual Semantic Transformer for Image Captioning
Ling Cheng
Wei Wei
Feida Zhu
Yong Liu
Chunyan Miao
ViT
154
3
0
29 Sep 2021
LocalDrop: A Hybrid Regularization for Deep Neural Networks
LocalDrop: A Hybrid Regularization for Deep Neural NetworksIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Ziqing Lu
Chang Xu
Bo Du
Takashi Ishida
Guang Dai
Masashi Sugiyama
177
17
0
01 Mar 2021
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic
  Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO
  Framework
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO Framework
C. Sur
210
7
0
16 Feb 2020
MRRC: Multiple Role Representation Crossover Interpretation for Image
  Captioning With R-CNN Feature Distribution Composition (FDC)
MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC)Multimedia tools and applications (MTA), 2020
C. Sur
119
17
0
15 Feb 2020
aiTPR: Attribute Interaction-Tensor Product Representation for Image
  Caption
aiTPR: Attribute Interaction-Tensor Product Representation for Image CaptionNeural Processing Letters (NPL), 2020
C. Sur
95
10
0
27 Jan 2020
CRUR: Coupled-Recurrent Unit for Unification, Conceptualization and
  Context Capture for Language Representation -- A Generalization of Bi
  Directional LSTM
CRUR: Coupled-Recurrent Unit for Unification, Conceptualization and Context Capture for Language Representation -- A Generalization of Bi Directional LSTMMultimedia tools and applications (MTA), 2019
C. Sur
BDL
140
6
0
22 Nov 2019
On Architectures for Including Visual Information in Neural Language
  Models for Image Description
On Architectures for Including Visual Information in Neural Language Models for Image Description
Marc Tanti
Albert Gatt
K. Camilleri
VLM
104
2
0
09 Nov 2019
Compositional Generalization in Image Captioning
Compositional Generalization in Image CaptioningConference on Computational Natural Language Learning (CoNLL), 2019
Mitja Nikolaus
Mostafa Abdou
Matthew Lamm
Rahul Aralikatte
Desmond Elliott
CoGe
235
49
0
10 Sep 2019
Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation
Stack-VS: Stacked Visual-Semantic Attention for Image Caption GenerationIEEE Access (IEEE Access), 2019
Wei Wei
Ling Cheng
Xian-Ling Mao
Guangyou Zhou
Feida Zhu
DiffM
147
24
0
05 Sep 2019
Hindi Visual Genome: A Dataset for Multimodal English-to-Hindi Machine
  Translation
Hindi Visual Genome: A Dataset for Multimodal English-to-Hindi Machine TranslationJournal of Computacion y Sistemas (JCYS), 2019
Shantipriya Parida
Ondrej Bojar
S. Dash
134
65
0
21 Jul 2019
Image Captioning based on Deep Learning Methods: A Survey
Image Captioning based on Deep Learning Methods: A Survey
Yiyu Wang
Jungang Xu
Yingfei Sun
Xianpei Han
VLM
92
8
0
20 May 2019
Multi-modal gated recurrent units for image description
Multi-modal gated recurrent units for image description
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
GAN
116
28
0
20 Apr 2019
A sequential guiding network with attention for image captioning
A sequential guiding network with attention for image captioning
Daouda Sow
Zengchang Qin
Mouhamed Niasse
T. Wan
214
3
0
01 Nov 2018
Unpaired Image Captioning by Language Pivoting
Unpaired Image Captioning by Language PivotingEuropean Conference on Computer Vision (ECCV), 2018
Jiuxiang Gu
Shafiq Joty
Jianfei Cai
G. Wang
221
88
0
14 Mar 2018
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
Jiuxiang Gu
Jianfei Cai
G. Wang
Tsuhan Chen
207
187
0
11 Sep 2017
Recurrent Multimodal Interaction for Referring Image Segmentation
Recurrent Multimodal Interaction for Referring Image Segmentation
Chenxi Liu
Zhe Lin
Xiaohui Shen
Jimei Yang
Xin Lu
Alan Yuille
EgoV
205
268
0
23 Mar 2017
1