ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1412.6632
  4. Cited By
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)

Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)

20 December 2014
Junhua Mao
W. Xu
Yi Yang
Jiang Wang
Zhiheng Huang
Alan Yuille
    VLM
ArXivPDFHTML

Papers citing "Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)"

50 / 417 papers shown
Title
Aesthetic Image Captioning From Weakly-Labelled Photographs
Aesthetic Image Captioning From Weakly-Labelled Photographs
Koustav Ghosal
A. Rana
A. Smolic
17
25
0
29 Aug 2019
Adversarial Representation Learning for Text-to-Image Matching
Adversarial Representation Learning for Text-to-Image Matching
N. Sarafianos
Xiang Xu
I. Kakadiaris
GAN
30
185
0
28 Aug 2019
Sequential Latent Spaces for Modeling the Intention During Diverse Image
  Captioning
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
J. Aneja
Harsh Agrawal
Dhruv Batra
A. Schwing
BDL
VLM
18
66
0
22 Aug 2019
Dynamic Stale Synchronous Parallel Distributed Training for Deep
  Learning
Dynamic Stale Synchronous Parallel Distributed Training for Deep Learning
Xing Zhao
Aijun An
Junfeng Liu
B. Chen
18
57
0
16 Aug 2019
Semi Supervised Phrase Localization in a Bidirectional Caption-Image
  Retrieval Framework
Semi Supervised Phrase Localization in a Bidirectional Caption-Image Retrieval Framework
Deepan Das
Noor Mohammed Ghouse
Shashank Verma
Yin Li
11
4
0
08 Aug 2019
Image Captioning using Facial Expression and Attention
Image Captioning using Facial Expression and Attention
Omid Mohamad Nezami
Mark Dras
Stephen Wan
Cécile Paris
CVBM
17
8
0
08 Aug 2019
Scene-based Factored Attention for Image Captioning
Scene-based Factored Attention for Image Captioning
Chen Shen
Rongrong Ji
Fuhai Chen
Xiaoshuai Sun
Xiangming Li
11
0
0
07 Aug 2019
Cascaded Revision Network for Novel Object Captioning
Cascaded Revision Network for Novel Object Captioning
Qianyu Feng
Yu Wu
Hehe Fan
C. Yan
Yezhou Yang
18
35
0
06 Aug 2019
Image Captioning with Unseen Objects
Image Captioning with Unseen Objects
B. Demirel
R. G. Cinbis
Nazli Ikizler-Cinbis
VLM
11
16
0
31 Jul 2019
An Empirical Study on Leveraging Scene Graphs for Visual Question
  Answering
An Empirical Study on Leveraging Scene Graphs for Visual Question Answering
Cheng Zhang
Wei-Lun Chao
D. Xuan
23
50
0
28 Jul 2019
Learning Visual Actions Using Multiple Verb-Only Labels
Learning Visual Actions Using Multiple Verb-Only Labels
Michael Wray
Dima Damen
15
7
0
25 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
15
132
0
22 Jul 2019
Image Captioning: Transforming Objects into Words
Image Captioning: Transforming Objects into Words
Simão Herdade
Armin Kappeler
K. Boakye
Joao Soares
ViT
8
462
0
14 Jun 2019
Figure Captioning with Reasoning and Sequence-Level Training
Figure Captioning with Reasoning and Sequence-Level Training
Charles C. Chen
Ruiyi Zhang
Eunyee Koh
Sungchul Kim
Scott D. Cohen
Tong Yu
Ryan Rossi
Razvan Bunescu
AIMat
15
38
0
07 Jun 2019
Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Zhengjun Zha
Daqing Liu
Hanwang Zhang
Yongdong Zhang
Feng Wu
12
119
0
06 Jun 2019
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
11
37
0
29 May 2019
Multimodal Transformer with Multi-View Visual Representation for Image
  Captioning
Multimodal Transformer with Multi-View Visual Representation for Image Captioning
Jun-chen Yu
Jing Li
Zhou Yu
Qingming Huang
ViT
11
374
0
20 May 2019
AI in the media and creative industries
AI in the media and creative industries
Giuseppe Amato
Malte Behrmann
Frédéric Bimbot
Baptiste Caramiaux
Fabrizio Falchi
...
Andrew Perkis
R. Redondo
Enrico Turrin
T. Viéville
Emmanuel Vincent
16
41
0
10 May 2019
3G structure for image caption generation
3G structure for image caption generation
Aihong Yuan
Xuelong Li
Xiaoqiang Lu
11
34
0
21 Apr 2019
Saliency-Guided Attention Network for Image-Sentence Matching
Saliency-Guided Attention Network for Image-Sentence Matching
Zhong Ji
Haoran Wang
J. Han
Yanwei Pang
8
85
0
20 Apr 2019
Multi-modal gated recurrent units for image description
Multi-modal gated recurrent units for image description
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
GAN
19
26
0
20 Apr 2019
Challenges and Prospects in Vision and Language Research
Challenges and Prospects in Vision and Language Research
Kushal Kafle
Robik Shrestha
Christopher Kanan
14
41
0
19 Apr 2019
Self-critical n-step Training for Image Captioning
Self-critical n-step Training for Image Captioning
Junlong Gao
Shiqi Wang
Shanshe Wang
Siwei Ma
Wen Gao
14
55
0
15 Apr 2019
Factor Graph Attention
Factor Graph Attention
Idan Schwartz
Seunghak Yu
Tamir Hazan
A. Schwing
19
110
0
11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
A. Schwing
Tamir Hazan
19
69
0
11 Apr 2019
Reasoning Visual Dialogs with Structural and Partial Observations
Reasoning Visual Dialogs with Structural and Partial Observations
Zilong Zheng
Wenguan Wang
Siyuan Qi
Song-Chun Zhu
30
117
0
11 Apr 2019
Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption
  Alignment
Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment
Samyak Datta
Karan Sikka
Anirban Roy
Karuna Ahuja
Devi Parikh
Ajay Divakaran
6
101
0
27 Mar 2019
Recurrent Back-Projection Network for Video Super-Resolution
Recurrent Back-Projection Network for Video Super-Resolution
Muhammad Haris
Gregory Shakhnarovich
Norimichi Ukita
SupR
20
430
0
25 Mar 2019
Neural Sequential Phrase Grounding (SeqGROUND)
Neural Sequential Phrase Grounding (SeqGROUND)
Pelin Dogan
Leonid Sigal
Markus Gross
ObjD
14
51
0
18 Mar 2019
Image captioning with weakly-supervised attention penalty
Image captioning with weakly-supervised attention penalty
Jiayun Li
M. K. Ebrahimpour
Azadeh Moghtaderi
Yen-Yun Yu
12
5
0
06 Mar 2019
JECL: Joint Embedding and Cluster Learning for Image-Text Pairs
JECL: Joint Embedding and Cluster Learning for Image-Text Pairs
Sean T. Yang
Kuan-Hao Huang
Bill Howe
VLM
14
3
0
04 Jan 2019
CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions
CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions
Runtao Liu
Chenxi Liu
Yutong Bai
Alan Yuille
NAI
ObjD
22
121
0
03 Jan 2019
Transfer learning from language models to image caption generators:
  Better models may not transfer better
Transfer learning from language models to image caption generators: Better models may not transfer better
Marc Tanti
Albert Gatt
K. Camilleri
VLM
18
3
0
01 Jan 2019
Coupled Recurrent Network (CRN)
Coupled Recurrent Network (CRN)
Lin Sun
K. Jia
Yuejia Shen
Silvio Savarese
Dit-Yan Yeung
Bertram E. Shi
21
4
0
25 Dec 2018
Attend More Times for Image Captioning
Attend More Times for Image Captioning
Jiajun Du
Yu Qin
Hongtao Lu
Yonghua Zhang
VLM
16
5
0
08 Dec 2018
An Attempt towards Interpretable Audio-Visual Video Captioning
An Attempt towards Interpretable Audio-Visual Video Captioning
Yapeng Tian
Chenxiao Guan
Justin Goodman
Marc Moore
Chenliang Xu
17
20
0
07 Dec 2018
Layer Flexible Adaptive Computational Time
Layer Flexible Adaptive Computational Time
Lida Zhang
Abdolghani Ebrahimi
Diego Klabjan
AI4CE
20
1
0
06 Dec 2018
Neural Rejuvenation: Improving Deep Network Training by Enhancing
  Computational Resource Utilization
Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization
Siyuan Qiao
Zhe-nan Lin
Jianming Zhang
Alan Yuille
8
23
0
02 Dec 2018
Intention Oriented Image Captions with Guiding Objects
Intention Oriented Image Captions with Guiding Objects
Yue Zheng
Yali Li
Shengjin Wang
11
55
0
19 Nov 2018
Image Captioning Based on a Hierarchical Attention Mechanism and Policy
  Gradient Optimization
Image Captioning Based on a Hierarchical Attention Mechanism and Policy Gradient Optimization
Shiyang Yan
Yuan Xie
F. Wu
Jeremy S. Smith
Wenjin Lu
Bailing Zhang
14
5
0
13 Nov 2018
A sequential guiding network with attention for image captioning
A sequential guiding network with attention for image captioning
Daouda Sow
Zengchang Qin
Mouhamed Niasse
T. Wan
19
3
0
01 Nov 2018
Session-based Recommendation with Graph Neural Networks
Session-based Recommendation with Graph Neural Networks
Shu Wu
Yuyuan Tang
Yanqiao Zhu
Liang Wang
Xing Xie
T. Tan
GNN
14
1,536
0
01 Nov 2018
Gated Hierarchical Attention for Image Captioning
Gated Hierarchical Attention for Image Captioning
Qingzhong Wang
Antoni B. Chan
16
18
0
30 Oct 2018
Using Deep Learning for price prediction by exploiting stationary limit
  order book features
Using Deep Learning for price prediction by exploiting stationary limit order book features
Avraam Tsantekidis
Nikolaos Passalis
Anastasios Tefas
J. Kanniainen
M. Gabbouj
Alexandros Iosifidis
OOD
16
88
0
23 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
A Comprehensive Survey of Deep Learning for Image Captioning
Md. Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM
3DV
16
760
0
06 Oct 2018
Zoom-RNN: A Novel Method for Person Recognition Using Recurrent Neural
  Networks
Zoom-RNN: A Novel Method for Person Recognition Using Recurrent Neural Networks
Sina Mokhtarzadeh Azar
Sajjad Azami
Mina Ghadimi Atigh
Mohammad Javadi
A. Nickabadi
13
2
0
24 Sep 2018
Neural Approaches to Conversational AI
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
32
668
0
21 Sep 2018
Lessons learned in multilingual grounded language learning
Lessons learned in multilingual grounded language learning
Ákos Kádár
Desmond Elliott
Marc-Alexandre Côté
Grzegorz Chrupała
A. Alishahi
VLM
12
24
0
20 Sep 2018
Image Captioning based on Deep Reinforcement Learning
Image Captioning based on Deep Reinforcement Learning
Haichao Shi
Peng Li
Bo Wang
Zhenyu Wang
15
25
0
13 Sep 2018
End-to-end Image Captioning Exploits Multimodal Distributional
  Similarity
End-to-end Image Captioning Exploits Multimodal Distributional Similarity
Pranava Madhyastha
Josiah Wang
Lucia Specia
CoGe
12
7
0
11 Sep 2018
Previous
123456789
Next