ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04020
  4. Cited By
A Comprehensive Survey of Deep Learning for Image Captioning

A Comprehensive Survey of Deep Learning for Image Captioning

6 October 2018
Md. Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
    VLM
    3DV
ArXivPDFHTML

Papers citing "A Comprehensive Survey of Deep Learning for Image Captioning"

28 / 228 papers shown
Title
C3VQG: Category Consistent Cyclic Visual Question Generation
C3VQG: Category Consistent Cyclic Visual Question Generation
Shagun Uppal
Anish Madan
Sarthak Bhagat
Yi Yu
R. Shah
13
20
0
15 May 2020
Multiple Visual-Semantic Embedding for Video Retrieval from Query
  Sentence
Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence
Huy Manh Nguyen
Tomo Miyazaki
Yoshihiro Sugaya
S. Omachi
32
1
0
16 Apr 2020
Context-Aware Group Captioning via Self-Attention and Contrastive
  Features
Context-Aware Group Captioning via Self-Attention and Contrastive Features
Zhuowan Li
Quan Hung Tran
Long Mai
Zhe-nan Lin
Alan Yuille
VLM
6
44
0
07 Apr 2020
DAISI: Database for AI Surgical Instruction
DAISI: Database for AI Surgical Instruction
Edgar Rojas-Muñoz
K. Couperus
J. Wachs
6
17
0
22 Mar 2020
The Four Dimensions of Social Network Analysis: An Overview of Research
  Methods, Applications, and Software Tools
The Four Dimensions of Social Network Analysis: An Overview of Research Methods, Applications, and Software Tools
David Camacho
Á. Panizo-Lledot
Gema Bello Orgaz
A. González-Pardo
Erik Cambria
8
235
0
21 Feb 2020
Captioning Images Taken by People Who Are Blind
Captioning Images Taken by People Who Are Blind
Danna Gurari
Yinan Zhao
Meng Zhang
Nilavra Bhattacharya
17
181
0
20 Feb 2020
UIT-ViIC: A Dataset for the First Evaluation on Vietnamese Image
  Captioning
UIT-ViIC: A Dataset for the First Evaluation on Vietnamese Image Captioning
Q. Lam
Q. Le
Kiet Van Nguyen
N. Nguyen
15
19
0
01 Feb 2020
Background Hardly Matters: Understanding Personality Attribution in Deep
  Residual Networks
Background Hardly Matters: Understanding Personality Attribution in Deep Residual Networks
Gabrielle Ras
R. Dotsch
L. Ambrogioni
Umut Güçlü
Marcel van Gerven
FAtt
11
0
0
20 Dec 2019
Meaning guided video captioning
Meaning guided video captioning
Rushi J. Babariya
Toru Tamaki
14
3
0
12 Dec 2019
Improved Few-Shot Visual Classification
Improved Few-Shot Visual Classification
Peyman Bateni
Raghav Goyal
Vaden Masrani
Frank D. Wood
Leonid Sigal
VLM
11
228
0
07 Dec 2019
Better Understanding Hierarchical Visual Relationship for Image Caption
Better Understanding Hierarchical Visual Relationship for Image Caption
Z. Fei
14
0
0
04 Dec 2019
Event Recognition with Automatic Album Detection based on Sequential
  Processing, Neural Attention and Image Captioning
Event Recognition with Automatic Album Detection based on Sequential Processing, Neural Attention and Image Captioning
Andrey V. Savchenko
6
1
0
25 Nov 2019
Orderless Recurrent Models for Multi-label Classification
Orderless Recurrent Models for Multi-label Classification
V. O. Yazici
Abel Gonzalez-Garcia
Arnau Ramisa
Bartlomiej Twardowski
Joost van de Weijer
SSL
9
92
0
22 Nov 2019
On Architectures for Including Visual Information in Neural Language
  Models for Image Description
On Architectures for Including Visual Information in Neural Language Models for Image Description
Marc Tanti
Albert Gatt
K. Camilleri
VLM
22
2
0
09 Nov 2019
Text-to-Image Synthesis Based on Machine Generated Captions
Text-to-Image Synthesis Based on Machine Generated Captions
Marco Menardi
Alex Falcon
Saida S. Mohamed
Lorenzo Seidenari
G. Serra
A. Bimbo
C. Tasso
17
0
0
09 Oct 2019
Language is Power: Representing States Using Natural Language in
  Reinforcement Learning
Language is Power: Representing States Using Natural Language in Reinforcement Learning
Erez Schwartz
Guy Tennenholtz
Chen Tessler
Shie Mannor
8
12
0
02 Oct 2019
Image Captioning using Facial Expression and Attention
Image Captioning using Facial Expression and Attention
Omid Mohamad Nezami
Mark Dras
Stephen Wan
Cécile Paris
CVBM
17
8
0
08 Aug 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
15
132
0
22 Jul 2019
Remaining Useful Lifetime Prediction via Deep Domain Adaptation
Remaining Useful Lifetime Prediction via Deep Domain Adaptation
P. Costa
A. Akçay
Yingqian Zhang
U. Kaymak
AI4CE
14
258
0
17 Jul 2019
"My Way of Telling a Story": Persona based Grounded Story Generation
"My Way of Telling a Story": Persona based Grounded Story Generation
Shrimai Prabhumoye
Khyathi Raghavi Chandu
Ruslan Salakhutdinov
A. Black
13
35
0
14 Jun 2019
A Review of Deep Learning with Special Emphasis on Architectures,
  Applications and Recent Trends
A Review of Deep Learning with Special Emphasis on Architectures, Applications and Recent Trends
Saptarshi Sengupta
Sanchita Basak
P. Saikia
Sayak Paul
Vasilios Tsalavoutis
Frederick Ditliac Atiah
V. Ravi
R. Peters
AI4CE
18
324
0
30 May 2019
SuperCaptioning: Image Captioning Using Two-dimensional Word Embedding
SuperCaptioning: Image Captioning Using Two-dimensional Word Embedding
Baohua Sun
L. Yang
Michael Lin
Charles Young
Patrick Dong
Wenhan Zhang
Jason Dong
VLM
8
8
0
25 May 2019
Deep Unified Multimodal Embeddings for Understanding both Content and
  Users in Social Media Networks
Deep Unified Multimodal Embeddings for Understanding both Content and Users in Social Media Networks
Karan Sikka
Lucas Van Bramer
Ajay Divakaran
12
1
0
17 May 2019
Scalable Deep Learning on Distributed Infrastructures: Challenges,
  Techniques and Tools
Scalable Deep Learning on Distributed Infrastructures: Challenges, Techniques and Tools
R. Mayer
Hans-Arno Jacobsen
GNN
19
186
0
27 Mar 2019
A Framework for Decoding Event-Related Potentials from Text
A Framework for Decoding Event-Related Potentials from Text
Shaorong Yan
A. White
14
0
0
27 Feb 2019
A Black-box Attack on Neural Networks Based on Swarm Evolutionary
  Algorithm
A Black-box Attack on Neural Networks Based on Swarm Evolutionary Algorithm
Xiaolei Liu
Yuheng Luo
Xiaosong Zhang
Qingxin Zhu
AAML
14
16
0
26 Jan 2019
A Survey of the Usages of Deep Learning in Natural Language Processing
A Survey of the Usages of Deep Learning in Natural Language Processing
Dan Otter
Julian R. Medina
Jugal Kalita
VLM
22
11
0
27 Jul 2018
Learning Attributes Equals Multi-Source Domain Generalization
Learning Attributes Equals Multi-Source Domain Generalization
Chuang Gan
Tianbao Yang
Boqing Gong
OOD
150
197
0
03 May 2016
Previous
12345