ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.09151
  4. Cited By
Convolutional Image Captioning

Convolutional Image Captioning

24 November 2017
J. Aneja
Aditya Deshpande
Alex Schwing
    VLM
ArXiv (abs)PDFHTML

Papers citing "Convolutional Image Captioning"

50 / 105 papers shown
Title
ReCap: Event-Aware Image Captioning with Article Retrieval and Semantic Gaussian Normalization
ReCap: Event-Aware Image Captioning with Article Retrieval and Semantic Gaussian Normalization
T. Nguyen
Thanh-Tung Phan-Nguyen
Gia-Huy Dinh
Lam-Huy Nguyen
M. Tran
T. Le
60
0
0
01 Sep 2025
Group-based Distinctive Image Captioning with Memory Difference Encoding and Attention
Group-based Distinctive Image Captioning with Memory Difference Encoding and AttentionInternational Journal of Computer Vision (IJCV), 2024
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
315
0
0
03 Apr 2025
ChatBEV: A Visual Language Model that Understands BEV Maps
ChatBEV: A Visual Language Model that Understands BEV Maps
Qingyao Xu
Tian Jin
Guang Chen
Yanfeng Wang
Yujiao Shi
307
2
0
18 Mar 2025
Pixels to Prose: Understanding the art of Image Captioning
Pixels to Prose: Understanding the art of Image Captioning
Hrishikesh Singh
Aarti Sharma
Millie Pant
3DVVLM
186
2
0
28 Aug 2024
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Uri Berger
Gabriel Stanovsky
Omri Abend
Lea Frermann
328
0
0
09 Aug 2024
Compressed Image Captioning using CNN-based Encoder-Decoder Framework
Compressed Image Captioning using CNN-based Encoder-Decoder Framework
Md Alif
Mahmudul Hasan
Shovon Bhowmick
195
2
0
28 Apr 2024
Context-Guided Spatio-Temporal Video Grounding
Context-Guided Spatio-Temporal Video GroundingComputer Vision and Pattern Recognition (CVPR), 2024
Xin Gu
Hengrui Fan
Yan Huang
Tiejian Luo
Libo Zhang
228
37
0
03 Jan 2024
Survey of Social Bias in Vision-Language Models
Survey of Social Bias in Vision-Language Models
Nayeon Lee
Yejin Bang
Holy Lovenia
Samuel Cahyawijaya
Wenliang Dai
Pascale Fung
VLM
337
29
0
24 Sep 2023
Diagnosing Human-object Interaction Detectors
Diagnosing Human-object Interaction DetectorsInternational Journal of Computer Vision (IJCV), 2023
Fangrui Zhu
Yiming Xie
Weidi Xie
Huaizu Jiang
197
10
0
16 Aug 2023
MMNet: Multi-Collaboration and Multi-Supervision Network for Sequential
  Deepfake Detection
MMNet: Multi-Collaboration and Multi-Supervision Network for Sequential Deepfake DetectionIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2023
Ruiyang Xia
Decheng Liu
Jie Li
Lin Yuan
N. Wang
Xinbo Gao
139
34
0
06 Jul 2023
GEST: the Graph of Events in Space and Time as a Common Representation
  between Vision and Language
GEST: the Graph of Events in Space and Time as a Common Representation between Vision and Language
Mihai Masala
Nicolae Cudlenco
Traian Rebedea
Marius Leordeanu
152
0
0
22 May 2023
Image-to-Text Translation for Interactive Image Recognition: A
  Comparative User Study with Non-Expert Users
Image-to-Text Translation for Interactive Image Recognition: A Comparative User Study with Non-Expert UsersJournal of Information Processing (JIP), 2023
Wataru Kawabe
Yusuke Sugano
VLM
136
2
0
11 May 2023
Multi-modal Machine Learning in Engineering Design: A Review and Future
  Directions
Multi-modal Machine Learning in Engineering Design: A Review and Future DirectionsJournal of Computing and Information Science in Engineering (JCISE), 2023
Binyang Song
Ruilin Zhou
Faez Ahmed
AI4CE
312
63
0
14 Feb 2023
Overcoming Catastrophic Forgetting by XAI
Overcoming Catastrophic Forgetting by XAI
Giang Nguyen
162
0
0
25 Nov 2022
Improving Radiology Summarization with Radiograph and Anatomy Prompts
Improving Radiology Summarization with Radiograph and Anatomy PromptsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Jinpeng Hu
Zhihong Chen
Yang Liu
Xiang Wan
Tsung-Hui Chang
MedIm
148
10
0
15 Oct 2022
M^4I: Multi-modal Models Membership Inference
M^4I: Multi-modal Models Membership InferenceNeural Information Processing Systems (NeurIPS), 2022
Pingyi Hu
Zihan Wang
Ruoxi Sun
Hu Wang
Minhui Xue
189
35
0
15 Sep 2022
Facial Expression Recognition and Image Description Generation in
  Vietnamese
Facial Expression Recognition and Image Description Generation in VietnameseFuzzy Systems and Data Mining (FSDM), 2022
Khang Nhut Lam
Kim Thi-Thanh Nguyen
Loc Huu Nguy
Jugal Kalita
3DHCVBM
150
1
0
12 Aug 2022
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2
Xinghui Zhou
Xin Jin
Jianwen Lv
Heng Huang
Ming Mao
Shuai Cui
CoGe
99
0
0
09 Aug 2022
Retrieval-Augmented Transformer for Image Captioning
Retrieval-Augmented Transformer for Image CaptioningInternational Conference on Content-Based Multimedia Indexing (CBMI), 2022
Sara Sarto
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
168
68
0
26 Jul 2022
Are metrics measuring what they should? An evaluation of image
  captioning task metrics
Are metrics measuring what they should? An evaluation of image captioning task metricsSignal processing. Image communication (SPIC), 2022
Othón González-Chávez
Guillermo Ruiz
Daniela Moctezuma
Tania A. Ramirez-delreal
199
9
0
04 Jul 2022
Measuring Representational Harms in Image Captioning
Measuring Representational Harms in Image CaptioningConference on Fairness, Accountability and Transparency (FAccT), 2022
Angelina Wang
Solon Barocas
Kristen Laird
Hanna M. Wallach
194
60
0
14 Jun 2022
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement
  Learning-based Beam Search
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam SearchIEEE Transactions on Image Processing (IEEE TIP), 2022
Tianlin Li
Zhe Chen
Bo Jiang
Jin Tang
Bin Luo
Dacheng Tao
296
21
0
19 May 2022
Diverse Image Captioning with Grounded Style
Diverse Image Captioning with Grounded StyleGerman Conference on Pattern Recognition (GCPR), 2022
Franz Klein
Shweta Mahajan
S. Roth
159
8
0
03 May 2022
Controllable Image Captioning
Luka Maxwell
280
0
0
28 Apr 2022
On Distinctive Image Captioning via Comparing and Reweighting
On Distinctive Image Captioning via Comparing and ReweightingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
153
23
0
08 Apr 2022
CaMEL: Mean Teacher Learning for Image Captioning
CaMEL: Mean Teacher Learning for Image CaptioningInternational Conference on Pattern Recognition (ICPR), 2022
Manuele Barraco
Matteo Stefanini
Marcella Cornia
S. Cascianelli
Lorenzo Baraldi
Rita Cucchiara
ViTVLM
180
37
0
21 Feb 2022
Deep Learning Approaches on Image Captioning: A Review
Deep Learning Approaches on Image Captioning: A ReviewACM Computing Surveys (ACM CSUR), 2022
Taraneh Ghandi
H. Pourreza
H. Mahyar
VLM
310
141
0
31 Jan 2022
Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis
  for Eyewear Devices
Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear DevicesProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2022
Yingying Zhao
Yuhu Chang
Yutian Lu
Yujiang Wang
Mingzhi Dong
...
Robert P. Dick
Fan Yang
Tun Lu
Ning Gu
L. Shang
172
15
0
24 Jan 2022
An Integrated Approach for Video Captioning and Applications
An Integrated Approach for Video Captioning and Applications
Soheyla Amirian
T. Taha
Khaled Rasheed
H. Arabnia
117
1
0
23 Jan 2022
Interactive Attention AI to translate low light photos to captions for
  night scene understanding in women safety
Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety
A. Rajagopal
V. Nirmala
Arun Muthuraj Vedamanickam
152
0
0
04 Jan 2022
Neural Attention for Image Captioning: Review of Outstanding Methods
Neural Attention for Image Captioning: Review of Outstanding Methods
Zanyar Zohourianshahzadi
Jugal Kalita
VLM
168
57
0
29 Nov 2021
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic
  Arithmetic
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic ArithmeticComputer Vision and Pattern Recognition (CVPR), 2021
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
295
230
0
29 Nov 2021
Cross Modification Attention Based Deliberation Model for Image
  Captioning
Cross Modification Attention Based Deliberation Model for Image Captioning
Zheng Lian
Yanan Zhang
Haichang Li
Rui Wang
Xiaohui Hu
102
7
0
17 Sep 2021
Bornon: Bengali Image Captioning with Transformer-based Deep learning
  approach
Bornon: Bengali Image Captioning with Transformer-based Deep learning approach
Faisal Muhammad Shah
Mayeesha Humaira
Md Abidur Rahman Khan Jim
Amit Saha Ami
Shimul Paul
103
20
0
11 Sep 2021
Journalistic Guidelines Aware News Image Captioning
Journalistic Guidelines Aware News Image CaptioningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Xuewen Yang
Svebor Karaman
Joel R. Tetreault
Alex Jaimes
206
32
0
07 Sep 2021
Group-based Distinctive Image Captioning with Memory Attention
Group-based Distinctive Image Captioning with Memory Attention
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
168
19
0
20 Aug 2021
X-modaler: A Versatile and High-performance Codebase for Cross-modal
  Analytics
X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics
Yehao Li
Yingwei Pan
Jingwen Chen
Ting Yao
Tao Mei
VLM
166
36
0
18 Aug 2021
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language
  Models
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language ModelsIEEE International Conference on Computer Vision (ICCV), 2021
Zheyuan Liu
Cristian Rodriguez-Opazo
Damien Teney
Stephen Gould
VLM
238
282
0
09 Aug 2021
ReFormer: The Relational Transformer for Image Captioning
ReFormer: The Relational Transformer for Image CaptioningACM Multimedia (ACM MM), 2021
Xuewen Yang
Yingru Liu
Xin Wang
ViT
183
62
0
29 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DVVLMMLLM
379
342
0
14 Jul 2021
Multi-Modal Image Captioning for the Visually Impaired
Multi-Modal Image Captioning for the Visually ImpairedNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Hiba Ahsan
Nikita Bhalla
Daivat Bhatt
Kaivankumar Shah
147
27
0
17 May 2021
Discrete-continuous Action Space Policy Gradient-based Attention for
  Image-Text Matching
Discrete-continuous Action Space Policy Gradient-based Attention for Image-Text MatchingComputer Vision and Pattern Recognition (CVPR), 2021
Shiyang Yan
Li Yu
Yuan Xie
234
34
0
21 Apr 2021
Automatic Generation of Descriptive Titles for Video Clips Using Deep
  Learning
Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning
Soheyla Amirian
Khaled Rasheed
T. Taha
H. Arabnia
VLMVGen
95
26
0
07 Apr 2021
Dynamic Attention guided Multi-Trajectory Analysis for Single Object
  Tracking
Dynamic Attention guided Multi-Trajectory Analysis for Single Object Tracking
Tianlin Li
Zhe Chen
Jin Tang
Bin Luo
Yaowei Wang
Yonghong Tian
Feng Wu
182
49
0
30 Mar 2021
Analysis of Convolutional Decoder for Image Caption Generation
Analysis of Convolutional Decoder for Image Caption Generation
Sulabh Katiyar
S. Borgohain
99
0
0
08 Mar 2021
Comparative evaluation of CNN architectures for Image Caption Generation
Comparative evaluation of CNN architectures for Image Caption GenerationInternational Journal of Advanced Computer Science and Applications (IJACSA), 2021
Sulabh Katiyar
S. Borgohain
111
27
0
23 Feb 2021
Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings
  and Data Augmentation
Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings and Data Augmentation
Sulabh Katiyar
S. Borgohain
VLM
128
15
0
22 Feb 2021
Intrinsic Image Captioning Evaluation
Intrinsic Image Captioning Evaluation
Chao Zeng
Sam Kwong
83
1
0
14 Dec 2020
Robust Image Captioning
Robust Image Captioning
Daniel Yarnell
Xian Wang
93
0
0
06 Dec 2020
Dual Attention on Pyramid Feature Maps for Image Captioning
Dual Attention on Pyramid Feature Maps for Image CaptioningIEEE transactions on multimedia (TMM), 2020
Litao Yu
Jian Zhang
Qiang Wu
280
56
0
02 Nov 2020
123
Next