Image Captioning and Visual Question Answering Based on Attributes and External Knowledge

9 March 2016

Qi Wu

Chunhua Shen

A. Hengel

Peng Wang

A. Dick

ArXiv PDF HTML

Papers citing "Image Captioning and Visual Question Answering Based on Attributes and External Knowledge"

25 / 25 papers shown

Title
Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation Junrong Yue Y. Zhang Chuan Qin Bo Li Xiaomin Lie Xinlei Yu Wenxin Zhang Zhendong Zhao 51 0 0 23 Apr 2025
The curse of language biases in remote sensing VQA: the role of spatial attributes, language diversity, and the need for clear evaluation Christel Chappuis Eliot Walt Vincent Mendez Sylvain Lobry B. L. Saux D. Tuia 23 3 0 28 Nov 2023
What do you MEME? Generating Explanations for Visual Semantic Role Labelling in Memes Shivam Sharma Siddhant Agarwal Tharun Suresh Preslav Nakov Md. Shad Akhtar Tanmoy Charkraborty VLM 20 18 0 01 Dec 2022
A survey on the development status and application prospects of knowledge graph in smart grids Jian Wang Xi Wang Chaoqun Ma Lei Kou 25 74 0 02 Nov 2022
Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval Xuri Ge Fuhai Chen Songpei Xu Fuxiang Tao J. Jose 25 26 0 17 Oct 2022
Image Captioning based on Feature Refinement and Reflective Decoding G. Alabduljabbar Hafida Benhidour Said Kerrache 3DV 14 3 0 16 Jun 2022
Learning to Answer Questions in Dynamic Audio-Visual Scenarios Guangyao Li Yake Wei Yapeng Tian Chenliang Xu Ji-Rong Wen Di Hu 29 136 0 26 Mar 2022
ACORT: A Compact Object Relation Transformer for Parameter Efficient Image Captioning J. Tan Y. Tan C. Chan Joon Huang Chuah VLM ViT 24 15 0 11 Feb 2022
Knowledge-based Embodied Question Answering Sinan Tan Mengmeng Ge Di Guo Huaping Liu F. Sun 24 20 0 16 Sep 2021
DAFNe: A One-Stage Anchor-Free Approach for Oriented Object Detection Steven Lang Fabrizio G. Ventola Kristian Kersting 31 14 0 13 Sep 2021
Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation Zechen Bai Yuta Nakashima Noa Garcia 66 43 0 13 Sep 2021
Image-to-Image Retrieval by Learning Similarity between Scene Graphs Sangwoong Yoon Woo-Young Kang Sungwook Jeon SeongEun Lee C. Han Jonghun Park Eun-Sol Kim 3DH 29 39 0 29 Dec 2020
Dual ResGCN for Balanced Scene GraphGeneration Jingyi Zhang Yong Zhang Baoyuan Wu Yanbo Fan Fumin Shen Heng Tao Shen 21 12 0 09 Nov 2020
New Ideas and Trends in Deep Multimodal Content Understanding: A Review Wei-Neng Chen Weiping Wang Li Liu M. Lew VLM 112 31 0 16 Oct 2020
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO Framework C. Sur 11 7 0 16 Feb 2020
MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC) C. Sur 25 16 0 15 Feb 2020
A Review on Intelligent Object Perception Methods Combining Knowledge-based Reasoning and Machine Learning Filippos Gouidis Alexandros Vassiliades T. Patkos Antonis Argyros Nick Bassiliades Dimitris Plexousakis OCL 29 12 0 26 Dec 2019
DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue X. Jiang J. Yu Zengchang Qin Yingying Zhuang Xingxing Zhang Yue Hu Qi Wu 15 70 0 17 Nov 2019
Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback Hui Wu Yupeng Gao Xiaoxiao Guo Ziad Al-Halah Steven J. Rennie Kristen Grauman Rogerio Feris EgoV 14 63 0 30 May 2019
Pedestrian Attribute Recognition: A Survey Xiao Wang Shaofei Zheng Rui Yang Aihua Zheng Zhe Chen Jin Tang B. Luo CVBM 26 127 0 22 Jan 2019
A Comprehensive Survey of Deep Learning for Image Captioning Md. Zakir Hossain Ferdous Sohel M. Shiratuddin Hamid Laga VLM 3DV 28 760 0 06 Oct 2018
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering Pan Lu Lei Ji Wei Zhang Nan Duan M. Zhou Jianyong Wang CoGe 17 79 0 24 May 2018
Defoiling Foiled Image Captions Pranava Madhyastha Josiah Wang Lucia Specia 22 9 0 16 May 2018
AI Challenger : A Large-scale Dataset for Going Deeper in Image Understanding Jiahong Wu He Zheng Bo-Lu Zhao Yixin Li Baoming Yan ... Shipei Zhou G. Lin Yanwei Fu Yizhou Wang Yonggang Wang VLM 30 149 0 17 Nov 2017
End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering Youngjae Yu Hyungjin Ko Jongwook Choi Gunhee Kim 6 229 0 10 Oct 2016