Stacked Attention Networks for Image Question Answering

7 November 2015

Li Deng

Papers citing "Stacked Attention Networks for Image Question Answering"

50 / 217 papers shown

Title
Faithful Multimodal Explanation for Visual Question Answering Jialin Wu Raymond J. Mooney 11 90 0 08 Sep 2018
Interpretable Visual Question Answering by Reasoning on Dependency Trees Qingxing Cao Bailin Li Xiaodan Liang Liang Lin 27 55 0 06 Sep 2018
Image Classification for Arabic: Assessing the Accuracy of Direct English to Arabic Translations Abdulkareem Alsudais VLM 25 4 0 13 Jul 2018
Topic-Guided Attention for Image Captioning Zhihao Zhu Zhan Xue Zejian Yuan 16 23 0 10 Jul 2018
Learning Visual Knowledge Memory Networks for Visual Question Answering Zhou Su Chen Zhu Yinpeng Dong Dongqi Cai Yurong Chen Jianguo Li 31 62 0 13 Jun 2018
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering Pan Lu Lei Ji Wei Zhang Nan Duan M. Zhou Jianyong Wang CoGe 17 79 0 24 May 2018
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning Yunlong Yu Zhong Ji Yanwei Fu Jichang Guo Yanwei Pang Zhongfei Zhang VLM 16 27 0 21 May 2018
Deep Ordinal Hashing with Spatial Attention Lu Jin Xiangbo Shu Kai Li Zechao Li Guo-Jun Qi Jinhui Tang 35 78 0 07 May 2018
Learn To Pay Attention Saumya Jetley Nicholas A. Lord Namhoon Lee Philip H. S. Torr 64 437 0 06 Apr 2018
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering Duy-Kien Nguyen Takayuki Okatani 22 279 0 03 Apr 2018
Unsupervised Textual Grounding: Linking Words to Image Concepts Raymond A. Yeh Minh Do A. Schwing 22 40 0 29 Mar 2018
Motion-Appearance Co-Memory Networks for Video Question Answering J. Gao Runzhou Ge Kan Chen Ram Nevatia 27 240 0 29 Mar 2018
Referring Relationships Ranjay Krishna Ines Chami Michael S. Bernstein Li Fei-Fei 22 94 0 28 Mar 2018
Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning David Mascharka Philip Tran Ryan Soklaski Arjun Majumdar 31 207 0 14 Mar 2018
Compositional Attention Networks for Machine Reasoning Drew A. Hudson Christopher D. Manning BDL OOD LRM 21 572 0 08 Mar 2018
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence Dong Huk Park Lisa Anne Hendricks Zeynep Akata Anna Rohrbach Bernt Schiele Trevor Darrell Marcus Rohrbach 35 418 0 15 Feb 2018
Interactive Grounded Language Acquisition and Generalization in a 2D World Haonan Yu Haichao Zhang W. Xu LLMAG LM&Ro 14 77 0 31 Jan 2018
Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions Qing Li Jianlong Fu D. Yu Tao Mei Jiebo Luo FAtt XAI CoGe 46 60 0 27 Jan 2018
DVQA: Understanding Data Visualizations via Question Answering Kushal Kafle Brian L. Price Scott D. Cohen Christopher Kanan AIMat 33 363 0 24 Jan 2018
Incorporating External Knowledge to Answer Open-Domain Visual Questions with Dynamic Memory Networks Guohao Li Hang Su Wenwu Zhu 28 46 0 03 Dec 2017
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering Aishwarya Agrawal Dhruv Batra Devi Parikh Aniruddha Kembhavi OOD 51 581 0 01 Dec 2017
HashGAN:Attention-aware Deep Adversarial Hashing for Cross Modal Retrieval Xi Zhang Siyu Zhou Jiashi Feng Hanjiang Lai Bo Li Yan Pan Jian Yin Shuicheng Yan GAN 24 55 0 26 Nov 2017
Survey of Recent Advances in Visual Question Answering Supriya Pandhre Shagun Sodhani 8 14 0 24 Sep 2017
FiLM: Visual Reasoning with a General Conditioning Layer Ethan Perez Florian Strub H. D. Vries Vincent Dumoulin Aaron Courville FAtt AIMat OffRL AI4CE 70 2,144 0 22 Sep 2017
Multi-scale Deep Learning Architectures for Person Re-identification Xuelin Qian Yanwei Fu Yu-Gang Jiang Tao Xiang Xiangyang Xue 17 276 0 15 Sep 2017
Variational Reasoning for Question Answering with Knowledge Graph Yuyu Zhang H. Dai Zornitsa Kozareva Alex Smola Le Song 15 467 0 12 Sep 2017
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation Chuang Gan Yandong Li Haoxiang Li Chen Sun Boqing Gong 22 126 0 15 Aug 2017
Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge Damien Teney Peter Anderson Xiaodong He A. Hengel 45 380 0 09 Aug 2017
GPLAC: Generalizing Vision-Based Robotic Skills using Weakly Labeled Images Avi Singh Larry Yang Sergey Levine 14 23 0 07 Aug 2017
Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for Visual Question Answering Zhou Yu Jun-chen Yu Jianping Fan Dacheng Tao 41 663 0 04 Aug 2017
Dual-Glance Model for Deciphering Social Relationships Junnan Li Yongkang Wong Qi Zhao Mohan S. Kankanhalli 14 77 0 02 Aug 2017
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network Zizhao Zhang Yuanpu Xie Fuyong Xing M. McGough L. Yang MedIm 13 301 0 08 Jul 2017
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model Jiasen Lu A. Kannan Jianwei Yang Devi Parikh Dhruv Batra BDL 15 136 0 05 Jun 2017
Multimodal Machine Learning: A Survey and Taxonomy T. Baltrušaitis Chaitanya Ahuja Louis-Philippe Morency 13 2,856 0 26 May 2017
MUTAN: Multimodal Tucker Fusion for Visual Question Answering H. Ben-younes Rémi Cadène Matthieu Cord Nicolas Thome 44 578 0 18 May 2017
The Forgettable-Watcher Model for Video Question Answering Hongyang Xue Zhou Zhao Deng Cai 19 9 0 03 May 2017
AMC: Attention guided Multi-modal Correlation Learning for Image Search Kan Chen Trung Bui Chen Fang Zhaowen Wang Ram Nevatia 35 38 0 03 Apr 2017
It Takes Two to Tango: Towards Theory of AI's Mind Arjun Chandrasekaran Deshraj Yadav Prithvijit Chattopadhyay Viraj Prabhu Devi Parikh 28 53 0 03 Apr 2017
An Analysis of Visual Question Answering Algorithms Kushal Kafle Christopher Kanan 19 230 0 28 Mar 2017
Recurrent Multimodal Interaction for Referring Image Segmentation Chenxi Liu Zhe-nan Lin Xiaohui Shen Jimei Yang Xin Lu Alan Yuille EgoV 36 234 0 23 Mar 2017
VQABQ: Visual Question Answering by Basic Questions Jia-Hong Huang Modar Alfadly Bernard Ghanem 9 24 0 19 Mar 2017
Multi-Context Attention for Human Pose Estimation Xiao Chu Wei Yang Wanli Ouyang Cheng Ma Alan Yuille Xiaogang Wang 3DH 16 640 0 24 Feb 2017
Task-driven Visual Saliency and Attention-based Visual Question Answering Yuetan Lin Zhangyang Pang Donghui Wang Yueting Zhuang 29 26 0 22 Feb 2017
Learning Spatial Regularization with Image-level Supervisions for Multi-label Image Classification Feng Zhu Hongsheng Li Wanli Ouyang Nenghai Yu Xiaogang Wang 19 337 0 20 Feb 2017
Person Search with Natural Language Description Shuang Li Tong Xiao Hongsheng Li Bolei Zhou Dayu Yue Xiaogang Wang 19 385 0 19 Feb 2017
Deep Reinforcement Learning: An Overview Yuxi Li OffRL VLM 104 1,502 0 25 Jan 2017
Aspect-augmented Adversarial Networks for Domain Adaptation Yuan Zhang Regina Barzilay Tommi Jaakkola 30 96 0 01 Jan 2017
The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions Peng Wang Qi Wu Chunhua Shen A. Hengel OOD 18 86 0 16 Dec 2016
Attentive Explanations: Justifying Decisions and Pointing to the Evidence Dong Huk Park Lisa Anne Hendricks Zeynep Akata Bernt Schiele Trevor Darrell Marcus Rohrbach AAML 16 79 0 14 Dec 2016
Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer Sergey Zagoruyko N. Komodakis 14 2,546 0 12 Dec 2016