Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.02274
Cited By
Stacked Attention Networks for Image Question Answering
7 November 2015
Zichao Yang
Xiaodong He
Jianfeng Gao
Li Deng
Alex Smola
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Stacked Attention Networks for Image Question Answering"
50 / 217 papers shown
Title
Faithful Multimodal Explanation for Visual Question Answering
Jialin Wu
Raymond J. Mooney
11
90
0
08 Sep 2018
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
27
55
0
06 Sep 2018
Image Classification for Arabic: Assessing the Accuracy of Direct English to Arabic Translations
Abdulkareem Alsudais
VLM
25
4
0
13 Jul 2018
Topic-Guided Attention for Image Captioning
Zhihao Zhu
Zhan Xue
Zejian Yuan
16
23
0
10 Jul 2018
Learning Visual Knowledge Memory Networks for Visual Question Answering
Zhou Su
Chen Zhu
Yinpeng Dong
Dongqi Cai
Yurong Chen
Jianguo Li
31
62
0
13 Jun 2018
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering
Pan Lu
Lei Ji
Wei Zhang
Nan Duan
M. Zhou
Jianyong Wang
CoGe
17
79
0
24 May 2018
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning
Yunlong Yu
Zhong Ji
Yanwei Fu
Jichang Guo
Yanwei Pang
Zhongfei Zhang
VLM
16
27
0
21 May 2018
Deep Ordinal Hashing with Spatial Attention
Lu Jin
Xiangbo Shu
Kai Li
Zechao Li
Guo-Jun Qi
Jinhui Tang
35
78
0
07 May 2018
Learn To Pay Attention
Saumya Jetley
Nicholas A. Lord
Namhoon Lee
Philip H. S. Torr
64
437
0
06 Apr 2018
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
Duy-Kien Nguyen
Takayuki Okatani
22
279
0
03 Apr 2018
Unsupervised Textual Grounding: Linking Words to Image Concepts
Raymond A. Yeh
Minh Do
A. Schwing
22
40
0
29 Mar 2018
Motion-Appearance Co-Memory Networks for Video Question Answering
J. Gao
Runzhou Ge
Kan Chen
Ram Nevatia
27
240
0
29 Mar 2018
Referring Relationships
Ranjay Krishna
Ines Chami
Michael S. Bernstein
Li Fei-Fei
22
94
0
28 Mar 2018
Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning
David Mascharka
Philip Tran
Ryan Soklaski
Arjun Majumdar
31
207
0
14 Mar 2018
Compositional Attention Networks for Machine Reasoning
Drew A. Hudson
Christopher D. Manning
BDL
OOD
LRM
21
572
0
08 Mar 2018
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Anna Rohrbach
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
35
418
0
15 Feb 2018
Interactive Grounded Language Acquisition and Generalization in a 2D World
Haonan Yu
Haichao Zhang
W. Xu
LLMAG
LM&Ro
14
77
0
31 Jan 2018
Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions
Qing Li
Jianlong Fu
D. Yu
Tao Mei
Jiebo Luo
FAtt
XAI
CoGe
46
60
0
27 Jan 2018
DVQA: Understanding Data Visualizations via Question Answering
Kushal Kafle
Brian L. Price
Scott D. Cohen
Christopher Kanan
AIMat
33
363
0
24 Jan 2018
Incorporating External Knowledge to Answer Open-Domain Visual Questions with Dynamic Memory Networks
Guohao Li
Hang Su
Wenwu Zhu
28
46
0
03 Dec 2017
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
Aniruddha Kembhavi
OOD
51
581
0
01 Dec 2017
HashGAN:Attention-aware Deep Adversarial Hashing for Cross Modal Retrieval
Xi Zhang
Siyu Zhou
Jiashi Feng
Hanjiang Lai
Bo Li
Yan Pan
Jian Yin
Shuicheng Yan
GAN
24
55
0
26 Nov 2017
Survey of Recent Advances in Visual Question Answering
Supriya Pandhre
Shagun Sodhani
8
14
0
24 Sep 2017
FiLM: Visual Reasoning with a General Conditioning Layer
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
70
2,144
0
22 Sep 2017
Multi-scale Deep Learning Architectures for Person Re-identification
Xuelin Qian
Yanwei Fu
Yu-Gang Jiang
Tao Xiang
Xiangyang Xue
17
276
0
15 Sep 2017
Variational Reasoning for Question Answering with Knowledge Graph
Yuyu Zhang
H. Dai
Zornitsa Kozareva
Alex Smola
Le Song
15
467
0
12 Sep 2017
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
Chuang Gan
Yandong Li
Haoxiang Li
Chen Sun
Boqing Gong
22
126
0
15 Aug 2017
Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge
Damien Teney
Peter Anderson
Xiaodong He
A. Hengel
45
380
0
09 Aug 2017
GPLAC: Generalizing Vision-Based Robotic Skills using Weakly Labeled Images
Avi Singh
Larry Yang
Sergey Levine
14
23
0
07 Aug 2017
Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for Visual Question Answering
Zhou Yu
Jun-chen Yu
Jianping Fan
Dacheng Tao
41
663
0
04 Aug 2017
Dual-Glance Model for Deciphering Social Relationships
Junnan Li
Yongkang Wong
Qi Zhao
Mohan S. Kankanhalli
14
77
0
02 Aug 2017
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network
Zizhao Zhang
Yuanpu Xie
Fuyong Xing
M. McGough
L. Yang
MedIm
13
301
0
08 Jul 2017
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model
Jiasen Lu
A. Kannan
Jianwei Yang
Devi Parikh
Dhruv Batra
BDL
15
136
0
05 Jun 2017
Multimodal Machine Learning: A Survey and Taxonomy
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
13
2,856
0
26 May 2017
MUTAN: Multimodal Tucker Fusion for Visual Question Answering
H. Ben-younes
Rémi Cadène
Matthieu Cord
Nicolas Thome
44
578
0
18 May 2017
The Forgettable-Watcher Model for Video Question Answering
Hongyang Xue
Zhou Zhao
Deng Cai
19
9
0
03 May 2017
AMC: Attention guided Multi-modal Correlation Learning for Image Search
Kan Chen
Trung Bui
Chen Fang
Zhaowen Wang
Ram Nevatia
35
38
0
03 Apr 2017
It Takes Two to Tango: Towards Theory of AI's Mind
Arjun Chandrasekaran
Deshraj Yadav
Prithvijit Chattopadhyay
Viraj Prabhu
Devi Parikh
28
53
0
03 Apr 2017
An Analysis of Visual Question Answering Algorithms
Kushal Kafle
Christopher Kanan
19
230
0
28 Mar 2017
Recurrent Multimodal Interaction for Referring Image Segmentation
Chenxi Liu
Zhe-nan Lin
Xiaohui Shen
Jimei Yang
Xin Lu
Alan Yuille
EgoV
36
234
0
23 Mar 2017
VQABQ: Visual Question Answering by Basic Questions
Jia-Hong Huang
Modar Alfadly
Bernard Ghanem
9
24
0
19 Mar 2017
Multi-Context Attention for Human Pose Estimation
Xiao Chu
Wei Yang
Wanli Ouyang
Cheng Ma
Alan Yuille
Xiaogang Wang
3DH
16
640
0
24 Feb 2017
Task-driven Visual Saliency and Attention-based Visual Question Answering
Yuetan Lin
Zhangyang Pang
Donghui Wang
Yueting Zhuang
29
26
0
22 Feb 2017
Learning Spatial Regularization with Image-level Supervisions for Multi-label Image Classification
Feng Zhu
Hongsheng Li
Wanli Ouyang
Nenghai Yu
Xiaogang Wang
19
337
0
20 Feb 2017
Person Search with Natural Language Description
Shuang Li
Tong Xiao
Hongsheng Li
Bolei Zhou
Dayu Yue
Xiaogang Wang
19
385
0
19 Feb 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,502
0
25 Jan 2017
Aspect-augmented Adversarial Networks for Domain Adaptation
Yuan Zhang
Regina Barzilay
Tommi Jaakkola
30
96
0
01 Jan 2017
The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions
Peng Wang
Qi Wu
Chunhua Shen
A. Hengel
OOD
18
86
0
16 Dec 2016
Attentive Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
AAML
16
79
0
14 Dec 2016
Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer
Sergey Zagoruyko
N. Komodakis
14
2,546
0
12 Dec 2016
Previous
1
2
3
4
5
Next