ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.02274
  4. Cited By
Stacked Attention Networks for Image Question Answering

Stacked Attention Networks for Image Question Answering

7 November 2015
Zichao Yang
Xiaodong He
Jianfeng Gao
Li Deng
Alex Smola
    BDL
ArXivPDFHTML

Papers citing "Stacked Attention Networks for Image Question Answering"

50 / 217 papers shown
Title
Faithful Multimodal Explanation for Visual Question Answering
Faithful Multimodal Explanation for Visual Question Answering
Jialin Wu
Raymond J. Mooney
11
90
0
08 Sep 2018
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
27
55
0
06 Sep 2018
Image Classification for Arabic: Assessing the Accuracy of Direct
  English to Arabic Translations
Image Classification for Arabic: Assessing the Accuracy of Direct English to Arabic Translations
Abdulkareem Alsudais
VLM
25
4
0
13 Jul 2018
Topic-Guided Attention for Image Captioning
Topic-Guided Attention for Image Captioning
Zhihao Zhu
Zhan Xue
Zejian Yuan
16
23
0
10 Jul 2018
Learning Visual Knowledge Memory Networks for Visual Question Answering
Learning Visual Knowledge Memory Networks for Visual Question Answering
Zhou Su
Chen Zhu
Yinpeng Dong
Dongqi Cai
Yurong Chen
Jianguo Li
31
62
0
13 Jun 2018
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual
  Question Answering
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering
Pan Lu
Lei Ji
Wei Zhang
Nan Duan
M. Zhou
Jianyong Wang
CoGe
17
79
0
24 May 2018
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot
  Learning
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning
Yunlong Yu
Zhong Ji
Yanwei Fu
Jichang Guo
Yanwei Pang
Zhongfei Zhang
VLM
16
27
0
21 May 2018
Deep Ordinal Hashing with Spatial Attention
Deep Ordinal Hashing with Spatial Attention
Lu Jin
Xiangbo Shu
Kai Li
Zechao Li
Guo-Jun Qi
Jinhui Tang
35
78
0
07 May 2018
Learn To Pay Attention
Learn To Pay Attention
Saumya Jetley
Nicholas A. Lord
Namhoon Lee
Philip H. S. Torr
64
437
0
06 Apr 2018
Improved Fusion of Visual and Language Representations by Dense
  Symmetric Co-Attention for Visual Question Answering
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
Duy-Kien Nguyen
Takayuki Okatani
22
279
0
03 Apr 2018
Unsupervised Textual Grounding: Linking Words to Image Concepts
Unsupervised Textual Grounding: Linking Words to Image Concepts
Raymond A. Yeh
Minh Do
A. Schwing
22
40
0
29 Mar 2018
Motion-Appearance Co-Memory Networks for Video Question Answering
Motion-Appearance Co-Memory Networks for Video Question Answering
J. Gao
Runzhou Ge
Kan Chen
Ram Nevatia
27
240
0
29 Mar 2018
Referring Relationships
Referring Relationships
Ranjay Krishna
Ines Chami
Michael S. Bernstein
Li Fei-Fei
22
94
0
28 Mar 2018
Transparency by Design: Closing the Gap Between Performance and
  Interpretability in Visual Reasoning
Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning
David Mascharka
Philip Tran
Ryan Soklaski
Arjun Majumdar
31
207
0
14 Mar 2018
Compositional Attention Networks for Machine Reasoning
Compositional Attention Networks for Machine Reasoning
Drew A. Hudson
Christopher D. Manning
BDL
OOD
LRM
21
572
0
08 Mar 2018
Multimodal Explanations: Justifying Decisions and Pointing to the
  Evidence
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Anna Rohrbach
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
35
418
0
15 Feb 2018
Interactive Grounded Language Acquisition and Generalization in a 2D
  World
Interactive Grounded Language Acquisition and Generalization in a 2D World
Haonan Yu
Haichao Zhang
W. Xu
LLMAG
LM&Ro
14
77
0
31 Jan 2018
Tell-and-Answer: Towards Explainable Visual Question Answering using
  Attributes and Captions
Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions
Qing Li
Jianlong Fu
D. Yu
Tao Mei
Jiebo Luo
FAtt
XAI
CoGe
46
60
0
27 Jan 2018
DVQA: Understanding Data Visualizations via Question Answering
DVQA: Understanding Data Visualizations via Question Answering
Kushal Kafle
Brian L. Price
Scott D. Cohen
Christopher Kanan
AIMat
33
363
0
24 Jan 2018
Incorporating External Knowledge to Answer Open-Domain Visual Questions
  with Dynamic Memory Networks
Incorporating External Knowledge to Answer Open-Domain Visual Questions with Dynamic Memory Networks
Guohao Li
Hang Su
Wenwu Zhu
28
46
0
03 Dec 2017
Don't Just Assume; Look and Answer: Overcoming Priors for Visual
  Question Answering
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
Aniruddha Kembhavi
OOD
51
581
0
01 Dec 2017
HashGAN:Attention-aware Deep Adversarial Hashing for Cross Modal
  Retrieval
HashGAN:Attention-aware Deep Adversarial Hashing for Cross Modal Retrieval
Xi Zhang
Siyu Zhou
Jiashi Feng
Hanjiang Lai
Bo Li
Yan Pan
Jian Yin
Shuicheng Yan
GAN
24
55
0
26 Nov 2017
Survey of Recent Advances in Visual Question Answering
Survey of Recent Advances in Visual Question Answering
Supriya Pandhre
Shagun Sodhani
8
14
0
24 Sep 2017
FiLM: Visual Reasoning with a General Conditioning Layer
FiLM: Visual Reasoning with a General Conditioning Layer
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
70
2,144
0
22 Sep 2017
Multi-scale Deep Learning Architectures for Person Re-identification
Multi-scale Deep Learning Architectures for Person Re-identification
Xuelin Qian
Yanwei Fu
Yu-Gang Jiang
Tao Xiang
Xiangyang Xue
17
276
0
15 Sep 2017
Variational Reasoning for Question Answering with Knowledge Graph
Variational Reasoning for Question Answering with Knowledge Graph
Yuyu Zhang
H. Dai
Zornitsa Kozareva
Alex Smola
Le Song
15
467
0
12 Sep 2017
VQS: Linking Segmentations to Questions and Answers for Supervised
  Attention in VQA and Question-Focused Semantic Segmentation
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
Chuang Gan
Yandong Li
Haoxiang Li
Chen Sun
Boqing Gong
22
126
0
15 Aug 2017
Tips and Tricks for Visual Question Answering: Learnings from the 2017
  Challenge
Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge
Damien Teney
Peter Anderson
Xiaodong He
A. Hengel
45
380
0
09 Aug 2017
GPLAC: Generalizing Vision-Based Robotic Skills using Weakly Labeled
  Images
GPLAC: Generalizing Vision-Based Robotic Skills using Weakly Labeled Images
Avi Singh
Larry Yang
Sergey Levine
14
23
0
07 Aug 2017
Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for
  Visual Question Answering
Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for Visual Question Answering
Zhou Yu
Jun-chen Yu
Jianping Fan
Dacheng Tao
41
663
0
04 Aug 2017
Dual-Glance Model for Deciphering Social Relationships
Dual-Glance Model for Deciphering Social Relationships
Junnan Li
Yongkang Wong
Qi Zhao
Mohan S. Kankanhalli
14
77
0
02 Aug 2017
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis
  Network
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network
Zizhao Zhang
Yuanpu Xie
Fuyong Xing
M. McGough
L. Yang
MedIm
13
301
0
08 Jul 2017
Best of Both Worlds: Transferring Knowledge from Discriminative Learning
  to a Generative Visual Dialog Model
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model
Jiasen Lu
A. Kannan
Jianwei Yang
Devi Parikh
Dhruv Batra
BDL
15
136
0
05 Jun 2017
Multimodal Machine Learning: A Survey and Taxonomy
Multimodal Machine Learning: A Survey and Taxonomy
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
13
2,856
0
26 May 2017
MUTAN: Multimodal Tucker Fusion for Visual Question Answering
MUTAN: Multimodal Tucker Fusion for Visual Question Answering
H. Ben-younes
Rémi Cadène
Matthieu Cord
Nicolas Thome
44
578
0
18 May 2017
The Forgettable-Watcher Model for Video Question Answering
The Forgettable-Watcher Model for Video Question Answering
Hongyang Xue
Zhou Zhao
Deng Cai
19
9
0
03 May 2017
AMC: Attention guided Multi-modal Correlation Learning for Image Search
AMC: Attention guided Multi-modal Correlation Learning for Image Search
Kan Chen
Trung Bui
Chen Fang
Zhaowen Wang
Ram Nevatia
35
38
0
03 Apr 2017
It Takes Two to Tango: Towards Theory of AI's Mind
It Takes Two to Tango: Towards Theory of AI's Mind
Arjun Chandrasekaran
Deshraj Yadav
Prithvijit Chattopadhyay
Viraj Prabhu
Devi Parikh
28
53
0
03 Apr 2017
An Analysis of Visual Question Answering Algorithms
An Analysis of Visual Question Answering Algorithms
Kushal Kafle
Christopher Kanan
19
230
0
28 Mar 2017
Recurrent Multimodal Interaction for Referring Image Segmentation
Recurrent Multimodal Interaction for Referring Image Segmentation
Chenxi Liu
Zhe-nan Lin
Xiaohui Shen
Jimei Yang
Xin Lu
Alan Yuille
EgoV
36
234
0
23 Mar 2017
VQABQ: Visual Question Answering by Basic Questions
VQABQ: Visual Question Answering by Basic Questions
Jia-Hong Huang
Modar Alfadly
Bernard Ghanem
9
24
0
19 Mar 2017
Multi-Context Attention for Human Pose Estimation
Multi-Context Attention for Human Pose Estimation
Xiao Chu
Wei Yang
Wanli Ouyang
Cheng Ma
Alan Yuille
Xiaogang Wang
3DH
16
640
0
24 Feb 2017
Task-driven Visual Saliency and Attention-based Visual Question
  Answering
Task-driven Visual Saliency and Attention-based Visual Question Answering
Yuetan Lin
Zhangyang Pang
Donghui Wang
Yueting Zhuang
29
26
0
22 Feb 2017
Learning Spatial Regularization with Image-level Supervisions for
  Multi-label Image Classification
Learning Spatial Regularization with Image-level Supervisions for Multi-label Image Classification
Feng Zhu
Hongsheng Li
Wanli Ouyang
Nenghai Yu
Xiaogang Wang
19
337
0
20 Feb 2017
Person Search with Natural Language Description
Person Search with Natural Language Description
Shuang Li
Tong Xiao
Hongsheng Li
Bolei Zhou
Dayu Yue
Xiaogang Wang
19
385
0
19 Feb 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,502
0
25 Jan 2017
Aspect-augmented Adversarial Networks for Domain Adaptation
Aspect-augmented Adversarial Networks for Domain Adaptation
Yuan Zhang
Regina Barzilay
Tommi Jaakkola
30
96
0
01 Jan 2017
The VQA-Machine: Learning How to Use Existing Vision Algorithms to
  Answer New Questions
The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions
Peng Wang
Qi Wu
Chunhua Shen
A. Hengel
OOD
18
86
0
16 Dec 2016
Attentive Explanations: Justifying Decisions and Pointing to the
  Evidence
Attentive Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
AAML
16
79
0
14 Dec 2016
Paying More Attention to Attention: Improving the Performance of
  Convolutional Neural Networks via Attention Transfer
Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer
Sergey Zagoruyko
N. Komodakis
14
2,546
0
12 Dec 2016
Previous
12345
Next