ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.03556
  4. Cited By
Human Attention in Visual Question Answering: Do Humans and Deep
  Networks Look at the Same Regions?

Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?

11 June 2016
Abhishek Das
Harsh Agrawal
C. L. Zitnick
Devi Parikh
Dhruv Batra
ArXivPDFHTML

Papers citing "Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?"

50 / 230 papers shown
Title
Pointing Novel Objects in Image Captioning
Pointing Novel Objects in Image Captioning
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
22
69
0
25 Apr 2019
Challenges and Prospects in Vision and Language Research
Challenges and Prospects in Vision and Language Research
Kushal Kafle
Robik Shrestha
Christopher Kanan
14
41
0
19 Apr 2019
Salient Object Detection in the Deep Learning Era: An In-Depth Survey
Salient Object Detection in the Deep Learning Era: An In-Depth Survey
Wenguan Wang
Qiuxia Lai
H. Fu
Jianbing Shen
Haibin Ling
Ruigang Yang
32
608
0
19 Apr 2019
Factor Graph Attention
Factor Graph Attention
Idan Schwartz
Seunghak Yu
Tamir Hazan
A. Schwing
19
110
0
11 Apr 2019
Two Body Problem: Collaborative Visual Task Completion
Two Body Problem: Collaborative Visual Task Completion
Unnat Jain
Luca Weihs
Eric Kolve
Mohammad Rastegari
Svetlana Lazebnik
Ali Farhadi
A. Schwing
Aniruddha Kembhavi
14
70
0
11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
A. Schwing
Tamir Hazan
19
69
0
11 Apr 2019
Can You Explain That? Lucid Explanations Help Human-AI Collaborative
  Image Retrieval
Can You Explain That? Lucid Explanations Help Human-AI Collaborative Image Retrieval
Arijit Ray
Yi Yao
Rakesh Kumar
Ajay Divakaran
Giedrius Burachas
11
5
0
05 Apr 2019
Assessment of Faster R-CNN in Man-Machine collaborative search
Assessment of Faster R-CNN in Man-Machine collaborative search
Arturo Deza
A. Surana
M. Eckstein
OOD
11
7
0
04 Apr 2019
Information Maximizing Visual Question Generation
Information Maximizing Visual Question Generation
Ranjay Krishna
Michael S. Bernstein
Li Fei-Fei
10
95
0
27 Mar 2019
Periphery-Fovea Multi-Resolution Driving Model guided by Human Attention
Periphery-Fovea Multi-Resolution Driving Model guided by Human Attention
Ye Xia
Jinkyu Kim
John F. Canny
K. Zipser
D. Whitney
13
51
0
24 Mar 2019
Unmasking Clever Hans Predictors and Assessing What Machines Really
  Learn
Unmasking Clever Hans Predictors and Assessing What Machines Really Learn
Sebastian Lapuschkin
S. Wäldchen
Alexander Binder
G. Montavon
Wojciech Samek
K. Müller
6
995
0
26 Feb 2019
GQA: A New Dataset for Real-World Visual Reasoning and Compositional
  Question Answering
GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering
Drew A. Hudson
Christopher D. Manning
CoGe
NAI
19
136
0
25 Feb 2019
Making History Matter: History-Advantage Sequence Training for Visual
  Dialog
Making History Matter: History-Advantage Sequence Training for Visual Dialog
Tianhao Yang
Zhengjun Zha
Hanwang Zhang
OffRL
20
8
0
25 Feb 2019
Probabilistic Neural-symbolic Models for Interpretable Visual Question
  Answering
Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering
Ramakrishna Vedantam
Karan Desai
Stefan Lee
Marcus Rohrbach
Dhruv Batra
Devi Parikh
NAI
BDL
15
84
0
21 Feb 2019
Taking a HINT: Leveraging Explanations to Make Vision and Language
  Models More Grounded
Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded
Ramprasaath R. Selvaraju
Stefan Lee
Yilin Shen
Hongxia Jin
Shalini Ghosh
Larry Heck
Dhruv Batra
Devi Parikh
FAtt
VLM
14
252
0
11 Feb 2019
Deep execution monitor for robot assistive tasks
Deep execution monitor for robot assistive tasks
L. Mauro
Edoardo Alati
Marta Sanzari
Valsamis Ntouskos
Gianluca Massimiani
F. Pirri
23
6
0
07 Feb 2019
Visual search and recognition for robot task execution and monitoring
Visual search and recognition for robot task execution and monitoring
L. Mauro
Francesco Puja
S. Grazioso
Valsamis Ntouskos
Marta Sanzari
Edoardo Alati
F. Pirri
8
9
0
07 Feb 2019
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Ning Xie
Farley Lai
Derek Doran
Asim Kadav
CoGe
31
321
0
20 Jan 2019
Manipulation-skill Assessment from Videos with Spatial Attention Network
Manipulation-skill Assessment from Videos with Spatial Attention Network
Zhenqiang Li
Yifei Huang
Minjie Cai
Yoichi Sato
11
58
0
09 Jan 2019
Personalized explanation in machine learning: A conceptualization
Personalized explanation in machine learning: A conceptualization
J. Schneider
J. Handali
XAI
FAtt
12
17
0
03 Jan 2019
Actor Conditioned Attention Maps for Video Action Detection
Actor Conditioned Attention Maps for Video Action Detection
Oytun Ulutan
S. Rallapalli
M. Srivatsa
Carlos Torres
B. S. Manjunath
6
42
0
30 Dec 2018
Grounded Video Description
Grounded Video Description
Luowei Zhou
Yannis Kalantidis
Xinlei Chen
Jason J. Corso
Marcus Rohrbach
19
190
0
17 Dec 2018
e-SNLI: Natural Language Inference with Natural Language Explanations
e-SNLI: Natural Language Inference with Natural Language Explanations
Oana-Maria Camburu
Tim Rocktaschel
Thomas Lukasiewicz
Phil Blunsom
LRM
255
620
0
04 Dec 2018
A Multidisciplinary Survey and Framework for Design and Evaluation of
  Explainable AI Systems
A Multidisciplinary Survey and Framework for Design and Evaluation of Explainable AI Systems
Sina Mohseni
Niloofar Zarei
Eric D. Ragan
23
102
0
28 Nov 2018
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual
  Question Answering
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
Medhini Narasimhan
Svetlana Lazebnik
A. Schwing
NAI
GNN
ReLM
10
11
0
01 Nov 2018
Adversarial TableQA: Attention Supervision for Question Answering on
  Tables
Adversarial TableQA: Attention Supervision for Question Answering on Tables
Minseok Cho
Reinald Kim Amplayo
Seung-won Hwang
Jonghyuck Park
LMTD
OOD
20
22
0
18 Oct 2018
Straight to the Facts: Learning Knowledge Base Retrieval for Factual
  Visual Question Answering
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
Medhini Narasimhan
A. Schwing
16
104
0
04 Sep 2018
LUCSS: Language-based User-customized Colourization of Scene Sketches
LUCSS: Language-based User-customized Colourization of Scene Sketches
C. Zou
Haoran Mo
Ruofei Du
Xing Wu
Chengying Gao
Hongbo Fu
22
8
0
30 Aug 2018
Multimodal Differential Network for Visual Question Generation
Multimodal Differential Network for Visual Question Generation
Badri N. Patro
Sandeep Kumar
V. Kurmi
Vinay P. Namboodiri
8
41
0
12 Aug 2018
Interpretable Visual Question Answering by Visual Grounding from
  Attention Supervision Mining
Interpretable Visual Question Answering by Visual Grounding from Attention Supervision Mining
Yundong Zhang
Juan Carlos Niebles
Á. Soto
25
67
0
01 Aug 2018
Deep Imbalanced Attribute Classification using Visual Attention
  Aggregation
Deep Imbalanced Attribute Classification using Visual Attention Aggregation
N. Sarafianos
Xiang Xu
I. Kakadiaris
23
214
0
10 Jul 2018
Understanding Visual Ads by Aligning Symbols and Objects using
  Co-Attention
Understanding Visual Ads by Aligning Symbols and Objects using Co-Attention
Karuna Ahuja
Karan Sikka
Anirban Roy
Ajay Divakaran
21
10
0
04 Jul 2018
Focal Visual-Text Attention for Visual Question Answering
Focal Visual-Text Attention for Visual Question Answering
Junwei Liang
Lu Jiang
Liangliang Cao
Li-Jia Li
Alexander G. Hauptmann
25
110
0
05 Jun 2018
Explaining Explanations: An Overview of Interpretability of Machine
  Learning
Explaining Explanations: An Overview of Interpretability of Machine Learning
Leilani H. Gilpin
David Bau
Ben Z. Yuan
Ayesha Bajwa
Michael A. Specter
Lalana Kagal
XAI
21
1,835
0
31 May 2018
Learning what and where to attend
Learning what and where to attend
Drew Linsley
Dan Scheibler
S. Eberhardt
Thomas Serre
6
32
0
22 May 2018
Towards Interpretable Face Recognition
Towards Interpretable Face Recognition
Bangjie Yin
Luan Tran
Haoxiang Li
Xiaohui Shen
Xiaoming Liu
CVBM
6
82
0
02 May 2018
But Who Protects the Moderators? The Case of Crowdsourced Image
  Moderation
But Who Protects the Moderators? The Case of Crowdsourced Image Moderation
B. Dang
M. J. Riedl
Matthew Lease
8
23
0
29 Apr 2018
Select, Attend, and Transfer: Light, Learnable Skip Connections
Select, Attend, and Transfer: Light, Learnable Skip Connections
Saeid Asgari Taghanaki
A. Bentaieb
Anmol Sharma
S. Kevin Zhou
Yefeng Zheng
...
Puneet Sharma
Sasa Grbic
Zhoubing Xu
D. Comaniciu
Ghassan Hamarneh
20
20
0
14 Apr 2018
Differential Attention for Visual Question Answering
Differential Attention for Visual Question Answering
Badri N. Patro
Vinay P. Namboodiri
AIMat
19
74
0
01 Apr 2018
Two can play this Game: Visual Dialog with Discriminative Question
  Generation and Answering
Two can play this Game: Visual Dialog with Discriminative Question Generation and Answering
Unnat Jain
Svetlana Lazebnik
A. Schwing
MLLM
21
81
0
29 Mar 2018
Neural Baby Talk
Neural Baby Talk
Jiasen Lu
Jianwei Yang
Dhruv Batra
Devi Parikh
VLM
189
434
0
27 Mar 2018
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual
  Questions
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions
Qing Li
Qingyi Tao
Shafiq R. Joty
Jianfei Cai
Jiebo Luo
29
106
0
20 Mar 2018
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis
  Tool
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
17
29
0
16 Mar 2018
Transparency by Design: Closing the Gap Between Performance and
  Interpretability in Visual Reasoning
Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning
David Mascharka
Philip Tran
Ryan Soklaski
Arjun Majumdar
25
207
0
14 Mar 2018
Totally Looks Like - How Humans Compare, Compared to Machines
Totally Looks Like - How Humans Compare, Compared to Machines
Amir Rosenfeld
M. Solbach
John K. Tsotsos
3DH
13
27
0
05 Mar 2018
Multimodal Explanations: Justifying Decisions and Pointing to the
  Evidence
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Anna Rohrbach
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
35
418
0
15 Feb 2018
Explaining First Impressions: Modeling, Recognizing, and Explaining
  Apparent Personality from Videos
Explaining First Impressions: Modeling, Recognizing, and Explaining Apparent Personality from Videos
Hugo Jair Escalante
Heysem Kaya
A. A. Salah
Sergio Escalera
Yağmur Güçlütürk
...
Furkan Gürpinar
Achmadnoer Sukma Wicaksana
Cynthia C. S. Liem
Marcel van Gerven
R. Lier
20
61
0
02 Feb 2018
Object-based reasoning in VQA
Object-based reasoning in VQA
Mikyas T. Desta
Larry Chen
Tomasz Kornuta
27
33
0
29 Jan 2018
Visual Analytics in Deep Learning: An Interrogative Survey for the Next
  Frontiers
Visual Analytics in Deep Learning: An Interrogative Survey for the Next Frontiers
Fred Hohman
Minsuk Kahng
Robert S. Pienta
Duen Horng Chau
OOD
HAI
19
537
0
21 Jan 2018
Visual Explanation by Interpretation: Improving Visual Feedback
  Capabilities of Deep Neural Networks
Visual Explanation by Interpretation: Improving Visual Feedback Capabilities of Deep Neural Networks
José Oramas
Kaili Wang
Tinne Tuytelaars
XAI
FAtt
11
60
0
18 Dec 2017
Previous
12345
Next