ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.00837
  4. Cited By
Making the V in VQA Matter: Elevating the Role of Image Understanding in
  Visual Question Answering

Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering

2 December 2016
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
    CoGe
ArXivPDFHTML

Papers citing "Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering"

50 / 1,956 papers shown
Title
Holistic Multi-modal Memory Network for Movie Question Answering
Holistic Multi-modal Memory Network for Movie Question Answering
Anran Wang
Anh Tuan Luu
Chuan-Sheng Foo
Hongyuan Zhu
Yi Tay
V. Chandrasekhar
15
20
0
12 Nov 2018
Shifting the Baseline: Single Modality Performance on Visual Navigation
  & QA
Shifting the Baseline: Single Modality Performance on Visual Navigation & QA
Jesse Thomason
Daniel Gordon
Yonatan Bisk
17
75
0
01 Nov 2018
A Corpus for Reasoning About Natural Language Grounded in Photographs
A Corpus for Reasoning About Natural Language Grounded in Photographs
Alane Suhr
Stephanie Zhou
Ally Zhang
Iris Zhang
Huajun Bai
Yoav Artzi
LRM
11
586
0
01 Nov 2018
TallyQA: Answering Complex Counting Questions
TallyQA: Answering Complex Counting Questions
Manoj Acharya
Kushal Kafle
Christopher Kanan
17
111
0
29 Oct 2018
Do Explanations make VQA Models more Predictable to a Human?
Do Explanations make VQA Models more Predictable to a Human?
Arjun Chandrasekaran
Viraj Prabhu
Deshraj Yadav
Prithvijit Chattopadhyay
Devi Parikh
FAtt
70
96
0
29 Oct 2018
Neural Modular Control for Embodied Question Answering
Neural Modular Control for Embodied Question Answering
Abhishek Das
Georgia Gkioxari
Stefan Lee
Devi Parikh
Dhruv Batra
LM&Ro
120
127
0
26 Oct 2018
Understand, Compose and Respond - Answering Visual Questions by a
  Composition of Abstract Procedures
Understand, Compose and Respond - Answering Visual Questions by a Composition of Abstract Procedures
B. Vatashsky
S. Ullman
CoGe
12
1
0
25 Oct 2018
Knowing Where to Look? Analysis on Attention of Visual Question
  Answering System
Knowing Where to Look? Analysis on Attention of Visual Question Answering System
Wei Li
Zehuan Yuan
Xiangzhong Fang
Changhu Wang
13
8
0
09 Oct 2018
Overcoming Language Priors in Visual Question Answering with Adversarial
  Regularization
Overcoming Language Priors in Visual Question Answering with Adversarial Regularization
S. Ramakrishnan
Aishwarya Agrawal
Stefan Lee
AAML
20
235
0
08 Oct 2018
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language
  Understanding
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding
Kexin Yi
Jiajun Wu
Chuang Gan
Antonio Torralba
Pushmeet Kohli
J. Tenenbaum
NAI
21
595
0
04 Oct 2018
Transfer Learning via Unsupervised Task Discovery for Visual Question
  Answering
Transfer Learning via Unsupervised Task Discovery for Visual Question Answering
Hyeonwoo Noh
Taehoon Kim
Jonghwan Mun
Bohyung Han
23
17
0
03 Oct 2018
The Wisdom of MaSSeS: Majority, Subjectivity, and Semantic Similarity in
  the Evaluation of VQA
The Wisdom of MaSSeS: Majority, Subjectivity, and Semantic Similarity in the Evaluation of VQA
Shailza Jolly
Sandro Pezzelle
T. Klein
Andreas Dengel
Moin Nabi
14
2
0
12 Sep 2018
How clever is the FiLM model, and how clever can it be?
How clever is the FiLM model, and how clever can it be?
A. Kuhnle
Huiyuan Xie
Ann A. Copestake
16
6
0
09 Sep 2018
What If We Simply Swap the Two Text Fragments? A Straightforward yet
  Effective Way to Test the Robustness of Methods to Confounding Signals in
  Nature Language Inference Tasks
What If We Simply Swap the Two Text Fragments? A Straightforward yet Effective Way to Test the Robustness of Methods to Confounding Signals in Nature Language Inference Tasks
Haohan Wang
Da-You Sun
Eric P. Xing
14
42
0
07 Sep 2018
Visual Coreference Resolution in Visual Dialog using Neural Module
  Networks
Visual Coreference Resolution in Visual Dialog using Neural Module Networks
Satwik Kottur
José M. F. Moura
Devi Parikh
Dhruv Batra
Marcus Rohrbach
11
164
0
06 Sep 2018
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
12
55
0
06 Sep 2018
Straight to the Facts: Learning Knowledge Base Retrieval for Factual
  Visual Question Answering
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
Medhini Narasimhan
A. Schwing
8
104
0
04 Sep 2018
RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking
  Recipes
RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking Recipes
Semih Yagcioglu
Aykut Erdem
Erkut Erdem
Nazli Ikizler-Cinbis
CoGe
11
169
0
04 Sep 2018
The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem
  Solvers
The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem Solvers
Dongxiang Zhang
Lei Wang
Nuo Xu
B. Dai
Heng Tao Shen
ReLM
AIMat
29
125
0
22 Aug 2018
SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense
  Inference
SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference
Rowan Zellers
Yonatan Bisk
Roy Schwartz
Yejin Choi
6
703
0
16 Aug 2018
How Much Reading Does Reading Comprehension Require? A Critical
  Investigation of Popular Benchmarks
How Much Reading Does Reading Comprehension Require? A Critical Investigation of Popular Benchmarks
Divyansh Kaushik
Zachary Chase Lipton
ELM
17
231
0
14 Aug 2018
Community Regularization of Visually-Grounded Dialog
Community Regularization of Visually-Grounded Dialog
Akshat Agarwal
Swaminathan Gurumurthy
Vasu Sharma
M. Lewis
Katia P. Sycara
13
10
0
10 Aug 2018
A Joint Sequence Fusion Model for Video Question Answering and Retrieval
A Joint Sequence Fusion Model for Video Question Answering and Retrieval
Youngjae Yu
Jongseok Kim
Gunhee Kim
16
339
0
07 Aug 2018
Learning Visual Question Answering by Bootstrapping Hard Attention
Learning Visual Question Answering by Bootstrapping Hard Attention
Mateusz Malinowski
Carl Doersch
Adam Santoro
Peter W. Battaglia
OOD
11
96
0
01 Aug 2018
Interpretable Visual Question Answering by Visual Grounding from
  Attention Supervision Mining
Interpretable Visual Question Answering by Visual Grounding from Attention Supervision Mining
Yundong Zhang
Juan Carlos Niebles
Á. Soto
15
67
0
01 Aug 2018
Pythia v0.1: the Winning Entry to the VQA Challenge 2018
Pythia v0.1: the Winning Entry to the VQA Challenge 2018
Yu Jiang
Vivek Natarajan
Xinlei Chen
Marcus Rohrbach
Dhruv Batra
Devi Parikh
VLM
8
202
0
26 Jul 2018
Explainable Neural Computation via Stack Neural Module Networks
Explainable Neural Computation via Stack Neural Module Networks
Ronghang Hu
Jacob Andreas
Trevor Darrell
Kate Saenko
LRM
OCL
14
197
0
23 Jul 2018
Question Relevance in Visual Question Answering
Question Relevance in Visual Question Answering
Prakruthi Prabhakar
Nitish Kulkarni
Linghao Zhang
11
6
0
23 Jul 2018
Dynamic Multimodal Instance Segmentation guided by natural language
  queries
Dynamic Multimodal Instance Segmentation guided by natural language queries
Edgar Margffoy-Tuay
Juan C. Pérez
Emilio Botero
Pablo Arbelaez
11
170
0
06 Jul 2018
Collaborative Annotation of Semantic Objects in Images with
  Multi-granularity Supervisions
Collaborative Annotation of Semantic Objects in Images with Multi-granularity Supervisions
Lishi Zhang
Chenghan Fu
Jia Li
18
8
0
27 Jun 2018
End-to-End Audio Visual Scene-Aware Dialog using Multimodal
  Attention-Based Video Features
End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features
Chiori Hori
Huda AlAmri
Jue Wang
G. Wichern
Takaaki Hori
...
Raphael Gontijo-Lopes
Abhishek Das
Irfan Essa
Dhruv Batra
Devi Parikh
VGen
11
125
0
21 Jun 2018
Learning Conditioned Graph Structures for Interpretable Visual Question
  Answering
Learning Conditioned Graph Structures for Interpretable Visual Question Answering
Will Norcliffe-Brown
Efstathios Vafeias
Sarah Parisot
GNN
9
236
0
19 Jun 2018
Learning Visual Knowledge Memory Networks for Visual Question Answering
Learning Visual Knowledge Memory Networks for Visual Question Answering
Zhou Su
Chen Zhu
Yinpeng Dong
Dongqi Cai
Yurong Chen
Jianguo Li
17
62
0
13 Jun 2018
Cross-Dataset Adaptation for Visual Question Answering
Cross-Dataset Adaptation for Visual Question Answering
Wei-Lun Chao
Hexiang Hu
Fei Sha
OOD
11
49
0
10 Jun 2018
Learning Answer Embeddings for Visual Question Answering
Learning Answer Embeddings for Visual Question Answering
Hexiang Hu
Wei-Lun Chao
Fei Sha
6
33
0
10 Jun 2018
CS-VQA: Visual Question Answering with Compressively Sensed Images
CS-VQA: Visual Question Answering with Compressively Sensed Images
Li-Chi Huang
K. Kulkarni
Anik Jha
Suhas Lohit
Suren Jayasuriya
P. Turaga
CoGe
10
8
0
08 Jun 2018
Visual Reasoning by Progressive Module Networks
Visual Reasoning by Progressive Module Networks
Seung Wook Kim
Makarand Tapaswi
Sanja Fidler
ReLM
LRM
16
12
0
06 Jun 2018
Focal Visual-Text Attention for Visual Question Answering
Focal Visual-Text Attention for Visual Question Answering
Junwei Liang
Lu Jiang
Liangliang Cao
Li-Jia Li
Alexander G. Hauptmann
17
110
0
05 Jun 2018
On the Flip Side: Identifying Counterexamples in Visual Question
  Answering
On the Flip Side: Identifying Counterexamples in Visual Question Answering
Gabriel Grand
Aron Szanto
Yoon Kim
Alexander Rush
9
0
0
03 Jun 2018
Visual Referring Expression Recognition: What Do Systems Actually Learn?
Visual Referring Expression Recognition: What Do Systems Actually Learn?
Volkan Cirik
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
9
63
0
30 May 2018
Joint Image Captioning and Question Answering
Joint Image Captioning and Question Answering
Jialin Wu
Zeyuan Hu
Raymond J. Mooney
15
12
0
22 May 2018
Reproducibility Report for "Learning To Count Objects In Natural Images
  For Visual Question Answering"
Reproducibility Report for "Learning To Count Objects In Natural Images For Visual Question Answering"
Shagun Sodhani
Vardaan Pahuja
6
0
0
21 May 2018
A new dataset and model for learning to understand navigational
  instructions
A new dataset and model for learning to understand navigational instructions
Ozan Arkan Can
Deniz Yuret
13
1
0
21 May 2018
Bilinear Attention Networks
Bilinear Attention Networks
Jin-Hwa Kim
Jaehyun Jun
Byoung-Tak Zhang
AIMat
12
866
0
21 May 2018
Did the Model Understand the Question?
Did the Model Understand the Question?
Pramod Kaushik Mudrakarta
Ankur Taly
Mukund Sundararajan
Kedar Dhamdhere
ELM
OOD
FAtt
20
196
0
14 May 2018
Reciprocal Attention Fusion for Visual Question Answering
Reciprocal Attention Fusion for Visual Question Answering
M. Farazi
Salman H Khan
11
14
0
11 May 2018
Question Type Guided Attention in Visual Question Answering
Question Type Guided Attention in Visual Question Answering
Yang Shi
Tommaso Furlanello
Sheng Zha
Anima Anandkumar
10
46
0
06 Apr 2018
Improved Fusion of Visual and Language Representations by Dense
  Symmetric Co-Attention for Visual Question Answering
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
Duy-Kien Nguyen
Takayuki Okatani
22
279
0
03 Apr 2018
Differential Attention for Visual Question Answering
Differential Attention for Visual Question Answering
Badri N. Patro
Vinay P. Namboodiri
AIMat
11
74
0
01 Apr 2018
Visual Question Reasoning on General Dependency Tree
Visual Question Reasoning on General Dependency Tree
Qingxing Cao
Xiaodan Liang
Bailin Li
Guanbin Li
Liang Lin
CoGe
15
37
0
31 Mar 2018
Previous
123...37383940
Next