ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1505.00468
  4. Cited By
VQA: Visual Question Answering

VQA: Visual Question Answering

3 May 2015
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
    CoGe
ArXivPDFHTML

Papers citing "VQA: Visual Question Answering"

42 / 792 papers shown
Title
Making the V in VQA Matter: Elevating the Role of Image Understanding in
  Visual Question Answering
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
99
3,116
0
02 Dec 2016
GuessWhat?! Visual object discovery through multi-modal dialogue
GuessWhat?! Visual object discovery through multi-modal dialogue
H. D. Vries
Florian Strub
A. Chandar
Olivier Pietquin
Hugo Larochelle
Aaron Courville
VLM
19
425
0
23 Nov 2016
Dense Captioning with Joint Inference and Visual Context
Dense Captioning with Joint Inference and Visual Context
L. Yang
K. Tang
Jianchao Yang
Li-Jia Li
VLM
19
169
0
21 Nov 2016
Recurrent Memory Addressing for describing videos
Recurrent Memory Addressing for describing videos
A. Jain
Abhinav Agarwalla
Kumar Krishna Agrawal
Pabitra Mitra
30
10
0
20 Nov 2016
Leveraging Video Descriptions to Learn Video Question Answering
Leveraging Video Descriptions to Learn Video Question Answering
Kuo-Hao Zeng
Tseng-Hung Chen
Ching-Yao Chuang
Yuan-Hong Liao
Juan Carlos Niebles
Min Sun
21
175
0
12 Nov 2016
Dual Attention Networks for Multimodal Reasoning and Matching
Dual Attention Networks for Multimodal Reasoning and Matching
Hyeonseob Nam
Jung-Woo Ha
Jeonghee Kim
34
664
0
02 Nov 2016
Deep Identity-aware Transfer of Facial Attributes
Deep Identity-aware Transfer of Facial Attributes
Mu-Wei Li
W. Zuo
David C. Zhang
CVBM
18
149
0
18 Oct 2016
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based
  Localization
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Ramprasaath R. Selvaraju
Michael Cogswell
Abhishek Das
Ramakrishna Vedantam
Devi Parikh
Dhruv Batra
FAtt
18
19,541
0
07 Oct 2016
Learning Language-Visual Embedding for Movie Understanding with
  Natural-Language
Learning Language-Visual Embedding for Movie Understanding with Natural-Language
Atousa Torabi
Niket Tandon
Leonid Sigal
14
97
0
26 Sep 2016
The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question
  Answering (FSVQA)
The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering (FSVQA)
Andrew Shin
Yoshitaka Ushiku
Tatsuya Harada
41
14
0
21 Sep 2016
The ACRV Picking Benchmark (APB): A Robotic Shelf Picking Benchmark to
  Foster Reproducible Research
The ACRV Picking Benchmark (APB): A Robotic Shelf Picking Benchmark to Foster Reproducible Research
Jurgen Leitner
Adam W. Tow
Jake E. Dean
Niko Sünderhauf
Joseph W. Durham
...
James Sergeant
Liao Wu
Fangyi Zhang
B. Upcroft
Peter Corke
16
78
0
17 Sep 2016
Towards Transparent AI Systems: Interpreting Visual Question Answering
  Models
Towards Transparent AI Systems: Interpreting Visual Question Answering Models
Yash Goyal
Akrit Mohapatra
Devi Parikh
Dhruv Batra
16
74
0
31 Aug 2016
Visual Question: Predicting If a Crowd Will Agree on the Answer
Visual Question: Predicting If a Crowd Will Agree on the Answer
Danna Gurari
Kristen Grauman
HAI
21
2
0
29 Aug 2016
Machine Comprehension Using Match-LSTM and Answer Pointer
Machine Comprehension Using Match-LSTM and Answer Pointer
Shuohang Wang
Jing Jiang
11
594
0
29 Aug 2016
Solving Visual Madlibs with Multiple Cues
Solving Visual Madlibs with Multiple Cues
Tatiana Tommasi
Arun Mallya
Bryan A. Plummer
Svetlana Lazebnik
Alexander C. Berg
Tamara L. Berg
29
18
0
11 Aug 2016
Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes
Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes
Çağlar Gülçehre
A. Chandar
Kyunghyun Cho
Yoshua Bengio
10
64
0
30 Jun 2016
Sort Story: Sorting Jumbled Images and Captions into Stories
Sort Story: Sorting Jumbled Images and Captions into Stories
Harsh Agrawal
Arjun Chandrasekaran
Dhruv Batra
Devi Parikh
Mohit Bansal
19
60
0
23 Jun 2016
Semantic Parsing to Probabilistic Programs for Situated Question
  Answering
Semantic Parsing to Probabilistic Programs for Situated Question Answering
Jayant Krishnamurthy
Oyvind Tafjord
Aniruddha Kembhavi
26
24
0
22 Jun 2016
Question Relevance in VQA: Identifying Non-Visual And False-Premise
  Questions
Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
Arijit Ray
Gordon A. Christie
Mohit Bansal
Dhruv Batra
Devi Parikh
16
56
0
21 Jun 2016
FVQA: Fact-based Visual Question Answering
FVQA: Fact-based Visual Question Answering
Peng Wang
Qi Wu
Chunhua Shen
Anton van den Hengel
A. Dick
CoGe
33
453
0
17 Jun 2016
Human Attention in Visual Question Answering: Do Humans and Deep
  Networks Look at the Same Regions?
Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?
Abhishek Das
Harsh Agrawal
C. L. Zitnick
Devi Parikh
Dhruv Batra
30
466
0
11 Jun 2016
Adversarial Feature Learning
Adversarial Feature Learning
Jiasen Lu
Philipp Krahenbuhl
Trevor Darrell
GAN
13
1,599
0
31 May 2016
End-to-End Instance Segmentation with Recurrent Attention
End-to-End Instance Segmentation with Recurrent Attention
Mengye Ren
R. Zemel
SSeg
19
61
0
30 May 2016
Learning Models for Actions and Person-Object Interactions with Transfer
  to Question Answering
Learning Models for Actions and Person-Object Interactions with Transfer to Question Answering
Arun Mallya
Svetlana Lazebnik
31
119
0
16 Apr 2016
Visual Storytelling
Visual Storytelling
Ting-Hao 'Kenneth' Huang
Huang
Francis Ferraro
N. Mostafazadeh
Ishan Misra
...
C. L. Zitnick
Devi Parikh
Lucy Vanderwende
Michel Galley
Margaret Mitchell
VGen
11
464
0
13 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description
TGIF: A New Dataset and Benchmark on Animated GIF Description
Yuncheng Li
Yale Song
Liangliang Cao
Joel R. Tetreault
Larry Goldberg
A. Jaimes
Jiebo Luo
11
269
0
10 Apr 2016
Deep Image Retrieval: Learning global representations for image search
Deep Image Retrieval: Learning global representations for image search
Albert Gordo
Jon Almazán
Jérôme Revaud
Diane Larlus
21
801
0
05 Apr 2016
Unsupervised Visual Sense Disambiguation for Verbs using Multimodal
  Embeddings
Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings
Spandana Gella
Mirella Lapata
Frank Keller
CoGe
11
52
0
30 Mar 2016
BreakingNews: Article Annotation by Image and Text Processing
BreakingNews: Article Annotation by Image and Text Processing
Arnau Ramisa
F. Yan
Francesc Moreno-Noguer
K. Mikolajczyk
21
105
0
23 Mar 2016
Dynamic Memory Networks for Visual and Textual Question Answering
Dynamic Memory Networks for Visual and Textual Question Answering
Caiming Xiong
Stephen Merity
R. Socher
18
753
0
04 Mar 2016
A Taxonomy of Deep Convolutional Neural Nets for Computer Vision
A Taxonomy of Deep Convolutional Neural Nets for Computer Vision
Suraj Srinivas
Ravi Kiran Sarvadevabhatla
Konda Reddy Mopuri
N. Prabhu
S. Kruthiventi
R. Venkatesh Babu
OOD
20
215
0
25 Jan 2016
Image Question Answering using Convolutional Neural Network with Dynamic
  Parameter Prediction
Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction
Hyeonwoo Noh
Paul Hongsuck Seo
Bohyung Han
OOD
14
327
0
18 Nov 2015
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for
  Visual Question Answering
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
Huijuan Xu
Kate Saenko
22
760
0
17 Nov 2015
Visual7W: Grounded Question Answering in Images
Visual7W: Grounded Question Answering in Images
Yuke Zhu
Oliver Groth
Michael S. Bernstein
Li Fei-Fei
25
871
0
11 Nov 2015
Explicit Knowledge-based Reasoning for Visual Question Answering
Explicit Knowledge-based Reasoning for Visual Question Answering
Peng Wang
Qi Wu
Chunhua Shen
A. Hengel
A. Dick
27
257
0
09 Nov 2015
VISALOGY: Answering Visual Analogy Questions
VISALOGY: Answering Visual Analogy Questions
Fereshteh Sadeghi
C. L. Zitnick
Ali Farhadi
11
46
0
30 Oct 2015
Describing Common Human Visual Actions in Images
Describing Common Human Visual Actions in Images
M. R. Ronchi
Pietro Perona
33
64
0
07 Jun 2015
Learning to Answer Questions From Image Using Convolutional Neural
  Network
Learning to Answer Questions From Image Using Convolutional Neural Network
Lin Ma
Zhengdong Lu
Hang Li
13
262
0
01 Jun 2015
Are You Talking to a Machine? Dataset and Methods for Multilingual Image
  Question Answering
Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering
Haoyuan Gao
Junhua Mao
Jie Zhou
Zhiheng Huang
Lei Wang
W. Xu
26
497
0
21 May 2015
Ask Your Neurons: A Neural-based Approach to Answering Questions about
  Images
Ask Your Neurons: A Neural-based Approach to Answering Questions about Images
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
25
596
0
05 May 2015
Learning like a Child: Fast Novel Visual Concept Learning from Sentence
  Descriptions of Images
Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images
Junhua Mao
Xu Wei
Yi Yang
Jiang Wang
Zhiheng Huang
Alan Yuille
25
154
0
25 Apr 2015
Salient Object Detection: A Benchmark
Salient Object Detection: A Benchmark
Ali Borji
Ming-Ming Cheng
Huaizu Jiang
Jia Li
27
1,719
0
05 Jan 2015
Previous
123...141516