Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.06676
Cited By
MUTAN: Multimodal Tucker Fusion for Visual Question Answering
18 May 2017
H. Ben-younes
Rémi Cadène
Matthieu Cord
Nicolas Thome
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MUTAN: Multimodal Tucker Fusion for Visual Question Answering"
22 / 272 papers shown
Title
Cross-Dataset Adaptation for Visual Question Answering
Wei-Lun Chao
Hexiang Hu
Fei Sha
OOD
22
49
0
10 Jun 2018
Learning Answer Embeddings for Visual Question Answering
Hexiang Hu
Wei-Lun Chao
Fei Sha
13
33
0
10 Jun 2018
Focal Visual-Text Attention for Visual Question Answering
Junwei Liang
Lu Jiang
Liangliang Cao
Li-Jia Li
Alexander G. Hauptmann
25
110
0
05 Jun 2018
On the Flip Side: Identifying Counterexamples in Visual Question Answering
Gabriel Grand
Aron Szanto
Yoon Kim
Alexander Rush
16
0
0
03 Jun 2018
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering
Pan Lu
Lei Ji
Wei Zhang
Nan Duan
M. Zhou
Jianyong Wang
CoGe
14
79
0
24 May 2018
Bilinear Attention Networks
Jin-Hwa Kim
Jaehyun Jun
Byoung-Tak Zhang
AIMat
19
867
0
21 May 2018
Did the Model Understand the Question?
Pramod Kaushik Mudrakarta
Ankur Taly
Mukund Sundararajan
Kedar Dhamdhere
ELM
OOD
FAtt
27
196
0
14 May 2018
Reciprocal Attention Fusion for Visual Question Answering
M. Farazi
Salman H Khan
18
14
0
11 May 2018
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
Duy-Kien Nguyen
Takayuki Okatani
22
279
0
03 Apr 2018
Visual Question Reasoning on General Dependency Tree
Qingxing Cao
Xiaodan Liang
Bailin Li
Guanbin Li
Liang Lin
CoGe
25
37
0
31 Mar 2018
Two can play this Game: Visual Dialog with Discriminative Question Generation and Answering
Unnat Jain
Svetlana Lazebnik
A. Schwing
MLLM
27
81
0
29 Mar 2018
Generalized Hadamard-Product Fusion Operators for Visual Question Answering
Brendan Duke
Graham W. Taylor
28
8
0
26 Mar 2018
Dual Recurrent Attention Units for Visual Question Answering
Ahmed Osman
Wojciech Samek
28
30
0
01 Feb 2018
DeepStyle: Multimodal Search Engine for Fashion and Interior Design
Ivona Tautkute
Tomasz Trzciñski
Aleksander P. Skorupa
Łukasz Brocki
K. Marasek
19
54
0
08 Jan 2018
Co-attending Free-form Regions and Detections with Multi-modal Multiplicative Feature Embedding for Visual Question Answering
Pan Lu
Hongsheng Li
Wei Zhang
Jianyong Wang
Xiaogang Wang
12
80
0
18 Nov 2017
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries
Bohan Zhuang
Qi Wu
Chunhua Shen
Ian Reid
A. Hengel
ObjD
16
134
0
17 Nov 2017
A Novel Framework for Robustness Analysis of Visual QA Models
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
Bernard Ghanem
AAML
OOD
22
34
0
16 Nov 2017
Visual Question Generation as Dual Task of Visual Question Answering
Yikang Li
Nan Duan
Bolei Zhou
Xiao Chu
Wanli Ouyang
Xiaogang Wang
24
165
0
21 Sep 2017
Robustness Analysis of Visual QA Models by Basic Questions
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
C. Huck Yang
Bernard Ghanem
OOD
17
23
0
14 Sep 2017
Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge
Damien Teney
Peter Anderson
Xiaodong He
A. Hengel
45
380
0
09 Aug 2017
Modulating early visual processing by language
H. D. Vries
Florian Strub
Jérémie Mary
Hugo Larochelle
Olivier Pietquin
Aaron Courville
31
482
0
02 Jul 2017
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
147
1,465
0
06 Jun 2016
Previous
1
2
3
4
5
6