ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.06676
  4. Cited By
MUTAN: Multimodal Tucker Fusion for Visual Question Answering

MUTAN: Multimodal Tucker Fusion for Visual Question Answering

18 May 2017
H. Ben-younes
Rémi Cadène
Matthieu Cord
Nicolas Thome
ArXivPDFHTML

Papers citing "MUTAN: Multimodal Tucker Fusion for Visual Question Answering"

50 / 272 papers shown
Title
Zero-Shot Grounding of Objects from Natural Language Queries
Zero-Shot Grounding of Objects from Natural Language Queries
Arka Sadhu
Kan Chen
Ram Nevatia
ObjD
28
156
0
20 Aug 2019
Attention on Attention for Image Captioning
Attention on Attention for Image Captioning
Lun Huang
Wenmin Wang
Jie Chen
Xiao-Yong Wei
22
823
0
19 Aug 2019
Multimodal Unified Attention Networks for Vision-and-Language
  Interactions
Multimodal Unified Attention Networks for Vision-and-Language Interactions
Zhou Yu
Yuhao Cui
Jun Yu
Dacheng Tao
Q. Tian
19
38
0
12 Aug 2019
Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking
Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking
Tan Wang
Xing Xu
Yang Yang
Alan Hanjalic
Heng Tao Shen
Jingkuan Song
17
145
0
12 Aug 2019
Multi-modality Latent Interaction Network for Visual Question Answering
Multi-modality Latent Interaction Network for Visual Question Answering
Peng Gao
Haoxuan You
Zhanpeng Zhang
Xiaogang Wang
Hongsheng Li
17
77
0
10 Aug 2019
Question-Agnostic Attention for Visual Question Answering
Question-Agnostic Attention for Visual Question Answering
M. Farazi
Salman H Khan
Nick Barnes
13
10
0
09 Aug 2019
An Empirical Study on Leveraging Scene Graphs for Visual Question
  Answering
An Empirical Study on Leveraging Scene Graphs for Visual Question Answering
Cheng Zhang
Wei-Lun Chao
D. Xuan
23
50
0
28 Jul 2019
The Resale Price Prediction of Secondhand Jewelry Items Using a
  Multi-modal Deep Model with Iterative Co-Attention
The Resale Price Prediction of Secondhand Jewelry Items Using a Multi-modal Deep Model with Iterative Co-Attention
Yusuke Yamaura
Nobuya Kanemaki
Y. Tsuboshita
10
3
0
01 Jul 2019
Deep Modular Co-Attention Networks for Visual Question Answering
Deep Modular Co-Attention Networks for Visual Question Answering
Zhou Yu
Jun Yu
Yuhao Cui
Dacheng Tao
Q. Tian
11
796
0
25 Jun 2019
RUBi: Reducing Unimodal Biases in Visual Question Answering
RUBi: Reducing Unimodal Biases in Visual Question Answering
Rémi Cadène
Corentin Dancette
H. Ben-younes
Matthieu Cord
Devi Parikh
CML
19
368
0
24 Jun 2019
Frontal Low-rank Random Tensors for Fine-grained Action Segmentation
Frontal Low-rank Random Tensors for Fine-grained Action Segmentation
Yan Zhang
Krikamol Muandet
Qianli Ma
Heiko Neumann
Siyu Tang
24
3
0
03 Jun 2019
OK-VQA: A Visual Question Answering Benchmark Requiring External
  Knowledge
OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge
Kenneth Marino
Mohammad Rastegari
Ali Farhadi
Roozbeh Mottaghi
15
1,013
0
31 May 2019
Gaining Extra Supervision via Multi-task learning for Multi-Modal Video
  Question Answering
Gaining Extra Supervision via Multi-task learning for Multi-Modal Video Question Answering
Junyeong Kim
Minuk Ma
Kyungsu Kim
Sungjin Kim
Chang-Dong Yoo
13
27
0
28 May 2019
Beyond Visual Semantics: Exploring the Role of Scene Text in Image
  Understanding
Beyond Visual Semantics: Exploring the Role of Scene Text in Image Understanding
Arka Ujjal Dey
Suman K. Ghosh
Ernest Valveny
Gaurav Harit
23
23
0
25 May 2019
Self-Critical Reasoning for Robust Visual Question Answering
Self-Critical Reasoning for Robust Visual Question Answering
Jialin Wu
Raymond J. Mooney
OOD
NAI
24
159
0
24 May 2019
Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image
  Representations
Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations
Fenglin Liu
Yuanxin Liu
Xuancheng Ren
Xiaodong He
Xu Sun
VLM
26
81
0
15 May 2019
Deep Local Trajectory Replanning and Control for Robot Navigation
Deep Local Trajectory Replanning and Control for Robot Navigation
Ashwini Pokle
Roberto Martín-Martín
P. Goebel
Vincent Chow
H. Ewald
...
Zhenkai Wang
Amir Sadeghian
Dorsa Sadigh
Silvio Savarese
Marynel Vázquez
6
62
0
13 May 2019
Progressive Attention Memory Network for Movie Story Question Answering
Progressive Attention Memory Network for Movie Story Question Answering
Junyeong Kim
Minuk Ma
Kyungsu Kim
Sungjin Kim
Chang-Dong Yoo
11
75
0
18 Apr 2019
Question Guided Modular Routing Networks for Visual Question Answering
Question Guided Modular Routing Networks for Visual Question Answering
Yanze Wu
Qiang Sun
Jianqi Ma
Bin Li
Yanwei Fu
Yao Peng
Xiangyang Xue
15
1
0
17 Apr 2019
Factor Graph Attention
Factor Graph Attention
Idan Schwartz
Seunghak Yu
Tamir Hazan
A. Schwing
19
110
0
11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
A. Schwing
Tamir Hazan
19
69
0
11 Apr 2019
Information Aggregation for Multi-Head Attention with
  Routing-by-Agreement
Information Aggregation for Multi-Head Attention with Routing-by-Agreement
Jian Li
Baosong Yang
Zi-Yi Dou
Xing Wang
Michael R. Lyu
Zhaopeng Tu
12
46
0
05 Apr 2019
Relation-Aware Graph Attention Network for Visual Question Answering
Relation-Aware Graph Attention Network for Visual Question Answering
Linjie Li
Zhe Gan
Yu Cheng
Jingjing Liu
GNN
28
341
0
29 Mar 2019
MFAS: Multimodal Fusion Architecture Search
MFAS: Multimodal Fusion Architecture Search
Juan-Manuel Perez-Rua
Valentin Vielzeuf
S. Pateux
M. Baccouche
F. Jurie
19
178
0
15 Mar 2019
Answer Them All! Toward Universal Visual Question Answering Models
Answer Them All! Toward Universal Visual Question Answering Models
Robik Shrestha
Kushal Kafle
Christopher Kanan
17
82
0
01 Mar 2019
MUREL: Multimodal Relational Reasoning for Visual Question Answering
MUREL: Multimodal Relational Reasoning for Visual Question Answering
Rémi Cadène
H. Ben-younes
Matthieu Cord
Nicolas Thome
LRM
19
271
0
25 Feb 2019
Cycle-Consistency for Robust Visual Question Answering
Cycle-Consistency for Robust Visual Question Answering
Meet Shah
Xinlei Chen
Marcus Rohrbach
Devi Parikh
OOD
14
185
0
15 Feb 2019
VrR-VG: Refocusing Visually-Relevant Relationships
VrR-VG: Refocusing Visually-Relevant Relationships
Yuanzhi Liang
Yalong Bai
Wei Zhang
Xueming Qian
Li Zhu
Tao Mei
3DH
14
8
0
01 Feb 2019
BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and
  Visual Relationship Detection
BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection
H. Ben-younes
Rémi Cadène
Nicolas Thome
Matthieu Cord
14
218
0
31 Jan 2019
TuckER: Tensor Factorization for Knowledge Graph Completion
TuckER: Tensor Factorization for Knowledge Graph Completion
Ivana Balazevic
Carl Allen
Timothy M. Hospedales
11
712
0
28 Jan 2019
Fusion Strategies for Learning User Embeddings with Neural Networks
Fusion Strategies for Learning User Embeddings with Neural Networks
Philipp Blandfort
Tushar Karayil
Federico Raue
Jörn Hees
Andreas Dengel
FedML
22
9
0
08 Jan 2019
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual
  Question Answering
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Peng Gao
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Hongsheng Li
AIMat
19
362
0
13 Dec 2018
Learning to Compose Dynamic Tree Structures for Visual Contexts
Learning to Compose Dynamic Tree Structures for Visual Contexts
Kaihua Tang
Hanwang Zhang
Baoyuan Wu
Wenhan Luo
W. Liu
9
490
0
05 Dec 2018
From Known to the Unknown: Transferring Knowledge to Answer Questions
  about Novel Visual and Semantic Concepts
From Known to the Unknown: Transferring Knowledge to Answer Questions about Novel Visual and Semantic Concepts
M. Farazi
Salman H Khan
Nick Barnes
15
13
0
30 Nov 2018
From Recognition to Cognition: Visual Commonsense Reasoning
From Recognition to Cognition: Visual Commonsense Reasoning
Rowan Zellers
Yonatan Bisk
Ali Farhadi
Yejin Choi
LRM
BDL
OCL
ReLM
27
865
0
27 Nov 2018
VQA with no questions-answers training
VQA with no questions-answers training
B. Vatashsky
S. Ullman
33
12
0
20 Nov 2018
Explicit Bias Discovery in Visual Question Answering Models
Explicit Bias Discovery in Visual Question Answering Models
Varun Manjunatha
Nirat Saini
L. Davis
CML
FAtt
19
92
0
19 Nov 2018
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual
  Question Answering
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
Medhini Narasimhan
Svetlana Lazebnik
A. Schwing
NAI
GNN
ReLM
15
11
0
01 Nov 2018
TallyQA: Answering Complex Counting Questions
TallyQA: Answering Complex Counting Questions
Manoj Acharya
Kushal Kafle
Christopher Kanan
19
111
0
29 Oct 2018
Understand, Compose and Respond - Answering Visual Questions by a
  Composition of Abstract Procedures
Understand, Compose and Respond - Answering Visual Questions by a Composition of Abstract Procedures
B. Vatashsky
S. Ullman
CoGe
18
1
0
25 Oct 2018
Towards Good Practices for Multi-modal Fusion in Large-scale Video
  Classification
Towards Good Practices for Multi-modal Fusion in Large-scale Video Classification
Jinlai Liu
Zehuan Yuan
Changhu Wang
14
9
0
16 Sep 2018
The Wisdom of MaSSeS: Majority, Subjectivity, and Semantic Similarity in
  the Evaluation of VQA
The Wisdom of MaSSeS: Majority, Subjectivity, and Semantic Similarity in the Evaluation of VQA
Shailza Jolly
Sandro Pezzelle
T. Klein
Andreas Dengel
Moin Nabi
22
2
0
12 Sep 2018
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
20
55
0
06 Sep 2018
Straight to the Facts: Learning Knowledge Base Retrieval for Factual
  Visual Question Answering
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
Medhini Narasimhan
A. Schwing
16
104
0
04 Sep 2018
Image Reassembly Combining Deep Learning and Shortest Path Problem
Image Reassembly Combining Deep Learning and Shortest Path Problem
Marie-Morgane Paumard
David Picard
Hedi Tabia
OCL
3DV
11
24
0
04 Sep 2018
Towards a Better Metric for Evaluating Question Generation Systems
Towards a Better Metric for Evaluating Question Generation Systems
Preksha Nema
Mitesh M. Khapra
6
107
0
30 Aug 2018
Question-Guided Hybrid Convolution for Visual Question Answering
Question-Guided Hybrid Convolution for Visual Question Answering
Peng Gao
Pan Lu
Hongsheng Li
Shuang Li
Yikang Li
S. Hoi
Xiaogang Wang
24
68
0
08 Aug 2018
HybridNet: Classification and Reconstruction Cooperation for
  Semi-Supervised Learning
HybridNet: Classification and Reconstruction Cooperation for Semi-Supervised Learning
Thomas Robert
Nicolas Thome
Matthieu Cord
36
39
0
30 Jul 2018
Don't only Feel Read: Using Scene text to understand advertisements
Don't only Feel Read: Using Scene text to understand advertisements
Arka Ujjal Dey
Suman K. Ghosh
Ernest Valveny
DiffM
8
4
0
21 Jun 2018
FigureNet: A Deep Learning model for Question-Answering on Scientific
  Plots
FigureNet: A Deep Learning model for Question-Answering on Scientific Plots
Revanth Reddy Gangi Reddy
Rahul Ramesh
A. Deshpande
Mitesh M. Khapra
AIMat
OOD
GNN
30
22
0
12 Jun 2018
Previous
123456
Next