Learning to Reason: End-to-End Module Networks for Visual Question Answering

18 April 2017

Papers citing "Learning to Reason: End-to-End Module Networks for Visual Question Answering"

23 / 73 papers shown

Title
Visual Entailment: A Novel Task for Fine-Grained Image Understanding Ning Xie Farley Lai Derek Doran Asim Kadav CoGe 31 321 0 20 Jan 2019
Neighbourhood Watch: Referring Expression Comprehension via Language-guided Graph Attention Networks Peng Wang Qi Wu Jiewei Cao Chunhua Shen Lianli Gao A. Hengel ObjD 22 252 0 12 Dec 2018
Counterfactual Critic Multi-Agent Training for Scene Graph Generation Long Chen Hanwang Zhang Jun Xiao Xiangnan He Shiliang Pu Shih-Fu Chang 14 159 0 06 Dec 2018
Explainable and Explicit Visual Reasoning over Scene Graphs Jiaxin Shi Hanwang Zhang Juan-Zi Li OCL 155 230 0 05 Dec 2018
Generating Diverse Programs with Instruction Conditioned Reinforced Adversarial Learning Aishwarya Agrawal Mateusz Malinowski Felix Hill S. M. Ali Eslami Oriol Vinyals Tejas D. Kulkarni 16 4 0 03 Dec 2018
Discovering General-Purpose Active Learning Strategies Ksenia Konyushkova Raphael Sznitman Pascal Fua 15 33 0 09 Oct 2018
Overcoming Language Priors in Visual Question Answering with Adversarial Regularization S. Ramakrishnan Aishwarya Agrawal Stefan Lee AAML 20 235 0 08 Oct 2018
How clever is the FiLM model, and how clever can it be? A. Kuhnle Huiyuan Xie Ann A. Copestake 16 6 0 09 Sep 2018
Interpretable Visual Question Answering by Reasoning on Dependency Trees Qingxing Cao Bailin Li Xiaodan Liang Liang Lin 20 55 0 06 Sep 2018
Context-Aware Visual Policy Network for Sequence-Level Image Captioning Daqing Liu Zhengjun Zha Hanwang Zhang Yongdong Zhang Feng Wu CLIP 26 103 0 16 Aug 2018
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering Duy-Kien Nguyen Takayuki Okatani 22 279 0 03 Apr 2018
HOUDINI: Lifelong Learning as Program Synthesis Lazar Valkov Dipak Chaudhari Akash Srivastava Charles Sutton Swarat Chaudhuri 9 78 0 31 Mar 2018
Motion-Appearance Co-Memory Networks for Video Question Answering J. Gao Runzhou Ge Kan Chen Ram Nevatia 13 240 0 29 Mar 2018
Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning David Mascharka Philip Tran Ryan Soklaski Arjun Majumdar 22 207 0 14 Mar 2018
Relational Neural Expectation Maximization: Unsupervised Discovery of Objects and their Interactions Sjoerd van Steenkiste Michael Chang Klaus Greff Jürgen Schmidhuber BDL OCL DRL 22 290 0 28 Feb 2018
Interactive Grounded Language Acquisition and Generalization in a 2D World Haonan Yu Haichao Zhang W. Xu LLMAG LM&Ro 6 77 0 31 Jan 2018
Neural Algebra of Classifiers Rodrigo Santa Cruz Basura Fernando A. Cherian Stephen Gould CoGe OCL 13 11 0 26 Jan 2018
DVQA: Understanding Data Visualizations via Question Answering Kushal Kafle Brian L. Price Scott D. Cohen Christopher Kanan AIMat 33 363 0 24 Jan 2018
FiLM: Visual Reasoning with a General Conditioning Layer Ethan Perez Florian Strub H. D. Vries Vincent Dumoulin Aaron Courville FAtt AIMat OffRL AI4CE 61 2,142 0 22 Sep 2017
Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge Damien Teney Peter Anderson Xiaodong He A. Hengel 45 380 0 09 Aug 2017
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering Yash Goyal Tejas Khot D. Summers-Stay Dhruv Batra Devi Parikh CoGe 94 3,115 0 02 Dec 2016
Neural Architecture Search with Reinforcement Learning Barret Zoph Quoc V. Le 264 5,326 0 05 Nov 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding Akira Fukui Dong Huk Park Daylen Yang Anna Rohrbach Trevor Darrell Marcus Rohrbach 144 1,464 0 06 Jun 2016