FiLM: Visual Reasoning with a General Conditioning Layer

22 September 2017

Aaron Courville

Papers citing "FiLM: Visual Reasoning with a General Conditioning Layer"

50 / 1,308 papers shown

Title
Guided Image-to-Image Translation with Bi-Directional Feature Transformation Badour Albahar Jia-Bin Huang 20 91 0 24 Oct 2019
HIGhER : Improving instruction following with Hindsight Generation for Experience Replay Geoffrey Cideron Mathieu Seurin Florian Strub Olivier Pietquin 16 37 0 21 Oct 2019
RTFM: Generalising to Novel Environment Dynamics via Reading Victor Zhong Tim Rocktaschel Edward Grefenstette LLMAG OffRL AI4CE 11 53 0 18 Oct 2019
Audio-Conditioned U-Net for Position Estimation in Full Sheet Images Florian Henkel Rainer Kelz Gerhard Widmer 14 4 0 16 Oct 2019
Meta Module Network for Compositional Visual Reasoning Wenhu Chen Zhe Gan Linjie Li Yu Cheng W. Wang Jingjing Liu LRM 17 68 0 08 Oct 2019
Meta-Transfer Learning through Hard Tasks Qianru Sun Yaoyao Liu Zhaozheng Chen Tat-Seng Chua Bernt Schiele 6 98 0 07 Oct 2019
CLEVRER: CoLlision Events for Video REpresentation and Reasoning Kexin Yi Yuta Saito Yunzhu Li Pushmeet Kohli Jiajun Wu Antonio Torralba J. Tenenbaum NAI 24 456 0 03 Oct 2019
CoPhy: Counterfactual Learning of Physical Dynamics Fabien Baradel Natalia Neverova J. Mille Greg Mori Christian Wolf CML AI4CE 17 97 0 26 Sep 2019
Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation Arna Ghosh Richard Y. Zhang P. Dokania Oliver Wang Alexei A. Efros Philip H. S. Torr Eli Shechtman VLM DiffM 12 130 0 24 Sep 2019
Explainable High-order Visual Question Reasoning: A New Benchmark and Knowledge-routed Network Qingxing Cao Bailin Li Xiaodan Liang Liang Lin 17 13 0 23 Sep 2019
Meta-Neighborhoods Siyuan Shan Yang Li Junier Oliva 11 13 0 18 Sep 2019
Temporal FiLM: Capturing Long-Range Sequence Dependencies with Feature-Wise Modulations Sawyer Birnbaum Volodymyr Kuleshov S. Enam Pang Wei Koh Stefano Ermon AI4TS 8 68 0 14 Sep 2019
Hierarchical Scene Coordinate Classification and Regression for Visual Localization Xiaotian Li Shuzhe Wang Yi Zhao Jakob Verbeek Juho Kannala 71 127 0 13 Sep 2019
Finding Generalizable Evidence by Learning to Convince Q&A Models Ethan Perez Siddharth Karamcheti Rob Fergus Jason Weston Douwe Kiela Kyunghyun Cho RALM 17 37 0 12 Sep 2019
Domain-Agnostic Few-Shot Classification by Learning Disparate Modulators Yongseok Choi Junyoung Park Subin Yi D.-Y. Cho OOD 11 0 0 11 Sep 2019
Relationships from Entity Stream Martin Andrews Sam Witteveen AI4TS GNN 11 0 0 07 Sep 2019
Supervised Multimodal Bitransformers for Classifying Images and Text Douwe Kiela Suvrat Bhooshan Hamed Firooz Ethan Perez Davide Testuggine 57 241 0 06 Sep 2019
No Press Diplomacy: Modeling Multi-Agent Gameplay Philip Paquette Yuchen Lu Steven Bocco Max O. Smith Satya Ortiz-Gagné Jonathan K. Kummerfeld Satinder Singh Joelle Pineau Aaron Courville 17 57 0 04 Sep 2019
Meta-Learning with Warped Gradient Descent Sebastian Flennerhag Andrei A. Rusu Razvan Pascanu Francesco Visin Hujun Yin R. Hadsell 6 209 0 30 Aug 2019
Is the Red Square Big? MALeViC: Modeling Adjectives Leveraging Visual Contexts Sandro Pezzelle Raquel Fernández VLM 9 18 0 27 Aug 2019
LXMERT: Learning Cross-Modality Encoder Representations from Transformers Hao Hao Tan Mohit Bansal VLM MLLM 52 2,444 0 20 Aug 2019
Probabilistic Reconstruction Networks for 3D Shape Inference from a Single Image Roman Klokov Jakob Verbeek Edmond Boyer 3DV 17 14 0 20 Aug 2019
What is needed for simple spatial language capabilities in VQA? A. Kuhnle Ann A. Copestake CoGe 13 1 0 17 Aug 2019
PHYRE: A New Benchmark for Physical Reasoning A. Bakhtin L. V. D. van der Maaten Justin Johnson Laura Gustafson Ross B. Girshick LRM 6 121 0 15 Aug 2019
Mastering emergent language: learning to guide in simulated navigation Mathijs Mul Diane Bouchacourt Elia Bruni LLMAG 11 9 0 14 Aug 2019
VideoNavQA: Bridging the Gap between Visual and Embodied Question Answering Cătălina Cangea Eugene Belilovsky Pietro Lió Aaron Courville 14 16 0 14 Aug 2019
Multimodal Unified Attention Networks for Vision-and-Language Interactions Zhou Yu Yuhao Cui Jun Yu Dacheng Tao Q. Tian 11 38 0 12 Aug 2019
Multi-modality Latent Interaction Network for Visual Question Answering Peng Gao Haoxuan You Zhanpeng Zhang Xiaogang Wang Hongsheng Li 9 77 0 10 Aug 2019
Dynamic Scale Inference by Entropy Minimization Dequan Wang Evan Shelhamer Bruno A. Olshausen Trevor Darrell 11 7 0 08 Aug 2019
Answering Questions about Data Visualizations using Efficient Bimodal Fusion Kushal Kafle Robik Shrestha Brian L. Price Scott D. Cohen Christopher Kanan 17 58 0 05 Aug 2019
An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation Vincent Michalski Vikram S. Voleti Samira Ebrahimi Kahou Anthony Ortiz Pascal Vincent C. Pal Doina Precup BDL 8 6 0 31 Jul 2019
Learning Question-Guided Video Representation for Multi-Turn Video Question Answering Guan-Lin Chao Abhinav Rastogi Semih Yavuz Dilek Z. Hakkani-Tür Jindong Chen Ian Lane 6 6 0 31 Jul 2019
Segmenting Objects in Day and Night:Edge-Conditioned CNN for Thermal Image Semantic Segmentation Chenglong Li W. Xia Yan Yan B. Luo Jin Tang 8 118 0 24 Jul 2019
Metalearned Neural Memory Tsendsuren Munkhdalai Alessandro Sordoni Tong Wang Adam Trischler KELM 17 60 0 23 Jul 2019
Switchable Normalization for Learning-to-Normalize Deep Representation Ping Luo Ruimao Zhang Jiamin Ren Zhanglin Peng Jingyu Li 23 73 0 22 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods Aditya Mogadala M. Kalimuthu Dietrich Klakow VLM 15 132 0 22 Jul 2019
Neural Drum Machine : An Interactive System for Real-time Synthesis of Drum Sounds Cyran Aouameur P. Esling Gaëtan Hadjeres 8 21 0 04 Jul 2019
Conditioned-U-Net: Introducing a Control Mechanism in the U-Net for Multiple Source Separations Gabriel Meseguer-Brocal Geoffroy Peeters 11 61 0 02 Jul 2019
GNN-FiLM: Graph Neural Networks with Feature-wise Linear Modulation Marc Brockschmidt 7 132 0 28 Jun 2019
Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders Yin-Jyun Luo Kat R. Agres Dorien Herremans 6 46 0 19 Jun 2019
Fast and Flexible Multi-Task Classification Using Conditional Neural Adaptive Processes James Requeima Jonathan Gordon J. Bronskill Sebastian Nowozin Richard E. Turner 8 240 0 18 Jun 2019
Language as an Abstraction for Hierarchical Deep Reinforcement Learning Yiding Jiang S. Gu Kevin Patrick Murphy Chelsea Finn OffRL 10 221 0 18 Jun 2019
Task-Aware Feature Generation for Zero-Shot Compositional Learning Xin Wang F. I. F. Richard Yu Trevor Darrell Joseph E. Gonzalez VLM CoGe 11 16 0 11 Jun 2019
Psycholinguistics meets Continual Learning: Measuring Catastrophic Forgetting in Visual Question Answering Claudio Greco Barbara Plank Raquel Fernández Raffaella Bernardi CLL KELM 9 48 0 10 Jun 2019
Human-Machine Collaboration for Fast Land Cover Mapping Caleb Robinson Anthony Ortiz Kolya Malkin Blake Elias Andi Peng Dan Morris B. Dilkina Nebojsa Jojic 24 20 0 10 Jun 2019
Attention-based Conditioning Methods for External Knowledge Integration Katerina Margatina Christos Baziotis Alexandros Potamianos 4 30 0 09 Jun 2019
Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image Style Transfer Jan Svoboda Asha Anoosheh Christian Osendorfer Jonathan Masci GAN 16 79 0 07 Jun 2019
FSPool: Learning Set Representations with Featurewise Sort Pooling Yan Zhang Jonathon S. Hare Adam Prugel-Bennett 9 75 0 06 Jun 2019
Geo-Aware Networks for Fine-Grained Recognition Grace Chu B. Potetz Weijun Wang Andrew G. Howard Yang Song Fernando Brucher Thomas Leung Hartwig Adam ObjD 25 80 0 04 Jun 2019
Adaptive Deep Kernel Learning Prudencio Tossou Basile Dura François Laviolette M. Marchand Alexandre Lacoste 11 29 0 28 May 2019