ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.07871
  4. Cited By
FiLM: Visual Reasoning with a General Conditioning Layer

FiLM: Visual Reasoning with a General Conditioning Layer

22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
    FAtt
    AIMat
    OffRL
    AI4CE
ArXivPDFHTML

Papers citing "FiLM: Visual Reasoning with a General Conditioning Layer"

50 / 1,308 papers shown
Title
Guided Image-to-Image Translation with Bi-Directional Feature
  Transformation
Guided Image-to-Image Translation with Bi-Directional Feature Transformation
Badour Albahar
Jia-Bin Huang
20
91
0
24 Oct 2019
HIGhER : Improving instruction following with Hindsight Generation for
  Experience Replay
HIGhER : Improving instruction following with Hindsight Generation for Experience Replay
Geoffrey Cideron
Mathieu Seurin
Florian Strub
Olivier Pietquin
16
37
0
21 Oct 2019
RTFM: Generalising to Novel Environment Dynamics via Reading
RTFM: Generalising to Novel Environment Dynamics via Reading
Victor Zhong
Tim Rocktaschel
Edward Grefenstette
LLMAG
OffRL
AI4CE
11
53
0
18 Oct 2019
Audio-Conditioned U-Net for Position Estimation in Full Sheet Images
Audio-Conditioned U-Net for Position Estimation in Full Sheet Images
Florian Henkel
Rainer Kelz
Gerhard Widmer
14
4
0
16 Oct 2019
Meta Module Network for Compositional Visual Reasoning
Meta Module Network for Compositional Visual Reasoning
Wenhu Chen
Zhe Gan
Linjie Li
Yu Cheng
W. Wang
Jingjing Liu
LRM
17
68
0
08 Oct 2019
Meta-Transfer Learning through Hard Tasks
Meta-Transfer Learning through Hard Tasks
Qianru Sun
Yaoyao Liu
Zhaozheng Chen
Tat-Seng Chua
Bernt Schiele
6
98
0
07 Oct 2019
CLEVRER: CoLlision Events for Video REpresentation and Reasoning
CLEVRER: CoLlision Events for Video REpresentation and Reasoning
Kexin Yi
Yuta Saito
Yunzhu Li
Pushmeet Kohli
Jiajun Wu
Antonio Torralba
J. Tenenbaum
NAI
24
456
0
03 Oct 2019
CoPhy: Counterfactual Learning of Physical Dynamics
CoPhy: Counterfactual Learning of Physical Dynamics
Fabien Baradel
Natalia Neverova
J. Mille
Greg Mori
Christian Wolf
CML
AI4CE
17
97
0
26 Sep 2019
Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation
Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation
Arna Ghosh
Richard Y. Zhang
P. Dokania
Oliver Wang
Alexei A. Efros
Philip H. S. Torr
Eli Shechtman
VLM
DiffM
12
130
0
24 Sep 2019
Explainable High-order Visual Question Reasoning: A New Benchmark and
  Knowledge-routed Network
Explainable High-order Visual Question Reasoning: A New Benchmark and Knowledge-routed Network
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
17
13
0
23 Sep 2019
Meta-Neighborhoods
Meta-Neighborhoods
Siyuan Shan
Yang Li
Junier Oliva
11
13
0
18 Sep 2019
Temporal FiLM: Capturing Long-Range Sequence Dependencies with
  Feature-Wise Modulations
Temporal FiLM: Capturing Long-Range Sequence Dependencies with Feature-Wise Modulations
Sawyer Birnbaum
Volodymyr Kuleshov
S. Enam
Pang Wei Koh
Stefano Ermon
AI4TS
8
68
0
14 Sep 2019
Hierarchical Scene Coordinate Classification and Regression for Visual
  Localization
Hierarchical Scene Coordinate Classification and Regression for Visual Localization
Xiaotian Li
Shuzhe Wang
Yi Zhao
Jakob Verbeek
Juho Kannala
71
127
0
13 Sep 2019
Finding Generalizable Evidence by Learning to Convince Q&A Models
Finding Generalizable Evidence by Learning to Convince Q&A Models
Ethan Perez
Siddharth Karamcheti
Rob Fergus
Jason Weston
Douwe Kiela
Kyunghyun Cho
RALM
17
37
0
12 Sep 2019
Domain-Agnostic Few-Shot Classification by Learning Disparate Modulators
Domain-Agnostic Few-Shot Classification by Learning Disparate Modulators
Yongseok Choi
Junyoung Park
Subin Yi
D.-Y. Cho
OOD
11
0
0
11 Sep 2019
Relationships from Entity Stream
Relationships from Entity Stream
Martin Andrews
Sam Witteveen
AI4TS
GNN
11
0
0
07 Sep 2019
Supervised Multimodal Bitransformers for Classifying Images and Text
Supervised Multimodal Bitransformers for Classifying Images and Text
Douwe Kiela
Suvrat Bhooshan
Hamed Firooz
Ethan Perez
Davide Testuggine
57
241
0
06 Sep 2019
No Press Diplomacy: Modeling Multi-Agent Gameplay
No Press Diplomacy: Modeling Multi-Agent Gameplay
Philip Paquette
Yuchen Lu
Steven Bocco
Max O. Smith
Satya Ortiz-Gagné
Jonathan K. Kummerfeld
Satinder Singh
Joelle Pineau
Aaron Courville
17
57
0
04 Sep 2019
Meta-Learning with Warped Gradient Descent
Meta-Learning with Warped Gradient Descent
Sebastian Flennerhag
Andrei A. Rusu
Razvan Pascanu
Francesco Visin
Hujun Yin
R. Hadsell
6
209
0
30 Aug 2019
Is the Red Square Big? MALeViC: Modeling Adjectives Leveraging Visual
  Contexts
Is the Red Square Big? MALeViC: Modeling Adjectives Leveraging Visual Contexts
Sandro Pezzelle
Raquel Fernández
VLM
9
18
0
27 Aug 2019
LXMERT: Learning Cross-Modality Encoder Representations from
  Transformers
LXMERT: Learning Cross-Modality Encoder Representations from Transformers
Hao Hao Tan
Mohit Bansal
VLM
MLLM
52
2,444
0
20 Aug 2019
Probabilistic Reconstruction Networks for 3D Shape Inference from a
  Single Image
Probabilistic Reconstruction Networks for 3D Shape Inference from a Single Image
Roman Klokov
Jakob Verbeek
Edmond Boyer
3DV
17
14
0
20 Aug 2019
What is needed for simple spatial language capabilities in VQA?
What is needed for simple spatial language capabilities in VQA?
A. Kuhnle
Ann A. Copestake
CoGe
13
1
0
17 Aug 2019
PHYRE: A New Benchmark for Physical Reasoning
PHYRE: A New Benchmark for Physical Reasoning
A. Bakhtin
L. V. D. van der Maaten
Justin Johnson
Laura Gustafson
Ross B. Girshick
LRM
6
121
0
15 Aug 2019
Mastering emergent language: learning to guide in simulated navigation
Mastering emergent language: learning to guide in simulated navigation
Mathijs Mul
Diane Bouchacourt
Elia Bruni
LLMAG
11
9
0
14 Aug 2019
VideoNavQA: Bridging the Gap between Visual and Embodied Question
  Answering
VideoNavQA: Bridging the Gap between Visual and Embodied Question Answering
Cătălina Cangea
Eugene Belilovsky
Pietro Lió
Aaron Courville
14
16
0
14 Aug 2019
Multimodal Unified Attention Networks for Vision-and-Language
  Interactions
Multimodal Unified Attention Networks for Vision-and-Language Interactions
Zhou Yu
Yuhao Cui
Jun Yu
Dacheng Tao
Q. Tian
11
38
0
12 Aug 2019
Multi-modality Latent Interaction Network for Visual Question Answering
Multi-modality Latent Interaction Network for Visual Question Answering
Peng Gao
Haoxuan You
Zhanpeng Zhang
Xiaogang Wang
Hongsheng Li
9
77
0
10 Aug 2019
Dynamic Scale Inference by Entropy Minimization
Dynamic Scale Inference by Entropy Minimization
Dequan Wang
Evan Shelhamer
Bruno A. Olshausen
Trevor Darrell
11
7
0
08 Aug 2019
Answering Questions about Data Visualizations using Efficient Bimodal
  Fusion
Answering Questions about Data Visualizations using Efficient Bimodal Fusion
Kushal Kafle
Robik Shrestha
Brian L. Price
Scott D. Cohen
Christopher Kanan
17
58
0
05 Aug 2019
An Empirical Study of Batch Normalization and Group Normalization in
  Conditional Computation
An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation
Vincent Michalski
Vikram S. Voleti
Samira Ebrahimi Kahou
Anthony Ortiz
Pascal Vincent
C. Pal
Doina Precup
BDL
8
6
0
31 Jul 2019
Learning Question-Guided Video Representation for Multi-Turn Video
  Question Answering
Learning Question-Guided Video Representation for Multi-Turn Video Question Answering
Guan-Lin Chao
Abhinav Rastogi
Semih Yavuz
Dilek Z. Hakkani-Tür
Jindong Chen
Ian Lane
6
6
0
31 Jul 2019
Segmenting Objects in Day and Night:Edge-Conditioned CNN for Thermal
  Image Semantic Segmentation
Segmenting Objects in Day and Night:Edge-Conditioned CNN for Thermal Image Semantic Segmentation
Chenglong Li
W. Xia
Yan Yan
B. Luo
Jin Tang
8
118
0
24 Jul 2019
Metalearned Neural Memory
Metalearned Neural Memory
Tsendsuren Munkhdalai
Alessandro Sordoni
Tong Wang
Adam Trischler
KELM
17
60
0
23 Jul 2019
Switchable Normalization for Learning-to-Normalize Deep Representation
Switchable Normalization for Learning-to-Normalize Deep Representation
Ping Luo
Ruimao Zhang
Jiamin Ren
Zhanglin Peng
Jingyu Li
23
73
0
22 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
15
132
0
22 Jul 2019
Neural Drum Machine : An Interactive System for Real-time Synthesis of
  Drum Sounds
Neural Drum Machine : An Interactive System for Real-time Synthesis of Drum Sounds
Cyran Aouameur
P. Esling
Gaëtan Hadjeres
8
21
0
04 Jul 2019
Conditioned-U-Net: Introducing a Control Mechanism in the U-Net for
  Multiple Source Separations
Conditioned-U-Net: Introducing a Control Mechanism in the U-Net for Multiple Source Separations
Gabriel Meseguer-Brocal
Geoffroy Peeters
11
61
0
02 Jul 2019
GNN-FiLM: Graph Neural Networks with Feature-wise Linear Modulation
GNN-FiLM: Graph Neural Networks with Feature-wise Linear Modulation
Marc Brockschmidt
7
132
0
28 Jun 2019
Learning Disentangled Representations of Timbre and Pitch for Musical
  Instrument Sounds Using Gaussian Mixture Variational Autoencoders
Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders
Yin-Jyun Luo
Kat R. Agres
Dorien Herremans
6
46
0
19 Jun 2019
Fast and Flexible Multi-Task Classification Using Conditional Neural
  Adaptive Processes
Fast and Flexible Multi-Task Classification Using Conditional Neural Adaptive Processes
James Requeima
Jonathan Gordon
J. Bronskill
Sebastian Nowozin
Richard E. Turner
8
240
0
18 Jun 2019
Language as an Abstraction for Hierarchical Deep Reinforcement Learning
Language as an Abstraction for Hierarchical Deep Reinforcement Learning
Yiding Jiang
S. Gu
Kevin Patrick Murphy
Chelsea Finn
OffRL
10
221
0
18 Jun 2019
Task-Aware Feature Generation for Zero-Shot Compositional Learning
Task-Aware Feature Generation for Zero-Shot Compositional Learning
Xin Wang
F. I. F. Richard Yu
Trevor Darrell
Joseph E. Gonzalez
VLM
CoGe
11
16
0
11 Jun 2019
Psycholinguistics meets Continual Learning: Measuring Catastrophic
  Forgetting in Visual Question Answering
Psycholinguistics meets Continual Learning: Measuring Catastrophic Forgetting in Visual Question Answering
Claudio Greco
Barbara Plank
Raquel Fernández
Raffaella Bernardi
CLL
KELM
9
48
0
10 Jun 2019
Human-Machine Collaboration for Fast Land Cover Mapping
Human-Machine Collaboration for Fast Land Cover Mapping
Caleb Robinson
Anthony Ortiz
Kolya Malkin
Blake Elias
Andi Peng
Dan Morris
B. Dilkina
Nebojsa Jojic
24
20
0
10 Jun 2019
Attention-based Conditioning Methods for External Knowledge Integration
Attention-based Conditioning Methods for External Knowledge Integration
Katerina Margatina
Christos Baziotis
Alexandros Potamianos
4
30
0
09 Jun 2019
Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image
  Style Transfer
Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image Style Transfer
Jan Svoboda
Asha Anoosheh
Christian Osendorfer
Jonathan Masci
GAN
16
79
0
07 Jun 2019
FSPool: Learning Set Representations with Featurewise Sort Pooling
FSPool: Learning Set Representations with Featurewise Sort Pooling
Yan Zhang
Jonathon S. Hare
Adam Prugel-Bennett
9
75
0
06 Jun 2019
Geo-Aware Networks for Fine-Grained Recognition
Geo-Aware Networks for Fine-Grained Recognition
Grace Chu
B. Potetz
Weijun Wang
Andrew G. Howard
Yang Song
Fernando Brucher
Thomas Leung
Hartwig Adam
ObjD
25
80
0
04 Jun 2019
Adaptive Deep Kernel Learning
Adaptive Deep Kernel Learning
Prudencio Tossou
Basile Dura
François Laviolette
M. Marchand
Alexandre Lacoste
11
29
0
28 May 2019
Previous
123...2324252627
Next