Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.07871
Cited By
FiLM: Visual Reasoning with a General Conditioning Layer
22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FiLM: Visual Reasoning with a General Conditioning Layer"
50 / 1,308 papers shown
Title
Effective Noise-aware Data Simulation for Domain-adaptive Speech Enhancement Leveraging Dynamic Stochastic Perturbation
Chien-Chun Wang
Li-Wei Chen
Hung-Shin Lee
Berlin Chen
Hsin-Min Wang
27
1
0
03 Sep 2024
Semantically Controllable Augmentations for Generalizable Robot Learning
Zoey Chen
Zhao Mandi
Homanga Bharadhwaj
Mohit Sharma
Shuran Song
Abhishek Gupta
Vikash Kumar
LM&Ro
29
5
0
02 Sep 2024
Affordance-based Robot Manipulation with Flow Matching
Fan Zhang
Michael Gienger
47
5
0
02 Sep 2024
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
Zanlin Ni
Yulin Wang
Renping Zhou
Rui Lu
Jiayi Guo
Jinyi Hu
Zhiyuan Liu
Yuan Yao
Gao Huang
25
7
0
31 Aug 2024
GANs Conditioning Methods: A Survey
Anis Bourou
Valérie Mezger
Auguste Genovesio
EGVM
AI4CE
41
0
0
28 Aug 2024
On latent dynamics learning in nonlinear reduced order modeling
N. Farenga
S. Fresca
Simone Brivio
Andrea Manzoni
AI4CE
34
1
0
27 Aug 2024
Automating Deformable Gasket Assembly
S. Adebola
Tara Sadjadpour
Karim El-Refai
Will Panitch
Zehan Ma
...
Tianshuang Qiu
Shreya Ganti
Charlotte Le
Jaimyn Drake
Ken Goldberg
AI4CE
32
0
0
22 Aug 2024
Multi-Style Facial Sketch Synthesis through Masked Generative Modeling
Bowen Sun
Guo Lu
Shibao Zheng
CVBM
25
0
0
22 Aug 2024
Scalable Autoregressive Image Generation with Mamba
Haopeng Li
Jinyue Yang
Kexin Wang
Xuerui Qiu
Yuhong Chou
Xin Li
Guoqi Li
Mamba
53
12
0
22 Aug 2024
Scaling Cross-Embodied Learning: One Policy for Manipulation, Navigation, Locomotion and Aviation
Ria Doshi
Homer Walke
Oier Mees
Sudeep Dasari
Sergey Levine
37
46
0
21 Aug 2024
DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation
Yin-Jyun Luo
K. Cheuk
Woosung Choi
Toshimitsu Uesaka
Keisuke Toyama
...
Chieh-Hsin Lai
Yuhta Takida
Wei-Hsiang Liao
Simon Dixon
Yuki Mitsufuji
CoGe
36
2
0
20 Aug 2024
Hologram Reasoning for Solving Algebra Problems with Geometry Diagrams
Litian Huang
Xinguo Yu
Feng Xiong
Bin He
Shengbing Tang
Jiawen Fu
16
1
0
20 Aug 2024
Mitigating Degree Bias in Signed Graph Neural Networks
Fang He
Jinhai Deng
Ruizhan Xue
Maojun Wang
Zeyu Zhang
30
3
0
16 Aug 2024
ViMo: Generating Motions from Casual Videos
Liangdong Qiu
Chengxing Yu
Yanran Li
Zhao Wang
Haibin Huang
Chongyang Ma
Di Zhang
Pengfei Wan
Xiaoguang Han
VGen
31
2
0
13 Aug 2024
FoVNet: Configurable Field-of-View Speech Enhancement with Low Computation and Distortion for Smart Glasses
Zhongweiyang Xu
Ali Aroudi
Ke Tan
Ashutosh Pandey
Jung-Suk Lee
Buye Xu
Francesco Nesta
24
1
0
12 Aug 2024
MetMamba: Regional Weather Forecasting with Spatial-Temporal Mamba Model
Haoyu Qin
Yungang Chen
Qianchuan Jiang
Pengchao Sun
Xiancai Ye
Chao Lin
Mamba
AI4CE
31
1
0
12 Aug 2024
Semi-Supervised One-Shot Imitation Learning
Philipp Wu
Kourosh Hakhamaneshi
Yuqing Du
Igor Mordatch
Aravind Rajeswaran
Pieter Abbeel
SSL
20
1
0
09 Aug 2024
Hyper Recurrent Neural Network: Condition Mechanisms for Black-box Audio Effect Modeling
Yen-Tung Yeh
Wen-Yi Hsiao
Yi-Hsuan Yang
24
6
0
09 Aug 2024
Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics
Ruining Li
Chuanxia Zheng
Christian Rupprecht
Andrea Vedaldi
DiffM
VGen
36
9
0
08 Aug 2024
Achieving Human Level Competitive Robot Table Tennis
David B. DÁmbrosio
Saminda Abeyruwan
L. Graesser
Atil Iscen
H. B. Amor
...
Vikas Sindhwani
Vincent Vanhoucke
Grace Vesom
P. Xu
Pannag R. Sanketi
87
14
0
07 Aug 2024
Tora: Trajectory-oriented Diffusion Transformer for Video Generation
Zhenghao Zhang
Junchao Liao
Menghao Li
Zuozhuo Dai
Bingxue Qiu
Hao Hu
Shaowei Cai
Weizhi Wang
VGen
42
43
0
31 Jul 2024
Efficient Pareto Manifold Learning with Low-Rank Structure
Weiyu Chen
James T. Kwok
28
6
0
30 Jul 2024
PianoMime: Learning a Generalist, Dexterous Piano Player from Internet Demonstrations
Cheng Qian
Julen Urain
Kevin Zakka
Jan Peters
17
4
0
25 Jul 2024
QueST: Self-Supervised Skill Abstractions for Learning Continuous Control
Atharva Mete
Haotian Xue
Albert Wilcox
Yongxin Chen
Animesh Garg
SSL
21
16
0
22 Jul 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
26
4
0
21 Jul 2024
Diff4VS: HIV-inhibiting Molecules Generation with Classifier Guidance Diffusion for Virtual Screening
Jiaqing Lyu
Changjie Chen
Bing Liang
Yijia Zhang
18
0
0
20 Jul 2024
Improved Esophageal Varices Assessment from Non-Contrast CT Scans
Chunli Li
Xiaoming Zhang
Yuan Gao
Xiaoli Yin
Le Lu
Ling Zhang
Ke Yan
Yu Shi
31
0
0
18 Jul 2024
VegeDiff: Latent Diffusion Model for Geospatial Vegetation Forecasting
Sijie Zhao
Hao Chen
Xue-liang Zhang
P. Xiao
Lei Bai
Wanli Ouyang
DiffM
38
0
0
17 Jul 2024
Universal Sound Separation with Self-Supervised Audio Masked Autoencoder
Junqi Zhao
Xubo Liu
Jinzheng Zhao
Yiitan Yuan
Qiuqiang Kong
Mark D. Plumbley
Wenwu Wang
25
3
0
16 Jul 2024
Target conversation extraction: Source separation using turn-taking dynamics
Tuochao Chen
Qirui Wang
Bohan Wu
Malek Itani
Sefik Emre Eskimez
Takuya Yoshioka
Shyamnath Gollakota
20
4
0
15 Jul 2024
Towards zero-shot amplifier modeling: One-to-many amplifier modeling via tone embedding control
Yu-Hua Chen
Yen-Tung Yeh
Yuan-Chiao Cheng
Jui-Te Wu
Yu-Hsiang Ho
J. Jang
Yi-Hsuan Yang
30
5
0
15 Jul 2024
Let Me DeCode You: Decoder Conditioning with Tabular Data
Tomasz Szczepañski
Michal K. Grzeszczyk
Szymon Płotka
Arleta Adamowicz
Piotr Fudalej
Przemysław Korzeniowski
Tomasz Trzciñski
Arkadiusz Sitek
AI4CE
24
1
0
12 Jul 2024
OVExp: Open Vocabulary Exploration for Object-Oriented Navigation
Meng Wei
Tai Wang
Yilun Chen
Hanqing Wang
Jiangmiao Pang
Xihui Liu
VLM
47
3
0
12 Jul 2024
Generative Image as Action Models
Mohit Shridhar
Yat Long Lo
Stephen James
38
9
0
10 Jul 2024
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior
Chenguo Lin
Yuchen Lin
Panwang Pan
Xuanyang Zhang
Yadong Mu
3DV
46
1
0
10 Jul 2024
Knowledge boosting during low-latency inference
Vidya Srinivas
Malek Itani
Tuochao Chen
Sefik Emre Eskimez
Takuya Yoshioka
Shyamnath Gollakota
24
2
0
09 Jul 2024
AutoTask: Task Aware Multi-Faceted Single Model for Multi-Task Ads Relevance
Shouchang Guo
Sonam Damani
Keng-hao Chang
31
0
0
09 Jul 2024
3D Vessel Graph Generation Using Denoising Diffusion
Chinmay Prabhakar
Suprosanna Shit
Fabio Musio
Kaiyuan Yang
Tamaz Amiranashvili
Johannes C. Paetzold
Hongwei Bran Li
Bjoern H. Menze
DiffM
MedIm
47
2
0
08 Jul 2024
Multimodal Classification via Modal-Aware Interactive Enhancement
Qing-Yuan Jiang
Zhouyang Chi
Yang Yang
31
3
0
05 Jul 2024
EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning
Jingyun Yang
Zi-ang Cao
Congyue Deng
Rika Antonova
Shuran Song
Jeannette Bohg
DiffM
51
29
0
01 Jul 2024
Language-Guided Object-Centric Diffusion Policy for Generalizable and Collision-Aware Robotic Manipulation
Hang Li
Qian Feng
Zhi Zheng
Jianxiang Feng
Zhaopeng Chen
Alois Knoll
21
1
0
29 Jun 2024
A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems
Karn N. Watcharasupat
Alexander Lerch
21
1
0
26 Jun 2024
Towards diffusion models for large-scale sea-ice modelling
Tobias S. Finn
Charlotte Durand
A. Farchi
Marc Bocquet
J. Brajard
34
2
0
26 Jun 2024
Generative artificial intelligence in ophthalmology: multimodal retinal images for the diagnosis of Alzheimer's disease with convolutional neural networks
I. R. Slootweg
M. Thach
K. Curro-Tafili
F. D. Verbraak
F. H. Bouwman
Y. Pijnenburg
J. F. Boer
J.H.P de Kwisthout
L. Bagheriye
P. J. González
MedIm
DiffM
46
0
0
26 Jun 2024
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
58
22
0
26 Jun 2024
Unified Auto-Encoding with Masked Diffusion
Philippe Hansen-Estruch
S. Vishwanath
Amy Zhang
Manan Tomar
DiffM
55
1
0
25 Jun 2024
SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond
Marco Comunità
Zhi-Wei Zhong
Akira Takahashi
Shiqi Yang
Mengjie Zhao
Koichi Saito
Yukara Ikemiya
Takashi Shibuya
Shusuke Takahashi
Yuki Mitsufuji
61
2
0
25 Jun 2024
Towards Efficient and Scalable Training of Differentially Private Deep Learning
Sebastian Rodriguez Beltran
Marlon Tobaben
Niki Loppi
Antti Honkela
19
0
0
25 Jun 2024
F-FOMAML: GNN-Enhanced Meta-Learning for Peak Period Demand Forecasting with Proxy Data
Zexing Xu
Linjun Zhang
Sitan Yang
Rasoul Etesami
Hanghang Tong
Huan Zhang
Jiawei Han
AI4TS
23
2
0
23 Jun 2024
Multimodal Multilabel Classification by CLIP
Yanming Guo
VLM
16
0
0
23 Jun 2024
Previous
1
2
3
4
5
6
...
25
26
27
Next