Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.07871
Cited By
FiLM: Visual Reasoning with a General Conditioning Layer
22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FiLM: Visual Reasoning with a General Conditioning Layer"
50 / 1,304 papers shown
Title
LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding
A. Kazerouni
Soroush Mehraban
Michael Brudno
Babak Taati
41
0
0
19 Mar 2025
Neuro Symbolic Knowledge Reasoning for Procedural Video Question Answering
Thanh-Son Nguyen
Hong Yang
Tzeh Yuan Neoh
Hao Zhang
Ee Yeo Keat
Basura Fernando
NAI
54
0
0
19 Mar 2025
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing
Seokhyeon Hong
Chaelin Kim
Serin Yoon
Junghyun Nam
Sihun Cha
Junyong Noh
DiffM
VGen
68
0
0
18 Mar 2025
Context-Aware Two-Step Training Scheme for Domain Invariant Speech Separation
Wupeng Wang
Zexu Pan
Jingru Lin
Shuai Wang
Haizhou Li
53
0
0
16 Mar 2025
Image-Goal Navigation Using Refined Feature Guidance and Scene Graph Enhancement
Zhicheng Feng
Xieyuanli Chen
Chenghao Shi
Lun Luo
Z. Chen
Yun Liu
Huimin Lu
48
0
0
14 Mar 2025
Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control
Hejia Chen
Haoxian Zhang
Shoulong Zhang
Xiaoqiang Liu
Sisi Zhuang
Yuan Zhang
Pengfei Wan
Di Zhang
Shuai Li
54
1
0
14 Mar 2025
Learning Control of Neural Sound Effects Synthesis from Physically Inspired Models
Yisu Zong
Joshua Reiss
46
0
0
13 Mar 2025
Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation
Qi Lv
Hao Li
Xiang Deng
Rui Shao
Yinchuan Li
Jianye Hao
Longxiang Gao
Michael Yu Wang
Liqiang Nie
41
0
0
13 Mar 2025
IQPFR: An Image Quality Prior for Blind Face Restoration and Beyond
Peng Hu
Chunming He
Lei Xu
Jingduo Tian
Sina Farsiu
Y. Zhang
Pei Liu
Xiu Li
56
0
0
12 Mar 2025
Temporal Difference Flows
Jesse Farebrother
Matteo Pirotta
Andrea Tirinzoni
Rémi Munos
A. Lazaric
Ahmed Touati
AI4TS
AIFin
50
0
0
12 Mar 2025
DynCIM: Dynamic Curriculum for Imbalanced Multimodal Learning
Chengxuan Qian
Kai Han
J. Wang
Zhenlong Yuan
Rui Qian
Chongwen Lyu
Jun Chen
46
1
0
09 Mar 2025
RA-DP: Rapid Adaptive Diffusion Policy for Training-Free High-frequency Robotics Replanning
Xi Ye
Rui Heng Yang
Jun Jin
Y. K. Li
Amir Rasouli
49
0
0
06 Mar 2025
VLA Model-Expert Collaboration for Bi-directional Manipulation Learning
Tian-Yu Xiang
Ao-Qun Jin
Xiao-Hu Zhou
Mei-Jiang Gui
Xiao-Liang Xie
...
Shuang-Yi Wang
Sheng-Bin Duang
Si-Cheng Wang
Zheng Lei
Z. Hou
58
1
0
06 Mar 2025
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
Huang Huang
Fangchen Liu
Letian Fu
Tingfan Wu
Mustafa Mukadam
Jitendra Malik
Ken Goldberg
Pieter Abbeel
LM&Ro
VLM
77
5
0
05 Mar 2025
ArticuBot: Learning Universal Articulated Object Manipulation Policy via Large Scale Simulation
Yufei Wang
Ziyu Wang
Mino Nakura
Pratik Bhowal
Chia-Liang Kuo
Yi-Ting Chen
Zackory M. Erickson
David Held
59
0
0
04 Mar 2025
Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation
Han Xue
Jieji Ren
Wendi Chen
Gu Zhang
Yuan Fang
Guoying Gu
Huazhe Xu
Cewu Lu
44
4
0
04 Mar 2025
Robustness to Geographic Distribution Shift using Location Encoders
Ruth Crasto
OOD
76
0
0
03 Mar 2025
Boolean-aware Attention for Dense Retrieval
Quan Mai
Susan Gauch
Douglas Adams
27
1
0
03 Mar 2025
XIRVIO: Critic-guided Iterative Refinement for Visual-Inertial Odometry with Explainable Adaptive Weighting
Chit Yuen Lam
Ronald Clark
Basaran Bahadir Kocer
VGen
65
0
0
01 Mar 2025
Synthesizing Individualized Aging Brains in Health and Disease with Generative Models and Parallel Transport
Jingru Fu
Yuqi Zheng
Neel Dey
D. Ferreira
R. Moreno
MedIm
24
0
0
28 Feb 2025
Deep Learning of the Evolution Operator Enables Forecasting of Out-of-Training Dynamics in Chaotic Systems
Ira J. S. Shokar
Peter H. Haynes
R. Kerswell
AI4TS
30
1
0
28 Feb 2025
DGFM: Full Body Dance Generation Driven by Music Foundation Models
Xinran Liu
Zhenhua Feng
Diptesh Kanojia
Wenwu Wang
DiffM
62
1
0
27 Feb 2025
On the Interpolation Effect of Score Smoothing
Zhengdao Chen
DiffM
71
0
0
26 Feb 2025
GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music
Xinran Liu
Xu Dong
Diptesh Kanojia
Wenwu Wang
Zhenhua Feng
DiffM
60
0
0
25 Feb 2025
Target Speaker Extraction through Comparing Noisy Positive and Negative Audio Enrollments
Shitong Xu
Yiyuan Yang
Niki Trigoni
Andrew Markham
34
0
0
23 Feb 2025
Towards Fusing Point Cloud and Visual Representations for Imitation Learning
Atalay Donat
Xiaogang Jia
Xi Huang
Aleksandar Taranovic
Denis Blessing
Ge Li
Hongyi Zhou
Hanyi Zhang
Rudolf Lioutikov
Gerhard Neumann
3DPC
SSL
68
1
0
20 Feb 2025
MedFuncta: Modality-Agnostic Representations Based on Efficient Neural Fields
Paul Friedrich
Florentin Bieder
P. Cattin
MedIm
57
0
0
20 Feb 2025
UPCMR: A Universal Prompt-guided Model for Random Sampling Cardiac MRI Reconstruction
Donghang Lyu
Chinmay Rao
Marius Staring
M. Osch
M. Doneva
Hildo J. Lamb
Nicola Pezzotti
44
1
0
18 Feb 2025
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation
Zhiyuan Liu
Yanchen Luo
Han Huang
Enzhi Zhang
Sihang Li
Junfeng Fang
Yaorui Shi
X. Wang
Kenji Kawaguchi
Tat-Seng Chua
100
3
0
18 Feb 2025
Predicate Hierarchies Improve Few-Shot State Classification
Emily Jin
Joy Hsu
Jiajun Wu
OffRL
67
0
0
18 Feb 2025
CoPEFT: Fast Adaptation Framework for Multi-Agent Collaborative Perception with Parameter-Efficient Fine-Tuning
Quanmin Wei
Penglin Dai
Wei Li
Bingyi Liu
Xiao-Jun Wu
41
0
0
15 Feb 2025
History-Guided Video Diffusion
Kiwhan Song
Boyuan Chen
Max Simchowitz
Yilun Du
Russ Tedrake
Vincent Sitzmann
VGen
109
7
0
10 Feb 2025
CDM: Contact Diffusion Model for Multi-Contact Point Localization
Seo Wook Han
Min Jun Kim
DiffM
30
0
0
10 Feb 2025
HOG-Diff: Higher-Order Guided Diffusion for Graph Generation
Yiming Huang
Tolga Birdal
DiffM
76
0
0
06 Feb 2025
Reinforcement Learning of Flexible Policies for Symbolic Instructions with Adjustable Mapping Specifications
Wataru Hatanaka
R. Yamashina
Takamitsu Matsubara
105
0
0
31 Jan 2025
Inductive Biases for Zero-shot Systematic Generalization in Language-informed Reinforcement Learning
Negin Hashemi Dijujin
Seyed Roozbeh Razavi Rohani
Mohammad Samiei
M. Baghshah
53
0
0
28 Jan 2025
UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images
Tatiana Taís Schein
Gustavo Pereira de Almeira
Stephanie Loi Brião
Rodrigo Andrade de Bem
Felipe Gomes de Oliveira
Paulo L. J. Drews-Jr
51
0
0
28 Jan 2025
Gradient-Based Multi-Objective Deep Learning: Algorithms, Theories, Applications, and Beyond
Weiyu Chen
Xiaoyuan Zhang
Baijiong Lin
Xi Victoria Lin
Han Zhao
Qingfu Zhang
James T. Kwok
73
2
0
19 Jan 2025
Control-ITRA: Controlling the Behavior of a Driving Model
Vasileios Lioutas
Adam Scibior
Matthew Niedoba
Berend Zwartsenberg
Frank D. Wood
87
0
0
17 Jan 2025
Modeling Time-Variant Responses of Optical Compressors with Selective State Space Models
Riccardo Simionato
Stefano Fasciani
68
1
0
17 Jan 2025
Enhanced Multi-Scale Cross-Attention for Person Image Generation
Hao Tang
Ling Shao
N. Sebe
Luc Van Gool
DiffM
65
0
0
15 Jan 2025
Score-based 3D molecule generation with neural fields
Matthieu Kirchmeyer
Pedro H. O. Pinheiro
Saeed Saremi
DiffM
43
0
0
15 Jan 2025
Multi-subject Open-set Personalization in Video Generation
Tsai-Shien Chen
Aliaksandr Siarohin
Willi Menapace
Yuwei Fang
Kwot Sin Lee
Ivan Skorokhodov
Kfir Aberman
Jun-Yan Zhu
Ming Yang
Sergey Tulyakov
DiffM
VGen
69
7
0
10 Jan 2025
Data-Driven Radio Propagation Modeling using Graph Neural Networks
Adrien Bufort
Laurent Lebocq
Stefan Cathabard
GNN
38
3
0
08 Jan 2025
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining
H. S. Bovbjerg
Jan Østergaard
Jesper Jensen
Zheng-Hua Tan
36
0
0
06 Jan 2025
S-Diff: An Anisotropic Diffusion Model for Collaborative Filtering in Spectral Domain
Rui Xia
Yanhua Cheng
Yongxiang Tang
Xiaocheng Liu
Xialong Liu
Lisong Wang
Peng Jiang
DiffM
33
0
0
03 Jan 2025
JADE: Joint-aware Latent Diffusion for 3D Human Generative Modeling
Haorui Ji
Rong Wang
Taojun Lin
Hongdong Li
3DH
35
1
0
31 Dec 2024
Simultaneous Music Separation and Generation Using Multi-Track Latent Diffusion Models
Tornike Karchkhadze
M. Izadi
Shlomo Dubnov
DiffM
39
2
0
31 Dec 2024
DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers
Yuntao Chen
Yuqi Wang
Zhaoxiang Zhang
101
7
0
24 Dec 2024
Dual Conditioned Motion Diffusion for Pose-Based Video Anomaly Detection
Andi Xu
Hongsong Wang
Pinle Ding
Jie Gui
DiffM
VGen
25
0
0
23 Dec 2024
Previous
1
2
3
4
5
...
25
26
27
Next