Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Home
Papers
1809.02587
Cited By
Self-Supervised Generation of Spatial Audio for 360 Video
7 September 2018
Pedro Morgado
Nuno Vasconcelos
Timothy R. Langlois
Oliver Wang
MDE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Self-Supervised Generation of Spatial Audio for 360 Video"
50 / 118 papers shown
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Chunhui Zhang
Li Liu
Yawen Cui
Guanjie Huang
Weilin Lin
Yiqian Yang
Yuehong Hu
VLM
392
127
0
14 May 2023
AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation
Shentong Mo
Yapeng Tian
VLM
197
57
0
03 May 2023
Looking Similar, Sounding Different: Leveraging Counterfactual Cross-Modal Pairs for Audiovisual Representation Learning
Computer Vision and Pattern Recognition (CVPR), 2023
Nikhil Singh
Chih-Wei Wu
Iroro Orife
Mahdi M. Kalayeh
390
3
0
12 Apr 2023
Audio-Visual Grouping Network for Sound Localization from Mixtures
Computer Vision and Pattern Recognition (CVPR), 2023
Shentong Mo
Yapeng Tian
158
63
0
29 Mar 2023
Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation
IEEE International Conference on Computer Vision (ICCV), 2023
Ziyang Chen
Shengyi Qian
Andrew Owens
236
19
0
20 Mar 2023
AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis
Neural Information Processing Systems (NeurIPS), 2023
Susan Liang
Chao Huang
Yapeng Tian
Anurag Kumar
Chenliang Xu
VGen
353
59
0
04 Feb 2023
Novel-View Acoustic Synthesis
Computer Vision and Pattern Recognition (CVPR), 2023
Changan Chen
Alexander Richard
Roman Shapovalov
V. Ithapu
Natalia Neverova
Kristen Grauman
Andrea Vedaldi
210
45
0
20 Jan 2023
iQuery: Instruments as Queries for Audio-Visual Sound Separation
Computer Vision and Pattern Recognition (CVPR), 2022
Jiaben Chen
Renrui Zhang
Dongze Lian
Jiaqi Yang
Ziyao Zeng
Jianbo Shi
279
39
0
07 Dec 2022
MarginNCE: Robust Sound Localization with a Negative Margin
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Sooyoung Park
Arda Senocak
Joon Son Chung
SSL
132
16
0
03 Nov 2022
Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source Separation
Neural Information Processing Systems (NeurIPS), 2022
Moitreya Chatterjee
Narendra Ahuja
A. Cherian
196
15
0
29 Oct 2022
A Closer Look at Weakly-Supervised Audio-Visual Source Localization
Neural Information Processing Systems (NeurIPS), 2022
Shentong Mo
Pedro Morgado
241
79
0
30 Aug 2022
Learning in Audio-visual Context: A Review, Analysis, and New Perspective
Yake Wei
Di Hu
Yapeng Tian
Xuelong Li
292
69
0
20 Aug 2022
End-to-End Binaural Speech Synthesis
Interspeech (Interspeech), 2022
Wen-Chin Huang
Dejan Marković
Alexander Richard
I. D. Gebru
Anjali Menon
145
15
0
08 Jul 2022
Deep Learning for Omnidirectional Vision: A Survey and New Perspectives
Hao Ai
Zidong Cao
Jin Zhu
Haotian Bai
Yucheng Chen
Ling Wang
306
41
0
21 May 2022
Learning Visual Styles from Audio-Visual Associations
European Conference on Computer Vision (ECCV), 2022
Tingle Li
Yichen Liu
Andrew Owens
Hang Zhao
DiffM
180
26
0
10 May 2022
ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer
Computer Vision and Pattern Recognition (CVPR), 2022
Ruohan Gao
Zilin Si
Yen-Yu Chang
Samuel Clarke
Jeannette Bohg
Li Fei-Fei
Wenzhen Yuan
Jiajun Wu
163
105
0
05 Apr 2022
Localizing Visual Sounds the Easy Way
European Conference on Computer Vision (ECCV), 2022
Shentong Mo
Pedro Morgado
258
98
0
17 Mar 2022
Visually Supervised Speaker Detection and Localization via Microphone Array
IEEE International Workshop on Multimedia Signal Processing (MMSP), 2021
Davide Berghi
A. Hilton
Philip J. B. Jackson
177
11
0
07 Mar 2022
Audio-Visual Fusion Layers for Event Type Aware Video Recognition
Arda Senocak
Junsik Kim
Tae-Hyun Oh
H. Ryu
Dingzeyu Li
In So Kweon
121
1
0
12 Feb 2022
Learning Sound Localization Better From Semantically Similar Samples
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Arda Senocak
H. Ryu
Junsik Kim
In So Kweon
SSL
133
37
0
07 Feb 2022
Class-aware Sounding Objects Localization via Audiovisual Correspondence
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Di Hu
Yake Wei
Rui Qian
Weiyao Lin
Ruihua Song
Ji-Rong Wen
168
47
0
22 Dec 2021
Geometry-Aware Multi-Task Learning for Binaural Audio Generation from Video
British Machine Vision Conference (BMVC), 2021
Rishabh Garg
Ruohan Gao
Kristen Grauman
172
31
0
21 Nov 2021
Beyond Mono to Binaural: Generating Binaural Audio from Mono Audio with Depth and Cross Modal Attention
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Kranti K. Parida
Siddharth Srivastava
Gaurav Sharma
MDE
188
28
0
15 Nov 2021
Structure from Silence: Learning Scene Structure from Ambient Sound
Conference on Robot Learning (CoRL), 2021
Ziyang Chen
Xixi Hu
Andrew Owens
157
30
0
10 Nov 2021
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
988
1,459
0
13 Oct 2021
Pano-AVQA: Grounded Audio-Visual Question Answering on 360
∘
^\circ
∘
Videos
IEEE International Conference on Computer Vision (ICCV), 2021
Heeseung Yun
Youngjae Yu
Wonsuk Yang
Kangil Lee
Gunhee Kim
295
105
0
11 Oct 2021
Visual Scene Graphs for Audio Source Separation
IEEE International Conference on Computer Vision (ICCV), 2021
Moitreya Chatterjee
Jonathan Le Roux
Narendra Ahuja
A. Cherian
208
41
0
24 Sep 2021
V-SlowFast Network for Efficient Visual Sound Separation
Xiangjie Sui
Esa Rahtu
230
12
0
18 Sep 2021
Binaural Audio Generation via Multi-task Learning
Sijia Li
Shiguang Liu
Tianyi Zhou
135
16
0
02 Sep 2021
ASOD60K: An Audio-Induced Salient Object Detection Dataset for Panoramic Videos
Yi Zhang
260
8
0
24 Jul 2021
FoleyGAN: Visually Guided Generative Adversarial Network-Based Synchronous Sound Generation in Silent Videos
IEEE transactions on multimedia (IEEE Trans. Multimedia), 2021
Sanchita Ghose
John J. Prevost
GAN
155
30
0
20 Jul 2021
Improving Multi-Modal Learning with Uni-Modal Teachers
Chenzhuang Du
Tingle Li
Yichen Liu
Zixin Wen
Tianyu Hua
Yue Wang
Hang Zhao
107
69
0
21 Jun 2021
Learning Audio-Visual Dereverberation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Changan Chen
Wei-Ju Sun
David Harwath
Kristen Grauman
203
35
0
14 Jun 2021
Exploiting Audio-Visual Consistency with Partial Supervision for Spatial Audio Generation
AAAI Conference on Artificial Intelligence (AAAI), 2021
Yan-Bo Lin
Y. Wang
204
22
0
03 May 2021
Points2Sound: From mono to binaural audio using 3D point cloud scenes
EURASIP Journal on Audio, Speech, and Music Processing (EURASIP J. Audio Speech Music Process), 2021
Francesc Lluís
V. Chatziioannou
A. Hofmann
3DPC
289
7
0
26 Apr 2021
On the Design of Deep Priors for Unsupervised Audio Restoration
Interspeech (Interspeech), 2021
V. Narayanaswamy
Jayaraman J. Thiagarajan
A. Spanias
AI4CE
120
6
0
14 Apr 2021
Visually Informed Binaural Audio Generation without Binaural Audios
Computer Vision and Pattern Recognition (CVPR), 2021
Xudong Xu
Hang Zhou
Ziwei Liu
Bo Dai
Xiaogang Wang
Dahua Lin
DiffM
135
67
0
13 Apr 2021
Unsupervised Sound Localization via Iterative Contrastive Learning
Computer Vision and Image Understanding (CVIU), 2021
Yan-Bo Lin
Hung-Yu Tseng
Hsin-Ying Lee
Yen-Yu Lin
Ming-Hsuan Yang
SSL
170
40
0
01 Apr 2021
Robust Audio-Visual Instance Discrimination
Computer Vision and Pattern Recognition (CVPR), 2021
Pedro Morgado
Ishan Misra
Nuno Vasconcelos
SSL
240
117
0
29 Mar 2021
Beyond Image to Depth: Improving Depth Prediction using Echoes
Computer Vision and Pattern Recognition (CVPR), 2021
Kranti K. Parida
Siddharth Srivastava
Gaurav Sharma
MDE
278
42
0
15 Mar 2021
There is More than Meets the Eye: Self-Supervised Multi-Object Detection and Tracking with Sound by Distilling Multimodal Knowledge
Computer Vision and Pattern Recognition (CVPR), 2021
Francisco Rivera Valverde
Juana Valeria Hurtado
Abhinav Valada
219
83
0
01 Mar 2021
Multimodality in VR: A survey
ACM Computing Surveys (CSUR), 2021
Daniel Martin
Sandra Malpica
Diego F. F. Gutierrez
B. Masiá
Ana Serrano
205
117
0
20 Jan 2021
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
Computer Vision and Pattern Recognition (CVPR), 2021
Ruohan Gao
Kristen Grauman
CVBM
448
237
0
08 Jan 2021
Sound Synthesis, Propagation, and Rendering: A Survey
Shiguang Liu
Tianyi Zhou
352
28
0
11 Nov 2020
Learning Representations from Audio-Visual Spatial Alignment
Pedro Morgado
Yi Li
Nuno Vasconcelos
SSL
165
138
0
03 Nov 2020
Noisy Agents: Self-supervised Exploration by Predicting Auditory Events
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2020
Chuang Gan
Xiaoyu Chen
Phillip Isola
Antonio Torralba
J. Tenenbaum
136
7
0
27 Jul 2020
Foley Music: Learning to Generate Music from Videos
Chuang Gan
Deng Huang
Peihao Chen
J. Tenenbaum
Antonio Torralba
VGen
127
151
0
21 Jul 2020
Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation
European Conference on Computer Vision (ECCV), 2020
Hang Zhou
Xudong Xu
Dahua Lin
Xiaogang Wang
Ziwei Liu
DiffM
191
94
0
20 Jul 2020
Leveraging Category Information for Single-Frame Visual Sound Source Separation
Xiangjie Sui
Esa Rahtu
129
9
0
15 Jul 2020
Do We Need Sound for Sound Source Localization?
Asian Conference on Computer Vision (ACCV), 2020
Takashi Oya
Shohei Iwase
Ryota Natsume
Takahiro Itazuri
Shugo Yamaguchi
Shigeo Morishima
131
25
0
11 Jul 2020
Previous
1
2
3
Next