Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1912.11474
Cited By
v1
v2
v3 (latest)
SoundSpaces: Audio-Visual Navigation in 3D Environments
24 December 2019
Changan Chen
Unnat Jain
Carl Schissler
S. V. A. Garí
Ziad Al-Halah
V. Ithapu
Philip Robinson
Kristen Grauman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SoundSpaces: Audio-Visual Navigation in 3D Environments"
22 / 22 papers shown
Title
How Would It Sound? Material-Controlled Multimodal Acoustic Profile Generation for Indoor Scenes
Mahnoor Fatima Saad
Ziad Al-Halah
VGen
76
1
0
04 Aug 2025
Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Jie Yin
Andrew F. Luo
Yilun Du
A. Cherian
Tim K. Marks
Jonathan Le Roux
Chuang Gan
218
1
0
16 Jul 2024
Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Computer Vision and Pattern Recognition (CVPR), 2023
Reuben Tan
Arijit Ray
Andrea Burns
Bryan A. Plummer
Justin Salamon
Oriol Nieto
Bryan C. Russell
Kate Saenko
200
30
0
28 Mar 2023
Beyond Visual Field of View: Perceiving 3D Environment with Echoes and Vision
Xiangjie Sui
Esa Rahtu
Hang Zhao
MDE
270
7
0
03 Jul 2022
Self-Supervised Domain Adaptation for Visual Navigation with Global Map Consistency
E. Lee
J. Kim
Young Min Kim
TTA
SSL
214
6
0
14 Oct 2021
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
844
1,437
0
13 Oct 2021
Knowledge-based Embodied Question Answering
Sinan Tan
Mengmeng Ge
Di Guo
Huaping Liu
F. Sun
190
38
0
16 Sep 2021
Vision-Language Navigation: A Survey and Taxonomy
Wansen Wu
Tao Chang
Xinmeng Li
LM&Ro
290
44
0
26 Aug 2021
Touch-based Curiosity for Sparse-Reward Tasks
Sai Rajeswar
Cyril Ibrahim
Nitin Surya
Florian Golemo
David Vazquez
Rameswar Panda
Pedro H. O. Pinheiro
106
6
0
01 Apr 2021
A Survey of Embodied AI: From Simulators to Research Tasks
IEEE Transactions on Emerging Topics in Computational Intelligence (IEEE TETCI), 2021
Jiafei Duan
Samson Yu
Tangyao Li
Huaiyu Zhu
Cheston Tan
LM&Ro
557
409
0
08 Mar 2021
Audio-Visual Floorplan Reconstruction
IEEE International Conference on Computer Vision (ICCV), 2020
Senthil Purushwalkam
S. V. A. Garí
V. Ithapu
Carl Schissler
Philip Robinson
Abhinav Gupta
Kristen Grauman
VGen
3DV
244
43
0
31 Dec 2020
Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents
Conference on Robot Learning (CoRL), 2020
Samyak Datta
Oleksandr Maksymets
Judy Hoffman
Stefan Lee
Dhruv Batra
Devi Parikh
196
47
0
07 Sep 2020
Noisy Agents: Self-supervised Exploration by Predicting Auditory Events
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2020
Chuang Gan
Xiaoyu Chen
Phillip Isola
Antonio Torralba
J. Tenenbaum
136
7
0
27 Jul 2020
Bridging the Imitation Gap by Adaptive Insubordination
Neural Information Processing Systems (NeurIPS), 2020
Luca Weihs
Unnat Jain
Iou-Jen Liu
Jordi Salvador
Svetlana Lazebnik
Aniruddha Kembhavi
Alex Schwing
308
40
0
23 Jul 2020
Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation
European Conference on Computer Vision (ECCV), 2020
Hang Zhou
Xudong Xu
Dahua Lin
Xiaogang Wang
Ziwei Liu
DiffM
191
94
0
20 Jul 2020
OtoWorld: Towards Learning to Separate by Learning to Move
Omkar Ranadive
Grant Gasser
David Terpay
Prem Seetharaman
125
1
0
12 Jul 2020
A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied Tasks
European Conference on Computer Vision (ECCV), 2020
Unnat Jain
Luca Weihs
Eric Kolve
Ali Farhadi
Svetlana Lazebnik
Aniruddha Kembhavi
Alex Schwing
218
62
0
09 Jul 2020
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
Chuang Gan
Jeremy Schwartz
S. Alter
Damian Mrowca
Martin Schrimpf
...
Antonio Torralba
J. DiCarlo
J. Tenenbaum
Josh H. McDermott
Daniel L. K. Yamins
VGen
396
324
0
09 Jul 2020
See, Hear, Explore: Curiosity via Audio-Visual Association
Neural Information Processing Systems (NeurIPS), 2020
Victoria Dean
Shubham Tulsiani
Abhinav Gupta
228
64
0
07 Jul 2020
Visually Guided Sound Source Separation using Cascaded Opponent Filter Network
Xiangjie Sui
Esa Rahtu
195
25
0
04 Jun 2020
VisualEchoes: Spatial Image Representation Learning through Echolocation
European Conference on Computer Vision (ECCV), 2020
Ruohan Gao
Changan Chen
Ziad Al-Halah
Carl Schissler
Kristen Grauman
MDE
SSL
425
90
0
04 May 2020
Look, Listen, and Act: Towards Audio-Visual Embodied Navigation
IEEE International Conference on Robotics and Automation (ICRA), 2019
Chuang Gan
Yiwei Zhang
Jiajun Wu
Boqing Gong
J. Tenenbaum
175
150
0
25 Dec 2019
1