ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.11474
  4. Cited By
SoundSpaces: Audio-Visual Navigation in 3D Environments
v1v2v3 (latest)

SoundSpaces: Audio-Visual Navigation in 3D Environments

24 December 2019
Changan Chen
Unnat Jain
Carl Schissler
S. V. A. Garí
Ziad Al-Halah
V. Ithapu
Philip Robinson
Kristen Grauman
ArXiv (abs)PDFHTML

Papers citing "SoundSpaces: Audio-Visual Navigation in 3D Environments"

22 / 22 papers shown
Title
How Would It Sound? Material-Controlled Multimodal Acoustic Profile Generation for Indoor Scenes
How Would It Sound? Material-Controlled Multimodal Acoustic Profile Generation for Indoor Scenes
Mahnoor Fatima Saad
Ziad Al-Halah
VGen
76
1
0
04 Aug 2025
Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Jie Yin
Andrew F. Luo
Yilun Du
A. Cherian
Tim K. Marks
Jonathan Le Roux
Chuang Gan
218
1
0
16 Jul 2024
Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Language-Guided Audio-Visual Source Separation via Trimodal ConsistencyComputer Vision and Pattern Recognition (CVPR), 2023
Reuben Tan
Arijit Ray
Andrea Burns
Bryan A. Plummer
Justin Salamon
Oriol Nieto
Bryan C. Russell
Kate Saenko
200
30
0
28 Mar 2023
Beyond Visual Field of View: Perceiving 3D Environment with Echoes and
  Vision
Beyond Visual Field of View: Perceiving 3D Environment with Echoes and Vision
Xiangjie Sui
Esa Rahtu
Hang Zhao
MDE
270
7
0
03 Jul 2022
Self-Supervised Domain Adaptation for Visual Navigation with Global Map
  Consistency
Self-Supervised Domain Adaptation for Visual Navigation with Global Map Consistency
E. Lee
J. Kim
Young Min Kim
TTASSL
214
6
0
14 Oct 2021
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
844
1,437
0
13 Oct 2021
Knowledge-based Embodied Question Answering
Knowledge-based Embodied Question Answering
Sinan Tan
Mengmeng Ge
Di Guo
Huaping Liu
F. Sun
190
38
0
16 Sep 2021
Vision-Language Navigation: A Survey and Taxonomy
Vision-Language Navigation: A Survey and Taxonomy
Wansen Wu
Tao Chang
Xinmeng Li
LM&Ro
290
44
0
26 Aug 2021
Touch-based Curiosity for Sparse-Reward Tasks
Touch-based Curiosity for Sparse-Reward Tasks
Sai Rajeswar
Cyril Ibrahim
Nitin Surya
Florian Golemo
David Vazquez
Rameswar Panda
Pedro H. O. Pinheiro
106
6
0
01 Apr 2021
A Survey of Embodied AI: From Simulators to Research Tasks
A Survey of Embodied AI: From Simulators to Research TasksIEEE Transactions on Emerging Topics in Computational Intelligence (IEEE TETCI), 2021
Jiafei Duan
Samson Yu
Tangyao Li
Huaiyu Zhu
Cheston Tan
LM&Ro
557
409
0
08 Mar 2021
Audio-Visual Floorplan Reconstruction
Audio-Visual Floorplan ReconstructionIEEE International Conference on Computer Vision (ICCV), 2020
Senthil Purushwalkam
S. V. A. Garí
V. Ithapu
Carl Schissler
Philip Robinson
Abhinav Gupta
Kristen Grauman
VGen3DV
244
43
0
31 Dec 2020
Integrating Egocentric Localization for More Realistic Point-Goal
  Navigation Agents
Integrating Egocentric Localization for More Realistic Point-Goal Navigation AgentsConference on Robot Learning (CoRL), 2020
Samyak Datta
Oleksandr Maksymets
Judy Hoffman
Stefan Lee
Dhruv Batra
Devi Parikh
196
47
0
07 Sep 2020
Noisy Agents: Self-supervised Exploration by Predicting Auditory Events
Noisy Agents: Self-supervised Exploration by Predicting Auditory EventsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2020
Chuang Gan
Xiaoyu Chen
Phillip Isola
Antonio Torralba
J. Tenenbaum
136
7
0
27 Jul 2020
Bridging the Imitation Gap by Adaptive Insubordination
Bridging the Imitation Gap by Adaptive InsubordinationNeural Information Processing Systems (NeurIPS), 2020
Luca Weihs
Unnat Jain
Iou-Jen Liu
Jordi Salvador
Svetlana Lazebnik
Aniruddha Kembhavi
Alex Schwing
308
40
0
23 Jul 2020
Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating
  Source Separation
Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source SeparationEuropean Conference on Computer Vision (ECCV), 2020
Hang Zhou
Xudong Xu
Dahua Lin
Xiaogang Wang
Ziwei Liu
DiffM
191
94
0
20 Jul 2020
OtoWorld: Towards Learning to Separate by Learning to Move
OtoWorld: Towards Learning to Separate by Learning to Move
Omkar Ranadive
Grant Gasser
David Terpay
Prem Seetharaman
125
1
0
12 Jul 2020
A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied
  Tasks
A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied TasksEuropean Conference on Computer Vision (ECCV), 2020
Unnat Jain
Luca Weihs
Eric Kolve
Ali Farhadi
Svetlana Lazebnik
Aniruddha Kembhavi
Alex Schwing
218
62
0
09 Jul 2020
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
Chuang Gan
Jeremy Schwartz
S. Alter
Damian Mrowca
Martin Schrimpf
...
Antonio Torralba
J. DiCarlo
J. Tenenbaum
Josh H. McDermott
Daniel L. K. Yamins
VGen
396
324
0
09 Jul 2020
See, Hear, Explore: Curiosity via Audio-Visual Association
See, Hear, Explore: Curiosity via Audio-Visual AssociationNeural Information Processing Systems (NeurIPS), 2020
Victoria Dean
Shubham Tulsiani
Abhinav Gupta
228
64
0
07 Jul 2020
Visually Guided Sound Source Separation using Cascaded Opponent Filter
  Network
Visually Guided Sound Source Separation using Cascaded Opponent Filter Network
Xiangjie Sui
Esa Rahtu
195
25
0
04 Jun 2020
VisualEchoes: Spatial Image Representation Learning through Echolocation
VisualEchoes: Spatial Image Representation Learning through EcholocationEuropean Conference on Computer Vision (ECCV), 2020
Ruohan Gao
Changan Chen
Ziad Al-Halah
Carl Schissler
Kristen Grauman
MDESSL
425
90
0
04 May 2020
Look, Listen, and Act: Towards Audio-Visual Embodied Navigation
Look, Listen, and Act: Towards Audio-Visual Embodied NavigationIEEE International Conference on Robotics and Automation (ICRA), 2019
Chuang Gan
Yiwei Zhang
Jiajun Wu
Boqing Gong
J. Tenenbaum
175
150
0
25 Dec 2019
1