Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1912.11684
Cited By
v1
v2 (latest)
Look, Listen, and Act: Towards Audio-Visual Embodied Navigation
IEEE International Conference on Robotics and Automation (ICRA), 2019
25 December 2019
Chuang Gan
Yiwei Zhang
Jiajun Wu
Boqing Gong
J. Tenenbaum
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Look, Listen, and Act: Towards Audio-Visual Embodied Navigation"
27 / 77 papers shown
Learning Audio-Visual Dereverberation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Changan Chen
Wei-Ju Sun
David Harwath
Kristen Grauman
220
35
0
14 Jun 2021
RobustNav: Towards Benchmarking Robustness in Embodied Navigation
IEEE International Conference on Computer Vision (ICCV), 2021
Prithvijit Chattopadhyay
Judy Hoffman
Roozbeh Mottaghi
Aniruddha Kembhavi
285
68
0
08 Jun 2021
VSGM -- Enhance robot task understanding ability through visual semantic graph
Cheng-Yu Tsai
Mu-Chun Su
232
0
0
19 May 2021
Move2Hear: Active Audio-Visual Source Separation
IEEE International Conference on Computer Vision (ICCV), 2021
Sagnik Majumder
Ziad Al-Halah
Kristen Grauman
220
47
0
15 May 2021
Visually Guided Sound Source Separation and Localization using Self-Supervised Motion Representations
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Xiangjie Sui
Esa Rahtu
154
30
0
17 Apr 2021
Can audio-visual integration strengthen robustness under multimodal attacks?
Computer Vision and Pattern Recognition (CVPR), 2021
Yapeng Tian
Chenliang Xu
AAML
304
41
0
05 Apr 2021
The ThreeDWorld Transport Challenge: A Visually Guided Task-and-Motion Planning Benchmark for Physically Realistic Embodied AI
IEEE International Conference on Robotics and Automation (ICRA), 2021
Chuang Gan
Siyuan Zhou
Jeremy Schwartz
S. Alter
Abhishek Bhandwaldar
...
Daniel L. K. Yamins
J. DiCarlo
Josh H. McDermott
Antonio Torralba
J. Tenenbaum
LM&Ro
250
89
0
25 Mar 2021
A Survey of Embodied AI: From Simulators to Research Tasks
IEEE Transactions on Emerging Topics in Computational Intelligence (IEEE TETCI), 2021
Jiafei Duan
Samson Yu
Tangyao Li
Huaiyu Zhu
Cheston Tan
LM&Ro
623
420
0
08 Mar 2021
Environment Predictive Coding for Embodied Agents
Santhosh Kumar Ramakrishnan
Tushar Nagarajan
Ziad Al-Halah
Kristen Grauman
207
14
0
03 Feb 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
Artificial Intelligence Review (AIR), 2021
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Xiaoshi Zhong
OffRL
338
88
0
01 Jan 2021
Audio-Visual Floorplan Reconstruction
IEEE International Conference on Computer Vision (ICCV), 2020
Senthil Purushwalkam
S. V. A. Garí
V. Ithapu
Carl Schissler
Philip Robinson
Abhinav Gupta
Kristen Grauman
VGen
3DV
262
43
0
31 Dec 2020
Semantic Audio-Visual Navigation
Computer Vision and Pattern Recognition (CVPR), 2020
Changan Chen
Ziad Al-Halah
Kristen Grauman
297
117
0
21 Dec 2020
Learning to Set Waypoints for Audio-Visual Navigation
Changan Chen
Sagnik Majumder
Ziad Al-Halah
Ruohan Gao
Santhosh Kumar Ramakrishnan
Kristen Grauman
SSL
244
5
0
21 Aug 2020
Occupancy Anticipation for Efficient Exploration and Navigation
Santhosh Kumar Ramakrishnan
Ziad Al-Halah
Kristen Grauman
EgoV
3DPC
318
187
0
21 Aug 2020
Noisy Agents: Self-supervised Exploration by Predicting Auditory Events
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2020
Chuang Gan
Xiaoyu Chen
Phillip Isola
Antonio Torralba
J. Tenenbaum
149
7
0
27 Jul 2020
Foley Music: Learning to Generate Music from Videos
Chuang Gan
Deng Huang
Peihao Chen
J. Tenenbaum
Antonio Torralba
VGen
136
151
0
21 Jul 2020
Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation
European Conference on Computer Vision (ECCV), 2020
Hang Zhou
Xudong Xu
Dahua Lin
Xiaogang Wang
Ziwei Liu
DiffM
208
95
0
20 Jul 2020
Generating Visually Aligned Sound from Videos
IEEE Transactions on Image Processing (TIP), 2020
Peihao Chen
Yang Zhang
Zhuliang Yu
Hongdong Xiao
Deng Huang
Chuang Gan
VGen
207
113
0
14 Jul 2020
Multiple Sound Sources Localization from Coarse to Fine
European Conference on Computer Vision (ECCV), 2020
Rui Qian
Di Hu
Heinrich Dinkel
Mengyue Wu
N. Xu
Weiyao Lin
269
179
0
13 Jul 2020
OtoWorld: Towards Learning to Separate by Learning to Move
Omkar Ranadive
Grant Gasser
David Terpay
Prem Seetharaman
126
1
0
12 Jul 2020
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
Chuang Gan
Jeremy Schwartz
S. Alter
Damian Mrowca
Martin Schrimpf
...
Antonio Torralba
J. DiCarlo
J. Tenenbaum
Josh H. McDermott
Daniel L. K. Yamins
VGen
447
351
0
09 Jul 2020
See, Hear, Explore: Curiosity via Audio-Visual Association
Neural Information Processing Systems (NeurIPS), 2020
Victoria Dean
Shubham Tulsiani
Abhinav Gupta
267
64
0
07 Jul 2020
Visually Guided Sound Source Separation using Cascaded Opponent Filter Network
Xiangjie Sui
Esa Rahtu
199
24
0
04 Jun 2020
VisualEchoes: Spatial Image Representation Learning through Echolocation
European Conference on Computer Vision (ECCV), 2020
Ruohan Gao
Changan Chen
Ziad Al-Halah
Carl Schissler
Kristen Grauman
MDE
SSL
467
90
0
04 May 2020
Music Gesture for Visual Sound Separation
Computer Vision and Pattern Recognition (CVPR), 2020
Chuang Gan
Deng Huang
Hang Zhao
J. Tenenbaum
Antonio Torralba
238
214
0
20 Apr 2020
Vision-Dialog Navigation by Exploring Cross-modal Memory
Computer Vision and Pattern Recognition (CVPR), 2020
Yi Zhu
Fengda Zhu
Zhaohuan Zhan
Bingqian Lin
Jianbin Jiao
Xiaojun Chang
Xiaodan Liang
VLM
185
52
0
15 Mar 2020
Robust Robotic Pouring using Audition and Haptics
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2020
Hongzhuo Liang
Chuangchuang Zhou
Shuang Li
Xiaojian Ma
Norman Hendrich
Timo Gerkmann
F. Sun
Marcus Stoffel
Jianwei Zhang
387
22
0
29 Feb 2020
Previous
1
2
Page 2 of 2