Few-Shot Audio-Visual Learning of Environment Acoustics

Few-Shot Audio-Visual Learning of Environment Acoustics

8 June 2022

Sagnik Majumder

Kristen Grauman

Papers citing "Few-Shot Audio-Visual Learning of Environment Acoustics"

12 / 12 papers shown

Title
Multi-modal and Multi-scale Spatial Environment Understanding for Immersive Visual Text-to-Speech Rui Liu Shuwei He Yifan Hu H. Li VLM 87 1 0 16 Dec 2024
Transforming Game Play: A Comparative Study of DCQN and DTQN Architectures in Reinforcement Learning William A. Stigall 43 0 0 14 Oct 2024
A Comprehensive Review of Few-shot Action Recognition Yuyang Wanyan Xiaoshan Yang Weiming Dong Changsheng Xu VLM 56 3 0 20 Jul 2024
SOAF: Scene Occlusion-aware Neural Acoustic Field Huiyu Gao Jiahao Ma David Ahmedt-Aristizabal Chuong H. Nguyen Miaomiao Liu 26 2 0 02 Jul 2024
AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis Swapnil Bhosale Haosen Yang Diptesh Kanojia Jiankang Deng Xiatian Zhu 38 5 0 13 Jun 2024
NeRAF: 3D Scene Infused Neural Radiance and Acoustic Fields Amandine Brunetto Sascha Hornauer Fabien Moutarde 39 1 0 28 May 2024
Looking Similar, Sounding Different: Leveraging Counterfactual Cross-Modal Pairs for Audiovisual Representation Learning Nikhil Singh Chih-Wei Wu Iroro Orife Mahdi M. Kalayeh 23 2 0 12 Apr 2023
Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations Sagnik Majumder Hao Jiang Pierre Moulon E. Henderson P. Calamia Kristen Grauman V. Ithapu EgoV 13 7 0 04 Jan 2023
SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning Changan Chen Carl Schissler Sanchit Garg Philip Kobernik Alexander William Clegg P. Calamia Dhruv Batra Philip Robinson Kristen Grauman 3DGS 24 79 0 16 Jun 2022
Audio-Visual Floorplan Reconstruction Senthil Purushwalkam S. V. A. Garí V. Ithapu Carl Schissler Philip Robinson Abhinav Gupta Kristen Grauman VGen 3DV 60 41 0 31 Dec 2020
VisualEchoes: Spatial Image Representation Learning through Echolocation Ruohan Gao Changan Chen Ziad Al-Halah Carl Schissler Kristen Grauman MDE SSL 156 83 0 04 May 2020
Deeply learned face representations are sparse, selective, and robust Yi Sun Xiaogang Wang Xiaoou Tang CVBM 239 918 0 03 Dec 2014