ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.00850
  4. Cited By
Active Audio-Visual Separation of Dynamic Sound Sources

Active Audio-Visual Separation of Dynamic Sound Sources

2 February 2022
Sagnik Majumder
Kristen Grauman
ArXivPDFHTML

Papers citing "Active Audio-Visual Separation of Dynamic Sound Sources"

18 / 18 papers shown
Title
Meerkat: Audio-Visual Large Language Model for Grounding in Space and
  Time
Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time
Sanjoy Chowdhury
Sayan Nag
Subhrajyoti Dasgupta
Jun Chen
Mohamed Elhoseiny
Ruohan Gao
Dinesh Manocha
VLM
MLLM
34
9
0
01 Jul 2024
ActiveRIR: Active Audio-Visual Exploration for Acoustic Environment
  Modeling
ActiveRIR: Active Audio-Visual Exploration for Acoustic Environment Modeling
Arjun Somayazulu
Sagnik Majumder
Changan Chen
Kristen Grauman
24
1
0
24 Apr 2024
Cross Pseudo-Labeling for Semi-Supervised Audio-Visual Source
  Localization
Cross Pseudo-Labeling for Semi-Supervised Audio-Visual Source Localization
Yuxin Guo
Shijie Ma
Yuhao Zhao
Hu Su
Wei Zou
42
4
0
05 Mar 2024
LAVSS: Location-Guided Audio-Visual Spatial Audio Separation
LAVSS: Location-Guided Audio-Visual Spatial Audio Separation
Yuxin Ye
Wenming Yang
Yapeng Tian
26
10
0
31 Oct 2023
Novel-View Acoustic Synthesis from 3D Reconstructed Rooms
Novel-View Acoustic Synthesis from 3D Reconstructed Rooms
Byeongjoo Ahn
Karren D. Yang
Brian Hamilton
Jonathan Sheaffer
Anurag Ranjan
Miguel Sarabia
Oncel Tuzel
Jen-Hao Rick Chang
25
9
0
23 Oct 2023
Separating Invisible Sounds Toward Universal Audiovisual Scene-Aware
  Sound Separation
Separating Invisible Sounds Toward Universal Audiovisual Scene-Aware Sound Separation
Yiyang Su
A. Vosoughi
Shijian Deng
Yapeng Tian
Chenliang Xu
26
4
0
18 Oct 2023
Measuring Acoustics with Collaborative Multiple Agents
Measuring Acoustics with Collaborative Multiple Agents
Yinfeng Yu
Changan Chen
Lele Cao
Fangkai Yang
Fuchun Sun
18
1
0
09 Oct 2023
AdVerb: Visually Guided Audio Dereverberation
AdVerb: Visually Guided Audio Dereverberation
Sanjoy Chowdhury
Sreyan Ghosh
Subhrajyoti Dasgupta
Anton Ratnarajah
Utkarsh Tyagi
Dinesh Manocha
22
11
0
23 Aug 2023
Learning Spatial Features from Audio-Visual Correspondence in Egocentric
  Videos
Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos
Sagnik Majumder
Ziad Al-Halah
Kristen Grauman
SSL
EgoV
32
4
0
10 Jul 2023
RealImpact: A Dataset of Impact Sound Fields for Real Objects
RealImpact: A Dataset of Impact Sound Fields for Real Objects
Samuel Clarke
Ruohan Gao
Mason Wang
M. Rau
Julia Xu
Jui-Hsien Wang
Doug L. James
Jiajun Wu
27
9
0
16 Jun 2023
Learning Semantic-Agnostic and Spatial-Aware Representation for
  Generalizable Visual-Audio Navigation
Learning Semantic-Agnostic and Spatial-Aware Representation for Generalizable Visual-Audio Navigation
Hongchen Wang
Yuxuan Wang
Fangwei Zhong
Min-Yu Wu
Jianwei Zhang
Yizhou Wang
Hao Dong
21
6
0
21 Apr 2023
Sound Localization from Motion: Jointly Learning Sound Direction and
  Camera Rotation
Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation
Ziyang Chen
Shengyi Qian
Andrew Owens
21
12
0
20 Mar 2023
Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations
Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations
Sagnik Majumder
Hao Jiang
Pierre Moulon
E. Henderson
P. Calamia
Kristen Grauman
V. Ithapu
EgoV
21
7
0
04 Jan 2023
iQuery: Instruments as Queries for Audio-Visual Sound Separation
iQuery: Instruments as Queries for Audio-Visual Sound Separation
Jiaben Chen
Renrui Zhang
Dongze Lian
Jiaqi Yang
Ziyao Zeng
Jianbo Shi
16
26
0
07 Dec 2022
Few-Shot Audio-Visual Learning of Environment Acoustics
Few-Shot Audio-Visual Learning of Environment Acoustics
Sagnik Majumder
Changan Chen
Ziad Al-Halah
Kristen Grauman
28
50
0
08 Jun 2022
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
Ruohan Gao
Kristen Grauman
CVBM
185
198
0
08 Jan 2021
Dual-Path Transformer Network: Direct Context-Aware Modeling for
  End-to-End Monaural Speech Separation
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
Jing-jing Chen
Qi-rong Mao
Dong Liu
54
280
0
28 Jul 2020
Deeply learned face representations are sparse, selective, and robust
Deeply learned face representations are sparse, selective, and robust
Yi Sun
Xiaogang Wang
Xiaoou Tang
CVBM
248
921
0
03 Dec 2014
1