Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.10961
Cited By
Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers
28 September 2018
Yutong Ban
Xavier Alameda-Pineda
Laurent Girin
Radu Horaud
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers"
8 / 8 papers shown
Title
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Jinzheng Zhao
Yong-mei Xu
Xinyuan Qian
Davide Berghi
Peipei Wu
Meng Cui
Jianyuan Sun
Philip J. B. Jackson
Wenwu Wang
BDL
37
7
0
23 Oct 2023
Egocentric Auditory Attention Localization in Conversations
Fiona Ryan
Hao Jiang
Abhinav Shukla
James M. Rehg
V. Ithapu
EgoV
29
16
0
28 Mar 2023
Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking
Yidi Li
Hong Liu
Hao Tang
10
20
0
14 Dec 2021
AcousticFusion: Fusing Sound Source Localization to Visual SLAM in Dynamic Environments
Tianwei Zhang
Huayan Zhang
Xiaofei Li
Junfeng Chen
Tin Lun Lam
S. Vijayakumar
14
18
0
03 Aug 2021
Multi-target DoA Estimation with an Audio-visual Fusion Mechanism
Xinyuan Qian
Maulik C. Madhavi
Zexu Pan
Jiadong Wang
Haizhou Li
24
44
0
13 May 2021
TransCenter: Transformers with Dense Representations for Multiple-Object Tracking
Yihong Xu
Yutong Ban
Guillaume Delorme
Chuang Gan
Daniela Rus
Xavier Alameda-Pineda
VOT
25
92
0
28 Mar 2021
Aggregating Long-Term Context for Learning Laparoscopic and Robot-Assisted Surgical Workflows
Yutong Ban
Guy Rosman
Thomas M. Ward
Daniel A. Hashimoto
Taisei Kondo
Hidekazu Iwaki
O. Meireles
Daniela Rus
28
19
0
01 Sep 2020
Mixture of Inference Networks for VAE-based Audio-visual Speech Enhancement
M. Sadeghi
Xavier Alameda-Pineda
13
21
0
23 Dec 2019
1