ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.10961
  4. Cited By
Variational Bayesian Inference for Audio-Visual Tracking of Multiple
  Speakers

Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers

28 September 2018
Yutong Ban
Xavier Alameda-Pineda
Laurent Girin
Radu Horaud
ArXivPDFHTML

Papers citing "Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers"

8 / 8 papers shown
Title
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Jinzheng Zhao
Yong-mei Xu
Xinyuan Qian
Davide Berghi
Peipei Wu
Meng Cui
Jianyuan Sun
Philip J. B. Jackson
Wenwu Wang
BDL
37
7
0
23 Oct 2023
Egocentric Auditory Attention Localization in Conversations
Egocentric Auditory Attention Localization in Conversations
Fiona Ryan
Hao Jiang
Abhinav Shukla
James M. Rehg
V. Ithapu
EgoV
29
16
0
28 Mar 2023
Multi-Modal Perception Attention Network with Self-Supervised Learning
  for Audio-Visual Speaker Tracking
Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking
Yidi Li
Hong Liu
Hao Tang
10
20
0
14 Dec 2021
AcousticFusion: Fusing Sound Source Localization to Visual SLAM in
  Dynamic Environments
AcousticFusion: Fusing Sound Source Localization to Visual SLAM in Dynamic Environments
Tianwei Zhang
Huayan Zhang
Xiaofei Li
Junfeng Chen
Tin Lun Lam
S. Vijayakumar
14
18
0
03 Aug 2021
Multi-target DoA Estimation with an Audio-visual Fusion Mechanism
Multi-target DoA Estimation with an Audio-visual Fusion Mechanism
Xinyuan Qian
Maulik C. Madhavi
Zexu Pan
Jiadong Wang
Haizhou Li
24
44
0
13 May 2021
TransCenter: Transformers with Dense Representations for Multiple-Object
  Tracking
TransCenter: Transformers with Dense Representations for Multiple-Object Tracking
Yihong Xu
Yutong Ban
Guillaume Delorme
Chuang Gan
Daniela Rus
Xavier Alameda-Pineda
VOT
25
92
0
28 Mar 2021
Aggregating Long-Term Context for Learning Laparoscopic and
  Robot-Assisted Surgical Workflows
Aggregating Long-Term Context for Learning Laparoscopic and Robot-Assisted Surgical Workflows
Yutong Ban
Guy Rosman
Thomas M. Ward
Daniel A. Hashimoto
Taisei Kondo
Hidekazu Iwaki
O. Meireles
Daniela Rus
28
19
0
01 Sep 2020
Mixture of Inference Networks for VAE-based Audio-visual Speech
  Enhancement
Mixture of Inference Networks for VAE-based Audio-visual Speech Enhancement
M. Sadeghi
Xavier Alameda-Pineda
13
21
0
23 Dec 2019
1