Personal VAD 2.0: Optimizing Personal Voice Activity Detection for
On-Device Speech Recognition

Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition

8 April 2022

Papers citing "Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition"

15 / 15 papers shown

Title
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining H. S. Bovbjerg Jan Østergaard Jesper Jensen Zheng-Hua Tan 36 0 0 06 Jan 2025
Investigation of Speaker Representation for Target-Speaker Speech Processing Takanori Ashihara Takafumi Moriya Shota Horiguchi Junyi Peng Tsubasa Ochiai Marc Delcroix Kohei Matsuura Hiroshi Sato 26 1 0 15 Oct 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning Shuai Wang Zheng-Shou Chen Kong Aik Lee Yan-min Qian Haizhou Li 26 4 0 21 Jul 2024
Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness Satyam Kumar Sai Srujana Buddi U. Sarawgi Vineet Garg Shivesh Ranjan Ognjen Rudovic Ahmed Hussen Abdelaziz Saurabh N. Adya 53 2 0 12 Jun 2024
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models Quan Wang Yiling Huang Guanlong Zhao Evan Clark Wei Xia Hank Liao AuLLM 15 8 0 07 Jan 2024
Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions H. S. Bovbjerg Jesper Jensen Jan Østergaard Zheng-Hua Tan VLM 19 3 0 27 Dec 2023
Personalizing Keyword Spotting with Speaker Information Beltrán Labrador Pai Zhu Guanlong Zhao Angelo Scorza Scarpati Quan Wang Alicia Lozano-Diez Alex Park Ignacio López Moreno 16 1 0 06 Nov 2023
In-Ear-Voice: Towards Milli-Watt Audio Enhancement With Bone-Conduction Microphones for In-Ear Sensing Platforms Philipp Schilk Niccolò Polvani Andrea Ronco Milos Cernak Michele Magno 21 12 0 05 Sep 2023
End-to-End Joint Target and Non-Target Speakers ASR Ryo Masumura Naoki Makishima Taiga Yamane Yoshihiko Yamazaki Saki Mizuno ... Akihiko Takashima Satoshi Suzuki Takafumi Moriya Nobukatsu Hojo Atsushi Ando 27 5 0 04 Jun 2023
SVVAD: Personal Voice Activity Detection for Speaker Verification Zuheng Kang Jianzong Wang Junqing Peng Jing Xiao 11 2 0 31 May 2023
Adaptive Endpointing with Deep Contextual Multi-armed Bandits Do June Min A. Stolcke A. Raju Colin Vaz Di He Venkatesh Ravichandran V. Trinh OffRL 27 0 0 23 Mar 2023
Personalized speech enhancement combining band-split RNN and speaker attentive module Xiaohuai Le Li Chen Chao-Peng He Yiqing Guo Cheng Chen Xianjun Xia Jing Lu 13 5 0 20 Feb 2023
BC-VAD: A Robust Bone Conduction Voice Activity Detection Niccolò Polvani Damien Ronssin Milos Cernak 19 0 0 06 Dec 2022
Taxonomic Classification of IoT Smart Home Voice Control M. Hewitt H. Cunningham 11 1 0 24 Oct 2022
Version Control of Speaker Recognition Systems Quan Wang Ignacio López Moreno 11 9 0 23 Jul 2020