Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.03793
Cited By
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
8 April 2022
Shaojin Ding
R. Rikhye
Qiao Liang
Yanzhang He
Quan Wang
A. Narayanan
Tom O'Malley
Ian McGraw
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition"
15 / 15 papers shown
Title
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining
H. S. Bovbjerg
Jan Østergaard
Jesper Jensen
Zheng-Hua Tan
36
0
0
06 Jan 2025
Investigation of Speaker Representation for Target-Speaker Speech Processing
Takanori Ashihara
Takafumi Moriya
Shota Horiguchi
Junyi Peng
Tsubasa Ochiai
Marc Delcroix
Kohei Matsuura
Hiroshi Sato
26
1
0
15 Oct 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
26
4
0
21 Jul 2024
Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness
Satyam Kumar
Sai Srujana Buddi
U. Sarawgi
Vineet Garg
Shivesh Ranjan
Ognjen
Rudovic
Ahmed Hussen Abdelaziz
Saurabh N. Adya
53
2
0
12 Jun 2024
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
Quan Wang
Yiling Huang
Guanlong Zhao
Evan Clark
Wei Xia
Hank Liao
AuLLM
15
8
0
07 Jan 2024
Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions
H. S. Bovbjerg
Jesper Jensen
Jan Østergaard
Zheng-Hua Tan
VLM
19
3
0
27 Dec 2023
Personalizing Keyword Spotting with Speaker Information
Beltrán Labrador
Pai Zhu
Guanlong Zhao
Angelo Scorza Scarpati
Quan Wang
Alicia Lozano-Diez
Alex Park
Ignacio López Moreno
16
1
0
06 Nov 2023
In-Ear-Voice: Towards Milli-Watt Audio Enhancement With Bone-Conduction Microphones for In-Ear Sensing Platforms
Philipp Schilk
Niccolò Polvani
Andrea Ronco
Milos Cernak
Michele Magno
21
12
0
05 Sep 2023
End-to-End Joint Target and Non-Target Speakers ASR
Ryo Masumura
Naoki Makishima
Taiga Yamane
Yoshihiko Yamazaki
Saki Mizuno
...
Akihiko Takashima
Satoshi Suzuki
Takafumi Moriya
Nobukatsu Hojo
Atsushi Ando
27
5
0
04 Jun 2023
SVVAD: Personal Voice Activity Detection for Speaker Verification
Zuheng Kang
Jianzong Wang
Junqing Peng
Jing Xiao
11
2
0
31 May 2023
Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Do June Min
A. Stolcke
A. Raju
Colin Vaz
Di He
Venkatesh Ravichandran
V. Trinh
OffRL
27
0
0
23 Mar 2023
Personalized speech enhancement combining band-split RNN and speaker attentive module
Xiaohuai Le
Li Chen
Chao-Peng He
Yiqing Guo
Cheng Chen
Xianjun Xia
Jing Lu
13
5
0
20 Feb 2023
BC-VAD: A Robust Bone Conduction Voice Activity Detection
Niccolò Polvani
Damien Ronssin
Milos Cernak
19
0
0
06 Dec 2022
Taxonomic Classification of IoT Smart Home Voice Control
M. Hewitt
H. Cunningham
11
1
0
24 Oct 2022
Version Control of Speaker Recognition Systems
Quan Wang
Ignacio López Moreno
11
9
0
23 Jul 2020
1