Frame-level speaker embeddings for text-independent speaker recognition and analysis of end-to-end model

12 September 2018

Hao Tang

Papers citing "Frame-level speaker embeddings for text-independent speaker recognition and analysis of end-to-end model"

30 / 30 papers shown

Title
Discrete Unit based Masking for Improving Disentanglement in Voice Conversion Philip H. Lee Ismail Rasim Ulgen Berrak Sisman 35 0 0 17 Sep 2024
Reliable Visualization for Deep Speaker Recognition Pengqi Li Lantian Li A. Hamdulla Dong Wang HAI 40 9 0 08 Apr 2022
Decomposed Temporal Dynamic CNN: Efficient Time-Adaptive Network for Text-Independent Speaker Verification Explained with Speaker Activation Map Seong-Hu Kim Hyeonuk Nam Yong-Hwa Park 25 9 0 29 Mar 2022
Keyword localisation in untranscribed speech using visually grounded speech models Kayode Olaleye Dan Oneaţă Herman Kamper 32 7 0 02 Feb 2022
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding Saurabhchand Bhati Jesús Villalba Piotr Żelasko Laureano Moro-Velazquez Najim Dehak SSL 55 22 0 05 Oct 2021
What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis Shammur A. Chowdhury Nadir Durrani Ahmed M. Ali 44 12 0 01 Jul 2021
QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus Hamdy Mubarak A. Hussein Shammur A. Chowdhury Ahmed M. Ali 18 44 0 24 Jun 2021
Neural Speaker Embeddings for Ultrasound-based Silent Speech Interfaces Amin Honarmandi Shandiz L. Tóth G. Gosztolya Alexandra Markó Tamás Gábor Csapó 26 6 0 08 Jun 2021
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods Xian Shi Fan Yu Yizhou Lu Yuhao Liang Qiangze Feng Daliang Wang Y. Qian Lei Xie 24 66 0 20 Feb 2021
Deep Discriminative Feature Learning for Accent Recognition Wei Wang Chao Zhang Xiao-pei Wu 34 2 0 25 Nov 2020
Few Shot Text-Independent speaker verification using 3D-CNN Prateek Mishra 27 5 0 25 Aug 2020
Disentangled speaker and nuisance attribute embedding for robust speaker verification Woohyun Kang Sung Hwan Mun Min Hyun Han N. Kim 27 17 0 07 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning Jing-Xuan Zhang Zhenhua Ling Lirong Dai 15 6 0 05 Aug 2020
Singer Identification Using Convolutional Acoustic Motif Embeddings Aitor Arronte Alvarez Francisco Gómez-Martín 12 1 0 01 Aug 2020
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms Jee-weon Jung Seung-bin Kim Hye-jin Shim Ju-ho Kim Ha-Jin Yu 23 60 0 01 Apr 2020
An empirical analysis of information encoded in disentangled neural speaker representations Raghuveer Peri Haoqi Li Krishna Somandepalli Arindam Jati Shrikanth Narayanan DRL 27 13 0 10 Feb 2020
A study on the role of subsidiary information in replay attack spoofing detection Jee-weon Jung Hye-jin Shim Hee-Soo Heo Ha-Jin Yu 24 3 0 31 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends S. Latif R. Rana Sara Khalifa Raja Jurdak Junaid Qadir Björn W. Schuller AI4TS 34 81 0 02 Jan 2020
Biometrics Recognition Using Deep Learning: A Survey Shervin Minaee AmirAli Abdolrashidi Hang Su Bennamoun David C. Zhang 26 84 0 30 Nov 2019
A Deep Neural Network for Short-Segment Speaker Recognition Amirhossein Hajavi Ali Etemad 16 74 0 22 Jul 2019
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition Yonatan Belinkov Ahmed M. Ali James R. Glass 28 32 0 09 Jul 2019
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification Youngmoon Jung Younggwan Kim Hyungjun Lim Yeunju Choi Hoirin Kim 21 32 0 19 Jun 2019
RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification Jee-weon Jung Hee-Soo Heo Ju-ho Kim Hye-jin Shim Ha-Jin Yu 17 140 0 17 Apr 2019
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation Suwon Shon Najim Dehak D. Reynolds James R. Glass 19 26 0 07 Apr 2019
VoiceID Loss: Speech Enhancement for Speaker Verification Suwon Shon Hao Tang James R. Glass VLM 9 87 0 07 Apr 2019
Utterance-level Aggregation For Speaker Recognition In The Wild Weidi Xie Arsha Nagrani Joon Son Chung Andrew Zisserman 19 343 0 26 Feb 2019
Channel adversarial training for cross-channel text-independent speaker recognition Xin Fang Liang Zou Jin Li Lei Sun Zhenhua Ling 16 29 0 25 Feb 2019
Noise-tolerant Audio-visual Online Person Verification using an Attention-based Neural Network Fusion Suwon Shon Tae-Hyun Oh James R. Glass 16 50 0 27 Nov 2018
Unsupervised Representation Learning of Speech for Dialect Identification Suwon Shon Wei-Ning Hsu James R. Glass 16 13 0 12 Sep 2018
End-to-End Training Approaches for Discriminative Segmental Models Hao Tang Weiran Wang Kevin Gimpel Karen Livescu 30 7 0 21 Oct 2016