Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.04437
Cited By
Frame-level speaker embeddings for text-independent speaker recognition and analysis of end-to-end model
12 September 2018
Suwon Shon
Hao Tang
James R. Glass
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Frame-level speaker embeddings for text-independent speaker recognition and analysis of end-to-end model"
30 / 30 papers shown
Title
Discrete Unit based Masking for Improving Disentanglement in Voice Conversion
Philip H. Lee
Ismail Rasim Ulgen
Berrak Sisman
35
0
0
17 Sep 2024
Reliable Visualization for Deep Speaker Recognition
Pengqi Li
Lantian Li
A. Hamdulla
Dong Wang
HAI
40
9
0
08 Apr 2022
Decomposed Temporal Dynamic CNN: Efficient Time-Adaptive Network for Text-Independent Speaker Verification Explained with Speaker Activation Map
Seong-Hu Kim
Hyeonuk Nam
Yong-Hwa Park
25
9
0
29 Mar 2022
Keyword localisation in untranscribed speech using visually grounded speech models
Kayode Olaleye
Dan Oneaţă
Herman Kamper
32
7
0
02 Feb 2022
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Laureano Moro-Velazquez
Najim Dehak
SSL
55
22
0
05 Oct 2021
What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis
Shammur A. Chowdhury
Nadir Durrani
Ahmed M. Ali
44
12
0
01 Jul 2021
QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus
Hamdy Mubarak
A. Hussein
Shammur A. Chowdhury
Ahmed M. Ali
18
44
0
24 Jun 2021
Neural Speaker Embeddings for Ultrasound-based Silent Speech Interfaces
Amin Honarmandi Shandiz
L. Tóth
G. Gosztolya
Alexandra Markó
Tamás Gábor Csapó
26
6
0
08 Jun 2021
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods
Xian Shi
Fan Yu
Yizhou Lu
Yuhao Liang
Qiangze Feng
Daliang Wang
Y. Qian
Lei Xie
24
66
0
20 Feb 2021
Deep Discriminative Feature Learning for Accent Recognition
Wei Wang
Chao Zhang
Xiao-pei Wu
34
2
0
25 Nov 2020
Few Shot Text-Independent speaker verification using 3D-CNN
Prateek Mishra
27
5
0
25 Aug 2020
Disentangled speaker and nuisance attribute embedding for robust speaker verification
Woohyun Kang
Sung Hwan Mun
Min Hyun Han
N. Kim
27
17
0
07 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
15
6
0
05 Aug 2020
Singer Identification Using Convolutional Acoustic Motif Embeddings
Aitor Arronte Alvarez
Francisco Gómez-Martín
12
1
0
01 Aug 2020
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms
Jee-weon Jung
Seung-bin Kim
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
23
60
0
01 Apr 2020
An empirical analysis of information encoded in disentangled neural speaker representations
Raghuveer Peri
Haoqi Li
Krishna Somandepalli
Arindam Jati
Shrikanth Narayanan
DRL
27
13
0
10 Feb 2020
A study on the role of subsidiary information in replay attack spoofing detection
Jee-weon Jung
Hye-jin Shim
Hee-Soo Heo
Ha-Jin Yu
24
3
0
31 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
34
81
0
02 Jan 2020
Biometrics Recognition Using Deep Learning: A Survey
Shervin Minaee
AmirAli Abdolrashidi
Hang Su
Bennamoun
David C. Zhang
26
84
0
30 Nov 2019
A Deep Neural Network for Short-Segment Speaker Recognition
Amirhossein Hajavi
Ali Etemad
16
74
0
22 Jul 2019
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition
Yonatan Belinkov
Ahmed M. Ali
James R. Glass
28
32
0
09 Jul 2019
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Youngmoon Jung
Younggwan Kim
Hyungjun Lim
Yeunju Choi
Hoirin Kim
21
32
0
19 Jun 2019
RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification
Jee-weon Jung
Hee-Soo Heo
Ju-ho Kim
Hye-jin Shim
Ha-Jin Yu
17
140
0
17 Apr 2019
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation
Suwon Shon
Najim Dehak
D. Reynolds
James R. Glass
19
26
0
07 Apr 2019
VoiceID Loss: Speech Enhancement for Speaker Verification
Suwon Shon
Hao Tang
James R. Glass
VLM
9
87
0
07 Apr 2019
Utterance-level Aggregation For Speaker Recognition In The Wild
Weidi Xie
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
19
343
0
26 Feb 2019
Channel adversarial training for cross-channel text-independent speaker recognition
Xin Fang
Liang Zou
Jin Li
Lei Sun
Zhenhua Ling
16
29
0
25 Feb 2019
Noise-tolerant Audio-visual Online Person Verification using an Attention-based Neural Network Fusion
Suwon Shon
Tae-Hyun Oh
James R. Glass
16
50
0
27 Nov 2018
Unsupervised Representation Learning of Speech for Dialect Identification
Suwon Shon
Wei-Ning Hsu
James R. Glass
16
13
0
12 Sep 2018
End-to-End Training Approaches for Discriminative Segmental Models
Hao Tang
Weiran Wang
Kevin Gimpel
Karen Livescu
30
7
0
21 Oct 2016
1