ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.04437
  4. Cited By
Frame-level speaker embeddings for text-independent speaker recognition
  and analysis of end-to-end model

Frame-level speaker embeddings for text-independent speaker recognition and analysis of end-to-end model

12 September 2018
Suwon Shon
Hao Tang
James R. Glass
ArXivPDFHTML

Papers citing "Frame-level speaker embeddings for text-independent speaker recognition and analysis of end-to-end model"

30 / 30 papers shown
Title
Discrete Unit based Masking for Improving Disentanglement in Voice
  Conversion
Discrete Unit based Masking for Improving Disentanglement in Voice Conversion
Philip H. Lee
Ismail Rasim Ulgen
Berrak Sisman
35
0
0
17 Sep 2024
Reliable Visualization for Deep Speaker Recognition
Reliable Visualization for Deep Speaker Recognition
Pengqi Li
Lantian Li
A. Hamdulla
Dong Wang
HAI
40
9
0
08 Apr 2022
Decomposed Temporal Dynamic CNN: Efficient Time-Adaptive Network for
  Text-Independent Speaker Verification Explained with Speaker Activation Map
Decomposed Temporal Dynamic CNN: Efficient Time-Adaptive Network for Text-Independent Speaker Verification Explained with Speaker Activation Map
Seong-Hu Kim
Hyeonuk Nam
Yong-Hwa Park
25
9
0
29 Mar 2022
Keyword localisation in untranscribed speech using visually grounded
  speech models
Keyword localisation in untranscribed speech using visually grounded speech models
Kayode Olaleye
Dan Oneaţă
Herman Kamper
32
7
0
02 Feb 2022
Unsupervised Speech Segmentation and Variable Rate Representation
  Learning using Segmental Contrastive Predictive Coding
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Laureano Moro Velázquez
Najim Dehak
SSL
53
22
0
05 Oct 2021
What do End-to-End Speech Models Learn about Speaker, Language and
  Channel Information? A Layer-wise and Neuron-level Analysis
What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis
Shammur A. Chowdhury
Nadir Durrani
Ahmed M. Ali
44
12
0
01 Jul 2021
QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic
  Speech Corpus
QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus
Hamdy Mubarak
A. Hussein
Shammur A. Chowdhury
Ahmed M. Ali
16
44
0
24 Jun 2021
Neural Speaker Embeddings for Ultrasound-based Silent Speech Interfaces
Neural Speaker Embeddings for Ultrasound-based Silent Speech Interfaces
Amin Honarmandi Shandiz
L. Tóth
G. Gosztolya
Alexandra Markó
Tamás Gábor Csapó
26
6
0
08 Jun 2021
The Accented English Speech Recognition Challenge 2020: Open Datasets,
  Tracks, Baselines, Results and Methods
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods
Xian Shi
Fan Yu
Yizhou Lu
Yuhao Liang
Qiangze Feng
Daliang Wang
Y. Qian
Lei Xie
24
66
0
20 Feb 2021
Deep Discriminative Feature Learning for Accent Recognition
Deep Discriminative Feature Learning for Accent Recognition
Wei Wang
Chao Zhang
Xiao-pei Wu
34
2
0
25 Nov 2020
Few Shot Text-Independent speaker verification using 3D-CNN
Few Shot Text-Independent speaker verification using 3D-CNN
Prateek Mishra
24
5
0
25 Aug 2020
Disentangled speaker and nuisance attribute embedding for robust speaker
  verification
Disentangled speaker and nuisance attribute embedding for robust speaker verification
Woohyun Kang
Sung Hwan Mun
Min Hyun Han
N. Kim
25
17
0
07 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with
  Adversarial Learning
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
15
6
0
05 Aug 2020
Singer Identification Using Convolutional Acoustic Motif Embeddings
Singer Identification Using Convolutional Acoustic Motif Embeddings
Aitor Arronte Alvarez
Francisco Gómez-Martín
10
1
0
01 Aug 2020
Improved RawNet with Feature Map Scaling for Text-independent Speaker
  Verification using Raw Waveforms
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms
Jee-weon Jung
Seung-bin Kim
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
20
60
0
01 Apr 2020
An empirical analysis of information encoded in disentangled neural
  speaker representations
An empirical analysis of information encoded in disentangled neural speaker representations
Raghuveer Peri
Haoqi Li
Krishna Somandepalli
Arindam Jati
Shrikanth Narayanan
DRL
27
13
0
10 Feb 2020
A study on the role of subsidiary information in replay attack spoofing
  detection
A study on the role of subsidiary information in replay attack spoofing detection
Jee-weon Jung
Hye-jin Shim
Hee-Soo Heo
Ha-Jin Yu
24
3
0
31 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent
  Advances, and Future Trends
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
34
81
0
02 Jan 2020
Biometrics Recognition Using Deep Learning: A Survey
Biometrics Recognition Using Deep Learning: A Survey
Shervin Minaee
AmirAli Abdolrashidi
Hang Su
Bennamoun
David C. Zhang
26
84
0
30 Nov 2019
A Deep Neural Network for Short-Segment Speaker Recognition
A Deep Neural Network for Short-Segment Speaker Recognition
Amirhossein Hajavi
Ali Etemad
16
74
0
22 Jul 2019
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic
  Speech Recognition
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition
Yonatan Belinkov
Ahmed M. Ali
James R. Glass
28
32
0
09 Jul 2019
Spatial Pyramid Encoding with Convex Length Normalization for
  Text-Independent Speaker Verification
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Youngmoon Jung
Younggwan Kim
Hyungjun Lim
Yeunju Choi
Hoirin Kim
21
32
0
19 Jun 2019
RawNet: Advanced end-to-end deep neural network using raw waveforms for
  text-independent speaker verification
RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification
Jee-weon Jung
Hee-Soo Heo
Ju-ho Kim
Hye-jin Shim
Ha-Jin Yu
17
140
0
17 Apr 2019
MCE 2018: The 1st Multi-target Speaker Detection and Identification
  Challenge Evaluation
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation
Suwon Shon
Najim Dehak
D. Reynolds
James R. Glass
19
26
0
07 Apr 2019
VoiceID Loss: Speech Enhancement for Speaker Verification
VoiceID Loss: Speech Enhancement for Speaker Verification
Suwon Shon
Hao Tang
James R. Glass
VLM
9
85
0
07 Apr 2019
Utterance-level Aggregation For Speaker Recognition In The Wild
Utterance-level Aggregation For Speaker Recognition In The Wild
Weidi Xie
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
19
343
0
26 Feb 2019
Channel adversarial training for cross-channel text-independent speaker
  recognition
Channel adversarial training for cross-channel text-independent speaker recognition
Xin Fang
Liang Zou
Jin Li
Lei Sun
Zhenhua Ling
14
29
0
25 Feb 2019
Noise-tolerant Audio-visual Online Person Verification using an
  Attention-based Neural Network Fusion
Noise-tolerant Audio-visual Online Person Verification using an Attention-based Neural Network Fusion
Suwon Shon
Tae-Hyun Oh
James R. Glass
11
50
0
27 Nov 2018
Unsupervised Representation Learning of Speech for Dialect
  Identification
Unsupervised Representation Learning of Speech for Dialect Identification
Suwon Shon
Wei-Ning Hsu
James R. Glass
12
13
0
12 Sep 2018
End-to-End Training Approaches for Discriminative Segmental Models
End-to-End Training Approaches for Discriminative Segmental Models
Hao Tang
Weiran Wang
Kevin Gimpel
Karen Livescu
30
7
0
21 Oct 2016
1