ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.10963
  4. Cited By
Attentive Statistics Pooling for Deep Speaker Embedding

Attentive Statistics Pooling for Deep Speaker Embedding

29 March 2018
K. Okabe
Takafumi Koshinaka
K. Shinoda
ArXivPDFHTML

Papers citing "Attentive Statistics Pooling for Deep Speaker Embedding"

50 / 231 papers shown
Title
Speaker Retrieval in the Wild: Challenges, Effectiveness and Robustness
Speaker Retrieval in the Wild: Challenges, Effectiveness and Robustness
Erfan Loweimi
Mengjie Qian
Kate Knill
Mark J. F. Gales
46
0
0
26 Apr 2025
SoCov: Semi-Orthogonal Parametric Pooling of Covariance Matrix for Speaker Recognition
SoCov: Semi-Orthogonal Parametric Pooling of Covariance Matrix for Speaker Recognition
Rongjin Li
Weibin Zhang
Dongpeng Chen
Jintao Kang
Xiaofen Xing
22
0
0
23 Apr 2025
Temporal Attention Pooling for Frequency Dynamic Convolution in Sound Event Detection
Temporal Attention Pooling for Frequency Dynamic Convolution in Sound Event Detection
Hyeonuk Nam
Yong-Hwa Park
31
0
0
17 Apr 2025
Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing
Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing
Tianchi Liu
Duc-Tuan Truong
Rohan Kumar Das
K. Lee
Haizhou Li
31
0
0
08 Apr 2025
Privacy-Preserving Biometric Verification with Handwritten Random Digit String
Privacy-Preserving Biometric Verification with Handwritten Random Digit String
Peirong Zhang
Y. Liu
Songxuan Lai
Hongliang Li
Lianwen Jin
63
2
0
17 Mar 2025
JiTTER: Jigsaw Temporal Transformer for Event Reconstruction for Self-Supervised Sound Event Detection
JiTTER: Jigsaw Temporal Transformer for Event Reconstruction for Self-Supervised Sound Event Detection
Hyeonuk Nam
Yong-Hwa Park
40
1
0
28 Feb 2025
Comprehensive Layer-wise Analysis of SSL Models for Audio Deepfake Detection
Comprehensive Layer-wise Analysis of SSL Models for Audio Deepfake Detection
Yassine El Kheir
Youness Samih
Suraj Maharjan
Tim Polzehl
Sebastian Möller
67
1
0
05 Feb 2025
Reducing the Gap Between Pretrained Speech Enhancement and Recognition Models Using a Real Speech-Trained Bridging Module
Zhongjian Cui
Chenrui Cui
Tianrui Wang
Mengnan He
Hao Shi
Meng Ge
Caixia Gong
Longbiao Wang
J. Dang
31
0
0
05 Jan 2025
On the Robustness of Cover Version Identification Models: A Study Using Cover Versions from YouTube
Simon Hachmeier
Robert Jäschke
AAML
38
0
0
03 Jan 2025
Memory-Efficient Training for Deep Speaker Embedding Learning in Speaker
  Verification
Memory-Efficient Training for Deep Speaker Embedding Learning in Speaker Verification
Bei Liu
Yanmin Qian
69
0
0
02 Dec 2024
JOOCI: a Framework for Learning Comprehensive Speech Representations
JOOCI: a Framework for Learning Comprehensive Speech Representations
Hemant Yadav
R. Shah
Sunayana Sitaram
23
0
0
14 Oct 2024
Improving Speaker Representations Using Contrastive Losses on
  Multi-scale Features
Improving Speaker Representations Using Contrastive Losses on Multi-scale Features
Satvik Dixit
Massa Baali
Rita Singh
Bhiksha Raj
24
0
0
07 Oct 2024
Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels
Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels
Zakaria Aldeneh
Takuya Higuchi
Jee-weon Jung
Li-Wei Chen
Stephen Shum
Ahmed Hussen Abdelaziz
Shinji Watanabe
Tatiana Likhomanenko
B. Theobald
VLM
SSL
44
0
0
16 Sep 2024
Exploring SSL Discrete Speech Features for Zipformer-based Contextual
  ASR
Exploring SSL Discrete Speech Features for Zipformer-based Contextual ASR
Mingyu Cui
Yifan Yang
Jiajun Deng
Jiawen Kang
Shujie Hu
Tianzi Wang
Zhaoqing Li
Shiliang Zhang
Xie Chen
Xunying Liu
23
1
0
13 Sep 2024
Universal Pooling Method of Multi-layer Features from Pretrained Models
  for Speaker Verification
Universal Pooling Method of Multi-layer Features from Pretrained Models for Speaker Verification
Jin Sob Kim
Hyun Joon Park
Wooseok Shin
Sung Won Han
SLR
48
0
0
12 Sep 2024
PDAF: A Phonetic Debiasing Attention Framework For Speaker Verification
PDAF: A Phonetic Debiasing Attention Framework For Speaker Verification
Massa Baali
Abdulhamid Aldoobi
Hira Dhamyal
Rita Singh
Bhiksha Raj
26
0
0
09 Sep 2024
The VoxCeleb Speaker Recognition Challenge: A Retrospective
The VoxCeleb Speaker Recognition Challenge: A Retrospective
Jaesung Huh
Joon Son Chung
Arsha Nagrani
A. Brown
Jee-weon Jung
Daniel Garcia-Romero
Andrew Zisserman
36
3
0
27 Aug 2024
Query-by-Example Keyword Spotting Using Spectral-Temporal Graph
  Attentive Pooling and Multi-Task Learning
Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning
Zhenyu Wang
Shuyu Kong
Li Wan
Biqiao Zhang
Yiteng Huang
Mumin Jin
Ming Sun
Xin Lei
Zhaojun Yang
31
0
0
27 Aug 2024
Toward Improving Synthetic Audio Spoofing Detection Robustness via
  Meta-Learning and Disentangled Training With Adversarial Examples
Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples
Zhenyu Wang
John H. L. Hansen
AAML
30
1
0
23 Aug 2024
Enhancing Partially Spoofed Audio Localization with Boundary-aware
  Attention Mechanism
Enhancing Partially Spoofed Audio Localization with Boundary-aware Attention Mechanism
Jiafeng Zhong
Bin Li
Jiangyan Yi
27
1
0
31 Jul 2024
One-Class Learning with Adaptive Centroid Shift for Audio Deepfake
  Detection
One-Class Learning with Adaptive Centroid Shift for Audio Deepfake Detection
Hyun Myung Kim
Kangwook Jang
Hoirin Kim
29
5
0
24 Jun 2024
Disentangled Representation Learning for Environment-agnostic Speaker
  Recognition
Disentangled Representation Learning for Environment-agnostic Speaker Recognition
Kihyun Nam
Hee-Soo Heo
Jee-weon Jung
Joon Son Chung
42
0
0
20 Jun 2024
MR-RawNet: Speaker verification system with multiple temporal
  resolutions for variable duration utterances using raw waveforms
MR-RawNet: Speaker verification system with multiple temporal resolutions for variable duration utterances using raw waveforms
Seung-bin Kim
Chan-yeong Lim
Jungwoo Heo
Ju-ho Kim
Hyun-Seo Shin
Kyo-Won Koo
Ha-Jin Yu
44
0
0
11 Jun 2024
Source -Free Domain Adaptation for Speaker Verification in Data-Scarce
  Languages and Noisy Channels
Source -Free Domain Adaptation for Speaker Verification in Data-Scarce Languages and Noisy Channels
Shlomo Salo Elia
Aviad Malachi
V. Aharonson
Gadi Pinkas
21
0
0
09 Jun 2024
Towards Lightweight Speaker Verification via Adaptive Neural Network
  Quantization
Towards Lightweight Speaker Verification via Adaptive Neural Network Quantization
Bei Liu
Haoyu Wang
Yanmin Qian
MQ
28
0
0
08 Jun 2024
Adapting WavLM for Speech Emotion Recognition
Adapting WavLM for Speech Emotion Recognition
Daria Diatlova
Anton Udalov
Vitalii Shutov
Egor Spirin
33
4
0
07 May 2024
USAT: A Universal Speaker-Adaptive Text-to-Speech Approach
USAT: A Universal Speaker-Adaptive Text-to-Speech Approach
Wenbin Wang
Yang Song
Sanjay Jha
32
10
0
28 Apr 2024
A Comparison of Differential Performance Metrics for the Evaluation of
  Automatic Speaker Verification Fairness
A Comparison of Differential Performance Metrics for the Evaluation of Automatic Speaker Verification Fairness
Oubaïda Chouchane
Christoph Busch
Chiara Galdi
Nicholas W. D. Evans
Massimiliano Todisco
21
1
0
27 Apr 2024
Audio Anti-Spoofing Detection: A Survey
Audio Anti-Spoofing Detection: A Survey
Menglu Li
Yasaman Ahmadiadli
Xiao-Ping Zhang
39
17
0
22 Apr 2024
A Large-Scale Evaluation of Speech Foundation Models
A Large-Scale Evaluation of Speech Foundation Models
Shu-Wen Yang
Heng-Jui Chang
Zili Huang
Andy T. Liu
Cheng-I Jeff Lai
...
Kushal Lakhotia
Shang-Wen Li
Abdelrahman Mohamed
Shinji Watanabe
Hung-yi Lee
38
19
0
15 Apr 2024
Exploring Pathological Speech Quality Assessment with ASR-Powered
  Wav2Vec2 in Data-Scarce Context
Exploring Pathological Speech Quality Assessment with ASR-Powered Wav2Vec2 in Data-Scarce Context
Tuan Nguyen
C. Fredouille
A. Ghio
M. Balaguer
Virginie Woisard
19
1
0
29 Mar 2024
KunquDB: An Attempt for Speaker Verification in the Chinese Opera
  Scenario
KunquDB: An Attempt for Speaker Verification in the Chinese Opera Scenario
Huali Zhou
Yuke Lin
Dongxi Liu
Ming Li
29
0
0
20 Mar 2024
Cosine Scoring with Uncertainty for Neural Speaker Embedding
Cosine Scoring with Uncertainty for Neural Speaker Embedding
Qiongqiong Wang
Kong Aik Lee
20
1
0
11 Mar 2024
Dynamic Cross Attention for Audio-Visual Person Verification
Dynamic Cross Attention for Audio-Visual Person Verification
R Gnana Praveen
Jahangir Alam
38
1
0
07 Mar 2024
Audio-Visual Person Verification based on Recursive Fusion of Joint
  Cross-Attention
Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention
R Gnana Praveen
Jahangir Alam
41
2
0
07 Mar 2024
Boosting Graph Pooling with Persistent Homology
Boosting Graph Pooling with Persistent Homology
Chaolong Ying
Xinjian Zhao
Tianshu Yu
GNN
AI4CE
29
3
0
26 Feb 2024
Adversarial Data Augmentation for Robust Speaker Verification
Adversarial Data Augmentation for Robust Speaker Verification
Zhenyu Zhou
Junhui Chen
Namin Wang
Lantian Li
Dong Wang
14
2
0
05 Feb 2024
Can you Remove the Downstream Model for Speaker Recognition with
  Self-Supervised Speech Features?
Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
Zakaria Aldeneh
Takuya Higuchi
Jee-weon Jung
Skyler Seto
Tatiana Likhomanenko
Stephen Shum
Ahmed Hussen Abdelaziz
Shinji Watanabe
B. Theobald
SSL
34
2
0
01 Feb 2024
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible
  recipes, self-supervised front-ends, and off-the-shelf models
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models
Jee-weon Jung
Wangyou Zhang
Jiatong Shi
Zakaria Aldeneh
Takuya Higuchi
B. Theobald
Ahmed Hussen Abdelaziz
Shinji Watanabe
71
21
0
30 Jan 2024
DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech
DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech
Jae-Yeol Im
Juhan Nam
DiffM
18
3
0
16 Jan 2024
NeXt-TDNN: Modernizing Multi-Scale Temporal Convolution Backbone for
  Speaker Verification
NeXt-TDNN: Modernizing Multi-Scale Temporal Convolution Backbone for Speaker Verification
Hyunjun Heo
U.H Shin
Ran Lee
YoungJu Cheon
Hyung-Min Park
26
9
0
14 Dec 2023
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker
  Verification
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification
Tianchi Liu
Kong Aik Lee
Qiongqiong Wang
Haizhou Li
VLM
68
13
0
06 Dec 2023
End-to-end Online Speaker Diarization with Target Speaker Tracking
End-to-end Online Speaker Diarization with Target Speaker Tracking
Weiqing Wang
Ming Li
28
5
0
12 Oct 2023
Multi-objective Progressive Clustering for Semi-supervised Domain
  Adaptation in Speaker Verification
Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification
Ze Li
Yuke Lin
Ning Jiang
Xiaoyi Qin
Guoqing Zhao
Haiying Wu
Ming Li
VLM
36
1
0
07 Oct 2023
Disentangling Voice and Content with Self-Supervision for Speaker
  Recognition
Disentangling Voice and Content with Self-Supervision for Speaker Recognition
Tianchi Liu
Kong Aik Lee
Qiongqiong Wang
Haizhou Li
BDL
DRL
27
30
0
02 Oct 2023
Audio-Visual Speaker Verification via Joint Cross-Attention
Audio-Visual Speaker Verification via Joint Cross-Attention
R Gnana Praveen
Jahangir Alam
26
6
0
28 Sep 2023
An Investigation of Distribution Alignment in Multi-Genre Speaker
  Recognition
An Investigation of Distribution Alignment in Multi-Genre Speaker Recognition
Zhenyu Zhou
Junhui Chen
Namin Wang
Lantian Li
D. Wang
13
2
0
25 Sep 2023
Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with
  Disentangled Representations
Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations
Wen Wang
Yang Song
S. Jha
29
8
0
24 Aug 2023
The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker
  Recognition Challenge 2023
The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023
Ze Li
Yuke Lin
Xiaoyi Qin
Ning Jiang
Guoqing Zhao
Ming Li
38
6
0
17 Aug 2023
Speaker Recognition Using Isomorphic Graph Attention Network Based
  Pooling on Self-Supervised Representation
Speaker Recognition Using Isomorphic Graph Attention Network Based Pooling on Self-Supervised Representation
Zirui Ge
Xinzhou Xu
Haiyan Guo
Tingting Wang
Zhen Yang
SSL
19
1
0
09 Aug 2023
12345
Next