ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.01077
  4. Cited By
Self-attention encoding and pooling for speaker recognition

Self-attention encoding and pooling for speaker recognition

3 August 2020
Pooyan Safari
Miquel India
Javier Hernando
    ViT
ArXivPDFHTML

Papers citing "Self-attention encoding and pooling for speaker recognition"

13 / 13 papers shown
Title
Overview of Speaker Modeling and Its Applications: From the Lens of Deep
  Speaker Representation Learning
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
37
4
0
21 Jul 2024
Keyword-driven Retrieval-Augmented Large Language Models for Cold-start User Recommendations
Keyword-driven Retrieval-Augmented Large Language Models for Cold-start User Recommendations
Hai-Dang Kieu
Minh-Duc Nguyen
T. Nguyen
Dung D. Le
RALM
LRM
79
2
0
30 May 2024
L-TUNING: Synchronized Label Tuning for Prompt and Prefix in LLMs
L-TUNING: Synchronized Label Tuning for Prompt and Prefix in LLMs
Md. Kowsher
Md. Shohanur Islam Sobuj
Asif Mahmud
Nusrat Jahan Prottasha
Prakash Bhat
16
3
0
21 Dec 2023
An Effective Transformer-based Contextual Model and Temporal Gate
  Pooling for Speaker Identification
An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification
Harunori Kawano
Sota Shimizu
30
1
0
22 Aug 2023
Automatic Modulation Classification with Deep Neural Networks
Automatic Modulation Classification with Deep Neural Networks
Clayton A. Harper
Mitchell A. Thornton
Eric C. Larson
11
5
0
27 Jan 2023
Towards A Unified Conformer Structure: from ASR to ASV Task
Towards A Unified Conformer Structure: from ASR to ASV Task
Dexin Liao
Tao Jiang
Feng Wang
Lin Li
Q. Hong
24
10
0
14 Nov 2022
Bootstrapping meaning through listening: Unsupervised learning of spoken
  sentence embeddings
Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings
Jian Zhu
Zuoyu Tian
Yadong Liu
Cong Zhang
Chia-wen Lo
SSL
30
2
0
23 Oct 2022
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual
  Speech Representation
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation
Sameer Khurana
Antoine Laurent
James R. Glass
25
36
0
17 May 2022
Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding
Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding
Hongning Zhu
Kong Aik Lee
Haizhou Li
33
15
0
14 Jul 2021
S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised
  Pretrained Representations
S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations
Jheng-hao Lin
Yist Y. Lin
C. Chien
Hung-yi Lee
20
56
0
07 Apr 2021
Double Multi-Head Attention for Speaker Verification
Double Multi-Head Attention for Speaker Verification
Miquel India
Pooyan Safari
Javier Hernando
28
18
0
26 Jul 2020
VoxCeleb2: Deep Speaker Recognition
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
224
2,234
0
14 Jun 2018
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,923
0
17 Aug 2015
1