ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.06247
  4. Cited By
End-to-End Neural Speaker Diarization with Self-attention

End-to-End Neural Speaker Diarization with Self-attention

13 September 2019
Yusuke Fujita
Naoyuki Kanda
Shota Horiguchi
Yawen Xue
Kenji Nagamatsu
Shinji Watanabe
ArXivPDFHTML

Papers citing "End-to-End Neural Speaker Diarization with Self-attention"

17 / 17 papers shown
Title
Speaker Diarization for Low-Resource Languages Through Wav2vec Fine-Tuning
Speaker Diarization for Low-Resource Languages Through Wav2vec Fine-Tuning
Abdulhady Abas Abdullah
S. H. Karim
Sara Azad Ahmed
Kanar R. Tariq
Tarik Ahmed Rashid
55
0
0
23 Apr 2025
Overview of Speaker Modeling and Its Applications: From the Lens of Deep
  Speaker Representation Learning
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
24
4
0
21 Jul 2024
LLM-based speaker diarization correction: A generalizable approach
LLM-based speaker diarization correction: A generalizable approach
Georgios Efstathiadis
Vijay Yadav
Anzar Abbas
34
3
0
07 Jun 2024
ASoBO: Attentive Beamformer Selection for Distant Speaker Diarization in
  Meetings
ASoBO: Attentive Beamformer Selection for Distant Speaker Diarization in Meetings
Théo Mariotte
Anthony Larcher
Silvio Montrésor
Jean-Hugh Thomas
18
0
0
05 Jun 2024
End-to-end Online Speaker Diarization with Target Speaker Tracking
End-to-end Online Speaker Diarization with Target Speaker Tracking
Weiqing Wang
Ming Li
19
5
0
12 Oct 2023
An Experimental Review of Speaker Diarization methods with application
  to Two-Speaker Conversational Telephone Speech recordings
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
L. Serafini
Samuele Cornell
Giovanni Morrone
Enrico Zovato
A. Brutti
S. Squartini
21
9
0
29 May 2023
Unified Modeling of Multi-Talker Overlapped Speech Recognition and
  Diarization with a Sidecar Separator
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator
Lingwei Meng
Jiawen Kang
Mingyu Cui
Haibin Wu
Xixin Wu
Helen M. Meng
16
10
0
25 May 2023
Exploring Speaker-Related Information in Spoken Language Understanding
  for Better Speaker Diarization
Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization
Luyao Cheng
Siqi Zheng
Zhang Qinglin
Haibo Wang
Yafeng Chen
Qian Chen
20
4
0
22 May 2023
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker
  Diarization with Target Speaker Attractor
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor
Zhengyang Chen
Bing Han
Shuai Wang
Yan-min Qian
12
15
0
18 May 2023
Neural Diarization with Non-autoregressive Intermediate Attractors
Neural Diarization with Non-autoregressive Intermediate Attractors
Yusuke Fujita
Tatsuya Komatsu
Robin Scheibler
Yusuke Kida
Tetsuji Ogawa
20
11
0
13 Mar 2023
Supervised Hierarchical Clustering using Graph Neural Networks for
  Speaker Diarization
Supervised Hierarchical Clustering using Graph Neural Networks for Speaker Diarization
Prachi Singh
Amrit Kaul
Sriram Ganapathy
BDL
12
8
0
24 Feb 2023
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT
K. Kinoshita
Thilo von Neumann
Marc Delcroix
Christoph Boeddeker
Reinhold Haeb-Umbach
18
4
0
28 Jul 2022
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party
  meeting transcription (M2MeT) challenge
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge
Maokui He
Xiang Lv
Weilin Zhou
Jingjing Yin
Xiaoqi Zhang
...
Shutong Niu
Yuhang Cao
Heng Lu
Jun Du
Chin-Hui Lee
43
7
0
10 Feb 2022
Self-Supervised Metric Learning With Graph Clustering For Speaker
  Diarization
Self-Supervised Metric Learning With Graph Clustering For Speaker Diarization
Prachi Singh
Sriram Ganapathy
SSL
11
7
0
14 Sep 2021
End-to-End Diarization for Variable Number of Speakers with Local-Global
  Networks and Discriminative Speaker Embeddings
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings
Soumi Maiti
Hakan Erdogan
K. Wilson
Scott Wisdom
Shinji Watanabe
J. Hershey
19
21
0
05 May 2021
Dyadic Speech-based Affect Recognition using DAMI-P2C Parent-child
  Multimodal Interaction Dataset
Dyadic Speech-based Affect Recognition using DAMI-P2C Parent-child Multimodal Interaction Dataset
Huili Chen
Yue Zhang
F. Weninger
Rosalind W. Picard
C. Breazeal
Hae Won Park
CVBM
12
14
0
20 Aug 2020
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
Yusuke Fujita
Naoyuki Kanda
Shota Horiguchi
Kenji Nagamatsu
Shinji Watanabe
148
242
0
12 Sep 2019
1