ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.10654
  4. Cited By
Encoder-Decoder Based Attractors for End-to-End Neural Diarization

Encoder-Decoder Based Attractors for End-to-End Neural Diarization

20 June 2021
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Leibny Paola García-Perera
ArXivPDFHTML

Papers citing "Encoder-Decoder Based Attractors for End-to-End Neural Diarization"

38 / 38 papers shown
Title
Quality-Aware End-to-End Audio-Visual Neural Speaker Diarization
Quality-Aware End-to-End Audio-Visual Neural Speaker Diarization
Mao-Kui He
Jun Du
Shu-Tong Niu
Qing-Feng Liu
Chin-Hui Lee
19
0
0
15 Oct 2024
LS-EEND: Long-Form Streaming End-to-End Neural Diarization with Online
  Attractor Extraction
LS-EEND: Long-Form Streaming End-to-End Neural Diarization with Online Attractor Extraction
Di Liang
Xiaofei Li
19
0
0
09 Oct 2024
Sortformer: Seamless Integration of Speaker Diarization and ASR by
  Bridging Timestamps and Tokens
Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens
Taejin Park
Ivan Medennikov
Kunal Dhawan
Weiqing Wang
He Huang
Nithin Rao Koluguri
Krishna C. Puvvada
Jagadeesh Balam
Boris Ginsburg
21
2
0
10 Sep 2024
Flow-TSVAD: Target-Speaker Voice Activity Detection via Latent Flow
  Matching
Flow-TSVAD: Target-Speaker Voice Activity Detection via Latent Flow Matching
Zhengyang Chen
Bing Han
Shuai Wang
Yidi Jiang
Yanmin Qian
35
0
0
07 Sep 2024
NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech
  Processing Tasks
NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks
He Huang
Taejin Park
Kunal Dhawan
Ivan Medennikov
Krishna C. Puvvada
Nithin Rao Koluguri
Weiqing Wang
Jagadeesh Balam
Boris Ginsburg
SSL
AI4TS
16
1
0
23 Aug 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep
  Speaker Representation Learning
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
24
4
0
21 Jul 2024
Leveraging Speaker Embeddings in End-to-End Neural Diarization for
  Two-Speaker Scenarios
Leveraging Speaker Embeddings in End-to-End Neural Diarization for Two-Speaker Scenarios
Juan Ignacio Alvarez-Trejos
Beltrán Labrador
Alicia Lozano-Diez
25
1
0
01 Jul 2024
ASoBO: Attentive Beamformer Selection for Distant Speaker Diarization in
  Meetings
ASoBO: Attentive Beamformer Selection for Distant Speaker Diarization in Meetings
Théo Mariotte
Anthony Larcher
Silvio Montrésor
Jean-Hugh Thomas
20
0
0
05 Jun 2024
Speech-Aware Neural Diarization with Encoder-Decoder Attractor Guided by
  Attention Constraints
Speech-Aware Neural Diarization with Encoder-Decoder Attractor Guided by Attention Constraints
PeiYing Lee
HauYun Guo
Berlin Chen
16
0
0
21 Mar 2024
Do End-to-End Neural Diarization Attractors Need to Encode Speaker
  Characteristic Information?
Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?
Lin Zhang
Themos Stafylakis
Federico Landini
Mireia Díez
Anna Silnova
Lukávs Burget
16
1
0
29 Feb 2024
Spatial-Temporal Activity-Informed Diarization and Separation
Spatial-Temporal Activity-Informed Diarization and Separation
Yicheng Hsu
Ssuhan Chen
Mingsian R. Bai
11
0
0
30 Jan 2024
Online speaker diarization of meetings guided by speech separation
Online speaker diarization of meetings guided by speech separation
Elio Gruttadauria
Mathieu Fontaine
S. Essid
14
3
0
30 Jan 2024
EEND-M2F: Masked-attention mask transformers for speaker diarization
EEND-M2F: Masked-attention mask transformers for speaker diarization
Marc Härkönen
Samuel J. Broughton
Lahiru Samarakoon
14
7
0
23 Jan 2024
Robust End-to-End Diarization with Domain Adaptive Training and
  Multi-Task Learning
Robust End-to-End Diarization with Domain Adaptive Training and Multi-Task Learning
Ivan Fung
Lahiru Samarakoon
Samuel J. Broughton
OOD
19
2
0
12 Dec 2023
Transformer Attractors for Robust and Efficient End-to-End Neural
  Diarization
Transformer Attractors for Robust and Efficient End-to-End Neural Diarization
Lahiru Samarakoon
Samuel J. Broughton
Marc Härkönen
Ivan Fung
16
6
0
11 Dec 2023
EEND-DEMUX: End-to-End Neural Speaker Diarization via Demultiplexed
  Speaker Embeddings
EEND-DEMUX: End-to-End Neural Speaker Diarization via Demultiplexed Speaker Embeddings
Sung Hwan Mun
Mingrui Han
Canyeong Moon
Nam Soo Kim
15
1
0
11 Dec 2023
DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors
DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors
Federico Landini
Mireia Díez
Themos Stafylakis
Lukávs Burget
25
11
0
07 Dec 2023
Multi-channel Conversational Speaker Separation via Neural Diarization
Multi-channel Conversational Speaker Separation via Neural Diarization
H. Taherian
DeLiang Wang
BDL
18
15
0
15 Nov 2023
A Single Speech Enhancement Model Unifying Dereverberation, Denoising,
  Speaker Counting, Separation, and Extraction
A Single Speech Enhancement Model Unifying Dereverberation, Denoising, Speaker Counting, Separation, and Extraction
Kohei Saijo
Wangyou Zhang
Zhong-Qiu Wang
Shinji Watanabe
Tetsunori Kobayashi
Tetsuji Ogawa
VLM
13
6
0
12 Oct 2023
Profile-Error-Tolerant Target-Speaker Voice Activity Detection
Profile-Error-Tolerant Target-Speaker Voice Activity Detection
Dongmei Wang
Xiong Xiao
Naoyuki Kanda
Midia Yousefi
Takuya Yoshioka
Jian Wu
11
3
0
21 Sep 2023
Attention-based Encoder-Decoder End-to-End Neural Diarization with
  Embedding Enhancer
Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer
Zhengyang Chen
Bing Han
Shuai Wang
Yan-min Qian
16
18
0
13 Sep 2023
An Experimental Review of Speaker Diarization methods with application
  to Two-Speaker Conversational Telephone Speech recordings
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
L. Serafini
Samuele Cornell
Giovanni Morrone
Enrico Zovato
A. Brutti
S. Squartini
27
9
0
29 May 2023
Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx)
  for Combined End-to-End and Vector Clustering-based Diarization
Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
Marc Delcroix
Naohiro Tawara
Mireia Díez
Federico Landini
Anna Silnova
A. Ogawa
Tomohiro Nakatani
L. Burget
S. Araki
19
4
0
23 May 2023
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker
  Diarization with Target Speaker Attractor
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor
Zhengyang Chen
Bing Han
Shuai Wang
Yan-min Qian
19
15
0
18 May 2023
Neural Diarization with Non-autoregressive Intermediate Attractors
Neural Diarization with Non-autoregressive Intermediate Attractors
Yusuke Fujita
Tatsuya Komatsu
Robin Scheibler
Yusuke Kida
Tetsuji Ogawa
22
11
0
13 Mar 2023
Late Audio-Visual Fusion for In-The-Wild Speaker Diarization
Late Audio-Visual Fusion for In-The-Wild Speaker Diarization
Zexu Pan
G. Wichern
François G. Germain
Aswin Shanmugam Subramanian
Jonathan Le Roux
VGen
19
1
0
02 Nov 2022
Mutual Learning of Single- and Multi-Channel End-to-End Neural
  Diarization
Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization
Shota Horiguchi
Yuki Takashima
Shinji Watanabe
Leibny Paola García-Perera
23
2
0
07 Oct 2022
Target Speaker Voice Activity Detection with Transformers and Its
  Integration with End-to-End Neural Diarization
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization
Dongmei Wang
Xiong Xiao
Naoyuki Kanda
Takuya Yoshioka
Jian Wu
15
25
0
27 Aug 2022
Online Neural Diarization of Unlimited Numbers of Speakers Using Global
  and Local Attractors
Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local Attractors
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yuki Takashima
Y. Kawaguchi
32
23
0
06 Jun 2022
Improving the Naturalness of Simulated Conversations for End-to-End
  Neural Diarization
Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization
Natsuo Yamashita
Shota Horiguchi
Takeshi Homma
13
16
0
24 Apr 2022
Joint speaker diarisation and tracking in switching state-space model
Joint speaker diarisation and tracking in switching state-space model
J. H. M. Wong
Yifan Gong
17
0
0
23 Sep 2021
Diarisation using location tracking with agglomerative clustering
Diarisation using location tracking with agglomerative clustering
J. H. M. Wong
Igor Abramovski
Xiong Xiao
Yifan Gong
8
1
0
22 Sep 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
269
323
0
24 Jan 2021
Speaker activity driven neural speech extraction
Speaker activity driven neural speech extraction
Marc Delcroix
Kateřina Žmolíková
Tsubasa Ochiai
K. Kinoshita
Tomohiro Nakatani
23
32
0
14 Jan 2021
Bayesian HMM clustering of x-vector sequences (VBx) in speaker
  diarization: theory, implementation and analysis on standard tasks
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: theory, implementation and analysis on standard tasks
Federico Landini
Jan Profant
Mireia Díez
L. Burget
208
198
0
29 Dec 2020
Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized
  Maximum Eigengap
Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap
Tae Jin Park
Kyu Jeong Han
Manoj Kumar
Shrikanth Narayanan
122
114
0
05 Mar 2020
End-to-End Neural Speaker Diarization with Self-attention
End-to-End Neural Speaker Diarization with Self-attention
Yusuke Fujita
Naoyuki Kanda
Shota Horiguchi
Yawen Xue
Kenji Nagamatsu
Shinji Watanabe
166
237
0
13 Sep 2019
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
Yusuke Fujita
Naoyuki Kanda
Shota Horiguchi
Kenji Nagamatsu
Shinji Watanabe
148
242
0
12 Sep 2019
1