ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.09624
  4. Cited By
Multi-stage Speaker Extraction with Utterance and Frame-Level Reference
  Signals
v1v2 (latest)

Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals

19 November 2020
Meng Ge
Chenglin Xu
Longbiao Wang
Chng Eng Siong
Jianwu Dang
Haizhou Li
ArXiv (abs)PDFHTML

Papers citing "Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals"

22 / 22 papers shown
Title
Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement
Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement
Ziling Huang
Haixin Guan
Yanhua Long
96
0
0
18 May 2025
Listen to Extract: Onset-Prompted Target Speaker Extraction
Listen to Extract: Onset-Prompted Target Speaker Extraction
Pengjie Shen
Kangrui Chen
Shulin He
Pengru Chen
Shuqi Yuan
He Kong
Xueliang Zhang
Zehao Wang
96
0
0
08 May 2025
Geometry-Constrained EEG Channel Selection for Brain-Assisted Speech
  Enhancement
Geometry-Constrained EEG Channel Selection for Brain-Assisted Speech Enhancement
Keying Zuo
Qingtian Xu
Jie Zhang
Zhenhua Ling
93
0
0
19 Sep 2024
USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
Bang Zeng
Ming Li
107
5
0
04 Sep 2024
A lightweight dual-stage framework for personalized speech enhancement
  based on DeepFilterNet2
A lightweight dual-stage framework for personalized speech enhancement based on DeepFilterNet2
Thomas Serre
Mathieu Fontaine
Éric Benhaim
Geoffroy Dutour
S. Essid
39
0
0
11 Apr 2024
Continuous Target Speech Extraction: Enhancing Personalized Diarization
  and Extraction on Complex Recordings
Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings
He Zhao
Hangting Chen
Jianwei Yu
Yuehai Wang
78
1
0
29 Jan 2024
Focus on the Sound around You: Monaural Target Speaker Extraction via
  Distance and Speaker Information
Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Jiuxin Lin
Peng Wang
Heinrich Dinkel
Jun Chen
Zhiyong Wu
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Yujun Wang
67
9
0
28 Jun 2023
AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker
  Extraction
AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Jiuxin Lin
X. Cai
Heinrich Dinkel
Jun Chen
Zhiyong Yan
Yongqing Wang
Junbo Zhang
Zhiyong Wu
Yujun Wang
Helen M. Meng
77
27
0
25 Jun 2023
Audio-Visual Speech Enhancement With Selective Off-Screen Speech
  Extraction
Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction
Tomoya Yoshinaga
Keitaro Tanaka
Shigeo Morishima
63
0
0
10 Jun 2023
Rethinking the visual cues in audio-visual speaker extraction
Rethinking the visual cues in audio-visual speaker extraction
Junjie Li
Meng Ge
Zexu Pan
Rui Cao
Longbiao Wang
Jianwu Dang
Shiliang Zhang
72
9
0
05 Jun 2023
A Framework for Unified Real-time Personalized and Non-Personalized
  Speech Enhancement
A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Zhepei Wang
Ritwik Giri
Devansh P. Shah
J. Valin
Mike Goodwin
Paris Smaragdis
67
9
0
23 Feb 2023
token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired
  Speech and Text
token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired Speech and Text
Xianghu Yue
Junyi Ao
Xiaoxue Gao
Haizhou Li
SSL
60
8
0
30 Oct 2022
Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Shulin He
Wei Rao
Jinjiang Liu
Jun Chen
Yukai Ju
Xueliang Zhang
Yannan Wang
Shidong Shang
46
6
0
28 Oct 2022
Semi-supervised Time Domain Target Speaker Extraction with Attention
Semi-supervised Time Domain Target Speaker Extraction with Attention
Zhepei Wang
Ritwik Giri
Shrikant Venkataramani
Umut Isik
J. Valin
Paris Smaragdis
Mike Goodwin
A. Krishnaswamy
59
7
0
18 Jun 2022
Simultaneous Speech Extraction for Multiple Target Speakers under the
  Meeting Scenarios
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios
Bang Zeng
Weiqing Wang
Yuanyuan Bao
Ming Li
57
0
0
17 Jun 2022
RaDur: A Reference-aware and Duration-robust Network for Target Sound
  Detection
RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection
Dongchao Yang
Helin Wang
Zhongjie Ye
Yuexian Zou
Wenwu Wang
57
0
0
05 Apr 2022
Target Confusion in End-to-end Speaker Extraction: Analysis and
  Approaches
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches
Zifeng Zhao
Dongchao Yang
Rongzhi Gu
Haoran Zhang
Yuexian Zou
44
19
0
04 Apr 2022
Detect what you want: Target Sound Detection
Detect what you want: Target Sound Detection
Dongchao Yang
Helin Wang
Yuexian Zou
Fan Cui
Chao Weng
95
7
0
19 Dec 2021
LiMuSE: Lightweight Multi-modal Speaker Extraction
LiMuSE: Lightweight Multi-modal Speaker Extraction
Qinghua Liu
Yating Huang
Yunzhe Hao
Jiaming Xu
Bo Xu
67
6
0
07 Nov 2021
Target Speech Extraction: Independent Vector Extraction Guided by
  Supervised Speaker Identification
Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification
J. Málek
Jakub Janský
Zbyněk Koldovský
Tomás Kounovský
Jaroslav Cmejla
J. Zdánský
50
10
0
05 Nov 2021
USEV: Universal Speaker Extraction with Visual Cue
USEV: Universal Speaker Extraction with Visual Cue
Zexu Pan
Meng Ge
Haizhou Li
70
44
0
30 Sep 2021
Lightweight Dual-channel Target Speaker Separation for Mobile Voice
  Communication
Lightweight Dual-channel Target Speaker Separation for Mobile Voice Communication
Yuanyuan Bao
Yanze Xu
Na Xu
Wenjing Yang
Hongfeng Li
Shicong Li
Y. Jia
Fei Xiang
Jincheng He
Ming Li
87
1
0
05 Jun 2021
1