Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

23 January 2020

Papers citing "Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam"

50 / 71 papers shown

Binaural Target Speaker Extraction using Individualized HRTF

Yoav Ellinson

Sharon Gannot

290

25 Jul 2025

Two-stage Audio-Visual Target Speaker Extraction System for Real-Time Processing On Edge Device

230

28 May 2025

Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement

Ziling Huang

Haixin Guan

Yanhua Long

267

18 May 2025

TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

327

10 May 2025

Listen to Extract: Onset-Prompted Target Speaker Extraction

395

08 May 2025

Contextual Speech Extraction: Leveraging Textual History as an Implicit Cue for Target Speech ExtractionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

347

13 Mar 2025

End-to-End Multi-Microphone Speaker Extraction Using Relative Transfer Functions

Aviad Eisenberg

Sharon Gannot

Shlomo E. Chazan

249

10 Feb 2025

SEF-PNet: Speaker Encoder-Free Personalized Speech Enhancement with Local and Global Contexts AggregationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

172

20 Jan 2025

Investigation of Speaker Representation for Target-Speaker Speech ProcessingSpoken Language Technology Workshop (SLT), 2024

267

15 Oct 2024

Two-stage Framework for Robust Speech Emotion Recognition Using Target Speaker Extraction in Human Speech Noise ConditionsAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2024

Jinyi Mi

Tomoki Toda

242

29 Sep 2024

Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration

329

24 Sep 2024

WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker ExtractionInterspeech (Interspeech), 2024

Shuai Wang

Ke Zhang

Shaoxiong Lin

Junjie Li

Xuefei Wang

Meng Ge

Jianwei Yu

Yanmin Qian

Haizhou Li

235

24 Sep 2024

On the effectiveness of enrollment speech augmentation for Target Speaker ExtractionSpoken Language Technology Workshop (SLT), 2024

Junjie Li

Ke Zhang

Shuai Wang

Haizhou Li

Man-Wai Mak

Kong Aik Lee

179

15 Sep 2024

DENSE: Dynamic Embedding Causal Target Speech Extraction

Yiwen Wang

Zeyu Yuan

Xihong Wu

234

10 Sep 2024

USEF-TSE: Universal Speaker Embedding Free Target Speaker ExtractionIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024

Bang Zeng

Ming Li

492

04 Sep 2024

SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling

Tsubasa Ochiai

Marc Delcroix

255

01 Jul 2024

Target Speech Extraction with Pre-trained Self-supervised Learning Models

284

17 Feb 2024

Probing Self-supervised Learning Models with Target Speech Extraction

307

17 Feb 2024

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models

Jee-weon Jung

Wangyou Zhang

Jiatong Shi

Zakaria Aldeneh

Takuya Higuchi

B. Theobald

Ahmed Hussen Abdelaziz

Shinji Watanabe

505

30 Jan 2024

Spatial-Temporal Activity-Informed Diarization and Separation

Yicheng Hsu

Ssuhan Chen

Mingsian R. Bai

259

30 Jan 2024

3S-TSE: Efficient Three-Stage Target Speaker Extraction for Real-Time and Low-Resource Applications

Fei Chen

Xueliang Zhang

296

18 Dec 2023

Typing to Listen at the Cocktail Party: Text-Guided Target Speaker ExtractionIEEE Transactions on Cognitive and Developmental Systems (IEEE TCDS), 2023

Kay Chen Tan

441

11 Oct 2023

The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASRAutomatic Speech Recognition & Understanding (ASRU), 2023

...

Kong Aik Lee

Hui Bu

317

24 Sep 2023

Target Speech Extraction with Conditional Diffusion ModelInterspeech (Interspeech), 2023

281

08 Aug 2023

MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker ModulationInterspeech (Interspeech), 2023

Jun Chen

Zhiyong Wu

296

28 Jun 2023

Beamformer-Guided Target Speaker ExtractionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Mohamed Elminshawi

Srikanth Raj Chetupalli

Emanuel Habets

179

15 Mar 2023

Target Sound Extraction with Variable Cross-modality CluesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

205

15 Mar 2023

Online Binaural Speech Separation of Moving Speakers With a Wavesplit NetworkIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Cong Han

N. Mesgarani

177

13 Mar 2023

A two-stage speaker extraction algorithm under adverse acoustic conditions using a single-microphoneEuropean Signal Processing Conference (EUSIPCO), 2023

Aviad Eisenberg

Sharon Gannot

Shlomo E. Chazan

392

13 Mar 2023

X-SepFormer: End-to-end Speaker Extraction Network with Explicit Optimization on Speaker ConfusionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

296

09 Mar 2023

A Framework for Unified Real-time Personalized and Non-Personalized Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

183

23 Feb 2023

Improving Target Speaker Extraction with Sparse LDA-transformed Speaker Embeddings

179

16 Jan 2023

Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial CoherenceJournal of the Acoustical Society of America (JASA), 2022

Yicheng Hsu

Yonghan Lee

M. Bai

270

16 Nov 2022

Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

H. Taherian

Sefik Emre Eskimez

Takuya Yoshioka

193

05 Nov 2022

Real-Time Joint Personalized Speech Enhancement and Acoustic Echo CancellationInterspeech (Interspeech), 2022

277

04 Nov 2022

Hierarchical speaker representation for target speaker extractionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Xueliang Zhang

358

28 Oct 2022

Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

William Ravenscroft

Stefan Goetze

Thomas Hain

404

27 Oct 2022

Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Xiaoyu Liu

Xu Li

Joan Serrà

224

23 Oct 2022

Streaming Target-Speaker ASR with Neural TransducerInterspeech (Interspeech), 2022

382

09 Sep 2022

Analysis of impact of emotions on target speech extraction and speech separationInternational Workshop on Acoustic Signal Enhancement (IWAENC), 2022

203

15 Aug 2022

Multi-channel target speech enhancement based on ERB-scaled spatial coherence features

Yicheng Hsu

Yonghan Lee

M. Bai

175

17 Jul 2022

Semi-supervised Time Domain Target Speaker Extraction with Attention

Zhepei Wang

Ritwik Giri

Shrikant Venkataramani

191

18 Jun 2022

Strategies to Improve Robustness of Target Speech Extraction to Enrollment VariationsInterspeech (Interspeech), 2022

149

16 Jun 2022

Personalized Acoustic Echo Cancellation for Full-duplex CommunicationsInterspeech (Interspeech), 2022

302

30 May 2022

Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Catalin Zorila

R. Doddipatla

256

09 May 2022

Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker ExtractionInterspeech (Interspeech), 2022

Dongchao Yang

173

15 Apr 2022

Listen only to me! How well can target speech extraction handle false alarms?Interspeech (Interspeech), 2022

228

11 Apr 2022

SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Marc Delcroix

Jorge Bennasar Vázquez

402

08 Apr 2022

Target Confusion in End-to-end Speaker Extraction: Analysis and ApproachesInterspeech (Interspeech), 2022

Dongchao Yang

215

04 Apr 2022

A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker ExtractionInterspeech (Interspeech), 2022

Zexu Pan

Meng Ge

Haizhou Li

316

31 Mar 2022