v1v2 (latest)

TasNet: time-domain audio separation network for real-time, single-channel speech separation

1 November 2017

Yi Luo

N. Mesgarani

ArXiv (abs)PDF HTML

Papers citing "TasNet: time-domain audio separation network for real-time, single-channel speech separation"

50 / 283 papers shown

Probing Self-supervised Learning Models with Target Speech Extraction

266

17 Feb 2024

Unrestricted Global Phase Bias-Aware Single-channel Speech Enhancement with Conformer-based Metric GAN

214

13 Feb 2024

Boosting Unknown-number Speaker Separation with Transformer Decoder-based AttractorIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Shinji Watanabe

155

23 Jan 2024

Consistent and Relevant: Rethink the Query Embedding in General Sound Separation

Dongchao Yang

Zhiyong Wu

172

24 Dec 2023

MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation

281

19 Dec 2023

Amphion: An Open-Source Audio, Music and Speech Generation ToolkitSpoken Language Technology Workshop (SLT), 2023

Xueyao Zhang

Liumeng Xue

Yicheng Gu

Yuancheng Wang

Haorui He

...

Haizhou Li

262

15 Dec 2023

LSTM-CNN Network for Audio Signature Analysis in Noisy Environments

Praveen Damacharla

Hamid Rajabalipanah

M. Fakheri

102

12 Dec 2023

Subspace Hybrid MVDR Beamforming for Augmented Hearing

117

30 Nov 2023

Improving Label Assignments Learning by Dynamic Sample Dropout Combined with Layer-wise Optimization in Speech SeparationInterspeech (Interspeech), 2023

Chenyu Gao

Yue Gu

I. Marsic

292

20 Nov 2023

LAVSS: Location-Guided Audio-Visual Spatial Audio SeparationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Yuxin Ye

Wenming Yang

Yapeng Tian

217

31 Oct 2023

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

642

23 Oct 2023

Real-time Speech Enhancement and Separation with a Unified Deep Neural Network for Single/Dual Talker Scenarios

Kashyap Patel

A. Kovalyov

Issa Panahi

214

16 Oct 2023

On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic EnvironmentsAutomatic Speech Recognition & Understanding (ASRU), 2023

William Ravenscroft

Stefan Goetze

Thomas Hain

177

09 Oct 2023

An Exploration of Task-decoupling on Two-stage Neural Post Filter for Real-time Personalized Acoustic Echo CancellationAutomatic Speech Recognition & Understanding (ASRU), 2023

Ziqian Wang

Lei Xie

154

07 Oct 2023

Unravel Anomalies: An End-to-end Seasonal-Trend Decomposition Approach for Time Series Anomaly DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

247

30 Sep 2023

RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech SeparationInternational Conference on Learning Representations (ICLR), 2023

Samuel Pegg

Kai Li

Xiaolin Hu

418

29 Sep 2023

Unifying Robustness and Fidelity: A Comprehensive Study of Pretrained Generative Methods for Speech Enhancement in Adverse Conditions

Dong Yu

225

16 Sep 2023

A Generalized Bandsplit Neural Network for Cinematic Audio Source SeparationIEEE Open Journal of Signal Processing (IEEE Open J. Signal Process.), 2023

Karn N. Watcharasupat

Aaron J. Hipple. Phillip A. Williams

Scott Kramer

Alexander Lerch

W. Wolcott

297

05 Sep 2023

Blind Source Separation of Single-Channel Mixtures via Multi-Encoder Autoencoders

Matthew B. Webster

Joonnyong Lee

328

31 Aug 2023

Sparks of Large Audio Models: A Survey and Outlook

...

Björn W. Schuller

676

24 Aug 2023

Convoifilter: A case study of doing cocktail party speech recognition

Thai-Binh Nguyen

A. Waibel

235

22 Aug 2023

Audio-visual video-to-speech synthesis with synthesized input audio

Triantafyllos Kefalas

Yannis Panagakis

Maja Pantic

VGen DiffM

281

31 Jul 2023

Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model

211

29 Jul 2023

An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel AttentionInterspeech (Interspeech), 2023

Junyu Wang

110

09 Jun 2023

Towards Solving Cocktail-Party: The First Method to Build a Realistic Dataset with Ground Truths for Speech Separation

Rawad Melhem

Assef Jafar

Oumayma Al Dakkak

161

25 May 2023

Speech Separation based on Contrastive Learning and Deep Modularization

Peter Ochieng

SSL

271

18 May 2023

BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker ConditionsInterspeech (Interspeech), 2023

166

17 May 2023

ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement

Feng Dang

Qi Hu

Pengyuan Zhang

Yonghong Yan

187

15 May 2023

Universal Source Separation with Weakly Labelled Data

Taylor Berg-Kirkpatrick

Shlomo Dubnov

Mark D. Plumbley

184

11 May 2023

Adversarial Generative NMF for Single Channel Source Separation

Martin Ludvigsen

M. Grasmair

GAN

134

24 Apr 2023

Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASRIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Yuchen Hu

Cheng Chen

Qiu-shi Zhu

Eng Siong Chng

298

11 Apr 2023

Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT

169

04 Apr 2023

End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone ConversationsSpeech Communication (Speech Commun.), 2023

320

21 Mar 2023

Multi-Channel Masking with Learnable Filterbank for Sound Source Separation

Wang Dai

Archontis Politis

Maria Sandsten

133

14 Mar 2023

Multi-Microphone Speaker Separation by Spatial RegionsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Julian Wechsler

Srikanth Raj Chetupalli

Wolfgang Mack

Emanuel Habets

157

13 Mar 2023

Improving the Intent Classification accuracy in Noisy Environment

Mohamed Nabih Ali

Alessio Brutti

Daniele Falavigna

119

12 Mar 2023

Distribution Preserving Source Separation With Time Frequency Predictive ModelsEuropean Signal Processing Conference (EUSIPCO), 2023

Pedro J. Villasana T

J. Klejsa

Lars Villemoes

P. Hedelin

165

10 Mar 2023

MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-AttentionsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Shengkui Zhao

Bin Ma

213

23 Feb 2023

DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Shuo Wang

151

21 Feb 2023

A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker OneIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Lingwei Meng

185

20 Feb 2023

Hypernetworks build Implicit Neural Representations of Sounds

509

09 Feb 2023

Separate And Diffuse: Using a Pretrained Diffusion Model for Improving Source Separation

Shahar Lutati

Eliya Nachmani

Lior Wolf

DiffM

211

25 Jan 2023

Perceive and predict: self-supervised speech representation based loss functions for speech enhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Thomas Hain

198

11 Jan 2023

ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement

Yossi Adi

233

21 Dec 2022

DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech EnhancementIEEE Signal Processing Letters (SPL), 2022

Dongheon Lee

Jung-Woo Choi

321

15 Dec 2022

Multi-Scale Feature Fusion Transformer Network for End-to-End Single Channel Speech Separation

150

14 Dec 2022

NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer

Changsheng Quan

Xiaofei Li

155

05 Dec 2022

Deep neural network techniques for monaural speech enhancement: state of the art analysisArtificial Intelligence Review (Artif Intell Rev), 2022

P. Ochieng

269

01 Dec 2022

TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech SeparationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

297

201

22 Nov 2022

Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts

169

11 Nov 2022