v1v2 (latest)

TasNet: time-domain audio separation network for real-time, single-channel speech separation

1 November 2017

Yi Luo

N. Mesgarani

ArXiv (abs)PDF HTML

Papers citing "TasNet: time-domain audio separation network for real-time, single-channel speech separation"

50 / 283 papers shown

SA-SDR: A novel loss function for separation of meeting style dataIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

197

29 Oct 2021

Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions

Wangyou Zhang

147

27 Oct 2021

Progressive Learning for Stabilizing Label Selection in Speech Separation with Mapping-based Method

Chenyang Gao

Yue Gu

I. Marsic

248

20 Oct 2021

Singer separation for karaoke content generation

Hsuan-Yu Chen

Xuan-Bo Chen

J. Jang

130

13 Oct 2021

SDR -- Medium Rare with Fast Computations

Robin Scheibler

237

13 Oct 2021

All-neural beamformer for continuous speech separation

243

13 Oct 2021

Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent DomainIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

Zengwei Yao

194

10 Oct 2021

Mean absorption estimation from room impulse responses using virtually supervised learningJournal of the Acoustical Society of America (JASA), 2021

C´edric Foy

Antoine Deleforge

Diego Di Carlo

125

01 Sep 2021

Learning Sparse Analytic Filters for Piano Transcription

Frank Cwitkowitz

M. Heydari

Z. Duan

288

23 Aug 2021

Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakersInterspeech (Interspeech), 2021

107

30 Jul 2021

Speeding Up Permutation Invariant Training for Source SeparationITG Conference on Speech Communication (ITG), 2021

150

30 Jul 2021

Multi-Task Audio Source Separation

135

14 Jul 2021

Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors

Shota Horiguchi

Shinji Watanabe

Leibny Paola García-Perera

Yawen Xue

Yuki Takashima

Yohei Kawaguchi

193

04 Jul 2021

Audiovisual Singing Voice SeparationTransactions of the International Society for Music Information Retrieval (TISMIR), 2021

Bochen Li

Yuxuan Wang

Z. Duan

161

01 Jul 2021

Online Self-Attentive Gated RNNs for Real-Time Speaker Separation

Yossi Adi

109

25 Jun 2021

Basis-MelGAN: Efficient Neural Vocoder Based on Audio Decomposition

Zhengxi Liu

Y. Qian

DRL

108

25 Jun 2021

Deep neural network Based Low-latency Speech Separation with Asymmetric analysis-Synthesis Window PairEuropean Signal Processing Conference (EUSIPCO), 2021

113

22 Jun 2021

Multi-accent Speech Separation with One Shot Learning

Kuan-Po Huang

Yuan-Kuei Wu

Hung-yi Lee

194

22 Jun 2021

Encoder-Decoder Based Attractors for End-to-End Neural DiarizationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

Leibny Paola García-Perera

213

20 Jun 2021

Independent Deeply Learned Tensor Analysis for Determined Audio Source SeparationEuropean Signal Processing Conference (EUSIPCO), 2021

Hiroshi Saruwatari

10 Jun 2021

Lightweight Dual-channel Target Speaker Separation for Mobile Voice Communication

199

05 Jun 2021

Many-Speakers Single Channel Speech Separation with Optimal Permutation TrainingInterspeech (Interspeech), 2021

366

18 Apr 2021

Time-domain Speech Enhancement with Generative Adversarial Learning

245

30 Mar 2021

On TasNet for Low-Latency Single-Speaker Speech Enhancement

176

27 Mar 2021

Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech SeparationComputer Vision and Pattern Recognition (CVPR), 2021

173

25 Mar 2021

Blind Speech Separation and Dereverberation using Neural BeamformingSpeech Communication (Speech Commun.), 2021

Lukas Pfeifenberger

Franz Pernkopf

140

24 Mar 2021

Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party EffectAAAI Conference on Artificial Intelligence (AAAI), 2021

126

02 Mar 2021

Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

189

01 Mar 2021

Dual-Path Modeling for Long Recording Speech Separation in MeetingsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Cong Han

123

23 Feb 2021

TransMask: A Compact and Fast Speech Separation Model Based on TransformerIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Zining Zhang

Bingsheng He

Zhenjie Zhang

136

19 Feb 2021

Multichannel-based learning for audio object extractionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Daniel Arteaga

Jordi Pons

DiffM

263

11 Feb 2021

Multimodal Attention Fusion for Target Speaker ExtractionSpoken Language Technology Workshop (SLT), 2021

101

02 Feb 2021

Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent NetworksSpoken Language Technology Workshop (SLT), 2021

231

13 Jan 2021

Neural Network-based Virtual Microphone EstimatorIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

109

12 Jan 2021

Multi-channel Multi-frame ADL-MVDR for Target Speech SeparationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

159

24 Dec 2020

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans

...

Aswin Shanmugam Subramanian

Wangyou Zhang

VLM

198

23 Dec 2020

Group Communication with Context Codec for Lightweight Source SeparationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

Yi Luo

Cong Han

N. Mesgarani

222

14 Dec 2020

Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Wangyou Zhang

183

30 Nov 2020

A comparison of handcrafted, parameterized, and learnable features for speech separationAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2020

189

29 Nov 2020

Streaming end-to-end multi-talker speech recognitionIEEE Signal Processing Letters (IEEE SPL), 2020

241

26 Nov 2020

Multi-Decoder DPRNN: High Accuracy Source Counting and Separation

Junzhe Zhu

Raymond A. Yeh

M. Hasegawa-Johnson

101

24 Nov 2020

Streaming Multi-speaker ASR with RNN-TIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Ilya Sklyar

A. Piunova

Yulan Liu

206

23 Nov 2020

Rethinking the Separation Layers in Speech Separation NetworksIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Cong Han

108

17 Nov 2020

Surrogate Source Model Learning for Determined Source Separation

Robin Scheibler

M. Togami

169

11 Nov 2020

Informed Source Extraction With Application to Acoustic Echo Reduction

Mohamed Elminshawi

Wolfgang Mack

Emanuel Habets

228

09 Nov 2020

ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration

Chenda Li

Jing Shi

Wangyou Zhang

Aswin Shanmugam Subramanian

...

210

07 Nov 2020

Phase Aware Speech Enhancement using Realisation of Complex-valued LSTM

Raktim Gautam Goswami

Sivaganesh Andhavarapu

Rama Murty

174

27 Oct 2020

Attention is All You Need in Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Mirco Ravanelli

301

703

25 Oct 2020

Training Noisy Single-Channel Speech Separation With Noisy Oracle Sources: A Large Gap and A Small StepIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Matthew Maciejewski

Jing Shi

Shinji Watanabe

Sanjeev Khudanpur

168

23 Oct 2020

Towards Listening to 10 People Simultaneously: An Efficient Permutation Invariant Training of Audio Source Separation Using Sinkhorn's Algorithm

Hideyuki Tachibana

255

22 Oct 2020