v1v2 (latest)

TasNet: time-domain audio separation network for real-time, single-channel speech separation

1 November 2017

Yi Luo

N. Mesgarani

ArXiv (abs)PDF HTML

Papers citing "TasNet: time-domain audio separation network for real-time, single-channel speech separation"

50 / 283 papers shown

HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks

400

03 Nov 2022

Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings

Yu Tsao

280

31 Oct 2022

UX-NET: Filter-and-Process-based Improved U-Net for Real-time Time-domain Audio SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Kashyap Patel

A. Kovalyov

Issa Panahi

139

28 Oct 2022

CasNet: Investigating Channel Robustness for Speech Separation

Fan Wang

Yao-Fei Cheng

Hung-Shin Lee

Yu Tsao

Hsin-Min Wang

113

27 Oct 2022

Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

William Ravenscroft

Stefan Goetze

Thomas Hain

300

27 Oct 2022

Individualized Conditioning and Negative Distances for Speaker SeparationInternational Conference on Machine Learning and Applications (ICMLA), 2022

155

12 Oct 2022

Speech Enhancement with Perceptually-motivated Optimization and Dual TransformationsAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022

113

24 Sep 2022

Streaming Target-Speaker ASR with Neural TransducerInterspeech (Interspeech), 2022

320

09 Sep 2022

TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

310

155

08 Sep 2022

Music Separation Enhancement with Generative ModelingInternational Society for Music Information Retrieval Conference (ISMIR), 2022

213

26 Aug 2022

Conv-NILM-Net, a causal and multi-appliance model for energy source separation

Mohamed Alami Chehboune

Jérémie Decock

Rim Kaddah

Jesse Read

133

03 Aug 2022

Spatial Aware Multi-Task Learning Based Speech SeparationIEEE International Conference on Mobile Adhoc and Sensor Systems (MASS), 2022

Wei Sun

Mei Wang

L. Qiu

110

20 Jul 2022

ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and UnderstandingInterspeech (Interspeech), 2022

Wangyou Zhang

...

214

19 Jul 2022

An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant ConditionsInterspeech (Interspeech), 2022

155

30 Jun 2022

Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech SeparationInterspeech (Interspeech), 2022

152

28 Jun 2022

An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation ModelsInterspeech (Interspeech), 2022

111

20 Jun 2022

Resource-Efficient Separation TransformerIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Mirco Ravanelli

178

19 Jun 2022

Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios

Bang Zeng

Weiqing Wang

Yuanyuan Bao

Ming Li

133

17 Jun 2022

Strategies to Improve Robustness of Target Speech Extraction to Enrollment VariationsInterspeech (Interspeech), 2022

103

16 Jun 2022

On the Design and Training Strategies for RNN-based Online Neural Speech Separation SystemsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Kai Li

Yi Luo

214

15 Jun 2022

Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR

192

26 May 2022

SepIt: Approaching a Single Channel Speech Separation BoundInterspeech (Interspeech), 2022

333

24 May 2022

A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR AccuracyInterspeech (Interspeech), 2022

158

06 May 2022

Mask scalar prediction for improving robust automatic speech recognition

185

26 Apr 2022

Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech DereverberationEuropean Signal Processing Conference (EUSIPCO), 2022

William Ravenscroft

Stefan Goetze

Thomas Hain

138

13 Apr 2022

The Rise and Fall of Robotic World (A case study of WALL-E)

Faisal Ghaffar

08 Apr 2022

tPLCnet: Real-time Deep Packet Loss Concealment in the Time Domain Using a Short Temporal ContextInterspeech (Interspeech), 2022

Nils L. Westhausen

B. Meyer

165

04 Apr 2022

Improving Target Sound Extraction with Timestamp InformationInterspeech (Interspeech), 2022

Helin Wang

Dongchao Yang

Chao Weng

Jianwei Yu

Yuexian Zou

200

02 Apr 2022

End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning RepresentationInterspeech (Interspeech), 2022

266

01 Apr 2022

Disentangling the Impacts of Language and Channel Variability on Speech Separation NetworksInterspeech (Interspeech), 2022

Fan Wang

Hung-Shin Lee

Yu Tsao

Hsin-Min Wang

139

30 Mar 2022

Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech SeparationInterspeech (Interspeech), 2022

Xue Yang

C. Bao

130

25 Mar 2022

Harmonicity Plays a Critical Role in DNN Based Versus in Biologically-Inspired Monaural Speech Segregation SystemsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Rahil Parikh

Ilya Kavalerov

C. Espy-Wilson

Shihab Shamma Institute for Systems Research

103

08 Mar 2022

Deep Impulse Responses: Estimating and Parameterizing Filters with Deep NetworksIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Alexander Richard

Peter Dodds

V. Ithapu

174

07 Feb 2022

Exploring Self-Attention Mechanisms for Speech SeparationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Mirco Ravanelli

234

06 Feb 2022

New Insights on Target Speaker Extraction

Mohamed Elminshawi

Wolfgang Mack

Srikanth Raj Chetupalli

Soumitro Chakrabarty

Emanuel Habets

258

01 Feb 2022

SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

315

26 Jan 2022

Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

155

11 Jan 2022

Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem

Bo Xu

221

17 Dec 2021

Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data

Ke Chen

Xingjian Du

Bilei Zhu

Zejun Ma

Taylor Berg-Kirkpatrick

Shlomo Dubnov

327

15 Dec 2021

Hybrid Neural Networks for On-device Directional Hearing

176

11 Dec 2021

A Time-domain Real-valued Generalized Wiener Filter for Multi-channel Neural Separation Systems

Yi Luo

244

07 Dec 2021

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural NetworkNeural Information Processing Systems (NeurIPS), 2021

157

04 Dec 2021

BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement

Sunwoo Kim

Minje Kim

602

17 Nov 2021

MoRe-Fi: Motion-robust and Fine-grained Respiration Monitoring via Deep-Learning UWB RadarACM International Conference on Embedded Networked Sensor Systems (SenSys), 2021

265

126

16 Nov 2021

Monaural source separation: From anechoic to reverberant environmentsInternational Workshop on Acoustic Signal Enhancement (IWAENC), 2021

162

15 Nov 2021

Inter-channel Conv-TasNet for multichannel speech enhancement

Dongheon Lee

Seongrae Kim

Jung-Woo Choi

126

08 Nov 2021

Weight, Block or Unit? Exploring Sparsity Tradeoffs for Speech Enhancement on Tiny Neural Accelerators

178

03 Nov 2021

Reduction of Subjective Listening Effort for TV Broadcast Signals with Recurrent Neural NetworksIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

133

02 Nov 2021

Real-time Speaker counting in a cocktail party scenario using Attention-guided Convolutional Neural NetworkInterspeech (Interspeech), 2021

Midia Yousefi

John H. L. Hansen

109

30 Oct 2021

Cross-attention conformer for context modeling in speech enhancement for ASRAutomatic Speech Recognition & Understanding (ASRU), 2021

185

30 Oct 2021