ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00541
  4. Cited By
TasNet: time-domain audio separation network for real-time,
  single-channel speech separation
v1v2 (latest)

TasNet: time-domain audio separation network for real-time, single-channel speech separation

1 November 2017
Yi Luo
N. Mesgarani
ArXiv (abs)PDFHTML

Papers citing "TasNet: time-domain audio separation network for real-time, single-channel speech separation"

50 / 283 papers shown
DBNET: DOA-driven beamforming network for end-to-end farfield sound
  source separation
DBNET: DOA-driven beamforming network for end-to-end farfield sound source separation
Ali Aroudi
Sebastian Braun
84
9
0
22 Oct 2020
BERT for Joint Multichannel Speech Dereverberation with Spatial-aware
  Tasks
BERT for Joint Multichannel Speech Dereverberation with Spatial-aware Tasks
Yang Jiao
104
0
0
21 Oct 2020
The Cone of Silence: Speech Separation by Localization
The Cone of Silence: Speech Separation by Localization
Teerapat Jenrungrot
V. Jayaram
S. M. Seitz
Ira Kemelmacher-Shlizerman
189
64
0
12 Oct 2020
Toward Speech Separation in The Pre-Cocktail Party Problem with TasTas
Toward Speech Separation in The Pre-Cocktail Party Problem with TasTas
Ziqiang Shi
Jiqing Han
108
0
0
07 Sep 2020
SEANet: A Multi-modal Speech Enhancement Network
SEANet: A Multi-modal Speech Enhancement NetworkInterspeech (Interspeech), 2020
Marco Tagliasacchi
Yunpeng Li
Karolis Misiunas
Dominik Roblek
319
99
0
04 Sep 2020
Improved Lite Audio-Visual Speech Enhancement
Improved Lite Audio-Visual Speech EnhancementIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Shang-Yi Chuang
Hsin-Min Wang
Yu Tsao
296
43
0
30 Aug 2020
Efficient Low-Latency Speech Enhancement with Mobile Audio Streaming
  Networks
Efficient Low-Latency Speech Enhancement with Mobile Audio Streaming Networks
Michal Romaniuk
Piotr Masztalski
K. Piaskowski
M. Matuszewski
78
5
0
17 Aug 2020
Textual Echo Cancellation
Textual Echo Cancellation
Shaojin Ding
Ye Jia
Ke Hu
Quan Wang
292
8
0
13 Aug 2020
Speech Separation Based on Multi-Stage Elaborated Dual-Path Deep BiLSTM
  with Auxiliary Identity Loss
Speech Separation Based on Multi-Stage Elaborated Dual-Path Deep BiLSTM with Auxiliary Identity LossInterspeech (Interspeech), 2020
Ziqiang Shi
Rujie Liu
Jiqing Han
148
7
0
06 Aug 2020
Dual-Path Transformer Network: Direct Context-Aware Modeling for
  End-to-End Monaural Speech Separation
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech SeparationInterspeech (Interspeech), 2020
Jing-jing Chen
Qi-rong Mao
Dong Liu
368
330
0
28 Jul 2020
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for
  Mixture Signals
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture SignalsNeural Information Processing Systems (NeurIPS), 2020
Jing Shi
Xuankai Chang
Pengcheng Guo
Shinji Watanabe
Yusuke Fujita
Jiaming Xu
Bo Xu
Lei Xie
170
25
0
25 Jun 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction
Speaker-Conditional Chain Model for Speech Separation and ExtractionInterspeech (Interspeech), 2020
Jing Shi
Jiaming Xu
Yusuke Fujita
Shinji Watanabe
Bo Xu
BDL
144
23
0
25 Jun 2020
Multi-talker ASR for an unknown number of sources: Joint training of
  source counting, separation and ASR
Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR
Thilo von Neumann
Christoph Boeddeker
Lukas Drude
K. Kinoshita
Marc Delcroix
Tomohiro Nakatani
Reinhold Haeb-Umbach
235
45
0
04 Jun 2020
Efficient Integration of Multi-channel Information for
  Speaker-independent Speech Separation
Efficient Integration of Multi-channel Information for Speaker-independent Speech Separation
Yuichiro Koyama
Oluwafemi Azeez
Bhiksha Raj
98
4
0
23 May 2020
Exploring the Best Loss Function for DNN-Based Low-latency Speech
  Enhancement with Temporal Convolutional Networks
Exploring the Best Loss Function for DNN-Based Low-latency Speech Enhancement with Temporal Convolutional Networks
Yuichiro Koyama
Tyler Vuong
Stefan Uhlich
Bhiksha Raj
246
45
0
23 May 2020
Atss-Net: Target Speaker Separation via Attention-based Neural Network
Atss-Net: Target Speaker Separation via Attention-based Neural Network
Tingle Li
Qingjian Lin
Yuanyuan Bao
Ming Li
94
41
0
19 May 2020
Multimodal Target Speech Separation with Voice and Face References
Multimodal Target Speech Separation with Voice and Face References
Leyuan Qu
C. Weber
S. Wermter
CVBM
186
21
0
17 May 2020
Dual-Signal Transformation LSTM Network for Real-Time Noise Suppression
Dual-Signal Transformation LSTM Network for Real-Time Noise Suppression
Nils L. Westhausen
B. Meyer
218
121
0
15 May 2020
FaceFilter: Audio-visual speech separation using still images
FaceFilter: Audio-visual speech separation using still images
Soo-Whan Chung
Soyeon Choe
Joon Son Chung
Hong-Goo Kang
CVBM
167
75
0
14 May 2020
Cognitive-driven convolutional beamforming using EEG-based auditory
  attention decoding
Cognitive-driven convolutional beamforming using EEG-based auditory attention decoding
Ali Aroudi
Marc Delcroix
Tomohiro Nakatani
K. Kinoshita
S. Araki
Simon Doclo
83
6
0
10 May 2020
Asteroid: the PyTorch-based audio source separation toolkit for
  researchers
Asteroid: the PyTorch-based audio source separation toolkit for researchers
Manuel Pariente
Samuele Cornell
Joris Cosentino
S. Sivasankaran
Efthymios Tzinis
...
Juan M. Martín-Donas
David Ditter
Ariel Frank
Antoine Deleforge
Emmanuel Vincent
250
170
0
08 May 2020
Time-domain speaker extraction network
Time-domain speaker extraction networkAutomatic Speech Recognition & Understanding (ASRU), 2019
Chenglin Xu
Wei Rao
Chng Eng Siong
Haizhou Li
100
58
0
29 Apr 2020
SpEx: Multi-Scale Time Domain Speaker Extraction Network
SpEx: Multi-Scale Time Domain Speaker Extraction NetworkIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Chenglin Xu
Wei Rao
Eng Siong Chng
Haizhou Li
151
202
0
17 Apr 2020
Two-stage model and optimal SI-SNR for monaural multi-speaker speech separation in noisy environment
Chao Ma
Dongmei Li
Xupeng Jia
139
5
0
14 Apr 2020
Conditioned Source Separation for Music Instrument Performances
Conditioned Source Separation for Music Instrument PerformancesIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Olga Slizovskaia
G. Haro
E. Gómez
245
43
0
08 Apr 2020
Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss
Separating Varying Numbers of Sources with Auxiliary Autoencoding LossInterspeech (Interspeech), 2020
Yi Luo
N. Mesgarani
186
30
0
27 Mar 2020
Deep Attention Fusion Feature for Speech Separation with End-to-End
  Post-filter Method
Deep Attention Fusion Feature for Speech Separation with End-to-End Post-filter Method
Cunhang Fan
Jianhua Tao
B. Liu
Jiangyan Yi
Zhengqi Wen
Zhengqi Wen
131
9
0
17 Mar 2020
Robust Robotic Pouring using Audition and Haptics
Robust Robotic Pouring using Audition and HapticsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2020
Hongzhuo Liang
Chuangchuang Zhou
Shuang Li
Xiaojian Ma
Norman Hendrich
Timo Gerkmann
F. Sun
Marcus Stoffel
Jianwei Zhang
380
22
0
29 Feb 2020
Voice Separation with an Unknown Number of Multiple Speakers
Voice Separation with an Unknown Number of Multiple SpeakersInternational Conference on Machine Learning (ICML), 2020
Eliya Nachmani
Yossi Adi
Lior Wolf
386
183
0
29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Wavesplit: End-to-End Speech Separation by Speaker ClusteringIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Neil Zeghidour
David Grangier
VLM
344
283
0
20 Feb 2020
Efficient Trainable Front-Ends for Neural Speech Enhancement
Efficient Trainable Front-Ends for Neural Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Jonah Casebeer
Umut Isik
Shrikant Venkataramani
A. Krishnaswamy
AI4TS
83
3
0
20 Feb 2020
Content Based Singing Voice Extraction From a Musical Mixture
Content Based Singing Voice Extraction From a Musical MixtureIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Pritish Chandna
Merlijn Blaauw
J. Bonada
E. Gómez
207
16
0
12 Feb 2020
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain LossThe Speaker and Language Recognition Workshop (Odyssey), 2020
Rui Liu
Berrak Sisman
F. Bao
Guanglai Gao
Haizhou Li
368
14
0
02 Feb 2020
Continuous speech separation: dataset and analysis
Continuous speech separation: dataset and analysisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Zhuo Chen
Takuya Yoshioka
Liang Lu
Tianyan Zhou
Zhong Meng
Yi Luo
Jian Wu
Xiong Xiao
Jinyu Li
266
240
0
30 Jan 2020
LaFurca: Iterative Refined Speech Separation Based on Context-Aware
  Dual-Path Parallel Bi-LSTM
LaFurca: Iterative Refined Speech Separation Based on Context-Aware Dual-Path Parallel Bi-LSTM
Ziqiang Shi
Rujie Liu
Jiqing Han
179
4
0
23 Jan 2020
Temporal-Spatial Neural Filter: Direction Informed End-to-End
  Multi-channel Target Speech Separation
Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation
Rongzhi Gu
Yuexian Zou
138
20
0
02 Jan 2020
Utterance-level Permutation Invariant Training with Latency-controlled
  BLSTM for Single-channel Multi-talker Speech Separation
Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech SeparationAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2019
Lu Huang
Gaofeng Cheng
Pengyuan Zhang
Yi Yang
Shumin Xu
Jiasong Sun
110
8
0
25 Dec 2019
CNN-LSTM models for Multi-Speaker Source Separation using Bayesian Hyper
  Parameter Optimization
CNN-LSTM models for Multi-Speaker Source Separation using Bayesian Hyper Parameter OptimizationInterspeech (Interspeech), 2019
Jeroen Zegers
Hugo Van hamme
BDL
82
8
0
19 Dec 2019
End-to-end training of time domain audio separation and recognition
End-to-end training of time domain audio separation and recognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Thilo von Neumann
K. Kinoshita
Lukas Drude
Christoph Boeddeker
Marc Delcroix
Tomohiro Nakatani
Reinhold Haeb-Umbach
203
35
0
18 Dec 2019
A Unified Framework for Speech Separation
A Unified Framework for Speech Separation
F. Bahmaninezhad
Shi-Xiong Zhang
Yong-mei Xu
Meng Yu
John H. L. Hansen
Dong Yu
139
4
0
17 Dec 2019
MITAS: A Compressed Time-Domain Audio Separation Network with Parameter
  Sharing
MITAS: A Compressed Time-Domain Audio Separation Network with Parameter Sharing
Chao-I Tuan
Yuan-Kuei Wu
Hung-yi Lee
Yu Tsao
82
2
0
09 Dec 2019
Audio-Visual Target Speaker Enhancement on Multi-Talker Environment
  using Event-Driven Cameras
Audio-Visual Target Speaker Enhancement on Multi-Talker Environment using Event-Driven Cameras
A. Arriandiaga
Giovanni Morrone
Luca Pasa
Leonardo Badino
Chiara Bartolozzi
149
1
0
05 Dec 2019
Music Source Separation in the Waveform Domain
Music Source Separation in the Waveform Domain
Alexandre Défossez
Nicolas Usunier
Léon Bottou
Francis R. Bach
343
302
0
27 Nov 2019
Online Spectrogram Inversion for Low-Latency Audio Source Separation
Online Spectrogram Inversion for Low-Latency Audio Source SeparationIEEE Signal Processing Letters (SPL), 2019
P. Magron
Maria Sandsten
288
14
0
08 Nov 2019
Closing the Training/Inference Gap for Deep Attractor Networks
Closing the Training/Inference Gap for Deep Attractor Networks
C. Cadoux
Stefan Uhlich
Marc Ferras
Yuki Mitsufuji
99
2
0
05 Nov 2019
Onssen: an open-source speech separation and enhancement library
Onssen: an open-source speech separation and enhancement library
Zhaoheng Ni
Michael I. Mandel
VLM
229
8
0
03 Nov 2019
End-to-end Non-Negative Autoencoders for Sound Source Separation
End-to-end Non-Negative Autoencoders for Sound Source SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Shrikant Venkataramani
Efthymios Tzinis
Paris Smaragdis
263
5
0
31 Oct 2019
Interrupted and cascaded permutation invariant training for speech
  separation
Interrupted and cascaded permutation invariant training for speech separationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Gene-Ping Yang
Szu-Lin Wu
Yao-Wen Mao
Hung-yi Lee
Lin-Shan Lee
96
14
0
28 Oct 2019
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNetIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
David Ditter
Timo Gerkmann
212
62
0
25 Oct 2019
Multi-channel Speech Separation Using Deep Embedding Model with
  Multilayer Bootstrap Networks
Multi-channel Speech Separation Using Deep Embedding Model with Multilayer Bootstrap Networks
Ziye Yang
Xiao-Lei Zhang
119
1
0
24 Oct 2019
Previous
123456
Next