ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00541
  4. Cited By
TasNet: time-domain audio separation network for real-time,
  single-channel speech separation
v1v2 (latest)

TasNet: time-domain audio separation network for real-time, single-channel speech separation

1 November 2017
Yi Luo
N. Mesgarani
ArXiv (abs)PDFHTML

Papers citing "TasNet: time-domain audio separation network for real-time, single-channel speech separation"

50 / 283 papers shown
HyperSound: Generating Implicit Neural Representations of Audio Signals
  with Hypernetworks
HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks
Filip Szatkowski
Karol J. Piczak
Przemysław Spurek
Jacek Tabor
Tomasz Trzciñski
400
18
0
03 Nov 2022
Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal
  Self-Supervised Embeddings
Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings
Ethan Chern
Kuo-Hsuan Hung
Yi-Ting Chen
Tassadaq Hussain
M. Gogate
Amir Hussain
Yu Tsao
Jen-Cheng Hou
SSL
280
18
0
31 Oct 2022
UX-NET: Filter-and-Process-based Improved U-Net for Real-time
  Time-domain Audio Separation
UX-NET: Filter-and-Process-based Improved U-Net for Real-time Time-domain Audio SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Kashyap Patel
A. Kovalyov
Issa Panahi
139
7
0
28 Oct 2022
CasNet: Investigating Channel Robustness for Speech Separation
CasNet: Investigating Channel Robustness for Speech Separation
Fan Wang
Yao-Fei Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
113
2
0
27 Oct 2022
Deformable Temporal Convolutional Networks for Monaural Noisy
  Reverberant Speech Separation
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
William Ravenscroft
Stefan Goetze
Thomas Hain
300
13
0
27 Oct 2022
Individualized Conditioning and Negative Distances for Speaker
  Separation
Individualized Conditioning and Negative Distances for Speaker SeparationInternational Conference on Machine Learning and Applications (ICMLA), 2022
Tao Sun
Nidal Abuhajar
Shuyu Gong
Zhewei Wang
Charles D. Smith
Xianhui Wang
Li Xu
Jundong Liu
VLM
155
1
0
12 Oct 2022
Speech Enhancement with Perceptually-motivated Optimization and Dual
  Transformations
Speech Enhancement with Perceptually-motivated Optimization and Dual TransformationsAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022
Xucheng Wan
Kai Liu
Z.C. Du
Huan Zhou
113
0
0
24 Sep 2022
Streaming Target-Speaker ASR with Neural Transducer
Streaming Target-Speaker ASR with Neural TransducerInterspeech (Interspeech), 2022
Takafumi Moriya
Hiroshi Sato
Tsubasa Ochiai
Marc Delcroix
T. Shinozaki
320
25
0
09 Sep 2022
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural
  Speaker Separation
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zhong-Qiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
310
155
0
08 Sep 2022
Music Separation Enhancement with Generative Modeling
Music Separation Enhancement with Generative ModelingInternational Society for Music Information Retrieval Conference (ISMIR), 2022
N. Schaffer
Boaz Cogan
Ethan Manilow
Max Morrison
Prem Seetharaman
Bryan Pardo
213
11
0
26 Aug 2022
Conv-NILM-Net, a causal and multi-appliance model for energy source
  separation
Conv-NILM-Net, a causal and multi-appliance model for energy source separation
Mohamed Alami Chehboune
Jérémie Decock
Rim Kaddah
Jesse Read
133
2
0
03 Aug 2022
Spatial Aware Multi-Task Learning Based Speech Separation
Spatial Aware Multi-Task Learning Based Speech SeparationIEEE International Conference on Mobile Adhoc and Sensor Systems (MASS), 2022
Wei Sun
Mei Wang
L. Qiu
110
4
0
20 Jul 2022
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition,
  Translation, and Understanding
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and UnderstandingInterspeech (Interspeech), 2022
Yen-Ju Lu
Xuankai Chang
Chenda Li
Wangyou Zhang
Samuele Cornell
...
Robin Scheibler
Zhong-Qiu Wang
Yu Tsao
Y. Qian
Shinji Watanabe
VLM
214
35
0
19 Jul 2022
An Evaluation of Three-Stage Voice Conversion Framework for Noisy and
  Reverberant Conditions
An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant ConditionsInterspeech (Interspeech), 2022
Yeonjong Choi
Chao Xie
Tomoki Toda
DiffM
155
4
0
30 Jun 2022
Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech
  Separation
Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech SeparationInterspeech (Interspeech), 2022
Jian Luo
Jianzong Wang
Ning Cheng
Edward Xiao
Xulong Zhang
Jing Xiao
ViT
152
13
0
28 Jun 2022
An Empirical Analysis on the Vulnerabilities of End-to-End Speech
  Segregation Models
An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation ModelsInterspeech (Interspeech), 2022
Rahil Parikh
G. Rochette
C. Espy-Wilson
S. Shamma
UQCV
111
0
0
20 Jun 2022
Resource-Efficient Separation Transformer
Resource-Efficient Separation TransformerIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Luca Della Libera
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Frédéric Lepoutre
François Grondin
VLM
178
25
0
19 Jun 2022
Simultaneous Speech Extraction for Multiple Target Speakers under the
  Meeting Scenarios
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios
Bang Zeng
Weiqing Wang
Yuanyuan Bao
Ming Li
133
0
0
17 Jun 2022
Strategies to Improve Robustness of Target Speech Extraction to
  Enrollment Variations
Strategies to Improve Robustness of Target Speech Extraction to Enrollment VariationsInterspeech (Interspeech), 2022
Hiroshi Sato
Tsubasa Ochiai
Marc Delcroix
K. Kinoshita
Takafumi Moriya
Naoki Makishima
Mana Ihori
Tomohiro Tanaka
Ryo Masumura
103
6
0
16 Jun 2022
On the Design and Training Strategies for RNN-based Online Neural Speech
  Separation Systems
On the Design and Training Strategies for RNN-based Online Neural Speech Separation SystemsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Kai Li
Yi Luo
214
16
0
15 Jun 2022
Joint Training of Speech Enhancement and Self-supervised Model for
  Noise-robust ASR
Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR
Qiu-shi Zhu
Jie Zhang
Zitian Zhang
Lirong Dai
192
18
0
26 May 2022
SepIt: Approaching a Single Channel Speech Separation Bound
SepIt: Approaching a Single Channel Speech Separation BoundInterspeech (Interspeech), 2022
Shahar Lutati
Eliya Nachmani
Lior Wolf
VLM
333
32
0
24 May 2022
A Conformer-based Waveform-domain Neural Acoustic Echo Canceller
  Optimized for ASR Accuracy
A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR AccuracyInterspeech (Interspeech), 2022
S. Panchapagesan
A. Narayanan
T. Shabestary
Shuai Shao
N. Howard
Alex Park
James Walker
A. Gruenstein
158
9
0
06 May 2022
Mask scalar prediction for improving robust automatic speech recognition
Mask scalar prediction for improving robust automatic speech recognition
A. Narayanan
James Walker
S. Panchapagesan
N. Howard
Yuma Koizumi
185
4
0
26 Apr 2022
Receptive Field Analysis of Temporal Convolutional Networks for Monaural
  Speech Dereverberation
Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech DereverberationEuropean Signal Processing Conference (EUSIPCO), 2022
William Ravenscroft
Stefan Goetze
Thomas Hain
138
9
0
13 Apr 2022
The Rise and Fall of Robotic World (A case study of WALL-E)
The Rise and Fall of Robotic World (A case study of WALL-E)
Faisal Ghaffar
51
0
0
08 Apr 2022
tPLCnet: Real-time Deep Packet Loss Concealment in the Time Domain Using
  a Short Temporal Context
tPLCnet: Real-time Deep Packet Loss Concealment in the Time Domain Using a Short Temporal ContextInterspeech (Interspeech), 2022
Nils L. Westhausen
B. Meyer
165
9
0
04 Apr 2022
Improving Target Sound Extraction with Timestamp Information
Improving Target Sound Extraction with Timestamp InformationInterspeech (Interspeech), 2022
Helin Wang
Dongchao Yang
Chao Weng
Jianwei Yu
Yuexian Zou
200
13
0
02 Apr 2022
End-to-End Integration of Speech Recognition, Speech Enhancement, and
  Self-Supervised Learning Representation
End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning RepresentationInterspeech (Interspeech), 2022
Xuankai Chang
Takashi Maekaku
Yuya Fujita
Shinji Watanabe
VLM
266
58
0
01 Apr 2022
Disentangling the Impacts of Language and Channel Variability on Speech
  Separation Networks
Disentangling the Impacts of Language and Channel Variability on Speech Separation NetworksInterspeech (Interspeech), 2022
Fan Wang
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
139
8
0
30 Mar 2022
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of
  Convolutional Network for Speaker-Independent Speech Separation
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech SeparationInterspeech (Interspeech), 2022
Xue Yang
C. Bao
130
3
0
25 Mar 2022
Harmonicity Plays a Critical Role in DNN Based Versus in
  Biologically-Inspired Monaural Speech Segregation Systems
Harmonicity Plays a Critical Role in DNN Based Versus in Biologically-Inspired Monaural Speech Segregation SystemsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Rahil Parikh
Ilya Kavalerov
C. Espy-Wilson
Shihab Shamma Institute for Systems Research
103
3
0
08 Mar 2022
Deep Impulse Responses: Estimating and Parameterizing Filters with Deep
  Networks
Deep Impulse Responses: Estimating and Parameterizing Filters with Deep NetworksIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Alexander Richard
Peter Dodds
V. Ithapu
174
42
0
07 Feb 2022
Exploring Self-Attention Mechanisms for Speech Separation
Exploring Self-Attention Mechanisms for Speech SeparationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Cem Subakan
Mirco Ravanelli
Samuele Cornell
François Grondin
Mirko Bronzi
234
37
0
06 Feb 2022
New Insights on Target Speaker Extraction
New Insights on Target Speaker Extraction
Mohamed Elminshawi
Wolfgang Mack
Srikanth Raj Chetupalli
Soumitro Chakrabarty
Emanuel Habets
258
23
0
01 Feb 2022
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech
  Separation
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chenda Li
Lei Yang
Weiqin Wang
Y. Qian
315
34
0
26 Jan 2022
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced
  and Observed Signals for Overlapping Speech Recognition
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Hiroshi Sato
Tsubasa Ochiai
Marc Delcroix
K. Kinoshita
Naoyuki Kamo
Takafumi Moriya
155
30
0
11 Jan 2022
Discretization and Re-synthesis: an alternative method to solve the
  Cocktail Party Problem
Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Jing Shi
Xuankai Chang
Tomoki Hayashi
Yen-Ju Lu
Shinji Watanabe
Bo Xu
221
19
0
17 Dec 2021
Zero-shot Audio Source Separation through Query-based Learning from
  Weakly-labeled Data
Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
327
56
0
15 Dec 2021
Hybrid Neural Networks for On-device Directional Hearing
Hybrid Neural Networks for On-device Directional Hearing
Anran Wang
Maruchi Kim
Hao Zhang
Shyamnath Gollakota
176
18
0
11 Dec 2021
A Time-domain Real-valued Generalized Wiener Filter for Multi-channel
  Neural Separation Systems
A Time-domain Real-valued Generalized Wiener Filter for Multi-channel Neural Separation Systems
Yi Luo
244
17
0
07 Dec 2021
Speech Separation Using an Asynchronous Fully Recurrent Convolutional
  Neural Network
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural NetworkNeural Information Processing Systems (NeurIPS), 2021
Xiaolin Hu
Kai Li
Weiyi Zhang
Yi Luo
Jean-Marie Lemercier
Timo Gerkmann
157
61
0
04 Dec 2021
BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable
  and Efficient Speech Enhancement
BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement
Sunwoo Kim
Minje Kim
602
8
0
17 Nov 2021
MoRe-Fi: Motion-robust and Fine-grained Respiration Monitoring via
  Deep-Learning UWB Radar
MoRe-Fi: Motion-robust and Fine-grained Respiration Monitoring via Deep-Learning UWB RadarACM International Conference on Embedded Networked Sensor Systems (SenSys), 2021
Tianyue Zheng
Zhe Chen
Shujie Zhang
Chao Cai
Jun Luo
265
126
0
16 Nov 2021
Monaural source separation: From anechoic to reverberant environments
Monaural source separation: From anechoic to reverberant environmentsInternational Workshop on Acoustic Signal Enhancement (IWAENC), 2021
Tobias Cord-Landwehr
Christoph Boeddeker
Thilo von Neumann
Catalin Zorila
R. Doddipatla
Reinhold Haeb-Umbach
162
32
0
15 Nov 2021
Inter-channel Conv-TasNet for multichannel speech enhancement
Inter-channel Conv-TasNet for multichannel speech enhancement
Dongheon Lee
Seongrae Kim
Jung-Woo Choi
126
15
0
08 Nov 2021
Weight, Block or Unit? Exploring Sparsity Tradeoffs for Speech
  Enhancement on Tiny Neural Accelerators
Weight, Block or Unit? Exploring Sparsity Tradeoffs for Speech Enhancement on Tiny Neural Accelerators
Marko Stamenovic
Nils L. Westhausen
Li-Chia Yang
Carl R. Jensen
Alex Pawlicki
178
13
0
03 Nov 2021
Reduction of Subjective Listening Effort for TV Broadcast Signals with
  Recurrent Neural Networks
Reduction of Subjective Listening Effort for TV Broadcast Signals with Recurrent Neural NetworksIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Nils L. Westhausen
R. Huber
Hannah Baumgartner
Ragini Sinha
J. Rennies
B. Meyer
133
10
0
02 Nov 2021
Real-time Speaker counting in a cocktail party scenario using
  Attention-guided Convolutional Neural Network
Real-time Speaker counting in a cocktail party scenario using Attention-guided Convolutional Neural NetworkInterspeech (Interspeech), 2021
Midia Yousefi
John H. L. Hansen
109
10
0
30 Oct 2021
Cross-attention conformer for context modeling in speech enhancement for
  ASR
Cross-attention conformer for context modeling in speech enhancement for ASRAutomatic Speech Recognition & Understanding (ASRU), 2021
A. Narayanan
Chung-Cheng Chiu
Tom O'Malley
Quan Wang
Yanzhang He
185
16
0
30 Oct 2021
Previous
123456
Next