Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.00541
Cited By
v1
v2 (latest)
TasNet: time-domain audio separation network for real-time, single-channel speech separation
1 November 2017
Yi Luo
N. Mesgarani
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"TasNet: time-domain audio separation network for real-time, single-channel speech separation"
50 / 283 papers shown
SA-SDR: A novel loss function for separation of meeting style data
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Thilo von Neumann
K. Kinoshita
Christoph Boeddeker
Marc Delcroix
Reinhold Haeb-Umbach
197
29
0
29 Oct 2021
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions
Wangyou Zhang
Jing Shi
Chenda Li
Shinji Watanabe
Y. Qian
147
24
0
27 Oct 2021
Progressive Learning for Stabilizing Label Selection in Speech Separation with Mapping-based Method
Chenyang Gao
Yue Gu
I. Marsic
248
0
0
20 Oct 2021
Singer separation for karaoke content generation
Hsuan-Yu Chen
Xuan-Bo Chen
J. Jang
130
0
0
13 Oct 2021
SDR -- Medium Rare with Fast Computations
Robin Scheibler
237
23
0
13 Oct 2021
All-neural beamformer for continuous speech separation
Zhuohuang Zhang
Takuya Yoshioka
Naoyuki Kanda
Zhuo Chen
Xiaofei Wang
Dongmei Wang
Sefik Emre Eskimez
243
19
0
13 Oct 2021
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Zengwei Yao
Wenjie Pei
Fanglin Chen
Guangming Lu
David C. Zhang
194
14
0
10 Oct 2021
Mean absorption estimation from room impulse responses using virtually supervised learning
Journal of the Acoustical Society of America (JASA), 2021
C´edric Foy
Antoine Deleforge
Diego Di Carlo
125
19
0
01 Sep 2021
Learning Sparse Analytic Filters for Piano Transcription
Frank Cwitkowitz
M. Heydari
Z. Duan
288
2
0
23 Aug 2021
Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers
Interspeech (Interspeech), 2021
Thilo von Neumann
K. Kinoshita
Christoph Boeddeker
Marc Delcroix
Reinhold Haeb-Umbach
107
24
0
30 Jul 2021
Speeding Up Permutation Invariant Training for Source Separation
ITG Conference on Speech Communication (ITG), 2021
Thilo von Neumann
Christoph Boeddeker
K. Kinoshita
Marc Delcroix
Reinhold Haeb-Umbach
150
6
0
30 Jul 2021
Multi-Task Audio Source Separation
Lu Zhang
Chenxing Li
Feng Deng
Xiaorui Wang
135
13
0
14 Jul 2021
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yawen Xue
Yuki Takashima
Yohei Kawaguchi
193
45
0
04 Jul 2021
Audiovisual Singing Voice Separation
Transactions of the International Society for Music Information Retrieval (TISMIR), 2021
Bochen Li
Yuxuan Wang
Z. Duan
161
7
0
01 Jul 2021
Online Self-Attentive Gated RNNs for Real-Time Speaker Separation
Ori Kabeli
Yossi Adi
Zhenyu Tang
Buye Xu
Anurag Kumar
109
2
0
25 Jun 2021
Basis-MelGAN: Efficient Neural Vocoder Based on Audio Decomposition
Zhengxi Liu
Y. Qian
DRL
108
12
0
25 Jun 2021
Deep neural network Based Low-latency Speech Separation with Asymmetric analysis-Synthesis Window Pair
European Signal Processing Conference (EUSIPCO), 2021
Shanshan Wang
Gaurav Naithani
Archontis Politis
Maria Sandsten
113
12
0
22 Jun 2021
Multi-accent Speech Separation with One Shot Learning
Kuan-Po Huang
Yuan-Kuei Wu
Hung-yi Lee
194
4
0
22 Jun 2021
Encoder-Decoder Based Attractors for End-to-End Neural Diarization
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Leibny Paola García-Perera
213
83
0
20 Jun 2021
Independent Deeply Learned Tensor Analysis for Determined Audio Source Separation
European Signal Processing Conference (EUSIPCO), 2021
Naoki Narisawa
Rintaro Ikeshita
Norihiro Takamune
Daichi Kitamura
Tomohiko Nakamura
Hiroshi Saruwatari
Tomohiro Nakatani
88
1
0
10 Jun 2021
Lightweight Dual-channel Target Speaker Separation for Mobile Voice Communication
Yuanyuan Bao
Yanze Xu
Na Xu
Wenjing Yang
Hongfeng Li
Shicong Li
Y. Jia
Fei Xiang
Jincheng He
Ming Li
199
1
0
05 Jun 2021
Many-Speakers Single Channel Speech Separation with Optimal Permutation Training
Interspeech (Interspeech), 2021
Shaked Dovrat
Eliya Nachmani
Lior Wolf
VLM
366
25
0
18 Apr 2021
Time-domain Speech Enhancement with Generative Adversarial Learning
Feiyang Xiao
Jian Guan
Qiuqiang Kong
Wenwu Wang
GAN
245
9
0
30 Mar 2021
On TasNet for Low-Latency Single-Speaker Speech Enhancement
Morten Kolbæk
Zheng-Hua Tan
S. H. Jensen
Jesper Jensen
176
2
0
27 Mar 2021
Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation
Computer Vision and Pattern Recognition (CVPR), 2021
Jiyoung Lee
Soo-Whan Chung
Sunok Kim
Hong-Goo Kang
Kwanghoon Sohn
173
59
0
25 Mar 2021
Blind Speech Separation and Dereverberation using Neural Beamforming
Speech Communication (Speech Commun.), 2021
Lukas Pfeifenberger
Franz Pernkopf
140
5
0
24 Mar 2021
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect
AAAI Conference on Artificial Intelligence (AAAI), 2021
Jun Wang
Max W. Y. Lam
Jane Polak Scowcroft
Dong Yu
126
7
0
02 Mar 2021
Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Max W. Y. Lam
Jun Wang
Jane Polak Scowcroft
Dong Yu
AI4TS
189
56
0
01 Mar 2021
Dual-Path Modeling for Long Recording Speech Separation in Meetings
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Chenda Li
Zhuo Chen
Yi Luo
Cong Han
Tianyan Zhou
K. Kinoshita
Marc Delcroix
Shinji Watanabe
Y. Qian
123
11
0
23 Feb 2021
TransMask: A Compact and Fast Speech Separation Model Based on Transformer
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Zining Zhang
Bingsheng He
Zhenjie Zhang
136
23
0
19 Feb 2021
Multichannel-based learning for audio object extraction
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Daniel Arteaga
Jordi Pons
DiffM
263
3
0
11 Feb 2021
Multimodal Attention Fusion for Target Speaker Extraction
Spoken Language Technology Workshop (SLT), 2021
Hiroshi Sato
Tsubasa Ochiai
K. Kinoshita
Marc Delcroix
Tomohiro Nakatani
S. Araki
101
32
0
02 Feb 2021
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks
Spoken Language Technology Workshop (SLT), 2021
Max W. Y. Lam
Jun Wang
Jane Polak Scowcroft
Dong Yu
231
32
0
13 Jan 2021
Neural Network-based Virtual Microphone Estimator
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
Rintaro Ikeshita
K. Kinoshita
S. Araki
109
12
0
12 Jan 2021
Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Z. Zhang
Yong-mei Xu
Meng Yu
Shi-Xiong Zhang
Lianwu Chen
Donald Williamson
Dong Yu
159
34
0
24 Dec 2020
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Shinji Watanabe
Florian Boyer
Xuankai Chang
Pengcheng Guo
Tomoki Hayashi
...
Shigeki Karita
Chenda Li
Jing Shi
Aswin Shanmugam Subramanian
Wangyou Zhang
VLM
198
39
0
23 Dec 2020
Group Communication with Context Codec for Lightweight Source Separation
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Yi Luo
Cong Han
N. Mesgarani
222
21
0
14 Dec 2020
Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Christoph Boeddeker
Wangyou Zhang
Tomohiro Nakatani
K. Kinoshita
Tsubasa Ochiai
Marc Delcroix
Naoyuki Kamo
Y. Qian
Reinhold Haeb-Umbach
183
33
0
30 Nov 2020
A comparison of handcrafted, parameterized, and learnable features for speech separation
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2020
Wenbo Zhu
Mou Wang
Xiao-Lei Zhang
S. Rahardja
189
5
0
29 Nov 2020
Streaming end-to-end multi-talker speech recognition
IEEE Signal Processing Letters (IEEE SPL), 2020
Liang Lu
Naoyuki Kanda
Jinyu Li
Jiawei Liu
241
53
0
26 Nov 2020
Multi-Decoder DPRNN: High Accuracy Source Counting and Separation
Junzhe Zhu
Raymond A. Yeh
M. Hasegawa-Johnson
101
4
0
24 Nov 2020
Streaming Multi-speaker ASR with RNN-T
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Ilya Sklyar
A. Piunova
Yulan Liu
206
39
0
23 Nov 2020
Rethinking the Separation Layers in Speech Separation Networks
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yi Luo
Zhuo Chen
Cong Han
Chenda Li
Tianyan Zhou
N. Mesgarani
108
11
0
17 Nov 2020
Surrogate Source Model Learning for Determined Source Separation
Robin Scheibler
M. Togami
169
27
0
11 Nov 2020
Informed Source Extraction With Application to Acoustic Echo Reduction
Mohamed Elminshawi
Wolfgang Mack
Emanuel Habets
228
2
0
09 Nov 2020
ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration
Chenda Li
Jing Shi
Wangyou Zhang
Aswin Shanmugam Subramanian
Xuankai Chang
...
Moto Hira
Tomoki Hayashi
Christoph Boeddeker
Zhuo Chen
Shinji Watanabe
VLM
210
90
0
07 Nov 2020
Phase Aware Speech Enhancement using Realisation of Complex-valued LSTM
Raktim Gautam Goswami
Sivaganesh Andhavarapu
Rama Murty
174
2
0
27 Oct 2020
Attention is All You Need in Speech Separation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
301
703
0
25 Oct 2020
Training Noisy Single-Channel Speech Separation With Noisy Oracle Sources: A Large Gap and A Small Step
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Matthew Maciejewski
Jing Shi
Shinji Watanabe
Sanjeev Khudanpur
168
11
0
23 Oct 2020
Towards Listening to 10 People Simultaneously: An Efficient Permutation Invariant Training of Audio Source Separation Using Sinkhorn's Algorithm
Hideyuki Tachibana
255
15
0
22 Oct 2020
Previous
1
2
3
4
5
6
Next