Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.00541
Cited By
v1
v2 (latest)
TasNet: time-domain audio separation network for real-time, single-channel speech separation
1 November 2017
Yi Luo
N. Mesgarani
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"TasNet: time-domain audio separation network for real-time, single-channel speech separation"
50 / 283 papers shown
Probing Self-supervised Learning Models with Target Speech Extraction
Junyi Peng
Marc Delcroix
Tsubasa Ochiai
Oldrich Plchot
Takanori Ashihara
Shoko Araki
J. Černocký
266
6
0
17 Feb 2024
Unrestricted Global Phase Bias-Aware Single-channel Speech Enhancement with Conformer-based Metric GAN
Shiqi Zhang
Zheng Qiu
Daiki Takeuchi
Noboru Harada
Shoji Makino
214
9
0
13 Feb 2024
Boosting Unknown-number Speaker Separation with Transformer Decoder-based Attractor
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Younglo Lee
Shukjae Choi
Byeonghak Kim
Zhong-Qiu Wang
Shinji Watanabe
MoE
155
19
0
23 Jan 2024
Consistent and Relevant: Rethink the Query Embedding in General Sound Separation
Yuanyuan Wang
Hangting Chen
Dongchao Yang
Jianwei Yu
Chao Weng
Zhiyong Wu
Helen M. Meng
172
7
0
24 Dec 2023
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation
Shengkui Zhao
Yukun Ma
Chongjia Ni
Chong Zhang
Hao Wang
Trung Hieu Nguyen
Kun Zhou
J. Yip
Dianwen Ng
Bin Ma
281
61
0
19 Dec 2023
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Spoken Language Technology Workshop (SLT), 2023
Xueyao Zhang
Liumeng Xue
Yicheng Gu
Yuancheng Wang
Haorui He
...
Mingxuan Wang
Jun Han
Kai Chen
Haizhou Li
Zhizheng Wu
262
58
0
15 Dec 2023
LSTM-CNN Network for Audio Signature Analysis in Noisy Environments
Praveen Damacharla
Hamid Rajabalipanah
M. Fakheri
102
2
0
12 Dec 2023
Subspace Hybrid MVDR Beamforming for Augmented Hearing
S. Hafezi
Alastair H. Moore
Pierre Guiraud
Patrick A. Naylor
Jacob Donley
V. Tourbabin
Thomas Lunner
117
1
0
30 Nov 2023
Improving Label Assignments Learning by Dynamic Sample Dropout Combined with Layer-wise Optimization in Speech Separation
Interspeech (Interspeech), 2023
Chenyu Gao
Yue Gu
I. Marsic
292
0
0
20 Nov 2023
LAVSS: Location-Guided Audio-Visual Spatial Audio Separation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Yuxin Ye
Wenming Yang
Yapeng Tian
217
12
0
31 Oct 2023
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Jinzheng Zhao
Yong-mei Xu
Xinyuan Qian
Davide Berghi
Peipei Wu
Meng Cui
Jianyuan Sun
Philip J. B. Jackson
Wenwu Wang
BDL
642
9
0
23 Oct 2023
Real-time Speech Enhancement and Separation with a Unified Deep Neural Network for Single/Dual Talker Scenarios
Kashyap Patel
A. Kovalyov
Issa Panahi
214
1
0
16 Oct 2023
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments
Automatic Speech Recognition & Understanding (ASRU), 2023
William Ravenscroft
Stefan Goetze
Thomas Hain
177
7
0
09 Oct 2023
An Exploration of Task-decoupling on Two-stage Neural Post Filter for Real-time Personalized Acoustic Echo Cancellation
Automatic Speech Recognition & Understanding (ASRU), 2023
Zihan Zhang
Jiayao Sun
Xianjun Xia
Ziqian Wang
Xiaopeng Yan
Yijian Xiao
Lei Xie
154
0
0
07 Oct 2023
Unravel Anomalies: An End-to-end Seasonal-Trend Decomposition Approach for Time Series Anomaly Detection
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Zhenwei Zhang
Ruiqi Wang
Ran Ding
Yuantao Gu
247
7
0
30 Sep 2023
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation
International Conference on Learning Representations (ICLR), 2023
Samuel Pegg
Kai Li
Xiaolin Hu
418
8
0
29 Sep 2023
Unifying Robustness and Fidelity: A Comprehensive Study of Pretrained Generative Methods for Speech Enhancement in Adverse Conditions
Heming Wang
Meng Yu
Huatian Zhang
Chunlei Zhang
Zhongweiyang Xu
Muqiao Yang
Yixuan Zhang
Dong Yu
225
3
0
16 Sep 2023
A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation
IEEE Open Journal of Signal Processing (IEEE Open J. Signal Process.), 2023
Karn N. Watcharasupat
Chih-Wei Wu
Yiwei Ding
Iroro Orife
Aaron J. Hipple
Aaron J. Hipple. Phillip A. Williams
Scott Kramer
Alexander Lerch
W. Wolcott
297
7
0
05 Sep 2023
Blind Source Separation of Single-Channel Mixtures via Multi-Encoder Autoencoders
Matthew B. Webster
Joonnyong Lee
328
1
0
31 Aug 2023
Sparks of Large Audio Models: A Survey and Outlook
S. Latif
Moazzam Shoukat
Fahad Shamshad
Muhammad Usama
Yi Ren
...
Wenwu Wang
Xulong Zhang
Roberto Togneri
Xiaoshi Zhong
Björn W. Schuller
LM&MA
AuLLM
676
52
0
24 Aug 2023
Convoifilter: A case study of doing cocktail party speech recognition
Thai-Binh Nguyen
A. Waibel
235
2
0
22 Aug 2023
Audio-visual video-to-speech synthesis with synthesized input audio
Triantafyllos Kefalas
Yannis Panagakis
Maja Pantic
VGen
DiffM
281
1
0
31 Jul 2023
Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model
Sankalpa Rijal
Rajan Neupane
Saroj Prasad Mainali
Shishir K. Regmi
Shanta Maharjan
211
0
0
29 Jul 2023
An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel Attention
Interspeech (Interspeech), 2023
Junyu Wang
110
6
0
09 Jun 2023
Towards Solving Cocktail-Party: The First Method to Build a Realistic Dataset with Ground Truths for Speech Separation
Rawad Melhem
Assef Jafar
Oumayma Al Dakkak
161
1
0
25 May 2023
Speech Separation based on Contrastive Learning and Deep Modularization
Peter Ochieng
SSL
271
0
0
18 May 2023
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions
Interspeech (Interspeech), 2023
Jie Zhang
Qingquan Xu
Qiu-shi Zhu
Zhenhua Ling
166
19
0
17 May 2023
ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement
Feng Dang
Qi Hu
Pengyuan Zhang
Yonghong Yan
187
4
0
15 May 2023
Universal Source Separation with Weakly Labelled Data
Qiuqiang Kong
Kai Chen
Haohe Liu
Xingjian Du
Taylor Berg-Kirkpatrick
Shlomo Dubnov
Mark D. Plumbley
184
35
0
11 May 2023
Adversarial Generative NMF for Single Channel Source Separation
Martin Ludvigsen
M. Grasmair
GAN
134
0
0
24 Apr 2023
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Yuchen Hu
Cheng Chen
Qiu-shi Zhu
Eng Siong Chng
298
18
0
11 Apr 2023
Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT
Kai Chen
Gordon Wichern
Franccois G. Germain
Jonathan Le Roux
AI4TS
169
0
0
04 Apr 2023
End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Speech Communication (Speech Commun.), 2023
Giovanni Morrone
Samuele Cornell
L. Serafini
Enrico Zovato
Alessio Brutti
S. Squartini
320
5
0
21 Mar 2023
Multi-Channel Masking with Learnable Filterbank for Sound Source Separation
Wang Dai
Archontis Politis
Maria Sandsten
133
0
0
14 Mar 2023
Multi-Microphone Speaker Separation by Spatial Regions
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Julian Wechsler
Srikanth Raj Chetupalli
Wolfgang Mack
Emanuel Habets
157
13
0
13 Mar 2023
Improving the Intent Classification accuracy in Noisy Environment
Mohamed Nabih Ali
Alessio Brutti
Daniele Falavigna
119
1
0
12 Mar 2023
Distribution Preserving Source Separation With Time Frequency Predictive Models
European Signal Processing Conference (EUSIPCO), 2023
Pedro J. Villasana T
J. Klejsa
Lars Villemoes
P. Hedelin
165
2
0
10 Mar 2023
MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Shengkui Zhao
Bin Ma
213
74
0
23 Feb 2023
DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Shuo Wang
Xiangyu Kong
Xiulian Peng
H. Movassagh
Vinod Prakash
Yan Lu
151
14
0
21 Feb 2023
A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Lingwei Meng
Jiawen Kang
Mingyu Cui
Yuejiao Wang
Xixin Wu
Helen M. Meng
185
21
0
20 Feb 2023
Hypernetworks build Implicit Neural Representations of Sounds
Filip Szatkowski
Karol J. Piczak
Przemtslaw Spurek
Jacek Tabor
Tomasz Trzciñski
509
16
0
09 Feb 2023
Separate And Diffuse: Using a Pretrained Diffusion Model for Improving Source Separation
Shahar Lutati
Eliya Nachmani
Lior Wolf
DiffM
211
18
0
25 Jan 2023
Perceive and predict: self-supervised speech representation based loss functions for speech enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
George Close
William Ravenscroft
Thomas Hain
Stefan Goetze
SSL
198
16
0
11 Jan 2023
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Wei-Ning Hsu
Tal Remez
Bowen Shi
Jacob Donley
Yossi Adi
DiffM
233
14
0
21 Dec 2022
DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
IEEE Signal Processing Letters (SPL), 2022
Dongheon Lee
Jung-Woo Choi
321
42
0
15 Dec 2022
Multi-Scale Feature Fusion Transformer Network for End-to-End Single Channel Speech Separation
Yinhao Xu
Jian Zhou
L. Tao
H. Kwan
150
0
0
14 Dec 2022
NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer
Changsheng Quan
Xiaofei Li
155
5
0
05 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of the art analysis
Artificial Intelligence Review (Artif Intell Rev), 2022
P. Ochieng
269
35
0
01 Dec 2022
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
297
201
0
22 Nov 2022
Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts
Xiaofei Wang
Zhuo Chen
Yu Shi
Jian Wu
Naoyuki Kanda
Takuya Yoshioka
MoE
169
2
0
11 Nov 2022
Previous
1
2
3
4
5
6
Next