Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.00541
Cited By
v1
v2 (latest)
TasNet: time-domain audio separation network for real-time, single-channel speech separation
1 November 2017
Yi Luo
N. Mesgarani
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"TasNet: time-domain audio separation network for real-time, single-channel speech separation"
50 / 283 papers shown
Continual Learning for Singing Voice Separation with Human in the Loop Adaptation
Ankur Gupta
Anshul Rai
Archit Bansal
Vipul Arora
VLM
283
0
0
02 Dec 2025
Evaluating Objective Speech Quality Metrics for Neural Audio Codecs
Luca A. Lanzendörfer
Florian Grötschla
69
0
0
24 Nov 2025
Towards Practical Real-Time Low-Latency Music Source Separation
IEEE International Conference on Multimedia and Expo (ICME), 2025
Junyu Wu
Jie Liu
Tianrui Pan
J. Tang
Gangshan Wu
111
0
0
17 Nov 2025
SAO-Instruct: Free-form Audio Editing using Natural Language Instructions
Michael Ungersböck
Florian Grötschla
Luca A. Lanzendörfer
June Young Yi
Changho Choi
Roger Wattenhofer
AuLLM
161
1
0
26 Oct 2025
ReFESS-QI: Reference-Free Evaluation For Speech Separation With Joint Quality And Intelligibility Scoring
Ari Frummer
Helin Wang
Tianyu Cao
Adi Arbel
Yuval Sieradzki
Oren Gal
Jesus Villalba
Thomas Thebaud
Najim Dehak
121
0
0
23 Oct 2025
MARS-Sep: Multimodal-Aligned Reinforced Sound Separation
Zihan Zhang
Xize Cheng
Zhennan Jiang
Dongjie Fu
Jingyuan Chen
Zhou Zhao
Tao Jin
96
0
0
12 Oct 2025
Multi-bit Audio Watermarking
Luca A. Lanzendörfer
Kyle Fearne
Florian Grötschla
Roger Wattenhofer
113
0
0
02 Oct 2025
Unsupervised Single-Channel Speech Separation with a Diffusion Prior under Speaker-Embedding Guidance
Runwu Shi
Kai Li
Chang Li
Jiang Wang
Sihan Tan
Kazuhiro Nakadai
DiffM
76
0
0
29 Sep 2025
Neural Speech Separation with Parallel Amplitude and Phase Spectrum Estimation
Fei Liu
Yang Ai
Zhen-Hua Ling
117
0
0
17 Sep 2025
A Lightweight Architecture for Multi-instrument Transcription with Practical Optimizations
Ruigang Li
Yongxu Zhu
73
0
0
16 Sep 2025
A Study of the Scale Invariant Signal to Distortion Ratio in Speech Separation with Noisy References
Simon Dahl Jepsen
M. G. Christensen
Jesper Rindom Jensen
104
1
0
20 Aug 2025
Advances in Speech Separation: Techniques, Challenges, and Future Trends
Kai Li
Guo Chen
Wendi Sang
Yi Luo
Zhuo Chen
...
Shulin He
Zhong-Qiu Wang
Andong Li
Z. Wu
Xiaolin Hu
AI4TS
119
4
0
14 Aug 2025
Nonlinear Framework for Speech Bandwidth Extension
Tarikul Islam Tamiti
Nursad Mamun
Anomadarshi Barua
176
0
0
21 Jul 2025
Knowing When to Quit: Probabilistic Early Exits for Speech Separation
Kenny Falkær Olsen
Mads Østergaard
Karl Ulbæk
S. F. V. Nielsen
Rasmus Malik Høegh Lindrup
Bjørn Sand Jensen
Morten Mørup
UQCV
248
0
0
13 Jul 2025
EDNet: A Distortion-Agnostic Speech Enhancement Framework with Gating Mamba Mechanism and Phase Shift-Invariant Training
Doyeop Kwak
Youngjoon Jang
Seongyu Kim
Joon Son Chung
123
2
0
19 Jun 2025
SpeechRefiner: Towards Perceptual Quality Refinement for Front-End Algorithms
Sirui Li
Shuai Wang
Zhijun Liu
Zhongjie Jiang
Yannan Wang
Haizhou Li
152
1
0
16 Jun 2025
Local Equivariance Error-Based Metrics for Evaluating Sampling-Frequency-Independent Property of Neural Network
Kanami Imamura
Tomohiko Nakamura
Norihiro Takamune
Kohei Yatabe
Hiroshi Saruwatari
153
0
0
04 Jun 2025
How Far Are We from Generating Missing Modalities with Foundation Models?
Guanzhou Ke
Yi Xie
Xiaoli Wang
Guoqing Chao
Bo Wang
VLM
303
0
0
04 Jun 2025
Uni-VERSA: Versatile Speech Assessment with a Unified Network
Jiatong Shi
Hye-jin Shim
Shinji Watanabe
217
3
0
27 May 2025
Time-Frequency-Based Attention Cache Memory Model for Real-Time Speech Separation
Guo Chen
Kai Li
Runxuan Yang
Xiaolin Hu
AI4TS
239
1
0
19 May 2025
Survey of End-to-End Multi-Speaker Automatic Speech Recognition for Monaural Audio
Xinlu He
Jacob Whitehill
214
4
0
16 May 2025
Knowledge Distillation for Speech Denoising by Latent Representation Alignment with Cosine Distance
Diep Luong
Mikko Heikkinen
Konstantinos Drossos
Maria Sandsten
360
0
0
06 May 2025
A Comparative Study on Positional Encoding for Time-frequency Domain Dual-path Transformer-based Source Separation Models
Kohei Saijo
Tetsuji Ogawa
295
3
0
28 Apr 2025
Passive Underwater Acoustic Signal Separation based on Feature Decoupling Dual-path Network
Yucheng Liu
Longyu Jiang
237
1
0
11 Apr 2025
DeepExtractor: Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning
Tom Dooney
Harsh Narola
Stefano Bromuri
R. L. Curier
C. Broeck
Sarah Caudill
D. Tan
384
2
0
30 Jan 2025
EDSep: An Effective Diffusion-Based Method for Speech Source Separation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Jinwei Dong
Xinsheng Wang
Qirong Mao
323
5
0
28 Jan 2025
SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling
IEEE Journal on Selected Areas in Communications (JSAC), 2025
Shengshi Yao
Jincheng Dai
Xiaoqi Qin
Sixian Wang
Siye Wang
K. Niu
Ping Zhang
333
2
0
22 Jan 2025
Gen-A: Generalizing Ambisonics Neural Encoding to Unseen Microphone Arrays
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Mikko Heikkinen
Archontis Politis
Konstantinos Drossos
Maria Sandsten
AI4CE
116
1
0
14 Jan 2025
Evaluating the Impact of Discriminative and Generative E2E Speech Enhancement Models on Syllable Stress Preservation
Rangavajjala Sankara Bharadwaj
Jhansi Mallela
Sai Harshitha Aluru
Chiranjeevi Yarra
188
1
0
11 Dec 2024
Modulating State Space Model with SlowFast Framework for Compute-Efficient Ultra Low-Latency Speech Enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Longbiao Cheng
Ashutosh Pandey
Buye Xu
T. Delbruck
V. Ithapu
Shih-Chii Liu
209
4
0
04 Nov 2024
SepMamba: State-space models for speaker separation using Mamba
Thor Højhus Avenstrup
Boldizsár Elek
István László Mádi
András Bence Schin
Morten Mørup
Bjørn Sand Jensen
Kenny Falkær Olsen
Mamba
185
4
0
28 Oct 2024
OpenSep: Leveraging Large Language Models with Textual Inversion for Open World Audio Separation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Tanvir Mahmud
Diana Marculescu
VLM
206
3
0
28 Sep 2024
DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing
Interspeech (Interspeech), 2024
Kuang Yuan
Shuo Han
Swarun Kumar
Bhiksha Raj
137
3
0
10 Sep 2024
USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024
Bang Zeng
Ming Li
423
14
0
04 Sep 2024
Spectron: Target Speaker Extraction using Conditional Transformer with Adversarial Refinement
Tathagata Bandyopadhyay
ViT
251
0
0
02 Sep 2024
DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement
Tao Sun
Sander Bohté
184
8
0
14 Aug 2024
Advancing Spatio-Temporal Processing in Spiking Neural Networks through Adaptation
Nature Communications (Nat. Commun.), 2024
Maximilian Baronig
Romain Ferrand
Silvester Sabathiel
Robert Legenstein
364
14
0
14 Aug 2024
A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining
Feiyang Xiao
Jian Guan
Qiaoxi Zhu
Xubo Liu
Wenbo Wang
Shuhan Qi
Kejia Zhang
Jianyuan Sun
Wenwu Wang
323
11
0
06 Jul 2024
Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations
Kunal Dhawan
Nithin Rao Koluguri
Ante Jukić
Ryan Langman
Jagadeesh Balam
Boris Ginsburg
222
13
0
03 Jul 2024
Papez: Resource-Efficient Speech Separation with Auditory Working Memory
Hyunseok Oh
Juheon Yi
Youngki Lee
188
4
0
01 Jul 2024
Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition
William Ravenscroft
George Close
Stefan Goetze
Thomas Hain
Mohammad Soleymanpour
Anurag Chowdhury
Mark C. Fuhs
263
1
0
13 Jun 2024
Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation
Adam Sorrenti
96
0
0
30 May 2024
Look Once to Hear: Target Speech Hearing with Noisy Examples
International Conference on Human Factors in Computing Systems (CHI), 2024
Bandhav Veluri
Malek Itani
Tuochao Chen
Takuya Yoshioka
Shyamnath Gollakota
326
32
0
10 May 2024
TRAMBA: A Hybrid Transformer and Mamba Architecture for Practical Audio and Bone Conduction Speech Super Resolution and Enhancement on Mobile and Wearable Platforms
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2024
Yueyuan Sui
Minghui Zhao
Junxi Xia
Xiaofan Jiang
S. Xia
Mamba
242
17
0
02 May 2024
PEAVS: Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers' Opinion Scores
European Conference on Computer Vision (ECCV), 2024
Lucas Goncalves
Prashant Mathur
Chandrashekhar Lavania
Metehan Cekic
Marcello Federico
Kyu J. Han
170
9
0
10 Apr 2024
Weakly-supervised Audio Separation via Bi-modal Semantic Similarity
International Conference on Learning Representations (ICLR), 2024
Tanvir Mahmud
Saeed Amizadeh
K. Koishida
Diana Marculescu
AI4TS
231
5
0
02 Apr 2024
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection
Ali Behrouz
Michele Santacatterina
Ramin Zabih
441
46
0
29 Mar 2024
Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation
Xilin Jiang
Cong Han
N. Mesgarani
Mamba
244
74
0
27 Mar 2024
Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet
Satvik Venkatesh
Arthur Benilov
Philip Coleman
Frederic Roskam
302
11
0
27 Feb 2024
Target Speech Extraction with Pre-trained Self-supervised Learning Models
Junyi Peng
Marc Delcroix
Tsubasa Ochiai
Oldrich Plchot
Shoko Araki
J. Černocký
221
17
0
17 Feb 2024
1
2
3
4
5
6
Next