v1v2 (latest)

TasNet: time-domain audio separation network for real-time, single-channel speech separation

1 November 2017

Yi Luo

N. Mesgarani

ArXiv (abs)PDF HTML

Papers citing "TasNet: time-domain audio separation network for real-time, single-channel speech separation"

50 / 283 papers shown

Continual Learning for Singing Voice Separation with Human in the Loop Adaptation

283

02 Dec 2025

Evaluating Objective Speech Quality Metrics for Neural Audio Codecs

Luca A. Lanzendörfer

Florian Grötschla

24 Nov 2025

Towards Practical Real-Time Low-Latency Music Source SeparationIEEE International Conference on Multimedia and Expo (ICME), 2025

111

17 Nov 2025

SAO-Instruct: Free-form Audio Editing using Natural Language Instructions

161

26 Oct 2025

ReFESS-QI: Reference-Free Evaluation For Speech Separation With Joint Quality And Intelligibility Scoring

121

23 Oct 2025

MARS-Sep: Multimodal-Aligned Reinforced Sound Separation

12 Oct 2025

Multi-bit Audio Watermarking

113

02 Oct 2025

Unsupervised Single-Channel Speech Separation with a Diffusion Prior under Speaker-Embedding Guidance

29 Sep 2025

Neural Speech Separation with Parallel Amplitude and Phase Spectrum Estimation

Fei Liu

Yang Ai

Zhen-Hua Ling

117

17 Sep 2025

A Lightweight Architecture for Multi-instrument Transcription with Practical Optimizations

Ruigang Li

Yongxu Zhu

16 Sep 2025

A Study of the Scale Invariant Signal to Distortion Ratio in Speech Separation with Noisy References

Simon Dahl Jepsen

M. G. Christensen

Jesper Rindom Jensen

104

20 Aug 2025

Advances in Speech Separation: Techniques, Challenges, and Future Trends

...

119

14 Aug 2025

Nonlinear Framework for Speech Bandwidth Extension

Tarikul Islam Tamiti

Nursad Mamun

Anomadarshi Barua

176

21 Jul 2025

Knowing When to Quit: Probabilistic Early Exits for Speech Separation

Rasmus Malik Høegh Lindrup

Bjørn Sand Jensen

Morten Mørup

UQCV

248

13 Jul 2025

EDNet: A Distortion-Agnostic Speech Enhancement Framework with Gating Mamba Mechanism and Phase Shift-Invariant Training

123

19 Jun 2025

SpeechRefiner: Towards Perceptual Quality Refinement for Front-End Algorithms

152

16 Jun 2025

Local Equivariance Error-Based Metrics for Evaluating Sampling-Frequency-Independent Property of Neural Network

153

04 Jun 2025

How Far Are We from Generating Missing Modalities with Foundation Models?

303

04 Jun 2025

Uni-VERSA: Versatile Speech Assessment with a Unified Network

Jiatong Shi

Hye-jin Shim

Shinji Watanabe

217

27 May 2025

Time-Frequency-Based Attention Cache Memory Model for Real-Time Speech Separation

239

19 May 2025

Survey of End-to-End Multi-Speaker Automatic Speech Recognition for Monaural Audio

Xinlu He

Jacob Whitehill

214

16 May 2025

Knowledge Distillation for Speech Denoising by Latent Representation Alignment with Cosine Distance

360

06 May 2025

A Comparative Study on Positional Encoding for Time-frequency Domain Dual-path Transformer-based Source Separation Models

Kohei Saijo

Tetsuji Ogawa

295

28 Apr 2025

Passive Underwater Acoustic Signal Separation based on Feature Decoupling Dual-path Network

Yucheng Liu

Longyu Jiang

237

11 Apr 2025

DeepExtractor: Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning

384

30 Jan 2025

EDSep: An Effective Diffusion-Based Method for Speech Source SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

Jinwei Dong

Xinsheng Wang

Qirong Mao

323

28 Jan 2025

SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language ModelingIEEE Journal on Selected Areas in Communications (JSAC), 2025

333

22 Jan 2025

Gen-A: Generalizing Ambisonics Neural Encoding to Unseen Microphone ArraysIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

116

14 Jan 2025

Evaluating the Impact of Discriminative and Generative E2E Speech Enhancement Models on Syllable Stress Preservation

Rangavajjala Sankara Bharadwaj

Jhansi Mallela

Sai Harshitha Aluru

Chiranjeevi Yarra

188

11 Dec 2024

Modulating State Space Model with SlowFast Framework for Compute-Efficient Ultra Low-Latency Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

209

04 Nov 2024

SepMamba: State-space models for speaker separation using Mamba

Thor Højhus Avenstrup

185

28 Oct 2024

OpenSep: Leveraging Large Language Models with Textual Inversion for Open World Audio SeparationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Tanvir Mahmud

Diana Marculescu

VLM

206

28 Sep 2024

DeWinder: Single-Channel Wind Noise Reduction using Ultrasound SensingInterspeech (Interspeech), 2024

137

10 Sep 2024

USEF-TSE: Universal Speaker Embedding Free Target Speaker ExtractionIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024

Bang Zeng

Ming Li

423

04 Sep 2024

Spectron: Target Speaker Extraction using Conditional Transformer with Adversarial Refinement

Tathagata Bandyopadhyay

ViT

251

02 Sep 2024

DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement

Tao Sun

Sander Bohté

184

14 Aug 2024

Advancing Spatio-Temporal Processing in Spiking Neural Networks through AdaptationNature Communications (Nat. Commun.), 2024

364

14 Aug 2024

A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining

Wenwu Wang

323

06 Jul 2024

Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations

Kunal Dhawan

Nithin Rao Koluguri

Ante Jukić

Ryan Langman

Jagadeesh Balam

Boris Ginsburg

222

03 Jul 2024

Papez: Resource-Efficient Speech Separation with Auditory Working Memory

Hyunseok Oh

Juheon Yi

Youngki Lee

188

01 Jul 2024

Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition

Mohammad Soleymanpour

Anurag Chowdhury

Mark C. Fuhs

263

13 Jun 2024

Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation

Adam Sorrenti

30 May 2024

Look Once to Hear: Target Speech Hearing with Noisy ExamplesInternational Conference on Human Factors in Computing Systems (CHI), 2024

326

10 May 2024

TRAMBA: A Hybrid Transformer and Mamba Architecture for Practical Audio and Bone Conduction Speech Super Resolution and Enhancement on Mobile and Wearable PlatformsProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2024

242

02 May 2024

PEAVS: Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers' Opinion ScoresEuropean Conference on Computer Vision (ECCV), 2024

Lucas Goncalves

Prashant Mathur

Chandrashekhar Lavania

Metehan Cekic

Marcello Federico

Kyu J. Han

170

10 Apr 2024

Weakly-supervised Audio Separation via Bi-modal Semantic SimilarityInternational Conference on Learning Representations (ICLR), 2024

231

02 Apr 2024

MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection

Ali Behrouz

Michele Santacatterina

Ramin Zabih

441

29 Mar 2024

Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation

Xilin Jiang

Cong Han

N. Mesgarani

Mamba

244

27 Mar 2024

Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet

302

27 Feb 2024

Target Speech Extraction with Pre-trained Self-supervised Learning Models

221

17 Feb 2024