Training-Free Multi-Step Audio Source Separation

26 May 2025

Papers citing "Training-Free Multi-Step Audio Source Separation"

40 / 40 papers shown

FlowSep: Language-Queried Sound Separation with Rectified Flow MatchingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

409

10 Jan 2025

FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion DistillationInterspeech (Interspeech), 2024

241

03 Sep 2024

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

658

1,290

06 Aug 2024

URGENT Challenge: Universality, Robustness, and Generalizability For Speech EnhancementInterspeech (Interspeech), 2024

Wangyou Zhang

...

Shinji Watanabe

Yanmin Qian

218

07 Jun 2024

Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech EnhancementInterspeech (Interspeech), 2024

Wangyou Zhang

Kohei Saijo

Jee-weon Jung

Chenda Li

Shinji Watanabe

Yanmin Qian

181

06 Jun 2024

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Liangchen Luo

...

308

311

05 Jun 2024

Denoising Diffusion Bridge ModelsInternational Conference on Learning Representations (ICLR), 2023

406

125

29 Sep 2023

Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Junyu Chen

Susmitha Vekkot

Pancham Shukla

236

15 Sep 2023

SingFake: Singing Voice Deepfake DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

355

14 Sep 2023

Music Source Separation with Band-Split RoPE TransformerIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Wei-Tsung Lu

Ju-Chiang Wang

Qiuqiang Kong

Yun-Ning Hung

204

05 Sep 2023

$The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track$

The Sound Demixing Challenge 2023

\unicode{x2013}

Music Demixing TrackTransactions of the International Society for Music Information Retrieval (TISMIR), 2023

Marco A. Martínez-Ramírez

...

390

14 Aug 2023

Let's Verify Step by StepInternational Conference on Learning Representations (ICLR), 2023

1.2K

2,214

31 May 2023

Multi-Source Diffusion Models for Simultaneous Music Generation and SeparationInternational Conference on Learning Representations (ICLR), 2023

565

04 Feb 2023

Diffusion-based Generative Speech Source SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

367

31 Oct 2022

Scaling Laws for Reward Model OveroptimizationInternational Conference on Machine Learning (ICML), 2022

376

772

19 Oct 2022

Flow Matching for Generative ModelingInternational Conference on Learning Representations (ICLR), 2022

1.1K

2,869

06 Oct 2022

Music Source Separation with Band-split RNNIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Yi Luo

Jianwei Yu

232

178

30 Sep 2022

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified FlowInternational Conference on Learning Representations (ICLR), 2022

1.1K

1,983

07 Sep 2022

Speech Enhancement and Dereverberation with Diffusion-based Generative ModelsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

377

320

11 Aug 2022

UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022Interspeech (Interspeech), 2022

Hiroshi Saruwatari

318

412

05 Apr 2022

Improving Source Separation by Explicitly Modeling Dependencies Between SourcesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

175

28 Mar 2022

Self-Consistency Improves Chain of Thought Reasoning in Language ModelsInternational Conference on Learning Representations (ICLR), 2022

2.7K

5,537

21 Mar 2022

Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsNeural Information Processing Systems (NeurIPS), 2022

2.3K

14,449

28 Jan 2022

Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data

Ke Chen

Xingjian Du

Bilei Zhu

Zejun Ma

Taylor Berg-Kirkpatrick

Shlomo Dubnov

324

15 Dec 2021

DDS: A new device-degraded speech dataset for speech enhancement

Haoyu Li

Junichi Yamagishi

219

16 Sep 2021

Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation

Yuxuan Wang

333

108

12 Sep 2021

DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise SuppressorsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Chandan K. A. Reddy

Vishak Gopal

Ross Cutler

328

436

28 Oct 2020

Attention is All You Need in Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Mirco Ravanelli

283

699

25 Oct 2020

Denoising Diffusion Probabilistic Models

Jonathan Ho

Ajay Jain

Pieter Abbeel

DiffM

5.0K

25,697

19 Jun 2020

The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Testing Framework, and Challenge Results

...

348

401

16 May 2020

Scaling Laws for Neural Language Models

1.8K

6,650

23 Jan 2020

Music Source Separation in the Waveform Domain

335

302

27 Nov 2019

WHAMR!: Noisy and Reverberant Single-Channel Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

195

208

22 Oct 2019

A scalable noisy speech dataset and online subjective test frameworkInterspeech (Interspeech), 2019

164

170

17 Sep 2019

End-to-End Multi-Task Denoising for joint SDR and PESQ Optimization

Jaeyoung Kim

Mostafa El-Khamy

Jungwon Lee

201

26 Jan 2019

Scaling Speech Enhancement in Unseen Environments with Noise Embeddings

Gil Keren

Jing Han

Björn Schuller

108

26 Oct 2018

Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation

342

659

08 Jun 2018

Regularisation of Neural Networks by Enforcing Lipschitz Continuity

544

552

12 Apr 2018

Spectral Normalization for Generative Adversarial Networks

492

4,772

16 Feb 2018

Deep Unsupervised Learning using Nonequilibrium Thermodynamics

Jascha Narain Sohl-Dickstein

1.5K

8,825

12 Mar 2015