ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.19534
  4. Cited By
Training-Free Multi-Step Audio Source Separation

Training-Free Multi-Step Audio Source Separation

26 May 2025
Yongyi Zang
Jingyi Li
Qiuqiang Kong
ArXiv (abs)PDFHTML

Papers citing "Training-Free Multi-Step Audio Source Separation"

40 / 40 papers shown
Title
FlowSep: Language-Queried Sound Separation with Rectified Flow Matching
FlowSep: Language-Queried Sound Separation with Rectified Flow MatchingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Yi Yuan
Xubo Liu
Haohe Liu
Mark D. Plumbley
Wenwu Wang
387
22
0
10 Jan 2025
FastVoiceGrad: One-step Diffusion-Based Voice Conversion with
  Adversarial Conditional Diffusion Distillation
FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion DistillationInterspeech (Interspeech), 2024
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Yuto Kondo
DiffM
240
6
0
03 Sep 2024
Scaling LLM Test-Time Compute Optimally can be More Effective than
  Scaling Model Parameters
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Charlie Snell
Jaehoon Lee
Kelvin Xu
Aviral Kumar
LRM
632
1,284
0
06 Aug 2024
URGENT Challenge: Universality, Robustness, and Generalizability For
  Speech Enhancement
URGENT Challenge: Universality, Robustness, and Generalizability For Speech EnhancementInterspeech (Interspeech), 2024
Wangyou Zhang
Robin Scheibler
Kohei Saijo
Samuele Cornell
Chenda Li
...
Jan Pirklbauer
Marvin Sach
Shinji Watanabe
Tim Fingscheidt
Yanmin Qian
VLM
207
45
0
07 Jun 2024
Beyond Performance Plateaus: A Comprehensive Study on Scalability in
  Speech Enhancement
Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech EnhancementInterspeech (Interspeech), 2024
Wangyou Zhang
Kohei Saijo
Jee-weon Jung
Chenda Li
Shinji Watanabe
Yanmin Qian
169
17
0
06 Jun 2024
Improve Mathematical Reasoning in Language Models by Automated Process
  Supervision
Improve Mathematical Reasoning in Language Models by Automated Process Supervision
Liangchen Luo
Yinxiao Liu
Rosanne Liu
Samrat Phatale
Harsh Lara
...
Lei Shu
Yun Zhu
Lei Meng
Jiao Sun
Abhinav Rastogi
LRM
296
309
0
05 Jun 2024
Denoising Diffusion Bridge Models
Denoising Diffusion Bridge ModelsInternational Conference on Learning Representations (ICLR), 2023
Linqi Zhou
Aaron Lou
Samar Khanna
Stefano Ermon
DiffM
402
125
0
29 Sep 2023
Music Source Separation Based on a Lightweight Deep Learning Framework
  (DTTNET: DUAL-PATH TFC-TDF UNET)
Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Junyu Chen
Susmitha Vekkot
Pancham Shukla
221
15
0
15 Sep 2023
SingFake: Singing Voice Deepfake Detection
SingFake: Singing Voice Deepfake DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yongyi Zang
You Zhang
Mojtaba Heydari
Zhiyao Duan
340
51
0
14 Sep 2023
Music Source Separation with Band-Split RoPE Transformer
Music Source Separation with Band-Split RoPE TransformerIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Wei-Tsung Lu
Ju-Chiang Wang
Qiuqiang Kong
Yun-Ning Hung
192
60
0
05 Sep 2023
The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track
The Sound Demixing Challenge 2023 \unicodex2013\unicode{x2013}\unicodex2013 Music Demixing TrackTransactions of the International Society for Music Information Retrieval (TISMIR), 2023
Giorgio Fabbro
Stefan Uhlich
Chieh-Hsin Lai
Woosung Choi
Marco A. Martínez-Ramírez
...
Jun Hyung Lee
Yuanliang Dong
Xinran Zhang
Jiafeng Liu
Yuki Mitsufuji
368
37
0
14 Aug 2023
Let's Verify Step by Step
Let's Verify Step by StepInternational Conference on Learning Representations (ICLR), 2023
Hunter Lightman
V. Kosaraju
Yura Burda
Harrison Edwards
Bowen Baker
Teddy Lee
Jan Leike
John Schulman
Ilya Sutskever
K. Cobbe
ALMOffRLLRM
1.1K
2,183
0
31 May 2023
Multi-Source Diffusion Models for Simultaneous Music Generation and
  Separation
Multi-Source Diffusion Models for Simultaneous Music Generation and SeparationInternational Conference on Learning Representations (ICLR), 2023
Giorgio Mariani
Irene Tallini
Emilian Postolache
Michele Mancusi
Luca Cosmo
Emanuele Rodolà
DiffM
531
65
0
04 Feb 2023
Diffusion-based Generative Speech Source Separation
Diffusion-based Generative Speech Source SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Robin Scheibler
Youna Ji
Soo-Whan Chung
J. Byun
Soyeon Choe
Min-Seok Choi
DiffM
350
61
0
31 Oct 2022
Scaling Laws for Reward Model Overoptimization
Scaling Laws for Reward Model OveroptimizationInternational Conference on Machine Learning (ICML), 2022
Leo Gao
John Schulman
Jacob Hilton
ALM
365
766
0
19 Oct 2022
Flow Matching for Generative Modeling
Flow Matching for Generative ModelingInternational Conference on Learning Representations (ICLR), 2022
Y. Lipman
Ricky T. Q. Chen
Heli Ben-Hamu
Maximilian Nickel
Matt Le
OOD
1.1K
2,834
0
06 Oct 2022
Music Source Separation with Band-split RNN
Music Source Separation with Band-split RNNIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Yi Luo
Jianwei Yu
224
178
0
30 Sep 2022
Flow Straight and Fast: Learning to Generate and Transfer Data with
  Rectified Flow
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified FlowInternational Conference on Learning Representations (ICLR), 2022
Xingchao Liu
Chengyue Gong
Qiang Liu
OOD
1.0K
1,964
0
07 Sep 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Speech Enhancement and Dereverberation with Diffusion-based Generative ModelsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
349
317
0
11 Aug 2022
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022Interspeech (Interspeech), 2022
Takaaki Saeki
Detai Xin
Wataru Nakata
Tomoki Koriyama
Shinnosuke Takamichi
Hiroshi Saruwatari
294
410
0
05 Apr 2022
Improving Source Separation by Explicitly Modeling Dependencies Between
  Sources
Improving Source Separation by Explicitly Modeling Dependencies Between SourcesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Ethan Manilow
Curtis Hawthorne
Cheng-Zhi Anna Huang
Bryan Pardo
Jesse Engel
BDL
168
10
0
28 Mar 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLMBDLLRMAI4CE
2.4K
5,444
0
21 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsNeural Information Processing Systems (NeurIPS), 2022
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&RoLRMAI4CEReLM
2.3K
14,365
0
28 Jan 2022
Zero-shot Audio Source Separation through Query-based Learning from
  Weakly-labeled Data
Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
283
56
0
15 Dec 2021
DDS: A new device-degraded speech dataset for speech enhancement
DDS: A new device-degraded speech dataset for speech enhancement
Haoyu Li
Junichi Yamagishi
211
10
0
16 Sep 2021
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music
  Source Separation
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation
Qiuqiang Kong
Yin Cao
Haohe Liu
Keunwoo Choi
Yuxuan Wang
333
108
0
12 Sep 2021
DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to
  evaluate Noise Suppressors
DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise SuppressorsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Chandan K. A. Reddy
Vishak Gopal
Ross Cutler
308
436
0
28 Oct 2020
Attention is All You Need in Speech Separation
Attention is All You Need in Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
270
689
0
25 Oct 2020
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
4.8K
25,499
0
19 Jun 2020
The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets,
  Subjective Testing Framework, and Challenge Results
The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Testing Framework, and Challenge Results
Chandan K. A. Reddy
Vishak Gopal
Ross Cutler
Ebrahim Beyrami
R. Cheng
...
A. Aazami
Sebastian Braun
Puneet Rana
Sriram Srinivasan
J. Gehrke
344
401
0
16 May 2020
Scaling Laws for Neural Language Models
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
1.8K
6,606
0
23 Jan 2020
Music Source Separation in the Waveform Domain
Music Source Separation in the Waveform Domain
Alexandre Défossez
Nicolas Usunier
Léon Bottou
Francis R. Bach
316
302
0
27 Nov 2019
WHAMR!: Noisy and Reverberant Single-Channel Speech Separation
WHAMR!: Noisy and Reverberant Single-Channel Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Matthew Maciejewski
Gordon Wichern
E. McQuinn
Jonathan Le Roux
184
207
0
22 Oct 2019
A scalable noisy speech dataset and online subjective test framework
A scalable noisy speech dataset and online subjective test frameworkInterspeech (Interspeech), 2019
Chandan K. A. Reddy
Ebrahim Beyrami
Jamie Pool
Ross Cutler
Sriram Srinivasan
J. Gehrke
164
170
0
17 Sep 2019
End-to-End Multi-Task Denoising for joint SDR and PESQ Optimization
End-to-End Multi-Task Denoising for joint SDR and PESQ Optimization
Jaeyoung Kim
Mostafa El-Khamy
Jungwon Lee
201
31
0
26 Jan 2019
Scaling Speech Enhancement in Unseen Environments with Noise Embeddings
Scaling Speech Enhancement in Unseen Environments with Noise Embeddings
Gil Keren
Jing Han
Björn Schuller
108
17
0
26 Oct 2018
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source
  Separation
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation
Daniel Stoller
Sebastian Ewert
S. Dixon
AI4TS
314
656
0
08 Jun 2018
Regularisation of Neural Networks by Enforcing Lipschitz Continuity
Regularisation of Neural Networks by Enforcing Lipschitz Continuity
Henry Gouk
E. Frank
Bernhard Pfahringer
M. Cree
530
551
0
12 Apr 2018
Spectral Normalization for Generative Adversarial Networks
Spectral Normalization for Generative Adversarial Networks
Takeru Miyato
Toshiki Kataoka
Masanori Koyama
Yuichi Yoshida
ODL
445
4,748
0
16 Feb 2018
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Jascha Narain Sohl-Dickstein
Eric A. Weiss
Niru Maheswaranathan
Surya Ganguli
SyDaDiffM
1.5K
8,785
0
12 Mar 2015
1