Toward Universal Speech Enhancement for Diverse Input Conditions

Toward Universal Speech Enhancement for Diverse Input Conditions

29 September 2023

Wangyou Zhang

Shinji Watanabe

Papers citing "Toward Universal Speech Enhancement for Diverse Input Conditions"

16 / 16 papers shown

Title
A Comparative Study on Positional Encoding for Time-frequency Domain Dual-path Transformer-based Source Separation Models Kohei Saijo Tetsuji Ogawa 45 1 0 28 Apr 2025
AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement Junan Zhang Jing Yang Zihao Fang Y. Wang Zehua Zhang Zhuo Wang Fan Fan Z. Wu 39 2 0 26 Jan 2025
Task-Aware Unified Source Separation Kohei Saijo Janek Ebbers François G. Germain G. Wichern Jonathan Le Roux 29 1 0 31 Oct 2024
Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification Wen Huang Bing Han Zhengyang Chen Shuai Wang Yanmin Qian VLM SSL 13 0 0 22 Oct 2024
Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement Yudong Yang Zhan Liu Wenyi Yu Guangzhi Sun Qiuqiang Kong Chao Zhang DiffM 44 0 0 15 Sep 2024
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement Kohei Saijo G. Wichern François G. Germain Zexu Pan Jonathan Le Roux 26 6 0 06 Aug 2024
ctPuLSE: Close-Talk, and Pseudo-Label Based Far-Field, Speech Enhancement Zhong-Qiu Wang 20 1 0 28 Jul 2024
Improving Real-Time Music Accompaniment Separation with MMDenseNet Chun-Hsiang Wang Chung-Che Wang J. Wang Jyh-Shing Roger Jang Yen-Hsun Chu 31 0 0 30 Jun 2024
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement Wangyou Zhang Robin Scheibler Kohei Saijo Samuele Cornell Chenda Li ... Jan Pirklbauer Marvin Sach Shinji Watanabe Tim Fingscheidt Yanmin Qian VLM 32 6 0 07 Jun 2024
Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement Wangyou Zhang Kohei Saijo Jee-weon Jung Chenda Li Shinji Watanabe Yanmin Qian 30 4 0 06 Jun 2024
SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition Yihan Wu Soumi Maiti Yifan Peng Wangyou Zhang Chenda Li Yuyue Wang Xihua Wang Shinji Watanabe Ruihua Song 25 3 0 31 Jan 2024
Improving Design of Input Condition Invariant Speech Enhancement Wangyou Zhang Jee-weon Jung Shinji Watanabe Yanmin Qian AAML 26 2 0 25 Jan 2024
UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures Zhong-Qiu Wang Shinji Watanabe 11 10 0 31 May 2023
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation Zhong-Qiu Wang Samuele Cornell Shukjae Choi Younglo Lee Byeonghak Kim Shinji Watanabe 66 95 0 08 Sep 2022
Learning Filterbanks for End-to-End Acoustic Beamforming Samuele Cornell Manuel Pariente François Grondin S. Squartini 19 7 0 08 Nov 2021
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation Jing-jing Chen Qi-rong Mao Dong Liu 54 279 0 28 Jul 2020