ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.17384
  4. Cited By
Toward Universal Speech Enhancement for Diverse Input Conditions

Toward Universal Speech Enhancement for Diverse Input Conditions

29 September 2023
Wangyou Zhang
Kohei Saijo
Zhong-Qiu Wang
Shinji Watanabe
Yanmin Qian
    VLM
ArXivPDFHTML

Papers citing "Toward Universal Speech Enhancement for Diverse Input Conditions"

16 / 16 papers shown
Title
A Comparative Study on Positional Encoding for Time-frequency Domain Dual-path Transformer-based Source Separation Models
A Comparative Study on Positional Encoding for Time-frequency Domain Dual-path Transformer-based Source Separation Models
Kohei Saijo
Tetsuji Ogawa
45
1
0
28 Apr 2025
AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement
AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement
Junan Zhang
Jing Yang
Zihao Fang
Y. Wang
Zehua Zhang
Zhuo Wang
Fan Fan
Z. Wu
39
2
0
26 Jan 2025
Task-Aware Unified Source Separation
Task-Aware Unified Source Separation
Kohei Saijo
Janek Ebbers
François G. Germain
G. Wichern
Jonathan Le Roux
29
1
0
31 Oct 2024
Prototype and Instance Contrastive Learning for Unsupervised Domain
  Adaptation in Speaker Verification
Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification
Wen Huang
Bing Han
Zhengyang Chen
Shuai Wang
Yanmin Qian
VLM
SSL
13
0
0
22 Oct 2024
Extract and Diffuse: Latent Integration for Improved Diffusion-based
  Speech and Vocal Enhancement
Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement
Yudong Yang
Zhan Liu
Wenyi Yu
Guangzhi Sun
Qiuqiang Kong
Chao Zhang
DiffM
44
0
0
15 Sep 2024
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech
  Separation and Enhancement
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
Kohei Saijo
G. Wichern
François G. Germain
Zexu Pan
Jonathan Le Roux
26
6
0
06 Aug 2024
ctPuLSE: Close-Talk, and Pseudo-Label Based Far-Field, Speech
  Enhancement
ctPuLSE: Close-Talk, and Pseudo-Label Based Far-Field, Speech Enhancement
Zhong-Qiu Wang
20
1
0
28 Jul 2024
Improving Real-Time Music Accompaniment Separation with MMDenseNet
Improving Real-Time Music Accompaniment Separation with MMDenseNet
Chun-Hsiang Wang
Chung-Che Wang
J. Wang
Jyh-Shing Roger Jang
Yen-Hsun Chu
31
0
0
30 Jun 2024
URGENT Challenge: Universality, Robustness, and Generalizability For
  Speech Enhancement
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Wangyou Zhang
Robin Scheibler
Kohei Saijo
Samuele Cornell
Chenda Li
...
Jan Pirklbauer
Marvin Sach
Shinji Watanabe
Tim Fingscheidt
Yanmin Qian
VLM
32
6
0
07 Jun 2024
Beyond Performance Plateaus: A Comprehensive Study on Scalability in
  Speech Enhancement
Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement
Wangyou Zhang
Kohei Saijo
Jee-weon Jung
Chenda Li
Shinji Watanabe
Yanmin Qian
30
4
0
06 Jun 2024
SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition
SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition
Yihan Wu
Soumi Maiti
Yifan Peng
Wangyou Zhang
Chenda Li
Yuyue Wang
Xihua Wang
Shinji Watanabe
Ruihua Song
25
3
0
31 Jan 2024
Improving Design of Input Condition Invariant Speech Enhancement
Improving Design of Input Condition Invariant Speech Enhancement
Wangyou Zhang
Jee-weon Jung
Shinji Watanabe
Yanmin Qian
AAML
26
2
0
25 Jan 2024
UNSSOR: Unsupervised Neural Speech Separation by Leveraging
  Over-determined Training Mixtures
UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures
Zhong-Qiu Wang
Shinji Watanabe
11
10
0
31 May 2023
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural
  Speaker Separation
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Zhong-Qiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
66
95
0
08 Sep 2022
Learning Filterbanks for End-to-End Acoustic Beamforming
Learning Filterbanks for End-to-End Acoustic Beamforming
Samuele Cornell
Manuel Pariente
François Grondin
S. Squartini
19
7
0
08 Nov 2021
Dual-Path Transformer Network: Direct Context-Aware Modeling for
  End-to-End Monaural Speech Separation
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
Jing-jing Chen
Qi-rong Mao
Dong Liu
54
279
0
28 Jul 2020
1