Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.15149
Cited By
CMGAN: Conformer-based Metric GAN for Speech Enhancement
28 March 2022
Ru Cao
Sherif Abdulatif
Bin Yang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CMGAN: Conformer-based Metric GAN for Speech Enhancement"
47 / 47 papers shown
Title
Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion
Sandipan Dhar
Md. Tousin Akhter
N. D. Jana
Swagatam Das
35
1
0
18 Apr 2025
PrimeK-Net: Multi-scale Spectral Learning via Group Prime-Kernel Convolutional Neural Networks for Single Channel Speech Enhancement
Zizhen Lin
Junyu Wang
Ruili Li
Fei Shen
Xi Xuan
64
0
0
27 Feb 2025
Speech Enhancement with Overlapped-Frame Information Fusion and Causal Self-Attention
Yuewei Zhang
Huanbin Zou
Jie Zhu
39
0
0
21 Jan 2025
Single-Channel Distance-Based Source Separation for Mobile GPU in Outdoor and Indoor Environments
Hanbin Bae
Byungjun Kang
Jiwon Kim
Jaeyong Hwang
Hosang Sung
Hoon-Young Cho
3DV
28
0
0
06 Jan 2025
GAN-Based Speech Enhancement for Low SNR Using Latent Feature Conditioning
Shrishti Saha Shetu
Emanuël A. P. Habets
Andreas Brendel
26
1
0
17 Oct 2024
GALD-SE: Guided Anisotropic Lightweight Diffusion for Efficient Speech Enhancement
Chengzhong Wang
Jianjun Gu
Dingding Yao
Junfeng Li
Yonghong Yan
DiffM
125
0
0
23 Sep 2024
LiSenNet: Lightweight Sub-band and Dual-Path Modeling for Real-Time Speech Enhancement
Haoyin Yan
Jie M. Zhang
Cunhang Fan
Yeping Zhou
Peiqi Liu
40
1
0
20 Sep 2024
Dense-TSNet: Dense Connected Two-Stage Structure for Ultra-Lightweight Speech Enhancement
Zizhen Lin
Yuanle Li
Junyu Wang
Ruili Li
34
0
0
18 Sep 2024
A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models
Ryandhimas E. Zezario
Sabato Marco Siniscalchi
Hsin-Min Wang
Yu Tsao
26
2
0
16 Sep 2024
The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction
Wen-Chin Huang
Szu-Wei Fu
Erica Cooper
Ryandhimas E. Zezario
T. Toda
Hsin-Min Wang
Junichi Yamagishi
Yu Tsao
32
5
0
11 Sep 2024
Spectral oversubtraction? An approach for speech enhancement after robot ego speech filtering in semi-real-time
Yue Li
Koen V. Hindriks
Florian A. Kunneman
26
0
0
10 Sep 2024
Effective Noise-aware Data Simulation for Domain-adaptive Speech Enhancement Leveraging Dynamic Stochastic Perturbation
Chien-Chun Wang
Li-Wei Chen
Hung-Shin Lee
Berlin Chen
Hsin-Min Wang
27
1
0
03 Sep 2024
Exploiting Consistency-Preserving Loss and Perceptual Contrast Stretching to Boost SSL-based Speech Enhancement
Muhammad Salman Khan
Moreno La Quatra
Kuo-Hsuan Hung
Szu-Wei Fu
Sabato Marco Siniscalchi
Yu Tsao
23
2
0
08 Aug 2024
Improving Speech Enhancement by Integrating Inter-Channel and Band Features with Dual-branch Conformer
Jizhen Li
Xinmeng Xu
Weiping Tu
Yuhong Yang
Rong Zhu
24
1
0
09 Jul 2024
SNR-Progressive Model with Harmonic Compensation for Low-SNR Speech Enhancement
Zhongshu Hou
Tong Lei
Qinwen Hu
Zhanzhong Cao
Ming Tang
Jing Lu
32
0
0
24 Jun 2024
Complex Image-Generative Diffusion Transformer for Audio Denoising
Junhui Li
Pu Wang
Jialu Li
Youshan Zhang
DiffM
16
1
0
13 Jun 2024
Diffusion Gaussian Mixture Audio Denoise
Pu Wang
Junhui Li
Jialu Li
Liangdong Guo
Youshan Zhang
DiffM
29
0
0
13 Jun 2024
MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enhancement
Zizhen Lin
Xiaoting Chen
Junyu Wang
32
2
0
07 Jun 2024
The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement
Danilo de Oliveira
Simon Welker
Julius Richter
Timo Gerkmann
36
5
0
05 Jun 2024
TRNet: Two-level Refinement Network leveraging Speech Enhancement for Noise Robust Speech Emotion Recognition
Chengxin Chen
Pengyuan Zhang
27
0
0
19 Apr 2024
Unrestricted Global Phase Bias-Aware Single-channel Speech Enhancement with Conformer-based Metric GAN
Shiqi Zhang
Zheng Qiu
Daiki Takeuchi
Noboru Harada
Shoji Makino
11
3
0
13 Feb 2024
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge
Simon Leglaive
Matthieu Fraticelli
Hend ElGhazaly
Léonie Borne
Mostafa Sadeghi
Scott Wisdom
Manuel Pariente
J. Hershey
Daniel Pressnitzer
Jon P. Barker
16
8
0
02 Feb 2024
Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement
George Close
William Ravenscroft
Thomas Hain
Stefan Goetze
30
2
0
14 Dec 2023
DCHT: Deep Complex Hybrid Transformer for Speech Enhancement
Jialu Li
Junhui Li
Pu Wang
Youshan Zhang
18
4
0
30 Oct 2023
DPATD: Dual-Phase Audio Transformer for Denoising
Junhui Li
Pu Wang
Jialu Li
Xinzhe Wang
Youshan Zhang
13
4
0
30 Oct 2023
Music Augmentation and Denoising For Peak-Based Audio Fingerprinting
Kamil Akesbi
Dorian Desblancs
Benjamin Martin
34
0
0
20 Oct 2023
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT
Zhihao Du
Jiaming Wang
Qian Chen
Yunfei Chu
Zhifu Gao
...
Wen Wang
Siqi Zheng
Chang Zhou
Zhijie Yan
Shiliang Zhang
LLMAG
VLM
AuLLM
LM&MA
34
80
0
07 Oct 2023
The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR
Yuhao Liang
Mohan Shi
Fan Yu
Yangze Li
Shiliang Zhang
...
Jian Wu
Zhuo Chen
Kong Aik Lee
Zhijie Yan
Hui Bu
26
5
0
24 Sep 2023
A Study on Incorporating Whisper for Robust Speech Assessment
Ryandhimas E. Zezario
Yu-Wen Chen
Szu-Wei Fu
Yu Tsao
H. Wang
C. Fuh
27
10
0
22 Sep 2023
Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Yu-Wen Chen
Julia Hirschberg
Yu Tsao
19
5
0
03 Sep 2023
Deep learning-based denoising streamed from mobile phones improves speech-in-noise understanding for hearing aid users
P. U. Diehl
Hannes Zilly
Felix Sattler
Y. Singer
Kevin Kepp
...
Paul Meyer-Rachner
A. Pudszuhn
V. Hofmann
M. Vormann
Elias Sprengel
29
3
0
22 Aug 2023
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Ye-Xin Lu
Yang Ai
Zhenhua Ling
25
7
0
17 Aug 2023
Separate Anything You Describe
Xubo Liu
Qiuqiang Kong
Yan Zhao
Haohe Liu
Yiitan Yuan
Yuzhuo Liu
Rui Xia
Yuxuan Wang
Mark D. Plumbley
Wenwu Wang
VLM
25
43
0
09 Aug 2023
Large-scale unsupervised audio pre-training for video-to-speech synthesis
Triantafyllos Kefalas
Yannis Panagakis
M. Pantic
VGen
32
3
0
27 Jun 2023
Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
Zhaoxi Mu
Xinyu Yang
Wenjing Zhu
14
5
0
07 Mar 2023
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhancement
Shengkui Zhao
Bin Ma
27
16
0
23 Feb 2023
Local spectral attention for full-band speech enhancement
Zhongshu Hou
Qi Hu
Kai-Jyun Chen
Jing Lu
28
0
0
11 Feb 2023
Audio Denoising for Robust Audio Fingerprinting
Kamil Akesbi
21
3
0
21 Dec 2022
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
28
21
0
01 Dec 2022
Inference and Denoise: Causal Inference-based Neural Speech Enhancement
Tsun-An Hsieh
Chao-Han Huck Yang
Pin-Yu Chen
Sabato Marco Siniscalchi
Yu Tsao
CML
50
2
0
02 Nov 2022
A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech
Li-Wei Chen
Yao-Fei Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
20
3
0
27 Oct 2022
SCP-GAN: Self-Correcting Discriminator Optimization for Training Consistency Preserving Metric GAN on Speech Enhancement Tasks
Vasily Zadorozhnyy
Qian Ye
K. Koishida
13
8
0
26 Oct 2022
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
Sherif Abdulatif
Ru Cao
Bin Yang
21
61
0
22 Sep 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
11
178
0
11 Aug 2022
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Feng Dang
Hangting Chen
Pengyuan Zhang
68
94
0
27 Apr 2021
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
Jing-jing Chen
Qi-rong Mao
Dong Liu
59
280
0
28 Jul 2020
Image-to-Image Translation with Conditional Adversarial Networks
Phillip Isola
Jun-Yan Zhu
Tinghui Zhou
Alexei A. Efros
SSeg
212
19,447
0
21 Nov 2016
1