Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2206.08545
Cited By
v1
v2 (latest)
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates
Interspeech (Interspeech), 2022
17 June 2022
Seungu Han
Junhyeok Lee
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates"
37 / 37 papers shown
Title
Harmonic-Percussive Disentangled Neural Audio Codec for Bandwidth Extension
Benoît Giniès
Xiaoyu Bie
Olivier Fercoq
Gaël Richard
116
0
0
26 Nov 2025
UniverSR: Unified and Versatile Audio Super-Resolution via Vocoder-Free Flow Matching
Woongjib Choi
Sangmin Lee
Hyungseob Lim
Hong-Goo Kang
DiffM
SupR
145
0
0
01 Oct 2025
VoiceBridge: Designing Latent Bridge Models for General Speech Restoration at Scale
Chi Zhang
Zehua Chen
Kaiwen Zheng
Jun Zhu
AuLLM
150
0
0
28 Sep 2025
Audio Super-Resolution with Latent Bridge Models
Chang Li
Zehua Chen
Liyuan Wang
Jun Zhu
248
3
0
22 Sep 2025
Inference-time Scaling for Diffusion-based Audio Super-resolution
Yizhu Jin
Zhen Ye
Zeyue Tian
Haohe Liu
Qiuqiang Kong
Wenhan Luo
Wei Xue
DiffM
119
1
0
04 Aug 2025
Enhancing Spectrogram Realism in Singing Voice Synthesis via Explicit Bandwidth Extension Prior to Vocoder
Runxuan Yang
Kai Li
Guo Chen
Xiaolin Hu
81
0
0
03 Aug 2025
Neural Spectral Band Generation for Audio Coding
Woongjib Choi
Byeong Hyeon Kim
Hyungseob Lim
Inseon Jang
Hong-Goo Kang
140
0
0
07 Jun 2025
FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Jun-Hak Yun
Seung-Bin Kim
Seong-Whan Lee
DiffM
97
7
0
10 Jan 2025
Vector Quantized Diffusion Model Based Speech Bandwidth Extension
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Yuan Fang
Jinglin Bai
Jiajie Wang
Xueliang Zhang
157
4
0
09 Sep 2024
PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation
International Conference on Learning Representations (ICLR), 2024
Sang-Hoon Lee
Ha-Yeong Choi
Seong-Whan Lee
OOD
DiffM
AI4TS
217
11
0
14 Aug 2024
DSP-informed bandwidth extension using locally-conditioned excitation and linear time-varying filter subnetworks
S. Nercessian
Alexey Lukin
Johannes Imort
165
1
0
22 Jul 2024
Vibravox: A Dataset of French Speech Captured with Body-conduction Audio Sensors
J. Hauret
Malo Olivier
Thomas Joubaud
C. Langrenne
Sarah Poirée
V. Zimpfer
Éric Bavu
452
14
0
16 Jul 2024
Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control
Ye-Xin Lu
Yang Ai
Zheng-Yan Sheng
Zhen-Hua Ling
106
8
0
04 Jun 2024
Improving Diffusion Models's Data-Corruption Resistance using Scheduled Pseudo-Huber Loss
Artem Khrapov
Vadim Popov
Tasnima Sadekova
Assel Yermekova
Mikhail Kudinov
DiffM
171
3
0
25 Mar 2024
MusicHiFi: Fast High-Fidelity Stereo Vocoding
Ge Zhu
Juan-Pablo Caceres
Zhiyao Duan
Nicholas J. Bryan
DiffM
190
8
0
15 Mar 2024
Combined Generative and Predictive Modeling for Speech Super-resolution
Computer Speech and Language (CSL), 2024
Heming Wang
Eric W. Healy
DeLiang Wang
DiffM
188
3
0
25 Jan 2024
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
IEEE Transactions on Audio, Speech, and Language Processing (IEEE TASLP), 2024
Ye-Xin Lu
Yang Ai
Hui-Peng Du
Zhenhua Ling
165
25
0
12 Jan 2024
BAE-Net: A Low complexity and high fidelity Bandwidth-Adaptive neural network for speech super-resolution
Guochen Yu
Xiguang Zheng
Nan Li
Runqiang Han
C. Zheng
Chen Zhang
Chao Zhou
Qi Huang
Bin Yu
215
12
0
21 Dec 2023
HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Sang-Hoon Lee
Haram Choi
Seung-Bin Kim
Seong-Whan Lee
BDL
293
57
0
21 Nov 2023
Super Denoise Net: Speech Super Resolution with Noise Cancellation in Low Sampling Rate Noisy Environments
Junkang Yang
Hongqing Liu
Lu Gan
Yi Zhou
201
1
0
09 Oct 2023
AudioSR: Versatile Audio Super-resolution at Scale
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Haohe Liu
Ke Chen
Qiao Tian
Wenwu Wang
Mark D. Plumbley
DiffM
114
58
0
13 Sep 2023
Progressive distillation diffusion for raw music generation
Svetlana Pavlova
DiffM
156
0
0
20 Jul 2023
Edge Storage Management Recipe with Zero-Shot Data Compression for Road Anomaly Detection
Information and Communication Technology Convergence (ICTC), 2023
Yeonghyeon Park
U. Gim
Myung Jin Kim
155
1
0
10 Jul 2023
Blind Audio Bandwidth Extension: A Diffusion-Based Zero-Shot Approach
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Eloi Moliner
Filip Elvander
Vesa Valimaki
DiffM
199
20
0
02 Jun 2023
DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion
AAAI Conference on Artificial Intelligence (AAAI), 2023
Haram Choi
Sang-Hoon Lee
Seong-Whan Lee
DiffM
116
56
0
25 May 2023
mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra
Interspeech (Interspeech), 2023
Chenhao Shuai
Chaohua Shi
Lu Gan
Hongqing Liu
156
15
0
18 May 2023
A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI
Chenshuang Zhang
Chaoning Zhang
Sheng Zheng
Mengchun Zhang
Maryam Qamar
Sung-Ho Bae
In So Kweon
DiffM
MedIm
212
104
0
23 Mar 2023
TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation
David Berthelot
Arnaud Autef
Jierui Lin
Dian Ang Yap
Shuangfei Zhai
Siyuan Hu
Daniel Zheng
Walter Talbot
Eric Gu
DiffM
221
114
0
07 Mar 2023
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Jean-Marie Lemercier
Julius Richter
Simon Welker
Timo Gerkmann
DiffM
403
156
0
22 Dec 2022
AERO: Audio Super Resolution in the Spectral Domain
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Moshe Mandel
Or Tal
Yossi Adi
145
45
0
22 Nov 2022
PhaseAug: A Differentiable Augmentation for Speech Synthesis to Simulate One-to-Many Mapping
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Junhyeok Lee
Seungu Han
Hyunjae Cho
Wonbin Jung
113
13
0
08 Nov 2022
Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Jean-Marie Lemercier
Julius Richter
Simon Welker
Timo Gerkmann
DiffM
253
48
0
04 Nov 2022
Diffusion-based Generative Speech Source Separation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Robin Scheibler
Youna Ji
Soo-Whan Chung
J. Byun
Soyeon Choe
Min-Seok Choi
DiffM
318
60
0
31 Oct 2022
Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Reo Yoneyama
Ryuichi Yamamoto
Kentaro Tachibana
118
9
0
28 Oct 2022
Conditioning and Sampling in Variational Diffusion Models for Speech Super-Resolution
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chin-Yun Yu
Sung-Lin Yeh
Gyorgy Fazekas
Hao Tang
DiffM
119
31
0
27 Oct 2022
Solving Audio Inverse Problems with a Diffusion Model
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Eloi Moliner
J. Lehtinen
Vesa Valimaki
DiffM
283
73
0
27 Oct 2022
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
AAAI Conference on Artificial Intelligence (AAAI), 2022
Taejun Bak
Junmo Lee
Hanbin Bae
Jinhyeok Yang
Jaesung Bae
Young-Sun Joo
209
41
0
27 Jun 2022
1