ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.08545
  4. Cited By
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling
  Rates
v1v2 (latest)

NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates

Interspeech (Interspeech), 2022
17 June 2022
Seungu Han
Junhyeok Lee
    DiffM
ArXiv (abs)PDFHTML

Papers citing "NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates"

37 / 37 papers shown
Title
Harmonic-Percussive Disentangled Neural Audio Codec for Bandwidth Extension
Harmonic-Percussive Disentangled Neural Audio Codec for Bandwidth Extension
Benoît Giniès
Xiaoyu Bie
Olivier Fercoq
Gaël Richard
116
0
0
26 Nov 2025
UniverSR: Unified and Versatile Audio Super-Resolution via Vocoder-Free Flow Matching
UniverSR: Unified and Versatile Audio Super-Resolution via Vocoder-Free Flow Matching
Woongjib Choi
Sangmin Lee
Hyungseob Lim
Hong-Goo Kang
DiffMSupR
145
0
0
01 Oct 2025
VoiceBridge: Designing Latent Bridge Models for General Speech Restoration at Scale
VoiceBridge: Designing Latent Bridge Models for General Speech Restoration at Scale
Chi Zhang
Zehua Chen
Kaiwen Zheng
Jun Zhu
AuLLM
150
0
0
28 Sep 2025
Audio Super-Resolution with Latent Bridge Models
Audio Super-Resolution with Latent Bridge Models
Chang Li
Zehua Chen
Liyuan Wang
Jun Zhu
248
3
0
22 Sep 2025
Inference-time Scaling for Diffusion-based Audio Super-resolution
Inference-time Scaling for Diffusion-based Audio Super-resolution
Yizhu Jin
Zhen Ye
Zeyue Tian
Haohe Liu
Qiuqiang Kong
Wenhan Luo
Wei Xue
DiffM
119
1
0
04 Aug 2025
Enhancing Spectrogram Realism in Singing Voice Synthesis via Explicit Bandwidth Extension Prior to Vocoder
Enhancing Spectrogram Realism in Singing Voice Synthesis via Explicit Bandwidth Extension Prior to Vocoder
Runxuan Yang
Kai Li
Guo Chen
Xiaolin Hu
81
0
0
03 Aug 2025
Neural Spectral Band Generation for Audio Coding
Neural Spectral Band Generation for Audio Coding
Woongjib Choi
Byeong Hyeon Kim
Hyungseob Lim
Inseon Jang
Hong-Goo Kang
140
0
0
07 Jun 2025
FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching
FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow MatchingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Jun-Hak Yun
Seung-Bin Kim
Seong-Whan Lee
DiffM
97
7
0
10 Jan 2025
Vector Quantized Diffusion Model Based Speech Bandwidth Extension
Vector Quantized Diffusion Model Based Speech Bandwidth ExtensionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Yuan Fang
Jinglin Bai
Jiajie Wang
Xueliang Zhang
157
4
0
09 Sep 2024
PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform
  Generation
PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform GenerationInternational Conference on Learning Representations (ICLR), 2024
Sang-Hoon Lee
Ha-Yeong Choi
Seong-Whan Lee
OODDiffMAI4TS
217
11
0
14 Aug 2024
DSP-informed bandwidth extension using locally-conditioned excitation
  and linear time-varying filter subnetworks
DSP-informed bandwidth extension using locally-conditioned excitation and linear time-varying filter subnetworks
S. Nercessian
Alexey Lukin
Johannes Imort
165
1
0
22 Jul 2024
Vibravox: A Dataset of French Speech Captured with Body-conduction Audio Sensors
Vibravox: A Dataset of French Speech Captured with Body-conduction Audio Sensors
J. Hauret
Malo Olivier
Thomas Joubaud
C. Langrenne
Sarah Poirée
V. Zimpfer
Éric Bavu
452
14
0
16 Jul 2024
Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate
  Control
Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control
Ye-Xin Lu
Yang Ai
Zheng-Yan Sheng
Zhen-Hua Ling
106
8
0
04 Jun 2024
Improving Diffusion Models's Data-Corruption Resistance using Scheduled
  Pseudo-Huber Loss
Improving Diffusion Models's Data-Corruption Resistance using Scheduled Pseudo-Huber Loss
Artem Khrapov
Vadim Popov
Tasnima Sadekova
Assel Yermekova
Mikhail Kudinov
DiffM
171
3
0
25 Mar 2024
MusicHiFi: Fast High-Fidelity Stereo Vocoding
MusicHiFi: Fast High-Fidelity Stereo Vocoding
Ge Zhu
Juan-Pablo Caceres
Zhiyao Duan
Nicholas J. Bryan
DiffM
190
8
0
15 Mar 2024
Combined Generative and Predictive Modeling for Speech Super-resolution
Combined Generative and Predictive Modeling for Speech Super-resolutionComputer Speech and Language (CSL), 2024
Heming Wang
Eric W. Healy
DeLiang Wang
DiffM
188
3
0
25 Jan 2024
Towards High-Quality and Efficient Speech Bandwidth Extension with
  Parallel Amplitude and Phase Prediction
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase PredictionIEEE Transactions on Audio, Speech, and Language Processing (IEEE TASLP), 2024
Ye-Xin Lu
Yang Ai
Hui-Peng Du
Zhenhua Ling
165
25
0
12 Jan 2024
BAE-Net: A Low complexity and high fidelity Bandwidth-Adaptive neural
  network for speech super-resolution
BAE-Net: A Low complexity and high fidelity Bandwidth-Adaptive neural network for speech super-resolution
Guochen Yu
Xiguang Zheng
Nan Li
Runqiang Han
C. Zheng
Chen Zhang
Chao Zhou
Qi Huang
Bin Yu
215
12
0
21 Dec 2023
HierSpeech++: Bridging the Gap between Semantic and Acoustic
  Representation of Speech by Hierarchical Variational Inference for Zero-shot
  Speech Synthesis
HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech SynthesisIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Sang-Hoon Lee
Haram Choi
Seung-Bin Kim
Seong-Whan Lee
BDL
293
57
0
21 Nov 2023
Super Denoise Net: Speech Super Resolution with Noise Cancellation in
  Low Sampling Rate Noisy Environments
Super Denoise Net: Speech Super Resolution with Noise Cancellation in Low Sampling Rate Noisy Environments
Junkang Yang
Hongqing Liu
Lu Gan
Yi Zhou
201
1
0
09 Oct 2023
AudioSR: Versatile Audio Super-resolution at Scale
AudioSR: Versatile Audio Super-resolution at ScaleIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Haohe Liu
Ke Chen
Qiao Tian
Wenwu Wang
Mark D. Plumbley
DiffM
114
58
0
13 Sep 2023
Progressive distillation diffusion for raw music generation
Progressive distillation diffusion for raw music generation
Svetlana Pavlova
DiffM
156
0
0
20 Jul 2023
Edge Storage Management Recipe with Zero-Shot Data Compression for Road
  Anomaly Detection
Edge Storage Management Recipe with Zero-Shot Data Compression for Road Anomaly DetectionInformation and Communication Technology Convergence (ICTC), 2023
Yeonghyeon Park
U. Gim
Myung Jin Kim
155
1
0
10 Jul 2023
Blind Audio Bandwidth Extension: A Diffusion-Based Zero-Shot Approach
Blind Audio Bandwidth Extension: A Diffusion-Based Zero-Shot ApproachIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Eloi Moliner
Filip Elvander
Vesa Valimaki
DiffM
199
20
0
02 Jun 2023
DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled
  Representation and Prior Mixup for Verified Robust Voice Conversion
DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice ConversionAAAI Conference on Artificial Intelligence (AAAI), 2023
Haram Choi
Sang-Hoon Lee
Seong-Whan Lee
DiffM
116
56
0
25 May 2023
mdctGAN: Taming transformer-based GAN for speech super-resolution with
  Modified DCT spectra
mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectraInterspeech (Interspeech), 2023
Chenhao Shuai
Chaohua Shi
Lu Gan
Hongqing Liu
156
15
0
18 May 2023
A Survey on Audio Diffusion Models: Text To Speech Synthesis and
  Enhancement in Generative AI
A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI
Chenshuang Zhang
Chaoning Zhang
Sheng Zheng
Mengchun Zhang
Maryam Qamar
Sung-Ho Bae
In So Kweon
DiffMMedIm
212
104
0
23 Mar 2023
TRACT: Denoising Diffusion Models with Transitive Closure
  Time-Distillation
TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation
David Berthelot
Arnaud Autef
Jierui Lin
Dian Ang Yap
Shuangfei Zhai
Siyuan Hu
Daniel Zheng
Walter Talbot
Eric Gu
DiffM
221
114
0
07 Mar 2023
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech
  Enhancement and Dereverberation
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and DereverberationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Jean-Marie Lemercier
Julius Richter
Simon Welker
Timo Gerkmann
DiffM
403
156
0
22 Dec 2022
AERO: Audio Super Resolution in the Spectral Domain
AERO: Audio Super Resolution in the Spectral DomainIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Moshe Mandel
Or Tal
Yossi Adi
145
45
0
22 Nov 2022
PhaseAug: A Differentiable Augmentation for Speech Synthesis to Simulate
  One-to-Many Mapping
PhaseAug: A Differentiable Augmentation for Speech Synthesis to Simulate One-to-Many MappingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Junhyeok Lee
Seungu Han
Hyunjae Cho
Wonbin Jung
113
13
0
08 Nov 2022
Analysing Diffusion-based Generative Approaches versus Discriminative
  Approaches for Speech Restoration
Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech RestorationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Jean-Marie Lemercier
Julius Richter
Simon Welker
Timo Gerkmann
DiffM
253
48
0
04 Nov 2022
Diffusion-based Generative Speech Source Separation
Diffusion-based Generative Speech Source SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Robin Scheibler
Youna Ji
Soo-Whan Chung
J. Byun
Soyeon Choe
Min-Seok Choi
DiffM
318
60
0
31 Oct 2022
Nonparallel High-Quality Audio Super Resolution with Domain Adaptation
  and Resampling CycleGANs
Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Reo Yoneyama
Ryuichi Yamamoto
Kentaro Tachibana
118
9
0
28 Oct 2022
Conditioning and Sampling in Variational Diffusion Models for Speech
  Super-Resolution
Conditioning and Sampling in Variational Diffusion Models for Speech Super-ResolutionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chin-Yun Yu
Sung-Lin Yeh
Gyorgy Fazekas
Hao Tang
DiffM
119
31
0
27 Oct 2022
Solving Audio Inverse Problems with a Diffusion Model
Solving Audio Inverse Problems with a Diffusion ModelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Eloi Moliner
J. Lehtinen
Vesa Valimaki
DiffM
283
73
0
27 Oct 2022
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Avocodo: Generative Adversarial Network for Artifact-free VocoderAAAI Conference on Artificial Intelligence (AAAI), 2022
Taejun Bak
Junmo Lee
Hanbin Bae
Jinhyeok Yang
Jaesung Bae
Young-Sun Joo
209
41
0
27 Jun 2022
1