ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.07431
  4. Cited By
CRASH: Raw Audio Score-based Generative Modeling for Controllable
  High-resolution Drum Sound Synthesis

CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis

International Society for Music Information Retrieval Conference (ISMIR), 2021
14 June 2021
Simon Rouard
Gaëtan Hadjeres
    DiffM
ArXiv (abs)PDFHTML

Papers citing "CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis"

35 / 35 papers shown
An Octave-based Multi-Resolution CQT Architecture for Diffusion-based Audio Generation
An Octave-based Multi-Resolution CQT Architecture for Diffusion-based Audio Generation
Maurício do V. M. da Costa
Eloi Moliner
DiffM
170
1
0
20 Sep 2025
Audio Generation Through Score-Based Generative Modeling: Design Principles and Implementation
Ge Zhu
Yutong Wen
Zhiyao Duan
DiffMMedIm
241
3
0
10 Jun 2025
The Inverse Drum Machine: Source Separation Through Joint Transcription and Analysis-by-Synthesis
The Inverse Drum Machine: Source Separation Through Joint Transcription and Analysis-by-Synthesis
Bernardo Torres
Geoffroy Peeters
G. Richard
286
0
0
06 May 2025
DOSE : Drum One-Shot Extraction from Music Mixture
DOSE : Drum One-Shot Extraction from Music MixtureIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Suntae Hwang
Seonghyeon Kang
Kyungsu Kim
Semin Ahn
K. Lee
210
1
0
25 Apr 2025
Scaling Transformers for Low-Bitrate High-Quality Speech Coding
Scaling Transformers for Low-Bitrate High-Quality Speech Coding
Julian Parker
Anton Smirnov
Jordi Pons
CJ Carr
Zack Zukowski
Zach Evans
Xubo Liu
308
54
0
29 Nov 2024
Improving Musical Accompaniment Co-creation via Diffusion Transformers
Improving Musical Accompaniment Co-creation via Diffusion Transformers
J. Nistal
Marco Pasini
Stefan Lattner
150
10
0
30 Oct 2024
MambaFoley: Foley Sound Generation using Selective State-Space Models
MambaFoley: Foley Sound Generation using Selective State-Space ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Marco Furio Colombo
Francesca Ronchini
Luca Comanducci
Fabio Antonacci
Mamba
369
5
0
13 Sep 2024
Audio Conditioning for Music Generation via Discrete Bottleneck Features
Audio Conditioning for Music Generation via Discrete Bottleneck Features
Simon Rouard
Yossi Adi
Jade Copet
Axel Roebel
Alexandre Défossez
MGen
319
6
0
17 Jul 2024
Pictures Of MIDI: Controlled Music Generation via Graphical Prompts for
  Image-Based Diffusion Inpainting
Pictures Of MIDI: Controlled Music Generation via Graphical Prompts for Image-Based Diffusion Inpainting
Scott H. Hawley
232
2
0
01 Jul 2024
Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion
  Models
Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models
J. Nistal
Marco Pasini
Cyran Aouameur
M. Grachten
Stefan Lattner
DiffM
316
39
0
12 Jun 2024
SYMPLEX: Controllable Symbolic Music Generation using Simplex Diffusion
  with Vocabulary Priors
SYMPLEX: Controllable Symbolic Music Generation using Simplex Diffusion with Vocabulary Priors
Nicolas Jonason
Luca Casini
Bob L. T. Sturm
210
1
0
21 May 2024
Detecting Out-Of-Distribution Earth Observation Images with Diffusion
  Models
Detecting Out-Of-Distribution Earth Observation Images with Diffusion Models
Georges Le Bellier
Nicolas Audebert
272
11
0
19 Apr 2024
Long-form music generation with latent diffusion
Long-form music generation with latent diffusion
Zach Evans
Julian Parker
CJ Carr
Zack Zukowski
Josiah Taylor
Jordi Pons
MGenDiffM
308
86
0
16 Apr 2024
Fast Timing-Conditioned Latent Audio Diffusion
Fast Timing-Conditioned Latent Audio Diffusion
Zach Evans
CJ Carr
Josiah Taylor
Scott H. Hawley
Jordi Pons
DiffM
514
192
0
07 Feb 2024
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for
  Temporal-Event-Guided Foley Sound Synthesis
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Yoonjin Chung
Junwon Lee
Juhan Nam
170
22
0
17 Jan 2024
Zero-Shot Duet Singing Voices Separation with Diffusion Models
Zero-Shot Duet Singing Voices Separation with Diffusion Models
Chin-Yun Yu
Emilian Postolache
Emanuele Rodolà
Gyorgy Fazekas
DiffM
182
6
0
13 Nov 2023
Controllable Music Production with Diffusion Models and Guidance
  Gradients
Controllable Music Production with Diffusion Models and Guidance Gradients
Mark Levy
Bruno Di Giorgi
Floris Weers
Angelos Katharopoulos
Tom Nickson
DiffM
237
33
0
01 Nov 2023
Differentiable Modelling of Percussive Audio with Transient and Spectral
  Synthesis
Differentiable Modelling of Percussive Audio with Transient and Spectral Synthesis
Jordie Shier
Franco Caspe
Andrew Robertson
Mark Sandler
C. Saitis
Andrew Mcpherson
201
4
0
13 Sep 2023
A Review of Differentiable Digital Signal Processing for Music & Speech
  Synthesis
A Review of Differentiable Digital Signal Processing for Music & Speech SynthesisFrontiers in Signal Processing (FSP), 2023
B. Hayes
Jordie Shier
Gyorgy Fazekas
Andrew Mcpherson
C. Saitis
240
41
0
29 Aug 2023
The Ethical Implications of Generative Audio Models: A Systematic
  Literature Review
The Ethical Implications of Generative Audio Models: A Systematic Literature ReviewAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023
J. Barnett
252
48
0
07 Jul 2023
UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion
  Model
UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion ModelInterspeech (Interspeech), 2023
A. Iashchenko
Pavel Andreev
Ivan Shchekotov
Nicholas Babaev
Dmitry Vetrov
DiffM
335
6
0
01 Jun 2023
A Survey on Audio Diffusion Models: Text To Speech Synthesis and
  Enhancement in Generative AI
A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI
Chenshuang Zhang
Chaoning Zhang
Sheng Zheng
Mengchun Zhang
Maryam Qamar
Sung-Ho Bae
In So Kweon
DiffMMedIm
268
105
0
23 Mar 2023
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to
  GPT-5 All You Need?
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
Chaoning Zhang
Chenshuang Zhang
Sheng Zheng
Yu Qiao
Chenghao Li
...
Lik-Hang Lee
Yang Yang
Heng Tao Shen
In So Kweon
Choong Seon Hong
303
199
0
21 Mar 2023
Distribution Preserving Source Separation With Time Frequency Predictive
  Models
Distribution Preserving Source Separation With Time Frequency Predictive ModelsEuropean Signal Processing Conference (EUSIPCO), 2023
Pedro J. Villasana T
J. Klejsa
Lars Villemoes
P. Hedelin
165
2
0
10 Mar 2023
Multi-Source Diffusion Models for Simultaneous Music Generation and
  Separation
Multi-Source Diffusion Models for Simultaneous Music Generation and SeparationInternational Conference on Learning Representations (ICLR), 2023
Giorgio Mariani
Irene Tallini
Emilian Postolache
Michele Mancusi
Luca Cosmo
Emanuele Rodolà
DiffM
574
65
0
04 Feb 2023
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Kinyugo Maina
185
9
0
16 Jan 2023
Protein Language Models and Structure Prediction: Connection and
  Progression
Protein Language Models and Structure Prediction: Connection and Progression
Bozhen Hu
Jun Xia
Jiangbin Zheng
Cheng Tan
Yufei Huang
Yongjie Xu
Stan Z. Li
210
45
0
30 Nov 2022
Solving Audio Inverse Problems with a Diffusion Model
Solving Audio Inverse Problems with a Diffusion ModelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Eloi Moliner
J. Lehtinen
Vesa Valimaki
DiffM
380
74
0
27 Oct 2022
Full-band General Audio Synthesis with Score-based Diffusion
Full-band General Audio Synthesis with Score-based DiffusionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Santiago Pascual
Gautam Bhattacharya
Chunghsin Yeh
Jordi Pons
Joan Serrà
DiffM
220
39
0
26 Oct 2022
Protein structure generation via folding diffusion
Protein structure generation via folding diffusionNature Communications (Nat Commun), 2022
Kevin E. Wu
Kevin Kaichuang Yang
Rianne van den Berg
James Zou
Alex X. Lu
Ava P. Amini
DiffM
388
259
0
30 Sep 2022
Evaluating generative audio systems and their metrics
Evaluating generative audio systems and their metricsInternational Society for Music Information Retrieval Conference (ISMIR), 2022
Ashvala Vinay
Alexander Lerch
273
28
0
31 Aug 2022
DrumGAN VST: A Plugin for Drum Sound Analysis/Synthesis With
  Autoencoding Generative Adversarial Networks
DrumGAN VST: A Plugin for Drum Sound Analysis/Synthesis With Autoencoding Generative Adversarial Networks
J. Nistal
Cyran Aouameur
Ithan Velarde
Stefan Lattner
GAN
199
7
0
29 Jun 2022
Multi-instrument Music Synthesis with Spectrogram Diffusion
Multi-instrument Music Synthesis with Spectrogram DiffusionInternational Society for Music Information Retrieval Conference (ISMIR), 2022
Curtis Hawthorne
Ian Simon
Adam Roberts
Neil Zeghidour
Josh Gardner
Ethan Manilow
Jesse Engel
DiffM
238
56
0
11 Jun 2022
Universal Speech Enhancement with Score-based Diffusion
Universal Speech Enhancement with Score-based Diffusion
Joan Serrà
Santiago Pascual
Jordi Pons
R. O. Araz
D. Scaini
DiffM
366
129
0
07 Jun 2022
Differentiable Digital Signal Processing Mixture Model for Synthesis
  Parameter Extraction from Mixture of Harmonic Sounds
Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic SoundsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Masaya Kawamura
Tomohiko Nakamura
Daichi Kitamura
Hiroshi Saruwatari
Yu Takahashi
Kazunobu Kondo
211
15
0
01 Feb 2022
1