ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.00341
  4. Cited By
Jukebox: A Generative Model for Music

Jukebox: A Generative Model for Music

30 April 2020
Prafulla Dhariwal
Heewoo Jun
Christine Payne
Jong Wook Kim
Alec Radford
Ilya Sutskever
    VLM
ArXivPDFHTML

Papers citing "Jukebox: A Generative Model for Music"

50 / 131 papers shown
Title
Generative AI for Cyber Threat-Hunting in 6G-enabled IoT Networks
Generative AI for Cyber Threat-Hunting in 6G-enabled IoT Networks
M. Ferrag
Merouane Debbah
Muna Al-Hawawreh
13
33
0
21 Mar 2023
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
Peng Jin
Hao Li
Ze-Long Cheng
Kehan Li
Xiang Ji
Chang-rui Liu
Li-ming Yuan
Jie Chen
DiffM
VGen
24
53
0
17 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
24
504
0
07 Mar 2023
A General Framework for Learning Procedural Audio Models of
  Environmental Sounds
A General Framework for Learning Procedural Audio Models of Environmental Sounds
Danzel Serrano
M. Cartwright
DiffM
DRL
17
1
0
04 Mar 2023
CHeart: A Conditional Spatio-Temporal Generative Model for Cardiac
  Anatomy
CHeart: A Conditional Spatio-Temporal Generative Model for Cardiac Anatomy
Mengyun Qiao
Shuo Wang
Huaqi Qiu
A. de Marvao
D. O’Regan
Daniel Rueckert
Wenjia Bai
MedIm
21
14
0
30 Jan 2023
SingSong: Generating musical accompaniments from singing
SingSong: Generating musical accompaniments from singing
Chris Donahue
Antoine Caillon
Adam Roberts
Ethan Manilow
P. Esling
...
Mauro Verzetti
Ian Simon
Olivier Pietquin
Neil Zeghidour
Jesse Engel
32
52
0
30 Jan 2023
Self-Supervised Learning for Data Scarcity in a Fatigue Damage
  Prognostic Problem
Self-Supervised Learning for Data Scarcity in a Fatigue Damage Prognostic Problem
A. Akrim
C. Gogu
R. Vingerhoeds
M. Salaün
AI4CE
15
23
0
20 Jan 2023
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Kinyugo Maina
19
5
0
16 Jan 2023
Rock Guitar Tablature Generation via Natural Language Processing
Rock Guitar Tablature Generation via Natural Language Processing
Josue Casco-Rodriguez
18
1
0
12 Jan 2023
Traditional Classification Neural Networks are Good Generators: They are
  Competitive with DDPMs and GANs
Traditional Classification Neural Networks are Good Generators: They are Competitive with DDPMs and GANs
Guangrun Wang
Philip H. S. Torr
28
8
0
27 Nov 2022
Homology-constrained vector quantization entropy regularizer
Homology-constrained vector quantization entropy regularizer
Ivan O. Volkov
22
2
0
25 Nov 2022
Exploring the Efficacy of Pre-trained Checkpoints in Text-to-Music
  Generation Task
Exploring the Efficacy of Pre-trained Checkpoints in Text-to-Music Generation Task
Shangda Wu
Maosong Sun
14
20
0
21 Nov 2022
A Review of Intelligent Music Generation Systems
A Review of Intelligent Music Generation Systems
Lei Wang
Ziyi Zhao
Han Liu
Junwei Pang
Yi-qiang Qin
Qidi Wu
MGen
21
31
0
16 Nov 2022
I Hear Your True Colors: Image Guided Audio Generation
I Hear Your True Colors: Image Guided Audio Generation
Roy Sheffer
Yossi Adi
VLM
16
73
0
06 Nov 2022
Low-Resource Music Genre Classification with Cross-Modal Neural Model
  Reprogramming
Low-Resource Music Genre Classification with Cross-Modal Neural Model Reprogramming
Yun-Ning Hung
Chao-Han Huck Yang
Pin-Yu Chen
Alexander Lerch
21
17
0
02 Nov 2022
Audio Language Modeling using Perceptually-Guided Discrete
  Representations
Audio Language Modeling using Perceptually-Guided Discrete Representations
Felix Kreuk
Yaniv Taigman
Adam Polyak
Jade Copet
Gabriel Synnaeve
Alexandre Défossez
Yossi Adi
27
4
0
02 Nov 2022
Full-band General Audio Synthesis with Score-based Diffusion
Full-band General Audio Synthesis with Score-based Diffusion
Santiago Pascual
Gautam Bhattacharya
Chunghsin Yeh
Jordi Pons
Joan Serra
DiffM
19
33
0
26 Oct 2022
A Survey on Artificial Intelligence for Music Generation: Agents,
  Domains and Perspectives
A Survey on Artificial Intelligence for Music Generation: Agents, Domains and Perspectives
Carlos Hernandez-Olivan
Javier Hernandez-Olivan
J. R. Beltrán
MGen
32
6
0
25 Oct 2022
Modeling Animal Vocalizations through Synthesizers
Modeling Animal Vocalizations through Synthesizers
Masato Hagiwara
M. Cusimano
Jen-Yu Liu
25
4
0
19 Oct 2022
JukeDrummer: Conditional Beat-aware Audio-domain Drum Accompaniment
  Generation via Transformer VQ-VAE
JukeDrummer: Conditional Beat-aware Audio-domain Drum Accompaniment Generation via Transformer VQ-VAE
Yueh-Kao Wu
Ching-Yu Chiu
Yi-Hsuan Yang
ViT
19
14
0
12 Oct 2022
Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with
  Hierarchical Neural Embeddings
Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with Hierarchical Neural Embeddings
Tenglong Ao
Qingzhe Gao
Yuke Lou
Baoquan Chen
Libin Liu
SLR
25
59
0
04 Oct 2022
Deep Generative Multimedia Children's Literature
Deep Generative Multimedia Children's Literature
Matthew Lyle Olson
11
0
0
27 Sep 2022
Learning to Learn with Generative Models of Neural Network Checkpoints
Learning to Learn with Generative Models of Neural Network Checkpoints
William S. Peebles
Ilija Radosavovic
Tim Brooks
Alexei A. Efros
Jitendra Malik
UQCV
73
64
0
26 Sep 2022
ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on
  Pitch and Speed
ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Speed
Mei-Shuo Chen
Z. Duan
22
10
0
23 Sep 2022
AudioLM: a Language Modeling Approach to Audio Generation
AudioLM: a Language Modeling Approach to Audio Generation
Zalan Borsos
Raphaël Marinier
Damien Vincent
Eugene Kharitonov
Olivier Pietquin
...
Dominik Roblek
O. Teboul
David Grangier
Marco Tagliasacchi
Neil Zeghidour
AuLLM
28
566
0
07 Sep 2022
Equivariant Self-Supervision for Musical Tempo Estimation
Equivariant Self-Supervision for Musical Tempo Estimation
Elio Quinton
30
9
0
03 Sep 2022
GAFX: A General Audio Feature eXtractor
GAFX: A General Audio Feature eXtractor
Zhaoyang Bu
Han Zhang
Xiaohu Zhu
28
0
0
19 Jul 2022
Stochastic Restoration of Heavily Compressed Musical Audio using
  Generative Adversarial Networks
Stochastic Restoration of Heavily Compressed Musical Audio using Generative Adversarial Networks
Stefan Lattner
J. Nistal
27
11
0
04 Jul 2022
Co-creation and ownership for AI radio
Co-creation and ownership for AI radio
Skylar Gordon
Robert Mahari
Manaswi Mishra
Ziv Epstein
16
4
0
01 Jun 2022
Deep Learning and Synthetic Media
Deep Learning and Synthetic Media
Raphaël Millière
18
18
0
11 May 2022
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive
  Transformer
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Songwei Ge
Thomas Hayes
Harry Yang
Xiaoyue Yin
Guan Pang
David Jacobs
Jia-Bin Huang
Devi Parikh
ViT
43
214
0
07 Apr 2022
Forensic Analysis and Localization of Multiply Compressed MP3 Audio
  Using Transformers
Forensic Analysis and Localization of Multiply Compressed MP3 Audio Using Transformers
Ziyue Xiang
Paolo Bestagini
Stefano Tubaro
Edward J. Delp
15
10
0
30 Mar 2022
Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic
  Memory
Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory
Lian Siyao
Weijiang Yu
Tianpei Gu
Chunze Lin
Quan Wang
Chao Qian
Chen Change Loy
Ziwei Liu
SLR
26
183
0
24 Mar 2022
Autoregressive Image Generation using Residual Quantization
Autoregressive Image Generation using Residual Quantization
Doyup Lee
Chiheon Kim
Saehoon Kim
Minsu Cho
Wook-Shin Han
VGen
170
325
0
03 Mar 2022
Generating Videos with Dynamics-aware Implicit Generative Adversarial
  Networks
Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks
Sihyun Yu
Jihoon Tack
Sangwoo Mo
Hyunsu Kim
Junho Kim
Jung-Woo Ha
Jinwoo Shin
DiffM
VGen
18
199
0
21 Feb 2022
General-purpose, long-context autoregressive modeling with Perceiver AR
General-purpose, long-context autoregressive modeling with Perceiver AR
Curtis Hawthorne
Andrew Jaegle
Cătălina Cangea
Sebastian Borgeaud
C. Nash
...
Hannah R. Sheahan
Neil Zeghidour
Jean-Baptiste Alayrac
João Carreira
Jesse Engel
35
65
0
15 Feb 2022
Dual Learning Music Composition and Dance Choreography
Dual Learning Music Composition and Dance Choreography
Shuang Wu
Zhenguang Liu
Shijian Lu
Li Cheng
16
8
0
28 Jan 2022
CM3: A Causal Masked Multimodal Model of the Internet
CM3: A Causal Masked Multimodal Model of the Internet
Armen Aghajanyan
Po-Yao (Bernie) Huang
Candace Ross
Vladimir Karpukhin
Hu Xu
...
Dmytro Okhonko
Mandar Joshi
Gargi Ghosh
M. Lewis
Luke Zettlemoyer
15
154
0
19 Jan 2022
Efficient Large Scale Language Modeling with Mixtures of Experts
Efficient Large Scale Language Modeling with Mixtures of Experts
Mikel Artetxe
Shruti Bhosale
Naman Goyal
Todor Mihaylov
Myle Ott
...
Jeff Wang
Luke Zettlemoyer
Mona T. Diab
Zornitsa Kozareva
Ves Stoyanov
MoE
50
188
0
20 Dec 2021
Soundify: Matching Sound Effects to Video
Soundify: Matching Sound Effects to Video
David Chuan-En Lin
Anastasis Germanidis
Cristobal Valenzuela
Yining Shi
Nikolas Martelaro
25
16
0
17 Dec 2021
Unsupervised Source Separation By Steering Pretrained Music Models
Unsupervised Source Separation By Steering Pretrained Music Models
Ethan Manilow
P. O'Reilly
Prem Seetharaman
Bryan Pardo
8
2
0
25 Oct 2021
Discrete Acoustic Space for an Efficient Sampling in Neural
  Text-To-Speech
Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech
Mu-Wei Li
Jonas Rohnke
A. Bonafonte
Mateusz Lajszczak
Trevor Wood
DRL
17
2
0
24 Oct 2021
Deep Generative Models in Engineering Design: A Review
Deep Generative Models in Engineering Design: A Review
Lyle Regenwetter
A. Nobari
Faez Ahmed
3DV
AI4CE
24
175
0
21 Oct 2021
Taming Visually Guided Sound Generation
Taming Visually Guided Sound Generation
Vladimir E. Iashin
Esa Rahtu
VLM
28
120
0
17 Oct 2021
KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE using
  Mel-spectrograms
KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE using Mel-spectrograms
Chien-Feng Liao
Jen-Yu Liu
Yi-Hsuan Yang
19
5
0
08 Oct 2021
ATISS: Autoregressive Transformers for Indoor Scene Synthesis
ATISS: Autoregressive Transformers for Indoor Scene Synthesis
Despoina Paschalidou
Amlan Kar
Maria Shugrina
Karsten Kreis
Andreas Geiger
Sanja Fidler
3DV
ViT
29
148
0
07 Oct 2021
Attention is All You Need? Good Embeddings with Statistics are
  enough:Large Scale Audio Understanding without Transformers/ Convolutions/
  BERTs/ Mixers/ Attention/ RNNs or ....
Attention is All You Need? Good Embeddings with Statistics are enough:Large Scale Audio Understanding without Transformers/ Convolutions/ BERTs/ Mixers/ Attention/ RNNs or ....
Prateek Verma
AI4TS
24
2
0
07 Oct 2021
Style Equalization: Unsupervised Learning of Controllable Generative
  Sequence Models
Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models
Jen-Hao Rick Chang
A. Shrivastava
H. Koppula
Xiaoshuai Zhang
Oncel Tuzel
DiffM
48
16
0
06 Oct 2021
Controllable deep melody generation via hierarchical music structure
  representation
Controllable deep melody generation via hierarchical music structure representation
Shuqi Dai
Zeyu Jin
Celso Gomes
Roger B. Dannenberg
MGen
14
51
0
02 Sep 2021
AccoMontage: Accompaniment Arrangement via Phrase Selection and Style
  Transfer
AccoMontage: Accompaniment Arrangement via Phrase Selection and Style Transfer
Jingwei Zhao
Gus Xia
16
26
0
25 Aug 2021
Previous
123
Next