Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.06182
Cited By
CREPE: A Convolutional Representation for Pitch Estimation
17 February 2018
Jong Wook Kim
Justin Salamon
P. Li
J. P. Bello
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"CREPE: A Convolutional Representation for Pitch Estimation"
50 / 160 papers shown
Title
Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)
Junyu Chen
Susmitha Vekkot
Pancham Shukla
75
7
0
15 Sep 2023
DDSP-SFX: Acoustically-guided sound effects generation with differentiable digital signal processing
Yunyi Liu
Craig Jin
David Gunawan
26
4
0
14 Sep 2023
DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input
Nicolas Jonason
Xin Eric Wang
Erica Cooper
Lauri Juvela
Bob L. T. Sturm
Junichi Yamagishi
71
1
0
14 Sep 2023
EnCodecMAE: Leveraging neural codecs for universal audio representation learning
L. Pepino
Pablo Riera
Luciana Ferrer
76
5
0
14 Sep 2023
PESTO: Pitch Estimation with Self-supervised Transposition-equivariant Objective
Alain Riou
Stefan Lattner
Gaëtan Hadjeres
Geoffroy Peeters
65
18
0
05 Sep 2023
A Review of Differentiable Digital Signal Processing for Music & Speech Synthesis
B. Hayes
Jordie Shier
Gyorgy Fazekas
Andrew Mcpherson
C. Saitis
83
25
0
29 Aug 2023
Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion
Jordan J. Bird
Ahmad Lotfi
55
19
0
24 Aug 2023
Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction
Keren Shao
Kai Chen
Taylor Berg-Kirkpatrick
Shlomo Dubnov
61
2
0
04 Aug 2023
Finding Tori: Self-supervised Learning for Analyzing Korean Folk Song
Danbinaerin Han
Rafael Caro Repetto
Dasaem Jeong
60
4
0
04 Aug 2023
Interpretable Timbre Synthesis using Variational Autoencoders Regularized on Timbre Descriptors
Anastasia Natsiou
Luca Longo
Seán O'Leary
104
0
0
18 Jul 2023
RMVPE: A Robust Model for Vocal Pitch Estimation in Polyphonic Music
Haojie Wei
Xueke Cao
Tangpeng Dan
Yueguo Chen
90
24
0
27 Jun 2023
PrimaDNN': A Characteristics-aware DNN Customization for Singing Technique Detection
Yuya Yamamoto
Juhan Nam
Hiroko Terasawa
27
1
0
25 Jun 2023
Speaker Embeddings as Individuality Proxy for Voice Stress Detection
Zihan Wu
Neil Scheidwasser
Karl El Hajal
Milos Cernak
68
3
0
09 Jun 2023
JEPOO: Highly Accurate Joint Estimation of Pitch, Onset and Offset for Music Information Retrieval
Haojie Wei
Jun Yuan
Rui Zhang
Yueguo Chen
Gang Wang
86
4
0
02 Jun 2023
Harmonic enhancement using learnable comb filter for light-weight full-band speech enhancement model
Xiaohuai Le
Tong Lei
Li Chen
Yiqing Guo
Chao-Peng He
...
Hua-Jing Gao
Yijian Xiao
Piao Ding
Shenyi Song
Jing Lu
58
5
0
01 Jun 2023
Diverse and Expressive Speech Prosody Prediction with Denoising Diffusion Probabilistic Model
Xiang Li
Songxiang Liu
Max W. Y. Lam
Zhiyong Wu
Chao Weng
Helen Meng
DiffM
134
5
0
26 May 2023
NAS-FM: Neural Architecture Search for Tunable and Interpretable Sound Synthesis based on Frequency Modulation
Zhe Ye
Wei Xue
Xuejiao Tan
Qi-fei Liu
Yi-Ting Guo
79
2
0
22 May 2023
Learn to Sing by Listening: Building Controllable Virtual Singer by Unsupervised Learning from Voice Recordings
Wei Xue
Yiwen Wang
Qi-fei Liu
Yi-Ting Guo
77
1
0
09 May 2023
Pitch Estimation by Denoising Preprocessor and Hybrid Estimation Model
Yung-Cheng Hung
Ping-Hung Chen
Jian-Jiun Ding
23
0
0
06 May 2023
A multimodal dynamical variational autoencoder for audiovisual speech representation learning
Samir Sadok
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
Renaud Séguier
102
13
0
05 May 2023
Deep Audio-Visual Singing Voice Transcription based on Self-Supervised Learning Models
Xiangming Gu
Weizhen Zeng
Jianan Zhang
Longshen Ou
Ye Wang
100
6
0
24 Apr 2023
Looking Similar, Sounding Different: Leveraging Counterfactual Cross-Modal Pairs for Audiovisual Representation Learning
Nikhil Singh
Chih-Wei Wu
Iroro Orife
Mahdi M. Kalayeh
114
2
0
12 Apr 2023
Automatic Detection of Reactions to Music via Earable Sensing
Euihyoek Lee
Chulhong Min
Jeaseung Lee
Jin Yu
Seungwoo Kang
23
0
0
06 Apr 2023
Cross-domain Neural Pitch and Periodicity Estimation
Max Morrison
Caedon Hsieh
Nathan Pruyne
Bryan Pardo
84
18
0
28 Jan 2023
FretNet: Continuous-Valued Pitch Contour Streaming for Polyphonic Guitar Tablature Transcription
Frank Cwitkowitz
T. Hirvonen
Anssi Klapuri
74
3
0
06 Dec 2022
Learning General Audio Representations with Large-Scale Training of Patchout Audio Transformers
Khaled Koutini
Shahed Masoudian
Florian Schmid
Hamid Eghbalzadeh
Jan Schluter
Gerhard Widmer
144
6
0
25 Nov 2022
Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Xuan Shi
Erica Cooper
Xin Wang
Junichi Yamagishi
Shrikanth Narayanan
69
1
0
25 Nov 2022
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
Hyeong-Seok Choi
Jinhyeok Yang
Juheon Lee
Hyeongju Kim
85
46
0
17 Nov 2022
A unified one-shot prosody and speaker conversion system with self-supervised discrete speech units
Li-Wei Chen
Shinji Watanabe
Alexander I. Rudnicky
77
7
0
12 Nov 2022
PhaseAug: A Differentiable Augmentation for Speech Synthesis to Simulate One-to-Many Mapping
Junhyeok Lee
Seungu Han
Hyunjae Cho
Wonbin Jung
51
12
0
08 Nov 2022
Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots
Akanksha Saran
K. Desai
M. L. Chang
Rudolf Lioutikov
A. Thomaz
S. Niekum
49
3
0
01 Nov 2022
Analysis and Detection of Singing Techniques in Repertoires of J-POP Solo Singers
Yuya Yamamoto
Juhan Nam
Hiroko Terasawa
29
5
0
31 Oct 2022
Learning Music Representations with wav2vec 2.0
Alessandro Ragano
Emmanouil Benetos
Andrew Hines
77
9
0
27 Oct 2022
A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution
Yisi Liu
Peter Wu
A. Black
Gopala K. Anumanchipalli
37
4
0
27 Oct 2022
Audio Barlow Twins: Self-Supervised Audio Representation Learning
Jonah Anton
H. Coppock
Pancham Shukla
Bjorn W. Schuller
BDL
SSL
83
8
0
28 Sep 2022
The Efficacy of Self-Supervised Speech Models for Audio Representations
Tung-Yu Wu
Chen-An Li
Tzu-Han Lin
Tsung-Yuan Hsu
Hung-yi Lee
66
5
0
26 Sep 2022
Deep Speech Synthesis from Articulatory Representations
Peter Wu
Shinji Watanabe
Louis Goldstein
A. Black
Gopala K. Anumanchipalli
78
26
0
13 Sep 2022
Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer
Shrutina Agarwal
Sriram Ganapathy
Naoya Takahashi
36
4
0
26 Aug 2022
"Melatonin": A Case Study on AI-induced Musical Style
Emmanuel Deruty
M. Grachten
78
4
0
18 Aug 2022
DDX7: Differentiable FM Synthesis of Musical Instrument Sounds
Franco Caspe
Andrew Mcpherson
Mark Sandler
72
30
0
12 Aug 2022
DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A Comprehensive Evaluation
Da-Yi Wu
Wen-Yi Hsiao
Fu-Rong Yang
Oscar D. Friedman
Warren Jackson
Scott Bruzenak
Yi-Wen Liu
Yi-Hsuan Yang
DiffM
115
24
0
09 Aug 2022
Comparing Conventional Pitch Detection Algorithms with a Neural Network Approach
Anja Kroon
43
1
0
29 Jun 2022
BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping
Gasser Elbanna
Neil Scheidwasser
M. Kegler
P. Beckmann
Karl El Hajal
Milos Cernak
SSL
94
23
0
24 Jun 2022
Universal Speech Enhancement with Score-based Diffusion
Joan Serrà
Santiago Pascual
Jordi Pons
R. O. Araz
D. Scaini
DiffM
114
105
0
07 Jun 2022
Towards Robust Unsupervised Disentanglement of Sequential Data -- A Case Study Using Music Audio
Yin-Jyun Luo
Sebastian Ewert
S. Dixon
CoGe
106
12
0
12 May 2022
HarmoF0: Logarithmic Scale Dilated Convolution For Pitch Estimation
Weixing Wei
P. Li
Yi Yu
Wei Li
51
17
0
02 May 2022
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
94
69
0
26 Apr 2022
Learning and controlling the source-filter representation of speech with a variational autoencoder
Samir Sadok
Simon Leglaive
Laurent Girin
Xavier Alameda-Pineda
Renaud Séguier
SSL
DRL
BDL
119
14
0
14 Apr 2022
A Computational Analysis of Pitch Drift in Unaccompanied Solo Singing using DBSCAN Clustering
Sepideh Shafiei
S. Hakam
22
0
0
03 Apr 2022
Measuring pitch extractors' response to frequency-modulated multi-component signals
Hideki Kawahara
Kohei Yatabe
Ken-Ichi Sakakibara
Tatsuya Kitamura
Hideki Banno
Masanori Morise
29
0
0
02 Apr 2022
Previous
1
2
3
4
Next