ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.06182
  4. Cited By
CREPE: A Convolutional Representation for Pitch Estimation

CREPE: A Convolutional Representation for Pitch Estimation

17 February 2018
Jong Wook Kim
Justin Salamon
P. Li
J. P. Bello
ArXiv (abs)PDFHTML

Papers citing "CREPE: A Convolutional Representation for Pitch Estimation"

50 / 160 papers shown
Title
Evolving music theory for emerging musical languages
Evolving music theory for emerging musical languages
Emmanuel Deruty
34
0
0
17 Jun 2025
A Variational Framework for Improving Naturalness in Generative Spoken Language Models
A Variational Framework for Improving Naturalness in Generative Spoken Language Models
Li-Wei Chen
Takuya Higuchi
Zakaria Aldeneh
Ahmed Hussen Abdelaziz
Alexander I. Rudnicky
36
0
0
17 Jun 2025
Methods for pitch analysis in contemporary popular music: multiple pitches from harmonic tones in Vitalic's music
Methods for pitch analysis in contemporary popular music: multiple pitches from harmonic tones in Vitalic's music
Emmanuel Deruty
David Meredith
M. Grachten
Pascal Arbez-Nicolas
Andreas Hasselholt Jørgensen
Oliver Søndermølle Hansen
Magnus Stensli
Christian Nørkær Petersen
34
1
0
14 Jun 2025
RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding
RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding
Yisi Liu
Chenyang Wang
Hanjo Kim
Raniya Khan
Gopala Anumanchipalli
107
0
0
12 Jun 2025
Methods for pitch analysis in contemporary popular music: Vitalic's use of tones that do not operate on the principle of acoustic resonance
Methods for pitch analysis in contemporary popular music: Vitalic's use of tones that do not operate on the principle of acoustic resonance
Emmanuel Deruty
Pascal Arbez-Nicolas
David Meredith
21
2
0
08 Jun 2025
When Humans Growl and Birds Speak: High-Fidelity Voice Conversion from Human to Animal and Designed Sounds
When Humans Growl and Birds Speak: High-Fidelity Voice Conversion from Human to Animal and Designed Sounds
Minsu Kang
Seolhee Lee
Choonghyeon Lee
Namhyun Cho
VLM
28
0
0
30 May 2025
Self-supervised learning method using multiple sampling strategies for general-purpose audio representation
Self-supervised learning method using multiple sampling strategies for general-purpose audio representation
Ibuki Kuroyanagi
Tatsuya Komatsu
SSL
24
2
0
25 May 2025
Neurodyne: Neural Pitch Manipulation with Representation Learning and Cycle-Consistency GAN
Neurodyne: Neural Pitch Manipulation with Representation Learning and Cycle-Consistency GAN
Yicheng Gu
Chaoren Wang
Zhizheng Wu
Lauri Juvela
121
1
0
21 May 2025
ELGAR: Expressive Cello Performance Motion Generation for Audio Rendition
ELGAR: Expressive Cello Performance Motion Generation for Audio Rendition
Zhiping Qiu
Yitong Jin
Yijiao Wang
Yi Shi
Changbo Wang
Chao Tan
Xiaobing Li
Feng Yu
Tao Yu
Qionghai Dai
68
0
0
07 May 2025
Real-Time Pitch/F0 Detection Using Spectrogram Images and Convolutional Neural Networks
Real-Time Pitch/F0 Detection Using Spectrogram Images and Convolutional Neural Networks
Xufang Zhao
Omer Tsimhoni
66
0
0
08 Apr 2025
SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System
SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System
Hyeongju Kim
Jinhyeok Yang
Yechan Yu
Seunghun Ji
Jacob Morton
Frederik Bous
Joon Byun
Juheon Lee
149
0
0
29 Mar 2025
Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach
Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach
J. Abeßer
Siyang Song
Meinard Muller
70
0
0
24 Mar 2025
Designing Neural Synthesizers for Low-Latency Interaction
Designing Neural Synthesizers for Low-Latency Interaction
Franco Caspe
Jordie Shier
Mark Sandler
C. Saitis
Andrew Mcpherson
437
0
0
14 Mar 2025
ReelWave: Multi-Agentic Movie Sound Generation through Multimodal LLM Conversation
ReelWave: Multi-Agentic Movie Sound Generation through Multimodal LLM Conversation
Zixuan Wang
Chi-Keung Tang
Yu-Wing Tai
VGenDiffM
133
0
0
10 Mar 2025
AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
Samir Sadok
Simon Leglaive
Laurent Girin
Gaël Richard
Xavier Alameda-Pineda
129
3
0
10 Jan 2025
A System for Melodic Harmonization using Schoenberg Regions, Giant Steps, and Church Modes
Frederick Fernandes
45
0
0
05 Jan 2025
The Sound of Water: Inferring Physical Properties from Pouring Liquids
Piyush Bagad
Makarand Tapaswi
Cees G. M. Snoek
Andrew Zisserman
189
0
0
18 Nov 2024
The Concatenator: A Bayesian Approach To Real Time Concatenative
  Musaicing
The Concatenator: A Bayesian Approach To Real Time Concatenative Musaicing
Christopher Tralie
Ben Cantil
41
0
0
07 Nov 2024
Automatic Estimation of Singing Voice Musical Dynamics
Automatic Estimation of Singing Voice Musical Dynamics
Jyoti Narang
Nazif Can Tamer
Viviana De La Vega
Xavier Serra
41
0
0
27 Oct 2024
Sound Check: Auditing Audio Datasets
Sound Check: Auditing Audio Datasets
William Agnew
Julia Barnett
Annie Chu
Rachel Hong
Michael Feffer
Robin Netzorg
Harry H. Jiang
Ezra Awumey
Sauvik Das
127
1
0
17 Oct 2024
Towards Computational Analysis of Pansori Singing
Towards Computational Analysis of Pansori Singing
Sangheon Park
Danbinaerin Han
Dasaem Jeong
30
0
0
16 Oct 2024
SiFiSinger: A High-Fidelity End-to-End Singing Voice Synthesizer based
  on Source-filter Model
SiFiSinger: A High-Fidelity End-to-End Singing Voice Synthesizer based on Source-filter Model
Jianwei Cui
Yu Gu
Chao Weng
Jie Zhang
Liping Chen
Lirong Dai
90
4
0
16 Oct 2024
Exploring synthetic data for cross-speaker style transfer in style
  representation based TTS
Exploring synthetic data for cross-speaker style transfer in style representation based TTS
Lucas Ueda
Leonardo B. de M. M. Marques
Flávio O. Simões
Mário Uliani Neto
Fernando Runstein
Bianca Dal Bó
Paula D. P. Costa
93
0
0
25 Sep 2024
Hierarchical Generative Modeling of Melodic Vocal Contours in Hindustani
  Classical Music
Hierarchical Generative Modeling of Melodic Vocal Contours in Hindustani Classical Music
N. Shikarpur
Krishna Maneesha Dendukuri
Yusong Wu
Antoine Caillon
Cheng-Zhi Anna Huang
37
1
0
22 Aug 2024
LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with
  Inference Acceleration via Latent Consistency Distillation
LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation
Shihao Chen
Yu Gu
Jianwei Cui
Jie Zhang
Rilin Chen
Lirong Dai
77
2
0
22 Aug 2024
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound
Junwon Lee
Jaekwon Im
Dabin Kim
Juhan Nam
VGen
140
10
0
21 Aug 2024
DisMix: Disentangling Mixtures of Musical Instruments for Source-level
  Pitch and Timbre Manipulation
DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation
Yin-Jyun Luo
K. Cheuk
Woosung Choi
Toshimitsu Uesaka
Keisuke Toyama
...
Chieh-Hsin Lai
Yuhta Takida
Wei-Hsiang Liao
Simon Dixon
Yuki Mitsufuji
CoGe
106
2
0
20 Aug 2024
MaskAnyone Toolkit: Offering Strategies for Minimizing Privacy Risks and
  Maximizing Utility in Audio-Visual Data Archiving
MaskAnyone Toolkit: Offering Strategies for Minimizing Privacy Risks and Maximizing Utility in Audio-Visual Data Archiving
B. Owoyele
Martin Schilling
Rohan Sawahn
Niklas Kaemer
Pavel Zherebenkov
Bhuvanesh Verma
Wim Pouw
Gerard de Melo
103
0
0
06 Aug 2024
Differentiable Modal Synthesis for Physical Modeling of Planar String
  Sound and Motion Simulation
Differentiable Modal Synthesis for Physical Modeling of Planar String Sound and Motion Simulation
J. Lee
Jaehyun Park
Min Jun Choi
Kyogu Lee
81
2
0
07 Jul 2024
YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer
  Architectures and Cross-dataset Stem Augmentation
YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation
Sungkyun Chang
Emmanouil Benetos
Holger Kirchhoff
Simon Dixon
92
3
0
05 Jul 2024
Who Finds This Voice Attractive? A Large-Scale Experiment Using
  In-the-Wild Data
Who Finds This Voice Attractive? A Large-Scale Experiment Using In-the-Wild Data
Hitoshi Suda
Aya Watanabe
Shinnosuke Takamichi
65
0
0
05 Jul 2024
Machine Learning Techniques in Automatic Music Transcription: A
  Systematic Survey
Machine Learning Techniques in Automatic Music Transcription: A Systematic Survey
Fatemeh Jamshidi
Gary Pike
Amit Das
Richard Chapman
58
4
0
20 Jun 2024
Articulatory Encodec: Coding Speech through Vocal Tract Kinematics
Articulatory Encodec: Coding Speech through Vocal Tract Kinematics
Cheol Jun Cho
Peter Wu
Tejas S. Prabhune
Dhruv Agarwal
Gopala K. Anumanchipalli
110
8
0
18 Jun 2024
TSE-PI: Target Sound Extraction under Reverberant Environments with
  Pitch Information
TSE-PI: Target Sound Extraction under Reverberant Environments with Pitch Information
Yiwen Wang
Xihong Wu
75
2
0
13 Jun 2024
LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice
  Conversion with Singer Guidance
LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance
Shihao Chen
Yu Gu
Jie Zhang
Na Li
Rilin Chen
Liping Chen
Lirong Dai
DiffM
79
6
0
08 Jun 2024
STraDa: A Singer Traits Dataset
STraDa: A Singer Traits Dataset
Yuexuan Kong
V. Tran
Romain Hennequin
54
2
0
06 Jun 2024
An Investigation of Time-Frequency Representation Discriminators for
  High-Fidelity Vocoder
An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder
Yicheng Gu
Xueyao Zhang
Liumeng Xue
Haizhou Li
Zhizheng Wu
55
3
0
26 Apr 2024
ATFNet: Adaptive Time-Frequency Ensembled Network for Long-term Time
  Series Forecasting
ATFNet: Adaptive Time-Frequency Ensembled Network for Long-term Time Series Forecasting
Hengyu Ye
Jiadong Chen
Shijin Gong
Fuxin Jiang
Tieying Zhang
Jianjun Chen
Xiaofeng Gao
AI4TS
63
4
0
08 Apr 2024
Toward Fully Self-Supervised Multi-Pitch Estimation
Toward Fully Self-Supervised Multi-Pitch Estimation
Frank Cwitkowitz
Zhiyao Duan
78
4
0
23 Feb 2024
Cacophony: An Improved Contrastive Audio-Text Model
Cacophony: An Improved Contrastive Audio-Text Model
Ge Zhu
Jordan Darefsky
Zhiyao Duan
AuLLM
94
12
0
10 Feb 2024
DiffMoog: a Differentiable Modular Synthesizer for Sound Matching
DiffMoog: a Differentiable Modular Synthesizer for Sound Matching
Noy Uzrad
Oren Barkan
Almog Elharar
Shlomi Shvartzman
Moshe Laufer
Lior Wolf
Noam Koenigstein
64
6
0
23 Jan 2024
DJCM: A Deep Joint Cascade Model for Singing Voice Separation and Vocal
  Pitch Estimation
DJCM: A Deep Joint Cascade Model for Singing Voice Separation and Vocal Pitch Estimation
Haojie Wei
Xueke Cao
Wenbo Xu
Tangpeng Dan
Yueguo Chen
VLM
52
2
0
08 Jan 2024
Leveraging Laryngograph Data for Robust Voicing Detection in Speech
Leveraging Laryngograph Data for Robust Voicing Detection in Speech
Yixuan Zhang
Heming Wang
DeLiang Wang
49
0
0
05 Dec 2023
A Semi-Supervised Deep Learning Approach to Dataset Collection for
  Query-By-Humming Task
A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-By-Humming Task
Amantur Amatov
Dmitry Lamanov
Maksim Titov
Ivan Vovk
Ilya Makarov
Mikhail Kudinov
61
0
0
02 Dec 2023
String Sound Synthesizer on GPU-accelerated Finite Difference Scheme
String Sound Synthesizer on GPU-accelerated Finite Difference Scheme
J. Lee
Min Jun Choi
Kyogu Lee
38
2
0
30 Nov 2023
Reimagining Speech: A Scoping Review of Deep Learning-Powered Voice
  Conversion
Reimagining Speech: A Scoping Review of Deep Learning-Powered Voice Conversion
A. R. Bargum
Stefania Serafin
Cumhur Erkut
73
4
0
14 Nov 2023
Efficient bandwidth extension of musical signals using a differentiable
  harmonic plus noise model
Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model
Pierre-Amaury Grumiaux
Mathieu Lagrange
68
3
0
13 Nov 2023
A cry for help: Early detection of brain injury in newborns
A cry for help: Early detection of brain injury in newborns
Charles C. Onu
Samantha Latremouille
Arsenii Gorin
Junhao Wang
Innocent Udeogu
...
O. Kehinde
Muhammad A. Salisu
Datonye Briggs
Yoshua Bengio
Doina Precup
124
2
0
12 Oct 2023
F0 analysis of Ghanaian pop singing reveals progressive alignment with
  equal temperament over the past three decades: a case study
F0 analysis of Ghanaian pop singing reveals progressive alignment with equal temperament over the past three decades: a case study
Irán R. Román
Daniel Faronbi
Isabelle Burger-Weiser
Leila Adu-Gilmore
33
2
0
02 Oct 2023
Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low
  Complexity
Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low Complexity
Krishna Subramani
J. Valin
Jan Büthe
Paris Smaragdis
Mike Goodwin
75
3
0
25 Sep 2023
1234
Next