Dynamic HumTrans: Humming Transcription Using CNNs and Dynamic
Programming
IAPR International Workshop on Artificial Neural Networks in Pattern Recognition (ANNPR), 2024
Main:8 Pages
4 Figures
Bibliography:1 Pages
2 Tables
Appendix:3 Pages
Abstract
We propose a novel approach for humming transcription that combines a CNN-based architecture with a dynamic programming-based post-processing algorithm, utilizing the recently introduced HumTrans dataset. We identify and address inherent problems with the offset and onset ground truth provided by the dataset, offering heuristics to improve these annotations, resulting in a dataset with precise annotations that will aid future research. Additionally, we compare the transcription accuracy of our method against several others, demonstrating state-of-the-art (SOTA) results. All our code and corrected dataset is available at https://github.com/shubham-gupta-30/humming_transcription
View on arXivComments on this paper
