10

Timed text extraction from Taiwanese Kua-á-hì TV series

Tzu-Hung Huang
Yun-En Tsai
Yun-Ning Hung
Chih-Wei Wu
I-Chieh Wei
Li Su
Main:2 Pages
2 Figures
Bibliography:1 Pages
Abstract

Taiwanese opera (Kua-á-hì), a major form of local theatrical tradition, underwent extensive television adaptation notably by pioneers like Iûnn Lē-hua. These videos, while potentially valuable for in-depth studies of Taiwanese opera, often have low quality and require substantial manual effort during data preparation. To streamline this process, we developed an interactive system for real-time OCR correction and a two-step approach integrating OCR-driven segmentation with Speech and Music Activity Detection (SMAD) to efficiently identify vocal segments from archival episodes with high precision. The resulting dataset, consisting of vocal segments and corresponding lyrics, can potentially supports various MIR tasks such as lyrics identification and tune retrieval. Code is available atthis https URL.

View on arXiv
Comments on this paper