Timed text extraction from Taiwanese Kua-á-hì TV series

1 January 2026

Tzu-Hung Huang

Yun-En Tsai

Yun-Ning Hung

Chih-Wei Wu

I-Chieh Wei

Li Su

ArXiv (abs)PDF HTML Github (28676★)

Main:2 Pages

2 Figures

Bibliography:1 Pages

Abstract

Taiwanese opera (Kua-á-hì), a major form of local theatrical tradition, underwent extensive television adaptation notably by pioneers like Iûnn Lē-hua. These videos, while potentially valuable for in-depth studies of Taiwanese opera, often have low quality and require substantial manual effort during data preparation. To streamline this process, we developed an interactive system for real-time OCR correction and a two-step approach integrating OCR-driven segmentation with Speech and Music Activity Detection (SMAD) to efficiently identify vocal segments from archival episodes with high precision. The resulting dataset, consisting of vocal segments and corresponding lyrics, can potentially supports various MIR tasks such as lyrics identification and tune retrieval. Code is available atthis https URL.

View on arXiv

Comments on this paper