Voice timbre refers to the unique quality or character of a person's voice that distinguishes it from others as perceived by human hearing. The Voice Timbre Attribute Detection (VtaD) 2025 challenge focuses on explaining the voice timbre attribute in a comparative manner. In this challenge, the human impression of voice timbre is verbalized with a set of sensory descriptors, including bright, coarse, soft, magnetic, and so on. The timbre is explained from the comparison between two voices in their intensity within a specific descriptor dimension. The VtaD 2025 challenge starts in May and culminates in a special proposal at the NCMMSC2025 conference in October 2025 in Zhenjiang, China.
View on arXiv@article{sheng2025_2505.09382, title={ The Voice Timbre Attribute Detection 2025 Challenge Evaluation Plan }, author={ Zhengyan Sheng and Jinghao He and Liping Chen and Kong Aik Lee and Zhen-Hua Ling }, journal={arXiv preprint arXiv:2505.09382}, year={ 2025 } }