Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2410.07168
Cited By
v1
v2 (latest)
Sylber: Syllabic Embedding Representation of Speech from Raw Audio
International Conference on Learning Representations (ICLR), 2024
9 October 2024
Cheol Jun Cho
Nicholas Lee
Akshat Gupta
Dhruv Agarwal
Ethan Chen
Alan W Black
Gopala K. Anumanchipalli
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Sylber: Syllabic Embedding Representation of Speech from Raw Audio"
8 / 8 papers shown
Title
Towards Unsupervised Speech Recognition at the Syllable-Level
Liming Wang
Junrui Ni
Kai-Wei Chang
Saurabhchand Bhati
David Harwath
Mark Hasegawa-Johnson
James Glass
94
0
0
04 Oct 2025
Scaling Spoken Language Models with Syllabic Speech Tokenization
Nicholas Lee
Cheol Jun Cho
Alan W. Black
Gopala K. Anumanchipalli
84
0
0
30 Sep 2025
MaskVCT: Masked Voice Codec Transformer for Zero-Shot Voice Conversion With Increased Controllability via Multiple Guidances
Junhyeok Lee
Helin Wang
Yaohan Guan
Thomas Thebaud
Laureano Moro-Velazquez
Jesus Villalba
Najim Dehak
68
0
0
21 Sep 2025
Towards Accurate Phonetic Error Detection Through Phoneme Similarity Modeling
Xuanru Zhou
Jiachen Lian
Cheol Jun Cho
Tejas S. Prabhune
Shuhe Li
...
Rian Bogley
Lisa Wauters
Zachary Miller
M. G. Tempini
Gopala Anumanchipalli
74
4
0
18 Jul 2025
Articulatory modeling of the S-shaped F2 trajectories observed in Öhman's spectrographic analysis of VCV syllables
Frédéric Berthommier
38
0
0
28 May 2025
Unlocking Temporal Flexibility: Neural Speech Codec with Variable Frame Rate
Hanglei Zhang
Yiwei Guo
Zhihan Li
Xiang Hao
Xie Chen
Kai Yu
155
3
0
22 May 2025
DC-Spin: A Speaker-invariant Speech Tokenizer for Spoken Language Models
Heng-Jui Chang
Hongyu Gong
Changhan Wang
James R. Glass
Yu-An Chung
289
4
0
31 Oct 2024
SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in HuBERT
Cheol Jun Cho
Abdelrahman Mohamed
Shang-Wen Li
Alan W. Black
Gopala K. Anumanchipalli
198
13
0
16 Oct 2023
1