Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.15316
Cited By
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant
20 October 2024
Alan Dao
Dinh Bach Vu
Huy Hoang Ha
AuLLM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant"
2 / 2 papers shown
Title
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
S.
Mohammed Irfan Kurpath
Sahal Shaji Mullappilly
Jean Lahoud
Fahad A Khan
Rao Muhammad Anwer
Salman Khan
Hisham Cholakkal
AuLLM
54
0
0
06 Mar 2025
Continuous Speech Tokens Makes LLMs Robust Multi-Modality Learners
Ze Yuan
Yanqing Liu
Shujie Liu
Sheng Zhao
AuLLM
74
0
0
06 Dec 2024
1