Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant

20 October 2024

Papers citing "Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant"

2 / 2 papers shown

Title
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM S. Mohammed Irfan Kurpath Sahal Shaji Mullappilly Jean Lahoud Fahad A Khan Rao Muhammad Anwer Salman Khan Hisham Cholakkal AuLLM 54 0 0 06 Mar 2025
Continuous Speech Tokens Makes LLMs Robust Multi-Modality Learners Ze Yuan Yanqing Liu Shujie Liu Sheng Zhao AuLLM 74 0 0 06 Dec 2024