Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant
Paper
•
2410.15316
•
Published
•
12
WhisperVQ
WhisperVQ is a quantizer model and a key component of WhisperSpeech. It compresses speech into discrete tokens, enabling seamless compatibility with large language models (LLMs) for real-time speech understanding and various downstream tasks.
| No | Variant | Cortex CLI command |
|---|---|---|
| 1 | gguf | cortex run whispervq |
cortexso/whispervq
cortex run whispervq