Generate audio from text using voice selection
Expressive Zeroshot TTS
Chat with AI using text, audio, images, and video
End of Utterance (EOU) with LiveKit Turn Detector