Small reasoning model that runs locally in-browser
Transcribe audio to text
ML-powered speech synthesis directly in your browser
Transcribe audio to text with timestamps
Generate code from text prompts
Generate speech from text using a reference voice
Manipulate images by dragging points
Segment images using texts, points, or everything mode
Generate images from text prompts