Skip to content

Settings: Voice

The Voice panel configures speech-to-text and text-to-speech for the chat box. Each section toggles on to enable voice input or output. A provider hint banner points you to setup steps when a provider needs configuration.

The Voice settings panel covering speech-to-text and text-to-speech

A banner at the top surfaces guidance when the selected text-to-speech provider needs attention, with a control to adjust the configuration inline.

Turn on to enable voice input. The available providers:

ProviderNotes
Whisper (local)Runs locally. Pick the Whisper model (base or small); the model and binary download on demand and show an installed state.
OpenAIUses the whisper-1 model through your OpenAI key.
DeepgramUses the nova-2 model through a Deepgram key.

A microphone check lets you confirm your input device is working before relying on it.

Turn on to enable voice output. Providers include a local Kokoro option whose model downloads on demand, alongside cloud providers selected from the provider list. Voice and related options appear once a provider is chosen.