Create your first AI voice pipeline in minutes.
Setting | Description | Options / Notes |
---|---|---|
Provider | The transcription provider used for speech-to-text. | See available transcription providers |
Model | Select the transcription model that best fits your needs. | See available transcription models |
Turn Taking Mode | Determines how the system detects when the user is speaking. | Automatic: AI detects when user has finished speaking (using silence detection). Push to Talk: User controls when they’re speaking by holding down a button. Read more about turn taking. |
Can Interrupt | When Turn Taking Mode is ‘Automatic’: Toggle whether the user can interrupt AI whilst it’s speaking | Enable or disable interruption. When disabled, the user can only respond once the AI has finished speaking. |
Setting | Description | Options / Notes |
---|---|---|
Provider | The text-to-speech provider used to generate AI speech. | See available text-to-speech providers |
Model | Select the TTS model that matches your quality and speed needs. The default model is often best for English language use cases. | See available text-to-speech models |
Voice | Choose the voice that best represents your AI. | Select from available voices in the chosen provider/model. We recommend experimenting with different voices to find the right one for your use case. |
Setting | Description |
---|---|
LLM Prompt | Configure the personality and behavior of your AI assistant. |
Welcome Message | Configure the message your AI will speak when the conversation first starts. If disabled, the user starts the conversation. |