1) Enable push-to-talk in the dashboard
In your agent panel on https://dash.layercode.com/ → Transcriber → Settings → set Turn Taking to Push to Talk → Save your changes.
2) Use the React SDK callbacks
When using push-to-talk, calltriggerUserTurnStarted()
when the user begins speaking (pressing the button), and triggerUserTurnFinished()
when they stop (releasing the button).
app/ui/VoiceAgentPushToTalk.tsx
Turn taking is explained conceptually in our Turn taking guide.
What gets sent when you press the button
The React SDK continuously captures microphone audio into a short rolling buffer even while the button is idle. When you calltriggerUserTurnStarted()
, we immediately flush roughly one second of pre-roll audio along with anything you speak while the button stays down. This keeps the start of the utterance intact, so agents hear the full word instead of a clipped syllable.
You can fine-tune the pre-roll length with the vad.buffer_frames
agent setting. Each frame represents about 100 ms of audio, so lowering the value shortens the buffer and raising it adds more context before the press.