Provider | Price per minute |
---|---|
Platform Fee | $0.04 |
Hosted Backend Fee | $0.01 |
Provider | Model | Languages | Price per minute |
---|---|---|---|
Deepgram | nova-3 (English) | English | $0.0065 |
Deepgram | nova-3 (Multilingual) | English | $0.0078 |
Provider | Model | Languages | Price per minute |
---|---|---|---|
Rime | mistv2 | English, Spanish | $0.02 |
Cartesia | sonic-2 | English (American/British/Australian/Southern), Spanish (Latin/Peninsula), French, Portuguese (Brazilian/European), Hindi, Chinese, Russian, Dutch, Japanese, Turkish, Korean, German, Swedish, Italian, Polish | $0.03 |
Cartesia | sonic-turbo | English (American/British/Australian/Southern), Spanish (Latin/Peninsula), French, Portuguese (Brazilian/European), Hindi, Chinese, Russian, Dutch, Japanese, Turkish, Korean, German, Swedish, Italian, Polish | $0.03 |
ElevenLabs | eleven_v2_5_flash | English, Hindi, Portuguese, Chinese, Spanish, French, German, Japanese, Arabic, Russian, Korean, Indonesian, Italian, Dutch, Turkish, Polish, Swedish, Norwegian, Filipino, Malay, Romanian, Hungarian, Ukrainian, Greek, Czech, Danish, Finnish, Bulgarian, Croatian, Slovak, Tamil, Vietnamese, Korean, Japanese, Arabic, Russian, Portuguese, Spanish, French, German, Italian, Dutch, Turkish, Polish, Swedish, Norwegian, Filipino, Malay, Romanian, Hungarian, Ukrainian, Greek, Czech, Danish, Finnish, Bulgarian, Croatian, Slovak, Tamil, Vietnamese | $0.05 |
Low-latency voice pipelines | Production-ready, real-time voice processing with minimal delay |
Global infrastructure | 330+ locations worldwide for reliable, fast connections |
Multi-platform support | Web, mobile, and phone (coming soon) voice agents |
Speech-to-text transcription | Convert user speech to text using leading providers |
Text-to-speech synthesis | Convert AI responses to natural speech |
Real-time audio streaming | Continuous audio capture, processing, and playback |
Smart turn-taking | Automatic conversation flow with interrupt capability |
Hosted Backend | Managed backend option |
Custom backend support | Connect your own backend with a simple webhook |
Any framework support | Works with Next.js, Express, FastAPI, and more |
32+ languages supported | Multi-language transcription and speech synthesis |
100+ voices available | Wide selection across multiple TTS providers |
Provider flexibility | Easy switching between voice model providers |
No vendor lock-in | Switch providers and models without code changes |
Per-second billing | Pay only for actual speech time, not silence |
Transparent pricing | Usage-based costs with consolidated billing |