Krisp Voice Translation API
Real-time speech-to-speech translation API built for accuracy-critical calls, with built-in background voice cancellation.
Real-time speech-to-speech translation API built for accuracy-critical calls, with built-in background voice cancellation.

Real-time speech-to-speech translation
Converts spoken audio between languages live, covering transcription, translation and speech synthesis in one pipeline.
60+ languages any-to-any
Translates across more than 60 languages including locale variants such as US Spanish, French Canadian and Egyptian Arabic.
Background Voice Cancellation
Removes background noise, competing voices and reverberation to keep translation accurate on noisy calls.
Accent-robust recognition
Maintains accuracy with minimal degradation when speakers have strong accents.
Custom vocabulary and dictionary
Lets you add domain-specific terms and per-language-pair translation rules.
WebSocket API and SDKs
Streams audio over a WebSocket endpoint with Python and JavaScript/Node.js SDKs, and a C++ SDK listed as coming soon.
Session configuration via JSON
Controls languages, voice, custom vocabulary, BVC and transcripts through a single JSON object per session.
Contact centres adding live translation so agents and customers can speak in their own languages on the same call.
Voice AI and conversational agent builders embedding real-time translation into their applications.
Telephony and headset-based products that need accurate translation despite noisy audio.
Multilingual customer support teams handling calls across many regional language variants.
Businesses needing compliant voice translation under SOC 2, HIPAA or PCI-DSS requirements.
Developers prototyping speech-to-speech translation using the free tier before committing to a paid plan.