
What is KugelAudio?
KugelAudio is a state-of-the-art text-to-speech (TTS) platform designed for real-time applications. Whether you’re building voice agents, interactive applications, or content creation tools, KugelAudio provides the speed and quality you need.Quick Start
Get up and running with KugelAudio in under 5 minutes
Generate Speech
Generate high-quality audio from text
Streaming
Real-time audio streaming for low latency
Voices
Browse voices and create custom clones
Key Features
Natural prosody with break tags
Natural prosody with break tags
Lowest-latency tier
Lowest-latency tier
Legacy IDs such as
kugel-2.5 and kugel-2-turbo remain accepted for backwards compatibility. Use kugel-3 for new integrations.WebSocket Streaming
WebSocket Streaming
Stream audio chunks as they’re generated for the lowest possible latency. Perfect for LLM integrations where text arrives token by token.
Voice Cloning
Voice Cloning
Create custom voices from audio samples. Clone any voice with just a few seconds of reference audio.
Multi-Language Support
Multi-Language Support
Multilingual single model — 39 languages including DE, EN, FR, ES, IT, NL, PT, PL, RU, ZH, JA, KO, AR. See the TTS endpoint reference for the full list.
Available Models
Usekugel-3 for new integrations. See Models for the current model reference.
| Model | Best for |
|---|---|
kugel-3 | Voice agents, narration, brand voices, streaming, multilingual TTS, and break tags |
Getting Started
Get Your API Key
Sign up at kugelaudio.com and get your API key from the dashboard.
Need Help?
API Reference
Detailed API documentation with examples
