Welcome to KugelAudio

What is KugelAudio?

KugelAudio is a state-of-the-art text-to-speech (TTS) platform designed for real-time applications. Whether you’re building voice agents, interactive applications, or content creation tools, KugelAudio provides the speed and quality you need.

Quick Start

Get up and running with KugelAudio in under 5 minutes

Generate Speech

Generate high-quality audio from text

Streaming

Real-time audio streaming for low latency

Voices

Browse voices and create custom clones

Key Features

Natural prosody with break tags

kugel-3 is the canonical production model for natural prosody, clean brand voices, multilingual TTS, streaming, and break tags.

Lowest-latency tier

Legacy IDs such as kugel-2.5 and kugel-2-turbo remain accepted for backwards compatibility. Use kugel-3 for new integrations.

WebSocket Streaming

Stream audio chunks as they’re generated for the lowest possible latency. Perfect for LLM integrations where text arrives token by token.

Voice Cloning

Create custom voices from audio samples. Clone any voice with just a few seconds of reference audio.

Multi-Language Support

Multilingual single model — 39 languages including DE, EN, FR, ES, IT, NL, PT, PL, RU, ZH, JA, KO, AR. See the TTS endpoint reference for the full list.

Available Models

Use kugel-3 for new integrations. See Models for the current model reference.

Model	Best for
`kugel-3`	Voice agents, narration, brand voices, streaming, multilingual TTS, and `break` tags

Getting Started

Get Your API Key

Install the SDK

Choose your preferred SDK and install it:

pip install kugelaudio

Generate Your First Audio

from kugelaudio import KugelAudio

client = KugelAudio(api_key="your_api_key")
client.connect()  # Pre-connect at startup (one-time handshake cost)

audio = client.tts.generate(
    text="Hello, world!",
    model_id="kugel-3",
)
audio.save("output.wav")

Need Help?

API Reference

Detailed API documentation with examples

Quick Start

​What is KugelAudio?

Quick Start

Generate Speech

Streaming

Voices

​Key Features

​Available Models

​Getting Started

​Need Help?

API Reference

What is KugelAudio?

Key Features

Available Models

Getting Started

Need Help?