Voice Cloning

Voice cloning allows you to create a synthetic voice that sounds like a specific person from just a few seconds of reference audio.

Voice cloning is available on Business and Enterprise plans.

How It Works

Upload reference audio - Provide 10-30 seconds of clean speech
Processing - Our AI analyzes the voice characteristics
Voice created - Use your new voice in any TTS request

Requirements

Audio Quality

For best results, your reference audio should be:

Duration: 10-30 seconds of speech
Format: WAV, MP3, or FLAC
Sample rate: 16kHz or higher
Channels: Mono preferred
Quality: Clean, no background noise

Content Guidelines

✅ Good audio:

Clear speech with natural pacing
Single speaker only
Minimal background noise
Natural emotional range

❌ Avoid:

Multiple speakers
Background music
Heavy reverb or echo
Whispered or shouted speech
Heavily compressed audio

Creating a Voice Clone

Via Dashboard

Go to Dashboard → Voices → Create Voice
Upload your reference audio
Enter a name and description
Click Create Voice
Wait for processing (usually 2-5 minutes)

Via API

import requests

# Upload audio file
with open("reference.wav", "rb") as f:
    response = requests.post(
        "https://api.kugelaudio.com/v1/voices/clone",
        headers={"Authorization": "Bearer YOUR_API_KEY"},
        files={"audio": f},
        data={
            "name": "My Custom Voice",
            "description": "Cloned from reference audio",
        }
    )

voice = response.json()["data"]
print(f"Created voice: {voice['id']}")

Using Cloned Voices

Once created, use your cloned voice like any other:

Python
JavaScript

from kugelaudio import KugelAudio

client = KugelAudio(api_key="YOUR_API_KEY")

# Use your cloned voice
audio = client.tts.generate(
    text="Hello, this is my cloned voice speaking!",
    model="kugel-1-turbo",
    voice_id=YOUR_CLONED_VOICE_ID,
)

audio.save("cloned_output.wav")

import { KugelAudio } from 'kugelaudio';

const client = new KugelAudio({ apiKey: 'YOUR_API_KEY' });

const audio = await client.tts.generate({
  text: 'Hello, this is my cloned voice speaking!',
  model: 'kugel-1-turbo',
  voiceId: YOUR_CLONED_VOICE_ID,
});

Best Practices

Optimizing Voice Quality

Use high-quality source audio

The quality of your cloned voice depends heavily on the source audio. Use professional recordings when possible.

Provide diverse samples

Include a range of intonations, emotions, and sentence types in your reference audio for a more natural clone.

Adjust CFG scale

Experiment with different cfg_scale values. Cloned voices often benefit from slightly lower values (1.5-2.0) for more natural output.

Use the right model

The kugel-1 model generally produces better results for voice cloning due to its larger capacity.

Troubleshooting

Issue	Solution
Voice sounds robotic	Use higher quality source audio, try lower CFG scale
Voice sounds different	Ensure source audio is clean, try different text samples
Accent not preserved	Include more diverse samples, use longer reference audio
Inconsistent output	Use `speaker_prefix=True`, try different CFG values

Managing Cloned Voices

List Your Voices

# Get all your cloned voices
voices = client.voices.list(category="cloned")

for voice in voices:
    print(f"{voice.id}: {voice.name}")

Update Voice

# Update voice metadata
response = requests.patch(
    f"https://api.kugelaudio.com/v1/voices/{voice_id}",
    headers={"Authorization": "Bearer YOUR_API_KEY"},
    json={
        "name": "Updated Name",
        "description": "Updated description",
    }
)

Delete Voice

# Delete a cloned voice
response = requests.delete(
    f"https://api.kugelaudio.com/v1/voices/{voice_id}",
    headers={"Authorization": "Bearer YOUR_API_KEY"},
)

Privacy & Ethics

Only clone voices you have permission to use. Misuse of voice cloning technology may violate laws and our Terms of Service.

Guidelines

Get consent - Always obtain permission before cloning someone’s voice
Disclose synthetic speech - Be transparent when using cloned voices
No impersonation - Don’t use cloned voices to deceive or defraud
Respect rights - Don’t clone voices of public figures without authorization

Verification

For Business and Enterprise plans, we offer voice verification to ensure ethical use:

Upload proof of consent
Our team reviews the submission
Voice is marked as “verified”
Verified voices have no usage restrictions

Limits

Plan	Cloned Voices	Storage
Free	0	-
Starter	0	-
Business	10	100MB
Enterprise	Unlimited	Unlimited

Getting Started

SDKs

Integrations

Guides

How It Works

Requirements

Audio Quality

Content Guidelines

Creating a Voice Clone

Via Dashboard

Via API

Using Cloned Voices

Best Practices

Optimizing Voice Quality

Troubleshooting

Managing Cloned Voices

List Your Voices

Update Voice

Delete Voice

Privacy & Ethics

Guidelines

Verification

Limits

Next Steps

Python SDK

Models

Getting Started

SDKs

Integrations

Guides

​How It Works

​Requirements

​Audio Quality

​Content Guidelines

​Creating a Voice Clone

​Via Dashboard

​Via API

​Using Cloned Voices

​Best Practices

​Optimizing Voice Quality

​Troubleshooting

​Managing Cloned Voices

​List Your Voices

​Update Voice

​Delete Voice

​Privacy & Ethics

​Guidelines

​Verification

​Limits

​Next Steps

Python SDK

Models

How It Works

Requirements

Audio Quality

Content Guidelines

Creating a Voice Clone

Via Dashboard

Via API

Using Cloned Voices

Best Practices

Optimizing Voice Quality

Troubleshooting

Managing Cloned Voices

List Your Voices

Update Voice

Delete Voice

Privacy & Ethics

Guidelines

Verification

Limits

Next Steps