Skip to main content
Voice cloning allows you to create a synthetic voice that sounds like a specific person from just a few seconds of reference audio.
Voice cloning is available on Business and Enterprise plans.

How It Works

  1. Upload reference audio - Provide 10-30 seconds of clean speech
  2. Processing - Our AI analyzes the voice characteristics
  3. Voice created - Use your new voice in any TTS request

Requirements

Audio Quality

For best results, your reference audio should be:
  • Duration: 10-30 seconds of speech
  • Format: WAV, MP3, or FLAC
  • Sample rate: 16kHz or higher
  • Channels: Mono preferred
  • Quality: Clean, no background noise

Content Guidelines

Good audio:
  • Clear speech with natural pacing
  • Single speaker only
  • Minimal background noise
  • Natural emotional range
Avoid:
  • Multiple speakers
  • Background music
  • Heavy reverb or echo
  • Whispered or shouted speech
  • Heavily compressed audio

Creating a Voice Clone

Via Dashboard

  1. Go to DashboardVoicesCreate Voice
  2. Upload your reference audio
  3. Enter a name and description
  4. Click Create Voice
  5. Wait for processing (usually 2-5 minutes)

Via API

import requests

# Upload audio file
with open("reference.wav", "rb") as f:
    response = requests.post(
        "https://api.kugelaudio.com/v1/voices/clone",
        headers={"Authorization": "Bearer YOUR_API_KEY"},
        files={"audio": f},
        data={
            "name": "My Custom Voice",
            "description": "Cloned from reference audio",
        }
    )

voice = response.json()["data"]
print(f"Created voice: {voice['id']}")

Using Cloned Voices

Once created, use your cloned voice like any other:
from kugelaudio import KugelAudio

client = KugelAudio(api_key="YOUR_API_KEY")

# Use your cloned voice
audio = client.tts.generate(
    text="Hello, this is my cloned voice speaking!",
    model="kugel-1-turbo",
    voice_id=YOUR_CLONED_VOICE_ID,
)

audio.save("cloned_output.wav")

Best Practices

Optimizing Voice Quality

The quality of your cloned voice depends heavily on the source audio. Use professional recordings when possible.
Include a range of intonations, emotions, and sentence types in your reference audio for a more natural clone.
Experiment with different cfg_scale values. Cloned voices often benefit from slightly lower values (1.5-2.0) for more natural output.
The kugel-1 model generally produces better results for voice cloning due to its larger capacity.

Troubleshooting

IssueSolution
Voice sounds roboticUse higher quality source audio, try lower CFG scale
Voice sounds differentEnsure source audio is clean, try different text samples
Accent not preservedInclude more diverse samples, use longer reference audio
Inconsistent outputUse speaker_prefix=True, try different CFG values

Managing Cloned Voices

List Your Voices

# Get all your cloned voices
voices = client.voices.list(category="cloned")

for voice in voices:
    print(f"{voice.id}: {voice.name}")

Update Voice

# Update voice metadata
response = requests.patch(
    f"https://api.kugelaudio.com/v1/voices/{voice_id}",
    headers={"Authorization": "Bearer YOUR_API_KEY"},
    json={
        "name": "Updated Name",
        "description": "Updated description",
    }
)

Delete Voice

# Delete a cloned voice
response = requests.delete(
    f"https://api.kugelaudio.com/v1/voices/{voice_id}",
    headers={"Authorization": "Bearer YOUR_API_KEY"},
)

Privacy & Ethics

Only clone voices you have permission to use. Misuse of voice cloning technology may violate laws and our Terms of Service.

Guidelines

  1. Get consent - Always obtain permission before cloning someone’s voice
  2. Disclose synthetic speech - Be transparent when using cloned voices
  3. No impersonation - Don’t use cloned voices to deceive or defraud
  4. Respect rights - Don’t clone voices of public figures without authorization

Verification

For Business and Enterprise plans, we offer voice verification to ensure ethical use:
  1. Upload proof of consent
  2. Our team reviews the submission
  3. Voice is marked as “verified”
  4. Verified voices have no usage restrictions

Limits

PlanCloned VoicesStorage
Free0-
Starter0-
Business10100MB
EnterpriseUnlimitedUnlimited

Next Steps