Voice cloning is available on Business and Enterprise plans.
How It Works
- Upload reference audio - Provide 10-30 seconds of clean speech
- Processing - Our AI analyzes the voice characteristics
- Voice created - Use your new voice in any TTS request
Requirements
Audio Quality
For best results, your reference audio should be:- Duration: 10-30 seconds of speech
- Format: WAV, MP3, or FLAC
- Sample rate: 16kHz or higher
- Channels: Mono preferred
- Quality: Clean, no background noise
Content Guidelines
✅ Good audio:- Clear speech with natural pacing
- Single speaker only
- Minimal background noise
- Natural emotional range
- Multiple speakers
- Background music
- Heavy reverb or echo
- Whispered or shouted speech
- Heavily compressed audio
Creating a Voice Clone
Via Dashboard
- Go to Dashboard → Voices → Create Voice
- Upload your reference audio
- Enter a name and description
- Click Create Voice
- Wait for processing (usually 2-5 minutes)
Via API
Using Cloned Voices
Once created, use your cloned voice like any other:- Python
- JavaScript
Best Practices
Optimizing Voice Quality
Use high-quality source audio
Use high-quality source audio
The quality of your cloned voice depends heavily on the source audio. Use professional recordings when possible.
Provide diverse samples
Provide diverse samples
Include a range of intonations, emotions, and sentence types in your reference audio for a more natural clone.
Adjust CFG scale
Adjust CFG scale
Experiment with different
cfg_scale values. Cloned voices often benefit from slightly lower values (1.5-2.0) for more natural output.Use the right model
Use the right model
The
kugel-1 model generally produces better results for voice cloning due to its larger capacity.Troubleshooting
| Issue | Solution |
|---|---|
| Voice sounds robotic | Use higher quality source audio, try lower CFG scale |
| Voice sounds different | Ensure source audio is clean, try different text samples |
| Accent not preserved | Include more diverse samples, use longer reference audio |
| Inconsistent output | Use speaker_prefix=True, try different CFG values |
Managing Cloned Voices
List Your Voices
Update Voice
Delete Voice
Privacy & Ethics
Guidelines
- Get consent - Always obtain permission before cloning someone’s voice
- Disclose synthetic speech - Be transparent when using cloned voices
- No impersonation - Don’t use cloned voices to deceive or defraud
- Respect rights - Don’t clone voices of public figures without authorization
Verification
For Business and Enterprise plans, we offer voice verification to ensure ethical use:- Upload proof of consent
- Our team reviews the submission
- Voice is marked as “verified”
- Verified voices have no usage restrictions
Limits
| Plan | Cloned Voices | Storage |
|---|---|---|
| Free | 0 | - |
| Starter | 0 | - |
| Business | 10 | 100MB |
| Enterprise | Unlimited | Unlimited |