Skip to main content

Voice Cloning

In this section, you’ll learn how to clone your voice in EchoKit. The process has three main steps:

  • Process your voice data to reate high-quality audio samples, which will take 60 minutes - 90 minutes, depending on the voice quality.
  • Fine tune the GPT-SoVits model using the processed audio from Step 1, which will take 20 minutes
  • Deploy the fine-tuned model, which will take 20 minutes.