This is a DVAE filetune for xttsv2, based on the scripts presented here. https://github.com/daswer123/xtts-finetune-tests/tree/main/dvae-finetune

Trained on 100h of Russian high quality speech, potentially should improve finetune quality of GPT-2 and Perceiver models.

You can try to use it in xtts-finetune-webui as a custom DVAE

wandb: Run summary:
wandb: commit_loss 0.04019
wandb:    cur_step 2571
wandb:       epoch 19
wandb:        loss 0.10499
wandb:  recon_loss 0.06481

image/png

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.