Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about Early Overfitting in GeDEX-TTS Training with VCTK #5

Open
thunn opened this issue Sep 3, 2024 · 1 comment
Open

Comments

@thunn
Copy link

thunn commented Sep 3, 2024

Hello,

I recently ran the GeDEX-TTS training using the VCTK dataset and followed the instructions provided in the GeDEX-TTS/config/VCTK/base.yaml configuration file. The only modifications I made were:

  • Changed the vocoder tobigvgan
  • Updated the vocoder path to bigvgan/bigvgan_22khz_80band

While plotting the training logs, I observed that the model started to overfit very early in the training process. This is somewhat unexpected to me, and I wanted to inquire whether this behaviour is expected or if there might be an issue with my setup.

Here is a basic plot of the training losses.

GeDex-TTS on VCTK

Could you please provide insights into whether early overfitting is a known issue with this configuration, or if there are any recommended adjustments to prevent this?

Thank you for your help!

@winddori2002
Copy link
Owner

Hi,

Have you listened to the synthesized samples?

The duration loss can increase at the validation phase.

Although the diffusion loss does not improve a lot in training phase, more steps usually guarantee better sample quality as in Grad-TTS.

Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants