blurred validation output #43

JohnHerry · 2024-11-29T09:03:35Z

cards: 4 , batch size: 10 , lr: 1e-4
sample audio : 1s - 20s each, 16K audios, hop_size=200
mel-extractor: same as HiFiGAN
text tokenizer: phoneme tokens

I have trained to 250,000 steps but still can not get clear mel-spectrogram output. the validation output is blurred and diluteed artifacts,

has anybody run into such a situation? what is the reason? is this problem on training data preprocessing? or need some training parameter adjustment? wating for suggestions. thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

blurred validation output #43

blurred validation output #43

JohnHerry commented Nov 29, 2024

blurred validation output #43

blurred validation output #43

Comments

JohnHerry commented Nov 29, 2024