NaN during training when using own dataset #4

cjay42 · 2022-11-08T13:46:29Z

While fine-tuning works as expected, doing regular training with a dataset that isn't LJSpeech would eventually cause a NaN loss at some point.
The culprit appears to be the following line, which causes a division by zero if wav happens to contain perfect silence:

hifigan/hifigan/dataset.py

Line 106 in 374a456

wav = flip * gain * wav / wav.abs().max()

I'm not sure what the best solution for this would be, as a quick fix I simply clipped the divisor so it can't reach zero:

wav = flip * gain * wav / max([wav.abs().max(), 0.001])

The text was updated successfully, but these errors were encountered:

joan126 · 2023-01-08T00:48:42Z

met same issue with you!!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NaN during training when using own dataset #4

NaN during training when using own dataset #4

cjay42 commented Nov 8, 2022 •

edited

Loading

joan126 commented Jan 8, 2023

NaN during training when using own dataset #4

NaN during training when using own dataset #4

Comments

cjay42 commented Nov 8, 2022 • edited Loading

joan126 commented Jan 8, 2023

cjay42 commented Nov 8, 2022 •

edited

Loading