You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm fooling around with this project and I'm getting throughput I think is too slow, which leads me to believe I may have mis-configured something or there are other issues.
I'm reusing the pre-trained models with my own custom audio of ~750 audio clips ranging from 4-10 seconds.
I'm using:
PyTorch 1.7.1 with Python3.7 (CUDA 11.0 and Intel MKL)
In order to get the code to run properly I had to apply the fix from this bug (not sure if this is relevant just want to give all details): #201
It may be constrained by the disk read. Move your dataset to a faster storage, like copying it to RAM in /dev/shm.
Tesla V100 is an old GPU. For the default model size, you're going to top out around 2-3 step/sec at r=7 and 1 step/sec at r=2. It will be faster if you discard your longer utterances.
I'm fooling around with this project and I'm getting throughput I think is too slow, which leads me to believe I may have mis-configured something or there are other issues.
I'm reusing the pre-trained models with my own custom audio of ~750 audio clips ranging from 4-10 seconds.
I'm using:
PyTorch 1.7.1 with Python3.7 (CUDA 11.0 and Intel MKL)
In order to get the code to run properly I had to apply the fix from this bug (not sure if this is relevant just want to give all details):
#201
and I applied this pull request:
521179e
The only changes I've made to hyperparams is changing peak_norm from false to true:
and setting my paths.
I can confirm that it is using the GPU (at least GPU memory), but I've never seen nvidia-smi show utilization above 38%:
Things I've tried:
upping the batch size in hyperparams, also the learning rate, up to 64, which didn't help.
here is nvidia-smi output:
what it's up to:
If I change some of the learning rate parameters:
and smi:
Let me know what other information I can provide to help debug this.
Thank you,
Scott
The text was updated successfully, but these errors were encountered: