-
Notifications
You must be signed in to change notification settings - Fork 223
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to configure the dataset or modify the code if I want to do the one class binary classification #91
Comments
Please folllow the ESC-50 recipe (50 class classification with AudioSet pretrained model) and just change Line 69 in 9e3bd99
-Yuan |
I am feeding the model and constructing the dataset json file with this dictionary {'/m/spcmd00' : 0 , '/m/spcmd01':1}. and setting n_class equals to 2. 0 and 1 are the binary status of the only one class. Does this make sense? Thank you so much!! Great work by the way. |
This makes sense. But you also need to take care of the hyper-parameters, in particular, |
I want to test esc-50 in windows, is it possible? |
It might be possible if you have torch environment setup in Windows, though many things need to be changed in https://github.com/YuanGongND/ast/blob/master/egs/esc50/prep_esc50.py and https://github.com/YuanGongND/ast/blob/master/egs/esc50/run_esc.sh, and maybe somewhere else. An easier way might be use the Google Colab environment, I think it is OK for ESC-50 as it is small. -Yuan |
ok, thanks for your answer. |
Have you completed the binary classifications? I have some questions about modifying parameters. How did you modify --freqm, --timem, --tstride, --fstride, --audio_length? |
I usually suggest to first reproduce the ESC-50 recipe and then start modifying hyper-parameters, this helps you rule out other factors could impact the performance. The hyper-parameter you listed are not related to number of classes:
You do need to modify -Yuan |
Thank you very much, I have modified it. |
I calculated dataset_mean=-6.6268077 and dataset_std=5.358466 of esc-50 are different from those in run_esc.sh. I don’t know where I went wrong. Could you please answer? |
What's your mean and std? |
mean=0.000238, std=0.000841. I feel that this is wrong. I see run.py help is the dataset spectrogram mean, so I converted it to fft calculation. So I would like to ask how to calculate this? |
Is this your own dataset? This is certainly not correct as the std should not be 0. You can check the issues to find how to cal the mean and std. |
Hi yuan, in my own two-category data set training, the Avg precision in each epoch is 0.5, Recall is always 1, can you answer my question, the following is my result. |
thanks
The text was updated successfully, but these errors were encountered: