Replies: 1 comment 1 reply
-
This task seems like a fine-grained classification task. You may consider try some masked image modeling method(MAE, SimMIM, CAE, etc) as pre-training, which may model the texture and local pattern better. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello, community! I have been surrounded by this for quite a while now and experimented with several different ways to train.
So, a sample of the dataset is as follows.
This is very specific, with little difference between each class. It works well with normal supervised learning way but not contrastively. The first issue arises is when it has high training linear accuracy but low testing linear accuracy. This is explainable because the dataset is split according to drivers. Specifically, if I do a random split again, mixing all images and then split into test and train dataset, I get superb accuracy. (duh)
The current best config I have is:
The linear evaluation I obtained is has Top-1 of around 45, and Top-5 of around 90. However during training linear layer, Top-1 can reach 90.
I have gone thru many works on custom dataset, but I couldn't found any way to boost the accuracy further. I am not sure if this is the main obstacle of contrastive learning, but I am glad to hear from the community on this.
Beta Was this translation helpful? Give feedback.
All reactions