I have a question.
What data are the trained models (slim, RFB) provided?
Also, I am fine-tuning based on this model. Here, it is labeled (face, background), but I am learning by increasing it to about 5 classes.
In conclusion, the performance is poor. I'm guessing it's because the model is too light. Is there a good way?
I started learning from Scratch, and there are about 20,000 vocs for face data, and about 20,000 other data.
Thank you.
Thank you for sharing the good results.
I have a question. What data are the trained models (slim, RFB) provided? Also, I am fine-tuning based on this model. Here, it is labeled (face, background), but I am learning by increasing it to about 5 classes. In conclusion, the performance is poor. I'm guessing it's because the model is too light. Is there a good way? I started learning from Scratch, and there are about 20,000 vocs for face data, and about 20,000 other data. Thank you.