question about pre-trained model

Thank you for sharing the good results.

I have a question. What data are the trained models (slim, RFB) provided? Also, I am fine-tuning based on this model. Here, it is labeled (face, background), but I am learning by increasing it to about 5 classes. In conclusion, the performance is poor. I'm guessing it's because the model is too light. Is there a good way? I started learning from Scratch, and there are about 20,000 vocs for face data, and about 20,000 other data. Thank you.

Linzaer / Ultra-Light-Fast-Generic-Face-Detector-1MB

question about pre-trained model #246