developmentseed / label-maker

Data Preparation for Satellite Machine Learning
http://devseed.com/label-maker/
MIT License
461 stars 110 forks source link

add tile ratio warning when there is class imbalance #102

Open Geoyi opened 6 years ago

Geoyi commented 6 years ago

@drewbo, should we add class imbalance warning?

When I create a bounding box for an image classification task, a building classifier. I set the background_ratio to 1 and assumed Label Maker will create a balance classes ratio. But in this case, the bounding box only contained building tiles, and I ended up only have 9 tiles are the background tiles out of 340 tiles. If we can add the class imbalance warning that will help to evaluate the training dataset.

drewbo commented 6 years ago

Good catch. I think there are two underlying pieces here to better handle this: