Open Sunting78 opened 2 years ago
For 1% to 10%, we use the prepare_coco_data.py to generate data. Besides this setting, we also use the 35k subset of the COCO 2014 validation set as labeled images. This set the widely used COCO 2014 valminusminival. The 35k is about 30% of the COCO 2017 so we simply denote it as 30%. Since COCO 2014 valminusminival is a predefined set, you do not need to generate it with this script.
Thanks. But i want to know, is this the data division firstly proposed in this paper? Why this data division diffierent from semi-supervised detection for comparing the results of Boxes AP?
As I know, this dividion is firstly proposed in "Data Distillation: Towards Omni-Supervised Learning".
As I know, this dividion is firstly proposed in "Data Distillation: Towards Omni-Supervised Learning".
Got it. Thanks.
1) There are 5 folds in semi-supervised detection methods on COCO. Do you the same ? 2) The paper shows: For the 30% setting,we simply use the 35k subset of the COCO 2014 validationset as labeled images. But,in you code 'prepare_coco_data.py' , there is only used instances_train2017.json. https://github.com/zhenyuw16/noisyboundaries/blob/0c83823fe3634edb1274f4b20829822e49a25d65/scripts/coco/prepare_coco_data.py#L35
Look forward to your reply. Thanks.