Closed ozendelait closed 4 years ago
I'd vote for COCO too to make sure things are consistent.
Sounds good, but image-level labels should be included in the annotations for OID - annotations are not exhaustive on image-level label level, i.e. if there is no positive or negative label 'Cat' on an image, there still might be an instance of 'Cat' that will not have a box.
Similarly, for instance segmentation data boxes and image-level labels should be included into annotations.
which datasets from RVC's obj. det. task support image level labels? COCO, OID, Obj365, MVS?
OID, I am not sure about the rest
Echoing @akuznetso comment, encoding the negative image level labels is particularly important.
which datasets from RVC's obj. det. task support image level labels? COCO, OID, Obj365, MVS?
Obj365 can support image-level multi-labels.
COCO format does not have image-level label but seems like an easy addition to add.
@tylin / @wilson891226 : Are your datasets fully exhaustively annotated? Otherwise I don't understand where you are getting the information for the negative labels from?
@tylin / @wilson891226 : Are your datasets fully exhaustively annotated? Otherwise I don't understand where you are getting the information for the negative labels from?
365 is not exhaustively annotated, but it can supply multi labels (i.e., the tags of the corresponding bounding boxes) associated with an image.
COCO exhaustively labels images with instance annotations (box/mask per object instance) and crowd annotations (segmentation mask covering multiple objects). For images with 10+ objects, 10 of the objects are labeled with instance annotations and a segmentation mask is used to cover the rest of objects. During inference, the predictions overlap with the crowd annotations are ignored.
Ok, maybe I don't see it, but what's the point of having those positive labels without the negative ones? Its trivial to create the prositive list on the fly in your framework if you need it from the boxable annotations...
Additional image labels may be added in the future; issue closed for now
Potential candidates: ) COCO format ) OID format ) MaskRCNN/detectron2 format ) More?? As we use COCO-style format for the other tasks, I suggest to use it for boxable GT as well