bytedance / coconut_cvpr2024

Apache License 2.0
150 stars 6 forks source link

COCONut-L are less images than expected #32

Open theo2021 opened 2 weeks ago

theo2021 commented 2 weeks ago

Hi, I tried creating the COCONut-L dataset and noticed the number of images were different than expected.

I got the COCONut-S -> 118K images. Then the COCONut-B -> 123K images. Then for COCONut-L I get from gdrive -> 105835 images. In total 347526 images. Different from the 358K stated in the paper.

image

feivellau commented 2 weeks ago

+1

xdeng7 commented 2 weeks ago

Sorry for the confusion. We remove around 10k noisy labels for the coconut-large object365 subset as they may affect the model performance. Now you should check the huggingface, this will give a collection of object365 subset of panoptic masks, both the panseg mask and infor has been updated. The converted instance masks will be updated soon.

image image
xdeng7 commented 2 weeks ago

Hi, I tried creating the COCONut-L dataset and noticed the number of images were different than expected.

I got the COCONut-S -> 118K images. Then the COCONut-B -> 123K images. Then for COCONut-L I get from gdrive -> 105835 images. In total 347526 images. Different from the 358K stated in the paper.

image

please noted that, the img_id could be customized into continuous number as we don't use the filenames in object365, you can find the details in the json files.