Closed shelhamer closed 5 years ago
man, can you run his codes successfully?
Thank you for raising this issue! We have updated our code for this and re-generated the dataset's "general" labels (the link provided in the README now points to the new version). Please note that the "general" labels were not used in our work (so this did not affect our results in any way) but we have addressed this in order to encourage future work to leverage this hierarchy.
Closing as solved by #9 and 1e9fe081be9560596fb3e5c4de01f0162cd65d3b. Thanks!
The general labels in each split seem to have an extra, corrupted(?) value that many data points take on. For instance in the training set there are 20 labels {0, ..., 19} but there is a label value 20 used by 341546 data points.
Should all of these data points be excluded, or is there a way to generate correct labels for these?
Apologies if I am confused in my understanding of your dataset. Thank you for working on a hierarchical few-shot dataset.
Here is some illustrative code from my exploration of the data: