Closed bkj closed 4 years ago
A bit of background: the annotated parcel database which we used to produce our labels provides a very fine grained nomenclature of ~400 classes. Yet, superclass nomenclatures that group similar classes together in label sets with fewer elements are also provided. The two label files provided in our dataset correspond to two such superclass systems (the classes in the 19 class system being more high level than the 44 one).
Not all classes of the nomenclature are observed in the S2 tile (hence 17 and 35 instead of 19 and 44)
In the paper we use the 44 class system, but we also select those classes for which more than 100 parcels are available in the dataset, reducing the number of classes to 20.
And I am not sure about the -1 class, I'll have a closer look!
Oops -- the -1 thing was my mistake -- apologies!
Good news ;)
Last thing about the classes: we use the sub_class
argument of PixelSetData
to only consider these 20 classes with enough samples.
Hi, Thank you very much for providing the implementation of your paper. I am trying to understand your code. But, I am not able to find the names of the 20 or 44 classes in the order used in the implementation. Can you direct me towards the name of the classes.
Kind regards, Priti
Here's the list of class names for each label. Was that what you needed ?
Thank you very much @VSainteuf for your quick response. This is what I was looking for. Thank you for that. I had one more question. The pre-trained weights provided with the repo, is it trained on this label_44class nomenclature? Or you are using label_20class nomenclature for it. Also, is it possible to get access to the actual Sentinel-2 dataset with the timestamps?
Kind regards, Priti
Yes, as said earlier in this issue:
- Not all classes of the nomenclature are observed in the S2 tile (hence 17 and 35 instead of 19 and 44)
- In the paper we use the 44 class system, but we also select those classes for which more than 100 parcels are available in the dataset, reducing the number of classes to 20.
So we use the 44 label set, reduced to the 20 most frequent classes. And I don't have the original Sentinel-2 dataset anymore. You would need to re-download it using the timestamps of dates.json.
Thanks @VSainteuf for the information. It was very helpful of you.
Hi, One quick question: I can see you have a class named Beetroot, does this class also encompass sugar beet? Also, the link to the pretrained weights does not work. Is it possible to have a working link? Thanks, Nathan
In the paper, the results are reported for a 20-class classification task. However, it looks like the number of classes in the dataset are either 17 or 35:
Are you able to give more details on the setup of the experiments in the paper?
Thanks!
Edit: Also -- there's a
-1
class inlabel_19class
-- what does that mean?