Open ArseniuML opened 1 month ago
Hi @ArseniuML,
The reason for the discrepancy between train/val classes can be explained by the fact that the training set was generated by an automatic annotation method with limited output classes while the val set was created by manual annotators with a broader classification set.
You are right that classes included in the validation set but not in the training set cannot be detected if a supervised model is trained on the dataset. However, in this way, open vocabulary methods can also be benchmarked using the aiMotive dataset.
I tried to investigate, object of what types are annotated in train and val splits:
Why are there classes in the validation split that are not in the training split (VAN, TRAILER, SHOPPING-CART, PICKUP)? Why are there classes in the train split that are not in the val split (TRAIN, RIDER)?