I used the same best pretrained model in the validation set to generate the test set result. I did not retrain the network with both train/val data or use TTA, both of which could improve the test set result by a lot (Used by many other methods in their test set submissions).
The data distribution in semanticKITTI val and test sets are very big. You will find big result differences in most of the classes, especially those rare classes like motorcyclist.