Panoptic quality not symmetric?

In section 6 of "Panoptic Quality" paper, under "Human annotations", it is mentioned that PQ is symmetric, i.e. the order of ground truth and predictions is unimportant. In practice this does not seem to be the case, for example:

python panopticapi/evaluation.py --gt_json_file gt.json --pred_json_file pred.json
...
          |    PQ     SQ     RQ     N
--------------------------------------
All       |  30.1   45.2   33.3     2
Things    |  30.1   45.2   33.3     2
Stuff     |   0.0    0.0    0.0     0
...

python panopticapi/evaluation.py --gt_json_file pred.json --pred_json_file gt.json
...
          |    PQ     SQ     RQ     N
--------------------------------------
All       |  81.3   81.3  100.0     1
Things    |  81.3   81.3  100.0     1
Stuff     |   0.0    0.0    0.0     0
...

One of the reasons seems to be the special treatment of VOID class, for example predictions are not counted as false positives when their overlap with ground truth VOID is bigger than 0.5 (section 4.2 in the paper, under "Void labels"). I think it enforces one interpretation of VOID, meaning "not labeled", when ignoring such areas would make sense. Alternative would be "not known class", in which case marking it with known class would count as false positive. The second interpretation is more common IMHO and would retain the symmetric property of the metric.

I suggest to change the interpretation of VOID to make the metric symmetric, or add it as a command-line option.

cocodataset / panopticapi

Panoptic quality not symmetric? #16