Open RenShuhuai-Andy opened 2 months ago
Hi @universome, thanks for your great work!
I followed your instructions to evaluate fvd2048_16f based on calc_metrics_for_dataset.py, however, I found a potential bug.
fvd2048_16f
Specifically, I prepared the ground truth (gt) and predicted datasets according to the following dataset structure:
dataset/ video1/ - frame1.jpg - frame2.jpg - ... video2/ - frame1.jpg - frame2.jpg - ... ...
However, the code in https://github.com/universome/stylegan-v/blob/master/src/training/dataset.py#L319 will sort the frames in lexicographic order of file names (frame1, frame10, frame11, frame12...), instead of numerical order (frame1, frame2, frame3, frame4...):
frame1, frame10, frame11, frame12...
frame1, frame2, frame3, frame4...
I wonder if this out-of-order frame sequences lead to inaccurate evaluation results because I3D features are out of order?
Hi @universome, thanks for your great work!
I followed your instructions to evaluate
fvd2048_16f
based on calc_metrics_for_dataset.py, however, I found a potential bug.Specifically, I prepared the ground truth (gt) and predicted datasets according to the following dataset structure:
However, the code in https://github.com/universome/stylegan-v/blob/master/src/training/dataset.py#L319 will sort the frames in lexicographic order of file names (![image](https://github.com/universome/stylegan-v/assets/30052148/08152ec9-cba5-49a4-9e3c-b8999d4c1923)
frame1, frame10, frame11, frame12...
), instead of numerical order (frame1, frame2, frame3, frame4...
):I wonder if this out-of-order frame sequences lead to inaccurate evaluation results because I3D features are out of order?