Closed sebastianlutter closed 3 years ago
Hmm, very strange.
... --text {your_model} --ten_crop
when testing. Usually for performance: 10-crop > 5-crop > center-crop. Thus I will expect your model have about 74%-77% accuracy with five-crop or ten-crop. --schedule 300 400
in the training command. My bad, I didn't set it as default. Thank you for the clarifications. I'll check all that, run again and will report what my findings are
OK. Re-open if have more questions.
Thank you for the clarifications. I'll check all that, run again and will report what my findings are Hi Sebastian, I meet the same issue. I got 0.45 on ten-crop result. Did you fix the problem? What did you do?
No, I gave up at some point
Got it. That is strange. I used the same environment to run DPC and it works. Not know what is wrong.
I used the pretrained RGB stream kinetics400 base model you kindly provided and did a fine-tune training and think my results are not as expected. The project code runs without issues. The only change I did to your code is the numpy flattening issue from https://github.com/TengdaHan/MemDPC/issues/11.
Would be great if you can provide your opinion on what I did and the results I got.
Used python3.7:
process_data/src/extract_ff.py
to extract frames (commented-out flow features code, just RGB frames)process_data/src/write_csv.py
to generate csv files (clip path and frames count)process_data/data/ucf101/ClassInd.txt
with sorted list of the 101 labelsFrom the results in your paper I expected something like 0.70 Acc@1, but I got 0.41. I'm unsure if the Top1 accuracy mentioned in Table 2 in your paper (MemDPC, K400 (28d), Res. 224 , Arch. R-2D3D, depth 33, Modality V --> UCF 78.1%) has been trained with RGB or flow features. But results of my training should be at least as good as the C2 variant in Table 1 (full training with UCF101 on 128 img dims using RGB input and memory size 1024 --> Top1 of 68.2).
What is the expected accuracy for the training I did?