About the accuracy of 94.5% on UCF101 using the Resnext101?

Wei2Huang commented 6 years ago

Hello, can you tell me how can achieve the the accuracy of 94.5% on UCF101 using the Resnext101? I use your code, the same network architecture(Resnext101) and your pretrained parameters(resnext-101-64f-kinetics-UCF101_split1.pth) by the manner of sliding windows over the sequeence of videos. But I can only got the accuracy of 90.35%. I set somthing wrong?

kenshohara commented 6 years ago

How do you evaluate the accuracy? If you get the accuracy in val.log, the accuracy is the clip-level accuracy whereas the accuracy reported in my paper is the video-level accuracy, which can be evaluated using val.json and eval_ucf101.py.

huangwei1995 commented 6 years ago

Oh, sorry, I used the 16-frames input so got the low accuracy. But if use the 64-frames input, how to deal with the video with the lower number of frames than 64? zero padding?

josueortc commented 6 years ago

Hello @Wei2Huang could you send the config you use (python main.py ...) for running testing on UCF101?

huangwei1995 commented 6 years ago

Hello @Wei2Huang could you send the config you use (python main.py ...) for running testing on UCF101? I have read your paper in details later and I found your method, so now I got the accuracy of 94.7%, thank you very much!

KT27-A commented 5 years ago

@kenshohara appreciate your work indeed! And @Wei2Huang would you please tell me the details of your training? I just can get 85% for val.log based on the setting on the paper. What is your training epochs? What is your lr_patience? What is your batch size? Thank you very much.

huangwei1995 commented 5 years ago

I also tried model resnext-101-64f-kinetics-UCF101_split1.pth, but got 85% val accuracy. I commented the fc reinitialized code and that was all I changed the code. Is my command wrong? My command is as follows: @kenshohara @huangwei1995 python main.py --root_path . --video_path UCF-101-1 --annotation_path ucf_list/ucf101_01.json --result_path results --dataset ucf101 --n_classes 101 --n_finetune_classes 101 --pretrain_path ../pretrained_model/resnext-101-64f-kinetics-ucf101_split1.pth --ft_begin_index 4 --model resnext --model_depth 101 --resnet_shortcut B --batch_size 128 --n_threads 4 --checkpoint 5 --n_epochs 200 --test --no_train

I don't achieve the results, and I just use the pre-trained model provided by the authors. When I try to realize it, I just got the 86%..

KT27-A commented 5 years ago

@huangwei1995 Nowadays I trained the network and reached the accuracy as the paper said. @kenshohara Thank you very much for your sharing, I really learned a lot. And I found one place inappropriate, for your information. In ucf101.py, row 82, the function make_dataset will generate more than 3 samples when 64f with validation. I added

if (j + sample_duration) >= n_frames:
      break

at the end of the function. The val_acc will increase 2% for its getting rid of recurrent samples.

sumeetssaurav commented 5 years ago

@Katou2 Could you please share the details of your config file which you used for fine-tuning using resnext-100 and resnext-101-64f version of the models on UCF-101. I want to know the value of the hyper parameters used in the training. With the help of the resenext-101 64f fine tuned model provided by the author I could achieve 93 percent accuracy of the Split1 of UCF-101. However, using my own fine-tuned model the accuracy hardly reaches 90 %.

sumeetssaurav commented 5 years ago

There is another issue in the code. Once you generate the val.json, the file contains prediction results of 1 video less than what contained in the val list of videos. I mean if I have 100 validation videos, the the generated val.json will have prediction results of only 99 videos. I still wonder, what could be the possible reason for such behaviour.

KT27-A commented 5 years ago

@sumeetssaurav I just utilized the default hyperparameters. Are you sure you tested at video level accuracy? For the second problem, I think you are right. The reason is that the last sample would not be tested in test.py. To settle this problem, you can add the following code at the end of dataloader function in test.py.

if i == len(data_loader) - 1:
    test_results = calculate_video_results(output_buffer, previous_video_id,
                                        test_results, class_names)

sumeetssaurav commented 5 years ago

@Katou2 Thanks for the quick reply. Still wondering where I have to put the loop suggested by you. Could you please highlight it in the test.py code.