Problems with test_one_video.py using CSL-Daily dataset

lightandfuture commented 3 months ago

I tried with the test_one_video.py with CSL-Daily and some issues occurred. I tested it with the following 4 datasets from CSL-Daily, and all of them are not giving out the correct answers.

Datasets:

S000000_P0000_T00
S000003_P0000_T00
S000000_P0008_T00
S000009_P0008_T00

Errors:
(S000003_P0000_T00) (corrNet) ubuntu@Washington:home/CorrNet$ python test_one_video.py test_one_video.py:54: DeprecationWarning: an integer is required (got type numpy.float64). Implicit conversion to integers using int is deprecated, and may be removed in a future version of Python. video_length = torch.LongTensor([np.ceil(vid.size(1) / total_stride) total_stride + 2left_pad ]) output glosses : [[('有', 0)]] ##################################################################
(S000000_P0008_T00) (corrNet) ubuntu@Washington:home/CorrNet$ python test_one_video.py test_one_video.py:54: DeprecationWarning: an integer is required (got type numpy.float64). Implicit conversion to integers using int is deprecated, and may be removed in a future version of Python. video_length = torch.LongTensor([np.ceil(vid.size(1) / total_stride) total_stride + 2left_pad ]) output glosses : [[('2', 0)]] ##################################################################
(S000009_P0008_T00) (corrNet) ubuntu@Washington:~/CorrNet$ python test_one_video.py test_one_video.py:54: DeprecationWarning: an integer is required (got type numpy.float64). Implicit conversion to integers using int is deprecated, and may be removed in a future version of Python. video_length = torch.LongTensor([np.ceil(vid.size(1) / total_stride) total_stride + 2left_pad ]) output glosses : [[('山', 0), ('雪', 1)]]

Video Maps:

000000|S000000_P0000_T00|52|你们好|你们好！|你们好！|r a w
000003|S000001_P0000_T00|37|对不起|对不起！|对不起！|v w
000000|S000000_P0000_T00|52|你们好|你们好！|你们好！|r a w
000009|S000003_P0000_T00|30|谢谢|谢谢！|谢谢！|v w

PS: Although dataset_1 and dataset_3 have the same meanings, they are performed by different person and the result seems to be different. For dataset_1, we got [[('5', 0)]], and for dataset_3 we got [[('2', 0)]].

Can you help me with this problem? I am new to this area and I can't really figure out the problem myself. Thank you!

hulianyuyy commented 3 months ago

I have tested the file and can get correct results. May i ask if you have loaded the pretrained weights for CSL-daily dataset? Or maybe you can first load the weights for inference and see if it could reach 30.6 WER. A similar issue has been raised before.

lightandfuture commented 3 months ago

I have tested the file and can get correct results. May i ask if you have loaded the pretrained weights for CSL-daily dataset? Or maybe you can first load the weights for inference and see if it could reach 30.6 WER. A similar issue has been raised before.

Yes, I have loaded the pretrained weights provided, and I also checked the resnet.py and adjusted it according to the readme file.

Here are the files : resnet.txt test_one_video.txt

I'm sorry but I didn't seem to find the solution to issue#27.

hulianyuyy commented 2 months ago

I have checked your files and don't see any errors. I still can't figure out the issue. Maybe this issue only happens on the CSL-Daily dataset. Have you tested the test_one_video.py upon the phoenix14 dataset? Could it provide the correct outputs?

lightandfuture commented 2 months ago

I have checked your files and don't see any errors. I still can't figure out the issue. Maybe this issue only happens on the CSL-Daily dataset. Have you tested the test_one_video.py upon the phoenix14 dataset? Could it provide the correct outputs?

Sorry for the late reply. Yes, test_one_video.py worked fine upon the phoenix14 dataset on our device.

hulianyuyy commented 2 months ago

I concurrently don't have an exact answer for this question. Can you get 30.1 WER on the CSL-Daily dataset with the pretrained weight? It's strange that the test_one_video.py performs normally on my device.

hulianyuyy / CorrNet

Problems with test_one_video.py using CSL-Daily dataset #45

S000009_P0008_T00