Assistance Requested with Discrepancies in EGTEA Dataset Testing Results

Greetings,

First of all, congratulations on your recent work on "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation". I have been trying to replicate the results from your study ("In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation") using the provided repository and readme instructions on the EGTEA dataset. Unfortunately, I encountered discrepancies in the metric results compared to those reported in your paper. After fixing a couple of initial errors in the setup, the script executed successfully, but the results were still inconsistent with those expected. I would appreciate it if you could guide me on where I might have made a mistake or error during testing.

When I ran the script I countered two errors: The first error was: ModuleNotFoundError for torch._six:

Error: ModuleNotFoundError: No module named 'torch._six'
Fix: Commented out torch._six and set _int_classes = int. The second error was: FileNotFoundError for Gaze Data Files:
Error: FileNotFoundError: [Errno 2] No such file or directory: '/path/to/gaze_data/P01-R01-PastaSalad-GazeData.txt'
Fix: Modified line 115 in egtea_gaze.py to "label_name = video_name + '.txt' #if video_name[0] == 'O' else video_name+'-GazeData.txt' " since in the gaze_data directory in the dataset (EGTEA dataset) directory there is no file that has -GazeData at the end of the file name.

After fixing these two errors the script worked but the results were different than the ones in the paper.

The Execution Command I ran: CUDA_VISIBLE_DEVICES=0 python tools/run_net.py \ --cfg /scratch/users/theed/GLC2/GLC/configs/Egtea/MVIT_B_16x4_CONV.yaml \ TRAIN.ENABLE False \ TEST.BATCH_SIZE 32 \ NUM_GPUS 1 \ OUTPUT_DIR checkpoints/GLC \ TEST.CHECKPOINT_FILE_PATH /scratch/users/theed/GLC2/GLC/MViT_Egtea_ckpt.pyth \ DATA.PATH_PREFIX /scratch/users/theed/GLC2/GLC/egtea

The EGTEA directory structure was:

egtea	_ cropped_clips		_ OP01-R01-PastaSalad			_ OP01-R01-PastaSalad-1002316-1004005-F024051-F024101.mp4			_ OP01-R01-PastaSalad-1004110-1021110-F024057-F024548.mp4			_ ...		_ OP01-R02-TurkeySandwich			_ OP01-R02-TurkeySandwich-102320-105110-F002449-F002529.mp4			_ OP01-R02-TurkeySandwich-105440-106460-F002528-F002558.mp4			_ ...		_ ...
_ gaze_data
_ OP01-R01-PastaSalad.txt
_ OP01-R02-TurkeySandwich.txt
_ OP01-R03-BaconAndEggs.txt
_ ...
_ P25-R06-GreekSalad.txt
_ P26-R05-Cheeseburger.txt

I have attached the logging file with the configurations and the results I obtained. Could you please guide me on where I might have gone wrong or what additional steps I should take to align my results with those reported in your paper?

Thank you in advance for your time and assistance

stdout.log

BolinLai / GLC

Assistance Requested with Discrepancies in EGTEA Dataset Testing Results #7