I train the 256x704 config with 8 L20 and finds that temporal training (after 6 epochs) takes much longer time than single-frame warmup epochs (before 6 epochs). I attach my training log here. Is that normal for this training speed?
20241002_223820.log
I train the 256x704 config with 8 L20 and finds that temporal training (after 6 epochs) takes much longer time than single-frame warmup epochs (before 6 epochs). I attach my training log here. Is that normal for this training speed? 20241002_223820.log