Hi, Recently, when I test with model you provided on culane dataset, I found that the model has trained at least 13 epochs, but according cfg file from experiments/exp10, it will only training maximum 11 epochs (L212), so can you tell me what cfgs you used in training on culane dataset? Besides, the lr (16e-2) is also a little bit large and will cause loss to be Nan.
The number of epoch doesn't really matter that much. I remember around 10k iterations would be fine. Also, I trained with 8 GPUs. So maybe you can set lr to 1e-2 or so.
Hi, Recently, when I test with model you provided on culane dataset, I found that the model has trained at least 13 epochs, but according cfg file from experiments/exp10, it will only training maximum 11 epochs (L212), so can you tell me what cfgs you used in training on culane dataset? Besides, the lr (16e-2) is also a little bit large and will cause loss to be Nan.