Val loss increase when training

li-ronghui / LODGE

The code the CVPR2024 paper Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives

110 stars 6 forks source link

Thanks for your impressive work! I'm trying to train LODGE model from scratch to reproduce the results. However, when training the global diffusion and finetuning the local diffusion, the train loss gradually decreased，while the val loss started to increase after a certain number of iterations. This suggests the model may be overfitting. I followed the same config files as you provided here, but only modified the batch size to prevent OOM on my GPU. Could you provide more detailed instruction on training to solve this problem and how to select the final checkpoints of two diffusion models?

Global Diffusion Loss global loss Local Diffusion Loss (finetune) local loss

li-ronghui / LODGE

Val loss increase when training #20