hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All
https://hpcaitech.github.io/Open-Sora/
Apache License 2.0
20.77k stars 1.97k forks source link

The loss curve fluctuates greatly #413

Closed COST-97 closed 1 month ago

COST-97 commented 1 month ago

Hello: I found that the loss curve fluctuates greatly. Based on the model fine-tuning of opensora1.1, I tried different batch sizes of 32 and 64, learning rates of 2e-5, 1e-6, optimizer beta values (0.95, 0.9995), etc., but this problem has always existed. Do you have any suggestions on how to mitigate this problem?

github-actions[bot] commented 1 month ago

This issue is stale because it has been open for 7 days with no activity.

github-actions[bot] commented 1 month ago

This issue was closed because it has been inactive for 7 days since being marked as stale.

SkylerZheng commented 3 weeks ago

Wondering what learning rates were used for each stage video generation model training?