microsoft / LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs
https://aka.ms/GeneralAI
MIT License
3.71k stars 283 forks source link

【MiniLLM】is it normal to get negative loss at some step? #183

Closed lllyyyqqq closed 2 months ago

lllyyyqqq commented 7 months ago

First, Excellent work! I am trying to reproduce using my own data, and change some of your code. During the training, at some steps, I got negative rl_loss, reg_loss, pg_loss, is it a normal behaviour?

t1101675 commented 7 months ago

It seems abnormal to get negative losses.