Closed mainpyp closed 1 year ago
Hi, We split the metrics according to different timesteps, and thus it is convenient for us to trace the metrics in different timesteps. More specifically, for example, the 4 splits for loss_q stand for the average loss for timesteps [0, 500), [500, 1000), [1000, 1500), [1500, 2000] respectively.
When it comes to training metrics, I cannot find the difference between the different q splits. (loss_q1, nll_q0, etc.) Could you shortly explain or reference the corresponding paper / paragraph?