Logging Inconsistent For LongNet

DRAGNLabs / 301r_retnet

2 stars 1 forks source link

Open nprisbrey opened 2 months ago

nprisbrey commented 2 months ago

We need to be consistently logging the same values for all types of models. For the LongNet architecture, for example, we only log Validation loss whereas we record Validation loss and perplexity for the Transformer models and RetNet models.

Also, why are we only recording perplexity for Validation? I'd like to see loss and perplexity recorded for all models for all datasets.

nprisbrey commented 2 months ago