kheyer / Genomic-ULMFiT

ULMFiT for Genomic Sequence Data
285 stars 55 forks source link

loss values #5

Open jsephine opened 4 years ago

jsephine commented 4 years ago

Hello, thanks a lot for the great repo!

I'm trying to train a language model with different k-mers with stride=1. As you already discussed, we should expect the loss for this LM should be around ln(4^1)~1.3.

However, when I train 6-mer with stride 1, loss is around 8.3 ~ ln(4^6), and if I train 4-mer with stride 1, loss is around 5.2 ~ ln(4^4). Do you have an idea why this happens?

Thanks!