AnswerDotAI / bert24

Apache License 2.0
25 stars 3 forks source link

Add Parameter Count #67

Closed warner-benjamin closed 1 week ago

warner-benjamin commented 1 week ago

Add get_number_parameters, which by default follows Karpathy's nanoGPT and counts the token embeddings, but exclude non-trainable parameters, absolute positional embeddings, and the MLM head.