helpmefindaname / transformer-smaller-training-vocab

Temporary remove unused tokens during training to save ram and speed.
https://helpmefindaname.github.io/transformer-smaller-training-vocab/
MIT License
20 stars 2 forks source link

set the vocab size correctly when recreating the full embedding #11

Closed helpmefindaname closed 7 months ago