Closed robertostling closed 7 years ago
Currently the whole training data set is loaded to RAM, but this obviously does not scale. At some point we need to fix this, if we want to train with huge corpora.
I've written a module for this now, will integrate into HNMT later. Will come in handy now that CSC is charging for RAM usage.
Implemented in 6ebca3ba
Currently the whole training data set is loaded to RAM, but this obviously does not scale. At some point we need to fix this, if we want to train with huge corpora.