curiosity-ai / catalyst

🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's design, it brings pre-trained models, out-of-the box support for training word and document embeddings, and flexible entity recognition models.
MIT License
699 stars 71 forks source link

Can I train in portions, batchwise? #78

Open goroggy opened 1 year ago

goroggy commented 1 year ago

Hi there, I have some 160,000 documents to train a FastText with. I am out of memory if I produce a Document[] with them. Can I continue training? I suspect something with GetPreviousData but it's unclear how to do this. Thanks!