n-waves / multifit

The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761
MIT License
282 stars 56 forks source link

Is it possible to load several "learn" models at a time? #48

Closed iuria21 closed 4 years ago

iuria21 commented 4 years ago

I don't understand very well how the model works, but I was wondering if each classification model is loading the language model independently and then doing the evaluation. I want to use it for production with different classification datasets, but each of them need a lot of CPU memory.

PiotrCzapla commented 4 years ago

LM is being updated for each classification task so it is slightly different. You could implement multitask learning to get the optimisation you are looking for, but I would start directly from fastai to do so it will be easier than using this repo.