argosopentech / argos-train

Training scripts for Argos Translate
https://www.argosopentech.com
MIT License
110 stars 21 forks source link

Adding to an existing language model #34

Open mbachtell opened 9 months ago

mbachtell commented 9 months ago

If I wanted to add to an existing model how would I do that? I have topic specific language from scientific domains that I would like to add.

I didn't see anything in the open or closed tickets.

Thanks!

mayeulk commented 8 months ago

I have the same question. I do not know whether this is currently achievable or not, but the question seems like a duplicate of https://github.com/argosopentech/argos-train/issues/12 (which says it is not straightforward). From a mathematical point of view, this is doable as it is a transformer model. From a software point of view, it is already done in some fields, see for instance:

I believe there is at least one strategy:

  1. taking the initial aligned bilingual corpus of the existing model
  2. adding your aligned documents to that corpus
  3. retraining the model on the new corpus.

This would require to add local data, see https://github.com/argosopentech/argos-train/issues/24

mayeulk commented 8 months ago

Duplicate of #12