Closed luffycodes closed 2 years ago
Unfortunately, we don't have distilled LUKE models.
Also, what is the procedure to train the lite models? Does one just not the stage 1 training and skip directly to stage 2?
The lite models come from the original models, so they were not newly pretrained. We just took the word encoder and special entity embeddings from the original models and make the lite models. They are just for convenience, having smaller memory footprints when you don't need so many entity embeddings.
Hello,
Just curious to know if there are any distilled Luke models as well?
Also, what is the procedure to train the lite models? Does one just not the stage 1 training and skip directly to stage 2?
Thanks in advance,