Is there a guide to Train eagle heads on custom models?

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)

https://arxiv.org/pdf/2406.16858

Apache License 2.0

780 stars 79 forks source link

Is there a guide to Train eagle heads on custom models? #78

Closed cryoco closed 2 months ago

cryoco commented 3 months ago

Appreciate the guide on Inferencing on custom models. Is there a guide on how I can train my own eagle heads on a custom auto-regressive model?

Aharrypotter commented 3 months ago

插眼

Liyuhui-12 commented 3 months ago

You can use the script here for training. The code for the data generation phase needs to be modified. You need to modify the preprocess_function to ensure that the conversation matches the template and the loss_mask is in the correct position. Of course, you also need to pay attention to modifying the template during inference.

cryoco commented 3 months ago

You can use the script here for training. The code for the data generation phase needs to be modified. You need to modify the preprocess_function to ensure that the conversation matches the template and the loss_mask is in the correct position. Of course, you also need to pay attention to modifying the template during inference.

Thanks for the reply! Do I need to modify the modeling too? It seems a bit weird if my current model and the additional transformer layer have different structure.

Liyuhui-12 commented 3 months ago

It can run without modifying the structure of the draft model, but it is still unclear whether the consistency of the additional transformer layer with the base model will affect the final performance.