I use Chinese and English mixed dataset and a another custom 7B model to train eagle，but the performance in Chinese is not good enough

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)

Apache License 2.0

780 stars 79 forks source link

I noticed that your top-3 accuracy on the training set is only about 0.8, which is relatively low. What is your training accuracy on the English dataset? If it is close to the accuracy on the Chinese dataset, it could be that the structure or size of the draft model is not suitable. If the English accuracy is significantly higher than the Chinese accuracy, it is possible that your base model is not sufficiently trained on Chinese, and its features cannot effectively capture the semantic information of Chinese.

SafeAILab / EAGLE

I use Chinese and English mixed dataset and a another custom 7B model to train eagle，but the performance in Chinese is not good enough #81