songmzhang / DSKD

Repo for Paper "Dual-Space Knowledge Distillation for Large Language Models".
25 stars 3 forks source link