frankaging / Causal-Distill

The Codebase for Causal Distillation for Language Models (NAACL '22)
MIT License
25 stars 3 forks source link