JetRunner / MetaDistil

Code for ACL 2022 paper "BERT Learns to Teach: Knowledge Distillation with Meta Learning".
MIT License
80 stars 16 forks source link