issues
search
GeneZC
/
MiniMA
Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"
Apache License 2.0
91
stars
5
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Distill granite-7b-base?
#7
linux-leo
opened
1 week ago
0
The inference latency of model MiniMA-3B
#6
qxpBlog
closed
2 months ago
23
Inconsistent response from interactive MiniChat-3B
#5
rsong0606
closed
2 months ago
2
Code for Training MiniMoE
#4
ojus1
opened
6 months ago
1
Distill Mistral 7B?
#3
ojus1
opened
6 months ago
1
Getting errors when trying to replicate the distilling operation
#2
l3utterfly
closed
6 months ago
27
Update README.md
#1
eltociear
closed
2 months ago
0