issues
search
GeneZC
/
MiniMA
Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"
Apache License 2.0
96
stars
5
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Distill granite-7b-base?
#7
linux-leo
opened
4 months ago
0
The inference latency of model MiniMA-3B
#6
qxpBlog
closed
7 months ago
23
Inconsistent response from interactive MiniChat-3B
#5
rsong0606
closed
7 months ago
2
Code for Training MiniMoE
#4
ojus1
opened
10 months ago
1
Distill Mistral 7B?
#3
ojus1
opened
10 months ago
1
Getting errors when trying to replicate the distilling operation
#2
l3utterfly
closed
10 months ago
27
Update README.md
#1
eltociear
closed
7 months ago
0