issues
search
shaochenze
/
PatchTrain
Code for paper "Patch-Level Training for Large Language Models"
Apache License 2.0
71
stars
3
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
关于第二阶段的Token级别训练细节的问题
#2
jyweky
closed
1 month ago
2
关于交叉熵的具体计算细节
#1
JizhanFang
closed
3 months ago
4