issues
search
OpenThaiGPT
/
openthaigpt-pretraining
Apache License 2.0
21
stars
10
forks
source link
feat(model): Resume trainable lightning fabric pipeline
#235
Closed
MoosaTae
closed
1 year ago
MoosaTae
commented
1 year ago
Why this PR
because training fabric pipeline not available for now
Changes
Resume Training is possible
change .to("cuda") to with fabric.device for adamw fuse mode
use fabric.print instead print
solve wrong parameters name
rename llama to 100M size params
Related Issues
Close #
Checklist
[ ] PR should be in the
Naming convention
[ ] Assign yourself in to Assigneees
[ ] Tag related issues
[ ] Constants name should be ALL_CAPITAL, function name should be snake_case, and class name should be CamelCase
[ ] complex function/algorithm should have
Docstring
[ ] 1 PR should not have more than 200 lines changes (Exception for test files). If more than that please open multiple PRs
[ ] At least PR reviewer must come from the task's team (model, eval, data)
Why this PR
because training fabric pipeline not available for now
Changes
Related Issues
Close #
Checklist