OpenThaiGPT / openthaigpt-pretraining

Apache License 2.0
21 stars 10 forks source link

feat(model): add gradient checkpointing falcon #267

Closed MoosaTae closed 1 year ago

MoosaTae commented 1 year ago

Why this PR

falcon need gradient checkpointing

Changes

Related Issues

Close #

Checklist