OpenThaiGPT / openthaigpt-pretraining

Apache License 2.0
21 stars 10 forks source link

(model): Fix data preprocessing #243

Closed boat1603 closed 1 year ago

boat1603 commented 1 year ago

Why this PR

Why we need this PR? This PR is about hot fix data preprocessing script in current main version.

Changes

Related Issues

Close #

Checklist