sambanova / generative_data_prep

Apache License 2.0
58 stars 7 forks source link

Fail out with error if the OS kills any of the tokenization processes. #27

Closed snova-zoltanc closed 1 year ago

snova-zoltanc commented 1 year ago

Summary

Right now, if one of the processes are killed because of OOM or for some various reason killed by the OS, then the software will silently hang and not fail out with an error. This PR works to fail out with an error to make sure the user can run with fewer multiprocessing workers to avoid OOM issues.

PR Checklist