togethercomputer / OpenChatKit

Apache License 2.0
9.01k stars 1.02k forks source link

Add generalized support for preparing data and event reporting #134

Closed justusc closed 1 year ago

justusc commented 1 year ago

This change is intended to improve support for using OCK for automated training environment.

  1. It generalizes support for downloading training data from different sources.
  2. It generalizes support for downloading model pretraining data
  3. It adds event reporting support.
  4. Integrates event reporting in the dist_clm_train.py