Open sooraj1002 opened 4 months ago
@sooraj1002 Let's simply manage training sets as their own entities and have proper APIs for interactions with training sets.
/training-dataset
APIs
PATCH
: Updating Specific RowsPUT
: Updating the Whole Dataset (would be useful for correcting divergent datasets as well)POST
: Add a new training DatasetGET
: GET a training dataset by ID, Name, or TaskIDtraining-dataset/sync
syncs the current cached dataset to HF.Checks:
in addition to updating the HF dataset when training, there should be a dedicated API which updates the dataset to the HF repo without triggering the training