Open ChakshuGautam opened 4 months ago
@suresh12 to review the Doc
[ ] when train button is clicked ,it'll hit model registry API to get the:
[ ] Admin Panel will hit dataset registry to get dataset id for the given model-botid
[ ] Admin Panel will hit /train API with the following parameters:
{
"model": Base Model Branch on HF (from model registry)
"epochs": (from model registry)
"task_type": (from model registry)
"dataset": (from dataset registry)
"versioning": {
"owner": botid
"environment": bot environment
“model_name ': (from model registry)
},
“args”: (from model registry)
}
## Dataset service:
- To create dataset for models with the following for each model-botid t least :
- Base Model Branch on HF - the base model which will be used to train the dataset with
- task_type: classfication/NER etc
- model_format: onnx/pytorch - safetensors
- model_name (purpose for which model is getting trained ) like agri_classification in AKAI/KMAI {can be same as service_name}
- epochs (number of epochs the model is getting trained for)
- args : training arguements used to fine tune the model
- quantization: None mostly unless specified)
- to create dataset for datasets with :
datasetid for each model for each bot
Scoping Model Registry from ML Flow us lift and use directly.
onnx
,pt
,bin
) through a CDNClicking train button on Admi Panel
ML Pod:
Admin panel :
[ ] when train button is clicked ,it'll hit model registry API to get the:
[ ] Admin Panel will hit dataset registry to get dataset id for the given model-botid
[ ] Admin Panel will hit /train API with the following parameters: