Closed vprashar2929 closed 3 months ago
Found the point... The latest kepler gave us this power source... It is supposed to be intel_rapl or acpi.
@sthaha Also share that for his case, source is "rapl-sysfs" which is not supported yet in model-server side. I will check the latest source label and update the train_types
closing as #325 is now merged.
What happened?
When trying to train the model locally in case of Kepler with
release-0.7.11
is deployed it fails with the below traceCommand used to run train:
Verified the same with Kepler
release-0.7.7
and it works fine.What did you expect to happen?
Model training should work fine irrespective of the Kepler version unless there is a specific change between
0.7.7
and0.7.11
How can we reproduce it (as minimally and precisely as possible)?
script.sh
locallyAnything else we need to know?
No response
Kepler image tag
Deployment
Kepler model server image tag if deployed
Kepler estimator image tag if deployed
Kepler online trainer image tag if deployed
Kepler offline trainer image tag if deployed
Kepler profiler image tag if deployed
Kubernetes version
Install tools
Kepler deployment config