-
[Enter feedback here]
Why do we need to use "_prepare_to_copy" with the SDK. Could you expand on a it a bit ? What happens if don't use an MFLow model but sklearn or onnx ?
---
#### Document Det…
-
Hi,
I've used this for demoing to my customer and I think it would great to show how the azure-pipelines can be used to deploy to higher environments using the recommended approach of "compile once…
-
### 问题描述:
nvidia v100单卡32G显存 lora训练Baichuan2-7B-Base报OOM异常:torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 1.92 GiB (GPU 0; 31.74 GiB total capacity; 30.00 GiB already allocated; 3…
-
## Open Source Contributors Welcomed!
Please comment below if you would like to work on this issue!
### Contact Details [Optional]
support@zenml.io
## What happened?
A user encountered an i…
-
## Describe the bug
Can't copy a custom execution environment file to the Triton server when trying to deploy a model via a PVC.
How to set the EXECUTION_ENV_PATH parameter in the config.p…
-
## Describe the bug
I have a model that failed deployment. Kubectl delete is successful but the model is still visible under `Models` CRDs in k8s.
## To reproduce
1. Train a model and …
-
## Open Source Contributors Welcomed!
Please comment below if you would like to work on this issue!
### Contact Details [Optional]
support@zenml.io
## What happened?
After a discussion abou…
-
I have used W&B local for years. Recently, I sometime get stuck when finish training run.
There is no error or warning messages at all, the last terminal message I got on client side is:
### Ter…
-
/kind bug
**What steps did you take and what happened:**
Every time I try to run an experiment (in this case using Bayesian Optimization) after 18-25 trials, the pod that schedules the trials wi…
-
# Deprecating model registry stages
Starting in **MLflow 2.9**, we plan to mark model registry stages as deprecated in favor of new tools we’ve introduced for managing and deploying models in the M…