-
## 🐛 Bug
When I am running Pytorch Lightning with DeepSpeed on an Azure ML Compute Cluster (with a max of 7 nodes and Tesla-M60 GPU) I am getting different error messages in the driver logs:
```…
-
# FINOS Q4 2022 All Community Call
Register to join the **FINOS 2022 Q4 All Community Call** on **Wednesday October 26th at 10am EST / 3pm BST**, where Gabriele Columbro, FINOS Executive Director, …
-
Hi, I'm trying to do the tutorial [here](https://github.com/mlcommons/ck/blob/master/docs/tutorials/sc22-scc-mlperf.md) on my laptop using an arch linux based distro. And from what I understand, the s…
-
Nói về 2 điểm đặc biệt của khoá học
- Là nơi tổng hợp các best practices trong MLOps từ các công ty lớn, và từ nhiều kĩ sư có kinh nghiệm từ khắp thế giới
- Trình bày theo project-oriented, chứ ko p…
-
I have training Cross-Silo Horizontal distributed training mode with following configuration settings:
```
common_args:
training_type: "cross_silo"
random_seed: 0
scenario: "horizontal"
…
-
I suggest to create a catalog of [CM automations](https://github.com/mlcommons/ck/tree/master/cm-mlops/script) for MLPerf/ML/AI artifacts and tools that can be reused across projects.
We can add it…
-
**Describe the bug**
I'm looking into the sagemaker experiment tracking functionality to run my project based on my own docker image. But I cannot see the metrics in the sagemaker pipeline UI and Exp…
-
Hello,
I am a bit new to Helm, and Kubernetes even. But I want to give a presentation in a few days surrounding how to use BentoML in a MLOps scenario, and want to include Yatai, as i think it's a…
-
UDPATE: Jump to https://github.com/iterative/dvc.org/issues/2496#issuecomment-1255510563.
## Status
* We have 5 repositories: [example-get-started][egs], [dvc-checkpoints-mnist][dcm], [get-start…
-
### Search before asking
- [X] I had searched in the [issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and found no similar feature requirement.
### Description
Remov…