-
magatron AL/ML training hangs up with error messages as following. ReduceScatter failed to be finished within the timeout (30mins). It is tricky that no error log reported from NCCL. I have no idea ho…
-
In a discussion in the AI/ML meeting on 08/05/2024, @VenkatTechnologist proposed
that the AI team take a deep look into security aspects and parameters of AI systems.
Karen suggested that Venkat lea…
-
**System Information (please complete the following information):**
- Model Builder or CLI Version: 16.18.2.2415501
- Visual Studio Version (if applicable): Visual Studio Professional 2019 (versio…
-
**What would you like to be added**:
MVP support for LeaderWorkerSet in Kueue. It does not need to be ideal, but we want to have some support to unblock users and collect users' feedback.
The i…
-
## SHARK Studio Roadmap
This project establishes and tracks a plan for phased releases of the SHARK Studio WebUI.
There are three objectives of this roadmap:
- Define product features, support…
-
- **Package Name**: azure.ai.ml
- **Package Version**: 1.11.1
- **Operating System**: MacOS Ventura 13.6
- **Python Version**: 3.11.6
**Describe the bug**
Invoking a batch endpoint and providin…
AtleH updated
3 months ago
-
### How to reproduce
1. Open https://dev.datagrok.ai/f/Demo.TestJobs.Files.DemoFiles/demog.csv
2. Select ML | Train Model...
3. Select any columns for Predict & Features
4. Press TRAIN
### Expe…
-
I have a pipeline component written in yaml file. I load this using load_component. I set all the inputs in python. And then I call ml_client.jobs.create_or_update() and this fails. I traced the issue…
-
Good day!
Can my AI GAN art collection from November 6th 2021, TESTGALLERY, be added to the timeline? Some info about the project below:
# **CounterParty XCP Timeline - Adding Test Gallery**
**…
-
### Describe the feature
Hey @vansh-codes, I want to add a Next/Submit button on this page to move forward to the next page. I would also like to make the background consistent with the first landing…