-
Hey Maca,
I would like to excute a restart Batch File in schedule.json, I tried the BEC method but it doesnt seem to work. What would be the best way to restart the server with the restart messages …
-
### 🔍 Before submitting the issue
- [X] I have searched among the existing issues
- [X] I am using a Python virtual environment
### 🐞 Description of the bug
I can use the code below to start flue…
-
Task Scheduler successfully completed task "\FilmCab\file maintenance\_start_new_batch_run_session" , instance "{947ed502-b957-477b-8783-acfceb628353}" , action "C:\Program Files\PowerShell\7-preview\…
-
**Description**
Individual models works as expected but ensemble pipeline of these individuals raise `[StatusCode.INTERNAL] in ensemble 'depthcomp_pipeline', onnx runtime error 2: not enough space: e…
-
On `azure_mc`, the ESPResSo test was failing with an OOM error. I checked, and the test _does_ call the hook that requests memory [here](https://github.com/EESSI/test-suite/blob/7354fcd547891ed631ae3e…
-
### What happened?
Tripping on this line: https://github.com/ggerganov/llama.cpp/blob/a07c32ea54850c989f0ef6989da5b955b77b7172/src/llama.cpp#L18773C1-L18774C1
~~Meaning, the allocated slot space…
-
When training batch size 4 on H100 the speed is 1.27 second / it
When training batch size 4 on 2x H100 the speed is 2.05 second / it
So basically we almost got no speed boost from multiple GPU t…
-
Sometimes, when training using the SimCLR method I get some divergent loss function (see attached screenshot). I wonder if anyone has ever experienced this kind of issue when training with SimCLR. Thi…
-
### Describe the bug
I was trying to test different schedulers under DDPMPipeline. And an error occurred if I use PNDMScheduler beforehand I have found that PNDMScheduler should be compatible with DD…
-
### Related Problems?
As mentioned in item 7 in #1968, the current bath processes requires an async runtime (see [here](https://github.com/open-telemetry/opentelemetry-rust/blob/29fd682203cd6c677d0e9…