-
**Describe the bug**
Any use of `shfl_sync` throws an error saying `shfl_recurse` is a dynamic function.
**To reproduce**
The Minimal Working Example (MWE) for this bug:
Attempting to do a stream…
-
[Job](https://mihubot.xyz/runtime-utils/EhAUAhI) completed in 19 minutes 21 seconds.
https://github.com/dotnet/runtime/pull/108824
### Diffs
Diffs
```
Found 261 files with textual diffs.
Summary …
-
### Steps to reproduce the bug
1. set all columns minWidth: 100,
2. set all columns titlePadding: EdgeInsets.zero
3. autoFit all columns when onLoaded is called
4. horizontally scroll the grid
…
-
Hi I have a script that runs with the DataParralell trainer on a machine with 8 H100 GPUs (aws p5 VM) with deepspeed. When we run the script it starts to randomly get stuck forever at some iteration r…
-
I find that mypy seems to invalidate its caches in a way that's very opaque to me. In my large company codebase below, I see runtimes like the following:
```
% time mypy --no-incremental
file/wi…
-
Hello everyone, I want to learn the stage3 part of zerooffload when learning the source code of deepspeed, but I can't find the scheduling process code of the gradient between cpu and gpu, please help…
-
it's often "const" of "call_indirect" test failing it seems. but it doesn't seem very consistent.
failure logs i looked at were all ILP32F.
https://github.com/bytecodealliance/wasm-micro-runtime/a…
yamt updated
2 months ago
-
Hello,
I have Undefined Behaviour Sanitizer switched on for debug builds in my project which uses FFTW3. I am compiling with Apple Clang. Since the latest upgrade to version 12.0.5, I now get the f…
-
I am following the [IPEX Ollama docker guide](https://github.com/intel-analytics/ipex-llm/blob/main/docs/mddocs/DockerGuides/docker_cpp_xpu_quickstart.md). I am stuck on the `sycl-ls` step. The contai…
-
I was running the example script: `examples/scripts/train_ppo_llama.sh`.
Basically, it's ppo on llama3-8b with 8*H100, flash_attn, zero3, gradient_checkpointing, adam_offload, but it's OOM after some…