-
In function `verify_bacward_data_lstm::gpu()` we seemingly inadvertently rely on the workspace being `zeroed out`. We create a `std::vector` for workspace just to create a gpu buffer `workspace_dev` w…
-
**Describe the bug**
Running a script from the algebra docs causes this error to appear:
```
Unlocking __python_binary_op_single_value
Error occurred in handler:
Traceback (most recent call las…
-
I have tried to run your code, the problem is i am getting this error, i am using the same dataset i used with the ai-toolkit tool:
Starting 1 job(s)
Starting job train_lora_sdxl_24gb_1.0.yaml
Us…
AFMSB updated
1 month ago
-
similar to [23192](https://github.com/microsoft/vscode-python/issues/23192) I'm using python 3.10.10 and below are the logs:
```
2024-11-11 12:17:09.161 [debug] Testing: Manually triggered test refre…
-
**Describe the bug**
When attempting to render a spiral using `ns-render spiral` it fails with the following error:
```
✅ Done loading checkpoint from outputs/nerfacto-big/nerfacto/2024-03-26_22410…
-
# Bug Report
## Issue name
dvc push -v -j 4: Doesn't update Pushing %, B/s transferred, or transfer times.
## Description
When I run dvc push to an S3 bucket the % always reports 0%. I th…
-
### Describe the bug
The persistency of AWS CSI wrapper doesn't seem to work with kata-remote.
### How to reproduce
1. Deploy CAA
```
pushd $CAA/src/cloud-api-adaptor
oc get nodes
kubec…
-
Currently, the permission system is functional, but has some rough edges.
## Current situation
### Permissions
- Nodes
- Each node has associated permissions for that given node
- Each …
-
### Before You File a Bug Report Please Confirm You Have Done The Following...
- [X] I have tried restarting my IDE and the issue persists.
- [X] I have updated to the latest version of the packag…
-
**Describe the bug**
I try to finetune `llama3-8B` model with multi nodes but get an AtrributeError when finishing loading mcore format checkpoint and starting to build datasets, the error is below:
…