-
### System Info
```Shell
# Machine Learning and Deep Learning Libraries
torch==2.4.1 --index-url https://download.pytorch.org/whl/cu121 # Deep learning framework
torchvision==0.17.1 # Co…
-
We are using DeepSpeed; transformer, accelerate to fine tune Qwen llm, and hit the below issue.
[rank2]: pydantic_core._pydantic_core.ValidationError: 1 validation error for DeepSpeedZeroConfig
[ran…
-
Currently, we flatten the CLI config for training so it can be merged with the CLI flag options, only to then be unflattened again and sent into the training interface data structures.
To get around …
-
As someone who used this library for a while in prod, then gave up, I'd honestly recommend just dropping it to simplify the code. There are several issues:
- it isn't being very actively maintaine…
-
Seeing this on the main branch:
```
Traceback (most recent call last):
File "/home/deli/images/sd-scripts/tools/cache_text_encoder_outputs.py", line 194, in
cache_to_disk(args)
File "/…
-
When the instructlab training library is imported, it seems to import a lot of packages
throughout the project such as deepspeed, pytorch, and others which all slow everything down
before anything has…
-
It seems like the colab is not working anymore, possibly due to some updated python packages?
-
下面是报错信息,可以帮我看看吗?
ninja: build stopped: subcommand failed.
Traceback (most recent call last):
File "/dockerdata/graceqwang/videollava/lib/python3.10/site-packages/torch/utils/cpp_extension.py", …
-
I encountered an issue while finetune with the officially released code using the DeepSpeed. Here is the detailed error message:
```
File "/lib/python3.11/site-packages/deepspeed/runtime/zero/linear…
-
### Describe the issue
Issue:
We are trying to finetune the model on our dataset.
Currently, we are able to successfully finetune model `lmsys/vicuna-13b-v1.5` using projector weights `llava-v…