deepspeed-library Search Results

huggingface/accelerate #3104

Accelerate Distributed Randomly Hangs

### System Info ```Shell # Machine Learning and Deep Learning Libraries torch==2.4.1 --index-url https://download.pytorch.org/whl/cu121 # Deep learning framework torchvision==0.17.1 # Co…

raeudigerRaeffi updated 18 hours ago

microsoft/DeepSpeed #6525

[BUG] pydantic_core._pydantic_core.ValidationError: 1 valida…

We are using DeepSpeed; transformer, accelerate to fine tune Qwen llm, and hit the below issue. [rank2]: pydantic_core._pydantic_core.ValidationError: 1 validation error for DeepSpeedZeroConfig [ran…

jagadish-amd updated 2 days ago

instructlab/instructlab #1523

Create a cleaner mapping between the CLI flags and the train…

Currently, we flatten the CLI config for training so it can be merged with the CLI flag options, only to then be unflattened again and sent into the training interface data structures. To get around …

RobotSail updated 2 weeks ago

huggingface/accelerate #3065

Recommend dropping MS-AMP support

As someone who used this library for a while in prod, then gave up, I'd honestly recommend just dropping it to simplify the code. There are several issues: - it isn't being very actively maintaine…

rationalism updated 1 week ago

kohya-ss/sd-scripts #1288

cache_text_encoder_outputs.py raises AttributeError: 'Namesp…

Seeing this on the main branch: ``` Traceback (most recent call last): File "/home/deli/images/sd-scripts/tools/cache_text_encoder_outputs.py", line 194, in cache_to_disk(args) File "/…

deepdelirious updated 3 months ago

instructlab/training #115

Speed up training library loads

When the instructlab training library is imported, it seems to import a lot of packages throughout the project such as deepspeed, pytorch, and others which all slow everything down before anything has…

RobotSail updated 3 weeks ago

daswer123/xtts-api-server #78

The colab is not working anymore

It seems like the colab is not working anymore, possibly due to some updated python packages?

LostRuins updated 3 days ago

PKU-YuanGroup/Video-LLaVA #144

训练时报错AttributeError: 'DeepSpeedCPUAdam' object has no attrib…

下面是报错信息，可以帮我看看吗？ ninja: build stopped: subcommand failed. Traceback (most recent call last): File "/dockerdata/graceqwang/videollava/lib/python3.10/site-packages/torch/utils/cpp_extension.py", …

Qinger27 updated 1 month ago

deepseek-ai/DeepSeek-MoE #35

Finetune with deepspeed: type mismatch

I encountered an issue while finetune with the officially released code using the DeepSpeed. Here is the detailed error message: ``` File "/lib/python3.11/site-packages/deepspeed/runtime/zero/linear…

YeZiyi1998 updated 1 month ago

haotian-liu/LLaVA #1630

[Usage] Visual instruction tuning for LLaVa 1.6

### Describe the issue Issue: We are trying to finetune the model on our dataset. Currently, we are able to successfully finetune model `lmsys/vicuna-13b-v1.5` using projector weights `llava-v…

mattia-re-learn updated 6 days ago

1000+ results for deepspeed-library

1000+ results
for deepspeed-library