-
Getting this error on import of deepspeed. I am currently using torch 1.8.0 and installed the requirements.txt as directed. I am also not able to install the apex link provided.
-
**LocalAI version:**
1.23.2-cuda-12
**Environment, CPU architecture, OS, and Version:**
Linux lxdocker 6.2.16-4-pve #1 SMP PREEMPT_DYNAMIC PVE 6.2.16-5 (2023-07-14T17:53Z) x86_64 x86_64 x…
-
### Description
```shell
branch: fastertransformer_backend-release-v1.2.1_tag/
triton with ft container verion: 22.07
gpu:v100
model:huggingface t5-base
```
### Reproduced Steps
```shell
After…
-
**Describe the bug**
When using an image based on the provided `Dockerfile` and running the quick start steps (download enron data, run `deep.py`), execution crashes before training begins.
**To R…
-
### Who can help
@patil-suraj
## Information
Model I am using: GPT-J
The problem arises when using:
* [x] the official example scripts: (give details below)
* [x] my own modified scripts:…
-
**Describe the bug**
When training with sparse attention, Triton throws `IndexError: map::at`.
This is the full traceback
Traceback (most recent call last):
File "train.py", line 27, in …
-
It's not clear to me how to train the GPT3XL via GPU/Colab.
Could you add more details?
Thank you.