-
### System Info
- `transformers` version: 4.45.2
- Platform: Linux-5.15.0-72-generic-x86_64-with-glibc2.35
- Python version: 3.9.0
- Huggingface_hub version: 0.25.1
- Safetensors version: 0.4.5
…
-
Is it possible to train yolo3 on CPU with few classes. may be 5 or up to 10. I have dataset with 7000 imageds of 'HELMET' .
I want to train it to detect the helmet.
I have Intel® Core™ i5-7400 CPU…
-
I trained a LoRA, stopped training, and I want to continue training from the same one. But using --lora-ckpt-path just errors with this traceback:
```
File "D:\tts\ok\stable-audio-tools-main\trai…
-
I want to use pix2pix for a medical image generation task, where the control condition is different label images. My hyperparameter settings are as follows:
accelerate launch src/train_pix2pix_turbo.…
-
## 🐛 Bug
I'm getting an OOM CUDA error when passing `--cpu` option, which makes no sense.
I got it working when I disable all GPUs:
```sh
env CUDA_VISIBLE_DEVICES= python train.py ....
```
…
-
**Background:**
Hello, I am a user of this repository and I am interested in the AI model training sample code provided here. I have noticed that the current sample codes in the repository mainly foc…
-
### 🚀 The feature, motivation and pitch
Hi Pytorch maintainers,
I am currently engaged in training multiple large language models (LLMs) sequentially on a single GPU machine, utilizing FullShard…
-
It trains fine for a while, and then often I get a CPU OOM, which looks like:
```
[2024-01-04 11:41:05,662] INFO: Start Job: Job Task: run
...
RETURNN starting up, version 1.20240104.103023+git.a0…
-
### System Info
```Shell
'Accelerate version: 0.31.0
Platform: Linux-5.4.0-1131-aws-fips-x86_64-with-glibc2.35
'accelerate bash location: /databricks/python3/bin/accelerate
Python version: 3…
-
### 🐛 Describe the bug
```
import torch
import time
import torch.nn as nn
import torch.utils.data as Data
torch.manual_seed(10)
class ATT(nn.Module):
def __init__(self, ):
super…