-
不知道大佬有没有遇到ValueError: paged_adamw_32bit is not a valid OptimizerNames这个错误
-
In my training script, I set the **per_device_train_batch_size = 4** in the TrainingArguments.
But the **train_batch_size** in the **trainer_state.json** of each checkpoint is **2**.
When I tried …
-
Dear Developers, I can perform inference using the script you provided, but I encounter an object type mismatch during training. Specifically, I checked the data type, and the image input is already a…
-
### Describe the bug
I use lora to fintune the model of 'stabilityai/stable-diffusion-xl-base-1.0',get fellow error:
OSError: stabilityai/stable-diffusion-xl-base-1.0 does not appear to have a file …
-
```
2024-11-09 21:39:44.994636: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:485] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already b…
-
## tl;dr
- We have consolidated KerasNLP and KerasCV into a new **KerasHub** package.
- We have renamed the `keras-nlp` GitHub repository to `keras-hub`.
- **All existing usages will continue to …
-
Hi
Trying to install repo, but falling into error on applying LLM patch. Help me out pls
I ran the script with the command:
"python caption.py D:\DATASETS\equals --model_site modelscope --downloa…
-
I am facing a very weird issue here.
### Issue
- The training speed slows down with time for batch sizes 64 and 128. For batch size 32 it seems to be staying fairly constant.
- The tensorboard g…
-
### 🐛 Describe the bug
I've tried to train LLama model using the reward model. I created such a config
```python
config = TRLConfig(
train=TrainConfig(
seq_length=4096,
…
-
Traceback (most recent call last):
File "/content/zero_nlp/chatglm_v2_6b_lora/main.py", line 470, in
main()
File "/content/zero_nlp/chatglm_v2_6b_lora/main.py", line 133, in main
mode…