-
I am facing a very weird issue here.
### Issue
- The training speed slows down with time for batch sizes 64 and 128. For batch size 32 it seems to be staying fairly constant.
- The tensorboard g…
-
### System Info
peft==0.13.2
### Who can help?
@BenjaminBossan
### Information
- [X] The official example scripts
- [ ] My own modified scripts
### Tasks
- [X] An officially supported task in t…
-
### System Info
torch==2.4.0
transformers==4.43.4
trl==0.9.6
tokenizers==0.19.1
accelerate==0.32.0
peft==0.12.0
datasets==2.20.0
deepspeed==0.15.0
bitsandbytes==0.43.3
sentencepiece==0.2.0
…
-
code
```python
'''
conda activate beyond_scale_2_unsloth
'''
import torch
from datasets import load_dataset
from trl import SFTConfig, SFTTrainer
from unsloth import FastLanguageModel
from tr…
-
### 🐛 Describe the bug
''' checkpoint_path = './llama_relevance_results'
training_args = transformers.TrainingArguments(
#remove_unused_columns=False, # Whether or not to automatically r…
-
Hi,
Thanks for providing a public implementation for the experimental results of your paper.
I am trying to reproduce the results, regarding hyperparameters in the paper it is stated (quote):
`…
-
Would it work natively or we need to train new adapters?
-
## tl;dr
- We have consolidated KerasNLP and KerasCV into a new **KerasHub** package.
- We have renamed the `keras-nlp` GitHub repository to `keras-hub`.
- **All existing usages will continue to …
-
I have the following error when finetune the DocOwl1.5-Omni. It always raises error when index is 10. Please help!!!
```
File "/opt/conda/envs/mplug_owl2/lib/python3.10/site-packages/deepspeed/run…
-
I'm always missing the `transformer_engine` package after running `pip install -e ".[train]"` and attempt to run the demo
```
bash scripts/video/demo/video_demo.sh lmms-lab/LLaVA-NeXT-Video-32B-Qwen…