-
I am encountering an out-of-memory (OOM) issue while using the DPOTrainer even though I am running it on an A100 GPU. The model I am using is mistralai/Mistral-7B-Instruct-v0.2. This issue is similar …
-
pytorch:2.3.0
cuda:11.8
flash-attn:2.5.9.post1
python 3.10
unsloth是pip install git+https://github.com/yangjianxin1/unsloth.git 这样下的
不开unsloth可以跑,开了之后max_length改到512,per device_train_bat…
-
Hi, I'm following the instructions on this notebook https://colab.research.google.com/drive/1XamvWYinY6FOSX9GLvnqSjjsNflxdhNc?usp=sharing Apologies for being a TRL newbie.
I've gotten it to work on…
-
### System Info
peft: 0.10.1.dev0
accelerate: 0.30.0
bitsandbytes: 0.43.1
transformers: 4.39.3
GPU: A6000 * 2 ( 96GB )
nvidia-driver version: 535.171.04
cuda: 11.8
### Who can help?
_No…
-
RuntimeError: CUDA error: invalid device ordinal
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passin…
-
### 先决条件
- [X] 我已经搜索过 [问题](https://github.com/open-compass/opencompass/issues/) 和 [讨论](https://github.com/open-compass/opencompass/discussions) 但未得到预期的帮助。
- [X] 错误在 [最新版本](https://github.com/open-…
-
Hi @phineas-pta,
Recently, I experimented with fine-tuning Whisper using QLoRA. We tried using the Large v3 model, fine-tuning it with four datasets: CMV-17, VIVOS, Fleurs, and 100 hours of VinAI d…
-
### System Info
Python 3.11.9
Transformers 4.38.2
torch 2.3.0
### Who can help?
_No response_
### Information
- [ ] The official example scripts
- [X] My own modified scripts
### Tasks
- [ ] …
-
### System Info / 系統信息
GPU: a100-80g CUDA Version: 12.1 python:3.8 pytorch:2.2.1
### Who can help? / 谁可以帮助到您?
@1049451037
### Information / 问题信息
- [x] The official example scripts / 官…
-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
CUDA_VISIBLE_DEVICES=0,1,2,3 FORCE_TORCHRUN=1 NNODES=1 RANK=0 MASTER_ADDR=172.21.255.2 MASTER_PORT=29500 …
mfxss updated
3 months ago