qlora Search Results - Githubissues

1000+ results
for qlora

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/trl #1667

OOM with DPO Trainer on A100 GPU

I am encountering an out-of-memory (OOM) issue while using the DPOTrainer even though I am running it on an A100 GPU. The model I am using is mistralai/Mistral-7B-Instruct-v0.2. This issue is similar …

JhonDan1999 updated 1 month ago
3
yangjianxin1/Firefly #266

使用Unsloth反而OOM了是为什么呢？

pytorch:2.3.0 cuda:11.8 flash-attn:2.5.9.post1 python 3.10 unsloth是pip install git+https://github.com/yangjianxin1/unsloth.git 这样下的不开unsloth可以跑，开了之后max_length改到512，per device_train_bat…

yuyu990116 updated 3 months ago
2
unslothai/unsloth #591

Is it possible to resume a unsloth QLora Fine tune? If so ho…

Hi, I'm following the instructions on this notebook https://colab.research.google.com/drive/1XamvWYinY6FOSX9GLvnqSjjsNflxdhNc?usp=sharing Apologies for being a TRL newbie. I've gotten it to work on…

devzzzero updated 3 months ago
4
huggingface/peft #1716

GPU Allocation Issue (QLoRa + Llama3-8B-IT)

### System Info peft: 0.10.1.dev0 accelerate: 0.30.0 bitsandbytes: 0.43.1 transformers: 4.39.3 GPU: A6000 * 2 ( 96GB ) nvidia-driver version: 535.171.04 cuda: 11.8 ### Who can help? _No…

DONGRYEOLLEE1 updated 4 months ago
1
codefuse-ai/MFTCoder #57

RuntimeError: CUDA error: invalid device ordinal

RuntimeError: CUDA error: invalid device ordinal CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passin…

lwh8915 updated 3 months ago
1
open-compass/opencompass #792

UnboundLocalError: local variable 'prompt_token_num' referen…

### 先决条件 - [X] 我已经搜索过 [问题](https://github.com/open-compass/opencompass/issues/) 和 [讨论](https://github.com/open-compass/opencompass/discussions) 但未得到预期的帮助。 - [X] 错误在 [最新版本](https://github.com/open-…

qy1026 updated 8 months ago
2
phineas-pta/fine-tune-whisper-vi #3

Experimental Result?

Hi @phineas-pta, Recently, I experimented with fine-tuning Whisper using QLoRA. We tried using the Large v3 model, fine-tuning it with four datasets: CMV-17, VIVOS, Fleurs, and 100 hours of VinAI d…

QuangDiy updated 4 months ago
5
huggingface/transformers #31234

PEFT + ZeRO Phase 2 + Transformers doesn't output pytorch_mo…

### System Info Python 3.11.9 Transformers 4.38.2 torch 2.3.0 ### Who can help? _No response_ ### Information - [ ] The official example scripts - [X] My own modified scripts ### Tasks - [ ] …

cdoern updated 2 months ago
5
THUDM/CogVLM #443

CUDA error: an illegal memory access was encountered

### System Info / 系統信息 GPU: a100-80g CUDA Version: 12.1 python:3.8 pytorch:2.2.1 ### Who can help? / 谁可以帮助到您？ @1049451037 ### Information / 问题信息 - [x] The official example scripts / 官…

zzh-www updated 5 months ago
3
hiyouga/LLaMA-Factory #4371

Qlora的默认rank值为多少，想把超参rank改为64，怎么修改

### Reminder - [X] I have read the README and searched the existing issues. ### System Info CUDA_VISIBLE_DEVICES=0,1,2,3 FORCE_TORCHRUN=1 NNODES=1 RANK=0 MASTER_ADDR=172.21.255.2 MASTER_PORT=29500 …

mfxss updated 3 months ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for qlora

1000+ results
for qlora