qlora Search Results - Githubissues

1000+ results
for qlora

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

artidoro/qlora #123

ValueError: Tokenizer class GPTNeoXTokenizer does not exist …

When I tried ``` !python qlora.py –learning_rate 0.0001 --model_name_or_path EleutherAI/gpt-neox-20b --trust_remote_code ``` in colab, i got following errors ``` 2023-06-03 13:54:17.113623: W t…

zhashen updated 1 month ago
7
yangjianxin1/Firefly #270

ValueError: You can't train a model that has been loaded in …

在用 torchrun --nproc_per_node=4 train.py --train_args_file train_args/sft/qlora/qwen2-7b-sft-qlora.json 训练qwen2+qlora+unsloth时（use_unsloth=true）出现错误： ValueError: You can't train a model that has bee…

WeixuanXiong updated 2 weeks ago
1
huggingface/alignment-handbook #159

FSDP + QDoRA Support

Hi the team, great work! QDoRA seems to be better than QLoRA, refer to [Efficient finetuning of Llama 3 with FSDP QDoRA](https://www.answer.ai/posts/2024-04-26-fsdp-qdora-llama3.html) I wonder w…

iseesaw updated 1 month ago
6
gitlost-murali/blogs #1

machine-learning/data-science/lora-qlora

# Understanding LoRA and QLoRA - The Powerhouses of Efficient Finetuning in Large Language Models - Musings of Murali Delving into the math behind LoRA and QLoRA [http://gitlostmurali.com/machine-le…

utterances-bot updated 4 months ago
1
huggingface/peft #1844

Fail to use zero_init to construct llama2 with deepspeed zer…

### System Info ``` bitsandbytes==0.43.1 sentencepiece==0.1.97 huggingface_hub==0.23.2 accelerate==0.30.1 tokenizers==0.19.1 transformers==4.41.1 trl==0.8.6 peft==0.11.1 datasets==2.14.6 ``…

CHNRyan updated 3 weeks ago
6
gauss5930/AlpaGasus2-QLoRA #2

What is the difference between Alpagasus-2-13b-QLoRA-merged…

![image](https://github.com/gauss5930/AlpaGasus2-QLoRA/assets/76432120/7f40e304-e7db-4f47-8ef4-700b8a86eaac) I found these two models on OpenAI's LLM, and they exhibit significant differences in perf…

MDK-L updated 6 months ago
2
jasonppy/VoiceCraft #76

Question: VRAM requirements for training, finetuning, and in…

Do we have a general sense on this? Has LoRA/QLoRA fine tuning been attempted on this, and if so, any guidance?

ProjectProgramAMark updated 3 months ago
2
microsoft/dp-transformers #45

Integrating opacus with huggingface

Hello, I was trying to see if the Opacus library can be used with the Trainer module of huggingface. I see a code snippet in the readme that says to do the callback ```dp_transformers.PrivacyEngine…

SoumiDas updated 3 weeks ago
1
OpenAccess-AI-Collective/axolotl #1522

(OOM) FSDP+QLora 2*RTX3090 (24G per card) finetuning on 70b …

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports. …

yaohwang updated 2 months ago
4
huggingface/transformers #31293

`merge_and_unload` for a quantized model ruins its quality

### System Info - `transformers` version: 4.41.2 - Platform: Linux-5.15.0-1044-nvidia-x86_64-with-glibc2.35 - Python version: 3.10.0 - Huggingface_hub version: 0.23.0 - Safetensors version: 0.4.2…

Aktsvigun updated 2 weeks ago
1

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for qlora

1000+ results
for qlora