qlora Search Results - Githubissues

1000+ results
for qlora

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NetEase-FuXi/EETQ #17

Qlora with eetq is quite slow

The training process is quite slow, whereas using 8-bit hqq speeds it up by more than tenfold. Is this normal? Or have I missed any code？ ```python import torch from transformers import EetqConfi…

hjh0119 updated 1 month ago
3
UKPLab/sentence-transformers #2748

fine tuning quantized models

Is it possible to do the fine tuning quantizing the models and using qlora?

claracaste updated 1 week ago
8
pytorch/torchtune #893

[FR] (Q)DoRA

(Q)DoRA, an alternative to (Q)LoRA is quickly proving to be a superior technique in terms of closing the gap between FFT and PEFT. Known existing implementations: - https://github.com/huggingface/…

DreamGenX updated 4 days ago
6
SkunkworksAI/BakLLaVA #20

Cant merge qlora weights

``` oading checkpoint shards: 0%| | 0/2 [00:00

sahilqure updated 4 months ago
2
pytorch/torchtune #869

Understanding QLora memory consumption for inference

Hello, I have a question regarding GPU memory consumption during inference. Before finetuning a model with QLora, the torchtune.LoRALinear modules will convert the original LLM weights to nf4, a…

Optimox updated 2 months ago
5
pytorch/torchtune #1041

Support loading of pre-quantized models

For workloads such as QLoRA, we can save and upload (or use existing ones) pre-quantized model weights, which would have a couple of benefits: - Allow users to save disk space by only working with 4-…

rohan-varma updated 1 month ago
1
intel-analytics/ipex-llm #10476

LLM QLoRA script issue

Hi Intel team, I met a issue when I ran the script "qlora_finetune_llama2_70b_pvc_1550_4_card.sh" and deepspeed parameters are used. When running the code, errors occur whenever a checkpoint step …

JeNi0310 updated 3 months ago
2
unslothai/unsloth #633

Why do training lm_head and embed_tokens require converting …

I cannot train Qwen2 7B on a 4090 GPU as it would result in out-of-memory (OOM) errors due to the loading of the embedding layer. This process is anticipated to demand over 27GB of VRAM, exceeding the…

letterk updated 2 weeks ago
2
Lightning-AI/litgpt #1112

Higher memory use with QLoRA

Changing only 1 line the config file, that is ```bash quantize: bnb.nf4 ``` Increased the memory usage from 14 GB -> 18 GB. ``` Epoch 5 | iter 965 step 965 | loss train: 1.182, val: 1.0…

rasbt updated 3 months ago
4
intel-analytics/ipex-llm #11135

[integration]: merging bfloat16 model failed

**base-model: Weyaxi/Dolphin2.1-OpenOrca-7B** **Scenario:** - followed the following guidelines - https://github.com/intel-analytics/ipex-llm/tree/main/python/llm/example/GPU/LLM-Finetuning/QLoRA…

raj-ritu17 updated 1 month ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for qlora

1000+ results
for qlora