qlora Search Results - Githubissues

1000+ results
for qlora

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

UKPLab/sentence-transformers #2748

fine tuning quantized models

Is it possible to do the fine tuning quantizing the models and using qlora?

claracaste updated 2 months ago
11
bitsandbytes-foundation/bitsandbytes #1321

'nf4' compute datatype?

### Feature request In the quantization procedure for qlora, there is the 'nf'4 storage datatype and the compute datatype (in the paper bfloat16 which is the original)(please refer to the image). The…

dorsa-zeinali updated 1 month ago
1
georgesung/llm_qlora #10

ImportError: Using `bitsandbytes` 8-bit quantization require…

(qloravenv) C:\deepdream-test\llm_qlora-main>python train.py C:\deepdream-test\llm_qlora-main\configs\mistralaiMistral-7B-v0.1.yaml Load base model Traceback (most recent call last): File "C:\dee…

nicolai256 updated 5 months ago
2
artidoro/qlora #157

QLoRA training not as expected

Tried fine-tuning the [InstructCodeT5+](https://huggingface.co/Salesforce/instructcodet5p-16b) model using QLoRA and the loss is stuck at a particular value. Code for the experiment: ``` import pand…

karths8 updated 1 year ago
1
QwenLM/Qwen #1318

ValueError: Cannot merge LORA layers when the model is gptq …

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing ans…

goy-jin updated 1 week ago
1
yangjianxin1/Firefly #153

使用qlora微调的时候，由于微调的数据过长显存溢出，能设置多卡qlora微调吗

使用qlora微调的时候，由于微调的数据过长显存溢出，能设置多卡qlora微调吗

Zhang-star-master updated 4 months ago
4
yangjianxin1/Firefly #16

qlora训练的一个报错

ValueError: FP16 Mixed precision training with AMP or APEX (`--fp16`) and FP16 half precision evaluation (`--fp16_full_eval`) can only be used on CUDA devices. 请问这个错误怎么解决？

nieallen updated 3 months ago
10
intel-analytics/ipex-llm #11135

[integration]: merging bfloat16 model failed

**base-model: Weyaxi/Dolphin2.1-OpenOrca-7B** **Scenario:** - followed the following guidelines - https://github.com/intel-analytics/ipex-llm/tree/main/python/llm/example/GPU/LLM-Finetuning/QLoRA…

raj-ritu17 updated 4 months ago
2
pytorch/torchtune #1041

Support loading of pre-quantized models

For workloads such as QLoRA, we can save and upload (or use existing ones) pre-quantized model weights, which would have a couple of benefits: - Allow users to save disk space by only working with 4-…

rohan-varma updated 4 months ago
1
InternLM/xtuner #658

PROMPT_TEMPLATE.llama2_chat效果下降

llama2_7b_qlora_alpaca_enzh_e3.py作为模板，qlora微调gsm8k，修改PROMPT_TEMPLATE.llama2_chat为PROMPT_TEMPLATE.llama3_chat，acc从62下降到28，可能是什么原因导致的？

dongjiancheng77 updated 4 months ago
7

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for qlora

1000+ results
for qlora