qlora Search Results - Githubissues

1000+ results
for qlora

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

yuhuixu1993/qa-lora #21

RuntimeError: Expected all tensors to be on the same device,…

Hello！ The calculation accuracy of QLora training is float16, what is the calculation accuracy of qa-lora training? My fine-tuning TechGPT-7b was successful with QLora, but using qa-lora always repo…

orangezfj updated 4 months ago
1
tdrussell/qlora-pipe #16

How to correctly set bf16 for training, and seems deepspeed …

Hello @tdrussell, First of all, thank you very much for your great repo! It is absolutely great work to pull all these optimization solutions together. When I use the repo, I try to use the bf16 e…

iamhappytoo updated 2 weeks ago
4
intel-analytics/ipex-llm #9308

issue with qlora fine-tuning on Flex GPU

Hi, I am trying to use the Qlora code as provided in the repo on a Sapphire Rapids, Flex GPU machine. I was able to run the [qlora_finetuning.py](https://github.com/intel-analytics/BigDL/blob/m…

tsantra updated 6 months ago
10
lm-sys/FastChat #2578

Llama 2 70b qLoRA training not converging

Hi folks, I'm running into an issue finetuning the 70B Llama 2 model with 4bit qLoRA using the FastChat package, and I'm wondering if anyone else has encountered similar issues or has suggestions f…

alwayshalffull updated 9 months ago
1
unslothai/unsloth #534

Getting `OSError: [Errno 28] No space left on device` on Kag…

I try your example `Kaggle Mistral 7b Unsloth notebook`. The only thing I changed - I changed Flase to True in this line: `if True: model.save_pretrained_merged("model", tokenizer, save_method = "mer…

brand17 updated 1 month ago
1
artidoro/qlora #27

Cannot resume from checkpoint because it is not detected as …

I have problems resuming a checkpoint. What I did: 1) `python qlora.py --model_name_or_path huggyllama/llama-7b` 2) abort when a checkpoint has been written 3) `python qlora.py --model_name_or_path…

DavidFarago updated 9 months ago
3
huggingface/transformers #30172

4bit Adam

### Feature request Is there any chance we coukd get this 4bit adam optimizer added to tranformers? It has nearly the same performance as 32bit adam with significant drop in vram overhead. [repo…

NicolasMejiaPetit updated 1 week ago
2
yangjianxin1/Firefly #235

Qlora如何指定某一张卡，单卡训练？

# 加载模型 print("加载模型----") model = AutoModelForCausalLM.from_pretrained( args.model_name_or_path, device_map="auto", # device_map=device_map, load_in_4bit=T…

zhl970124 updated 2 months ago
4
Arnav0400/ViT-Slim #17

The size of tensor a (8388608) must match the size of tensor…

During training of glora i am facing the error "The size of tensor a (8388608) must match the size of tensor b (4096) at non-singleton dimension 0" I am using glors from PR https://github.com/Arnav…

Abdullah-kwl updated 1 month ago
4
LLaVA-VL/LLaVA-NeXT #41

Training/Finetunning code please

Hi, Dear author: It seems the llava-next is really insightful exploreing work. Please kindly release the training and inference code asap, thank you very much.

dragen1860 updated 3 weeks ago
5

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for qlora

1000+ results
for qlora