qlora Search Results - Githubissues

1000+ results
for qlora

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tloen/alpaca-lora #345

Welcome to instruct fine-tuning alpaca-lora based 4-bit GPTQ…

We try to implement 4bit-qlora, thanks to the optimized kernel implementation of back-propagation, the fine-tuning speed is similar to 8-bit lora at present. Welcome to use and issue: https://github.c…

PeiqinSun updated 1 year ago
6
THUDM/VisualGLM-6B #110

能不能指定某个单卡训练

yazheng0307 updated 1 month ago
5
yangjianxin1/Firefly #210

qlora微调后的模型性能

qlora微调模型输出正确结果后仍然会输出一些不相干的内容，例如： ![image](https://github.com/yangjianxin1/Firefly/assets/59114904/e3b50b77-165b-4757-b4eb-0a6349ec1f12) 我使用的断句功能，但是他在断完句子后仍然会输出一些无关紧要的内容，我一开始以为是训练集的大小太小，于是我将训练集的大小从2…

ZHANGJINKUI updated 5 months ago
5
artidoro/qlora #40

TypeError: __init__() got an unexpected keyword argument 'lo…

When I use the command below I got an error: ```shell python3 qlora.py –learning_rate 0.0001 --model_name_or_path ``` ╭─────────────────────────────── Traceback (most recent call last) ─…

muziyongshixin updated 1 year ago
2
intel-analytics/ipex-llm #9308

issue with qlora fine-tuning on Flex GPU

Hi, I am trying to use the Qlora code as provided in the repo on a Sapphire Rapids, Flex GPU machine. I was able to run the [qlora_finetuning.py](https://github.com/intel-analytics/BigDL/blob/m…

tsantra updated 7 months ago
10
FasterDecoding/Medusa #85

Why medusa-2 train llama2 with no such great improvement?

In the given examples axoltol [exmaples/medusa](https://github.com/ctlllll/axolotl/tree/main/examples/medusa), I follow the `vicuna_7b_qlora_stage1.yml` and `vicuna_7b_qlora_stage2.yml` to write my …

MeJerry215 updated 5 months ago
2
ml-explore/mlx-examples #714

[Feature Request] Support for QDoRA: Efficient quantized fin…

> Today we’re releasing the next step: QDoRA. This is just as memory efficient and scalable as FSDP/QLoRA, and critically is also as accurate for continued pre-training as full weight training. We thi…

s-smits updated 1 month ago
2
artidoro/qlora #190

Current git version of accelerate breaks QLoRA

Right now, [`requirements.txt`](https://github.com/artidoro/qlora/blob/main/requirements.txt) has `accelerate @ git+https://github.com/huggingface/accelerate.git`, but as of now this breaks QLoRA func…

BugReporterZ updated 1 year ago
3
artidoro/qlora #82

RuntimeError: CUDA error: an illegal memory access was encou…

Qlora LLaMa 13B ``` File "/home/hysz/anaconda3/envs/qlora/lib/python3.10/site-packages/torch/optim/lr_scheduler.py", line 69, in wrapper return wrapped(*args, **kwargs) File "/home/hysz/…

flaviadeutsch updated 1 year ago
6
DAMO-NLP-SG/VideoLLaMA2 #71

Error while loading custom finetuned QLoRA model in 4 bit : …

Hi Team, I have successfully finetuned a QLoRA adapter on a custom dataset. When I try to load it in full precision, it gets loaded and works well But this takes too much time and GPU memory to …

ApoorvFrontera updated 1 week ago
2

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for qlora

1000+ results
for qlora