qlora Search Results - Githubissues

1000+ results
for qlora

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #30559

Idefics2 fine-tuning: Error when unscale_gradients called on…

### System Info - `transformers` version: 4.40.0.dev0 - Platform: Linux-5.15.0-101-generic-x86_64-with-glibc2.17 - Python version: 3.8.2 - Huggingface_hub version: 0.20.2 - Safetensors version: 0…

rabiulcste updated 3 weeks ago
6
taishan1994/qlora-chinese-LLM #2

RuntimeError: mat1 and mat2 shapes cannot be multiplied

尝试在12G卡上训练 python qlora.py --model_name="chinese_alpaca" --model_name_or_path="./model_hub/chinese-alpaca-7b" --trust_remote_code=False --dataset="msra" --source_max_len=128 --target_max_len=64 --do_t…

dsheng updated 1 year ago
3
yangjianxin1/Firefly #210

qlora微调后的模型性能

qlora微调模型输出正确结果后仍然会输出一些不相干的内容，例如： ![image](https://github.com/yangjianxin1/Firefly/assets/59114904/e3b50b77-165b-4757-b4eb-0a6349ec1f12) 我使用的断句功能，但是他在断完句子后仍然会输出一些无关紧要的内容，我一开始以为是训练集的大小太小，于是我将训练集的大小从2…

ZHANGJINKUI updated 3 months ago
5
vihangd/alpaca-qlora #4

Question on differences with artidoro/qlora

I notice that there are some differences compared to the `artido/qlora` repo. Why were the following code left out in this repo? ```py def find_all_linear_names(args, model): cls = bnb.nn.Lin…

gptzerozero updated 1 year ago
1
FlagOpen/FlagEmbedding #919

Training with unsloth

Currently, Unsloth can only support single GPU training, how can you implement it with 8-GPU training? Thx

ZetangForward updated 1 week ago
4
FasterDecoding/Medusa #85

Why medusa-2 train llama2 with no such great improvement?

In the given examples axoltol [exmaples/medusa](https://github.com/ctlllll/axolotl/tree/main/examples/medusa), I follow the `vicuna_7b_qlora_stage1.yml` and `vicuna_7b_qlora_stage2.yml` to write my …

MeJerry215 updated 3 months ago
2
swastikmaiti/Llama-2-7B-Chat-PEFT #1

Is Their a Way to speed Up Inference

#### The inference code in `inference.ipynb` is taking 3minutes to run on Colab L4 GPU .Is their any way to speed up inference? @swastikmaiti

SahilCarterr updated 2 weeks ago
3
artidoro/qlora #135

Fine tune flan t5 xl and flan t5 xxl using qlora and has a p…

Fine tune flan t5 xl and flan t5 xxl using qlora and has a problem with learning rate 0.0 and loss 0.0 ? Can anyone resolve this problem ? Thanks

trannhatquy updated 1 month ago
6
huggingface/alignment-handbook #42

How to QLoRA training with ZeRO-3 on two or more GPUs?

I added a 4-bit load after the command LoRA training with ZeRO-3 on two or more GPUs to achieve a mix of QLoRA and ZeRO-3. But the program encountered the following error: RuntimeError: expected ther…

Di-Zayn updated 2 months ago
4
Cruiz102/Advesarial_Attacks_Tests #1

Add optimizations to the training loop.

To enable efficient training on GPUs and scale our repository for models with millions to billions of parameters—essential for working with large visual language models—we must implement optimization …

Cruiz102 updated 4 months ago
1

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for qlora

1000+ results
for qlora