qlora Search Results - Githubissues

1000+ results
for qlora

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

InternLM/xtuner #901

Load failure with the converted finetune InternVL2-2B model…

Take /root/share/new_models/datasets/CLoT-cn as datasets, To finetune /root/share/new_models/datasets/CLoT-cn with lora: The config settings for loralike following: ``` ###########################…

leagend updated 3 months ago
1
Dao-AILab/flash-attention #475

comparing HF vs FA2 llama2 models

hi, i'm looking over the optimizations in the trainer here, and trying to port them to the `transformers.trainer.Trainer` for use with llama2 i put together this simple script to view the differenc…

tmm1 updated 4 months ago
26
axolotl-ai-cloud/axolotl #740

RuntimeError: stack expects each tensor to be equal size, bu…

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports…

dachenlian updated 7 months ago
20
pytorch/torchtune #1029

QLoRA + FSDP

Is it possible to run `QLoRA` finetuning on more than a single device? I don't see any [configs](https://github.com/pytorch/torchtune/tree/main/recipes/configs/) for `QLoRA` other than for `single_de…

jeromeku updated 4 months ago
2
tenstorrent/tt-inference-server #37

Initial vLLM setup fails due to missing HuggingFace permissi…

When following initial setup steps from: https://github.com/tenstorrent/tt-inference-server/tree/main/vllm-tt-metal-llama3-70b#vllm-tt-metalium-llama-31-70b-inference-api, fails due to missing HF toke…

milank94 updated 3 days ago
1
QwenLM/Qwen-Agent #322

请问qwen agent如何加载本地模型

我用lora微调了qwen2模型，保存在本地，想使用qwen agent，但是不会加载微调后的模型，哪怕是加载保存在本地的qwen原始模型也不行。 def init_agent_service(): llm_cfg = {'model': r'./model/Qwen2-0.5B', 'model_server':'http://127.0.0.1:…

liguoyu666 updated 2 months ago
4
huggingface/transformers #31280

AttributeError: 'TrainingArguments' object has no attribute …

### System Info I use the SFTTrainer for my qlora fine-tuning for Mistral Instruct 2 model. I use unsloth to make my training faster. I have run the code multiple times before but today I got the …

gulsumbudakoglu updated 3 months ago
7
unslothai/unsloth #39

[Feature request] Support GPTQ quantization

So I have a GPTQ llama model I downloaded (from TheBloke), and it's already 4 bit quantized. I have to pass in False for the load_in_4bit parameter of: ``` model, tokenizer = FastLlamaModel.from_pr…

araleza updated 1 month ago
35
microsoft/Phi-3CookBook #127

Flash Attention supports only fp16 and bf16 data type for Ph…

> Please provide us with the following information: > --------------------------------------------------------------- ### This issue is for a: (mark with an `x`) ``` - [x] bug report -> please…

ArpitSharma7 updated 1 month ago
8
unslothai/unsloth #344

Loss not matching

Hi team, I tried to do QLora for 30B llama with unsloth. I found that there is no much improvement on speed and memory usage. The detaild are as following. seq_length=8192 batch size=1 use flash a…

ghost updated 3 weeks ago
2

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for qlora

1000+ results
for qlora