issues
search
artidoro
/
qlora
QLoRA: Efficient Finetuning of Quantized LLMs
https://arxiv.org/abs/2305.14314
MIT License
9.7k
stars
796
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
llama 3 -support?
#296
LuoyaoChen
opened
3 weeks ago
0
additional load_in_4bit removed
#295
shirinyamani
opened
4 weeks ago
0
Qlora with flan-t5 issue - ValueError: Trying to set a tensor of shape torch.Size([4096, 4096])
#294
JhonDan1999
opened
1 month ago
0
Paged optimizer vs gradient checkpointing?
#293
LeoPerelli
opened
1 month ago
0
Error when loading model
#292
m000lie
opened
2 months ago
0
Llama 1 7b MMLU results largely diverges from reported
#291
Edenzzzz
opened
2 months ago
0
a critical loss drop happen after each epoch ending
#290
Coco58323
opened
2 months ago
0
Question about deployment of fine tuned model
#289
Brandon371
opened
4 months ago
0
Fuyu-8B qLora
#288
SinanAkkoyun
opened
5 months ago
0
RuntimeError: expected mat1 and mat2 to have the same dtype, but got: float != c10::BFloat16
#287
andeyeluguo
closed
5 months ago
1
[Questions]: How to implement NF4/NF2 matmul kernel function?
#286
llCurious
opened
5 months ago
1
Table 4 and Table 5 have different results
#285
dekoponTree
opened
5 months ago
0
[Bug] large CUDA memory usage in the evaluation phase
#284
ChenMnZ
opened
6 months ago
1
How to support FLAN v2 dataset.
#283
ChenMnZ
opened
6 months ago
0
How do you use oasst1 dataset in qlora.py - why only the 'text' field is used?
#282
Huxwell
opened
6 months ago
0
Using QLORA for Multi Modal Vison Foundation Models Optimization - google/owlv2-base-patch16-ensemble
#281
solomonmanuelraj
opened
6 months ago
0
RuntimeError: CUDA error: CUBLAS_STATUS_NOT_INITIALIZED when calling `cublasCreate(handle)`
#280
Juanerx
opened
6 months ago
1
Issue with Yi 34B Training EOS token not working
#279
mrmuke
closed
7 months ago
1
Saving/Loading qlora adapters
#278
chrisi2045
opened
7 months ago
0
Merge issue
#277
qburst-fidha
opened
7 months ago
0
Multi-GPU Training Giving Different Loss
#276
nikhil-ghosh-berkeley
opened
7 months ago
1
Unable to generate predictions
#275
SamarthMM
opened
8 months ago
0
Garbage output of Llama-2-13B-chat model after qlora finetuning
#274
cywsg
opened
8 months ago
0
Quantization aware finetuning?
#273
SinanAkkoyun
opened
8 months ago
0
Qlora Read me fix
#272
Vezora-Corp
opened
9 months ago
0
Training on logits rather than tokens?
#271
SinanAkkoyun
opened
9 months ago
0
adding lba support for qlora
#270
itayhubara
closed
9 months ago
0
extra memory usage for loading the model
#269
XintianHan
opened
9 months ago
0
TypeError: 'NoneType' object is not iterable
#268
reilgun
opened
9 months ago
3
DDP Training fails
#267
AntoineBlanot
closed
9 months ago
1
[XPU] CUDA error when running on arc770 with Intel extension for pytorch
#266
delock
opened
9 months ago
1
can we only evaluate the mmlu_dataset when sh sh scripts/finetune_guanaco_7b.sh?
#265
LiZhangMing
opened
9 months ago
1
Could not reproduce the results listed in your paper using a single 3090 card.
#264
LiZhangMing
opened
9 months ago
6
[Bug Fix] Add importing `warnings`
#263
tongyx361
opened
10 months ago
0
uneven distribution of GPU workload
#262
liatamax
opened
10 months ago
1
Question: CUDA memory usage in the evaluation phase
#261
LimboWK
opened
10 months ago
2
Why do we print just half of `trainable_params" when using 4-bits?
#260
HanGuo97
opened
10 months ago
0
[Question] Why can we set `model_parallel` and `is_parallelizable` to `True` for whichever `model`?
#259
tongyx361
opened
10 months ago
0
Why do we need the Dequantization process?
#258
nthehai01
opened
10 months ago
0
Fix outdated description of HF arguments in README.md
#257
tongyx361
opened
10 months ago
0
Error invalid device ordinal at line 393
#255
matt-seb-ho
opened
10 months ago
0
Should base model be dequantized when merging LoRA weights with base model?
#254
jinyongyoo
opened
10 months ago
6
Getting error dataclasses.FrozenInstanceError: cannot assign to field generation_config when executing any of the scripts in the scripts folder with default parameters.
#253
vasuems
opened
10 months ago
2
epoch presented does not match the calculation
#252
lijierui
opened
10 months ago
0
[Bug] Test set is taken from training set
#251
Peter-Devine
opened
10 months ago
1
can be used in stable diffusion?
#250
henbucuoshanghai
opened
10 months ago
0
curious about the train speed
#249
JustQJ
opened
10 months ago
0
Can you help implement MMLU evals into Axolotl?
#248
teknium1
opened
10 months ago
0
warnings.warn is not defined
#247
wangkuiyi
opened
10 months ago
0
Configure flake8, black, and pre-commit
#246
wangkuiyi
closed
8 months ago
0
Next