issues
search
unslothai
/
unsloth
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
https://unsloth.ai
Apache License 2.0
18.36k
stars
1.28k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
How to train only last few layers using FastLanguageModel
#1320
gneeraj97
opened
1 hour ago
1
How to fine-tune LLaMA 3.2 11B Vision using LoRA with the recent update?
#1319
yukiarimo
opened
6 hours ago
1
Vision
#1318
danielhanchen
closed
6 hours ago
0
TypeError: expected string or bytes-like object
#1317
AndersonPedrosa35
closed
6 hours ago
1
Feat/kto
#1316
Erland366
opened
9 hours ago
0
Vision support
#1315
danielhanchen
closed
12 hours ago
0
failed finetune qwen32b_awq_int4 using lora with llama-factory
#1314
Daya-Jin
opened
13 hours ago
5
ImportError when importing FastLanguageModel from unsloth
#1313
MurphyJUAN
opened
20 hours ago
3
The tokenizer does not have a {% if add_generation_prompt %}
#1312
Galaxy-Husky
opened
22 hours ago
1
Not able to load model from huggingface repo with correct path (FileNotFoundError: invalid repository id)
#1311
ygl1020
opened
1 day ago
1
what was the quantisation algorithm used in unsloth/Llama-3.2-1B-bnb-4bit?
#1310
jayakommuru
opened
1 day ago
1
Does tensorRT-LLM support serving 4bit quantised unsloth Llama model
#1309
jayakommuru
opened
1 day ago
1
Unable to get the output due to bitsandbytes library error
#1308
iamnikhildogra
opened
1 day ago
1
Feat/kaggle-gguf-on-tmp
#1307
Erland366
opened
2 days ago
0
Pythia Models Unsupported
#1306
sert121
opened
3 days ago
1
Is it possible for Unsloth to support naive model parallelism?
#1305
Songjw133
opened
3 days ago
0
About alpha/rank in lora
#1304
Vital1162
opened
3 days ago
3
Dataset for train to translate language
#1303
nichellehouston
opened
3 days ago
1
AttributeError: module 'google.protobuf.descriptor' has no attribute '_internal_create_key'
#1302
Sherlock-shy
opened
4 days ago
2
Epoch number 3 disappears during evaluation
#1301
daegonYu
opened
4 days ago
1
Training Setting
#1300
nichellehouston
opened
4 days ago
2
Apple's cross entropy computation
#1299
fzyzcjy
closed
4 days ago
2
Can you add support for apple/ml-cross-entropy?
#1298
zfflxx
opened
5 days ago
6
`AttributeError: 'LlamaForCausalLM' object has no attribute 'update'`
#1297
scigeek72
opened
6 days ago
2
Error while importing "from unsloth import FastLanguageModel"
#1296
thesillystudent
opened
6 days ago
2
Fix too sensitive "Unsloth currently does not support multi GPU setups" when training with a single GPU in a multi-GPU environment.
#1295
giuliabaldini
opened
6 days ago
8
Error while importing unsloth in databricks
#1294
BurakaKrishna
opened
6 days ago
2
fix/sfttrainer-compatibility
#1293
Erland366
closed
1 week ago
1
Low GPU utilization when running Unsloth-finetuned Qwen2.5-Coder-14B-Instruct-128K-GGUF
#1292
e1ijah1
opened
1 week ago
3
Extremely long context finetuning
#1291
GianlucaDeStefano
opened
1 week ago
2
Train on responses only does not seem to work for Mistral format
#1290
LostRuins
opened
1 week ago
3
Added Support for Apple Silicon
#1289
shashikanth-a
opened
1 week ago
4
Bug fixes
#1288
danielhanchen
closed
1 week ago
0
fix indentation error in models/_utils.py:209
#1287
grpathak22
opened
1 week ago
0
Fix orpo/dpo trainer
#1286
dame-cell
opened
1 week ago
0
`unexpected keyword argument tokenizer` [FIXED]
#1285
avemio-digital
opened
1 week ago
4
`{% if add_generation_prompt %}` [FIXED]
#1284
giuliabaldini
opened
1 week ago
6
Error loading model: Unsloth: unsloth/Meta-Llama-3.1-8B-bnb-4bit not supported yet! Make an issue to https://github.com/unslothai/unsloth!
#1283
sree-tejis
opened
1 week ago
3
Gradient norm is zero for training Qwen2.5-0.5B-Instruct in unsloth=="2024.11.6"
#1282
joe32140
opened
1 week ago
1
Fix/export mistral
#1281
Erland366
closed
1 week ago
3
Qwen 2.5
#1280
danielhanchen
closed
1 week ago
0
FileExistsError: [WinError 183]
#1279
rogersohandsome
opened
1 week ago
1
Will the open source version support multiple Gpus later?
#1278
first-li
opened
1 week ago
3
Can you support the fine-tuning of the MiniCPM3-4B model ?
#1277
faceair
opened
1 week ago
2
fix/sft-trainer
#1276
Erland366
closed
1 week ago
5
Finetuned Llama 3.1 8B (base) gets stuck in a loop
#1275
skerit
opened
1 week ago
2
Add Support for Pre-Training
#1274
dame-cell
opened
1 week ago
2
Resizing tokenizer leads to missing end token and garbage response?
#1273
Mark-DelGrande
opened
1 week ago
1
Jupyter notebook: No module named 'unsloth'
#1272
iwouldratherbeatthebeach
opened
1 week ago
3
dataset for train model to translate language
#1271
nichellehouston
closed
1 week ago
1
Next