unsloth Search Results - Githubissues

1000+ results
for unsloth

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #34263

New GA fix causes training loss multiple times higher across…

### System Info 8xH100 ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially supported ta…

JianbangZ updated 2 weeks ago
29
unslothai/unsloth #597

OSError: [Errno 39] Directory not empty: '_unsloth_sentencep…

Hello Expert, an error happened when export gguf with F16. >>> **model.save_pretrained_gguf("model", tokenizer, quantization_method = "f16")** Unsloth: Merging 4bit and LoRA weights to 16bit…

zhaowei0315 updated 5 months ago
3
unslothai/unsloth #670

Using CUDA_VISIBLE_DEVICES set to anything else than 0 cause…

Hi! I am using mistralai/Mistral-7B-v0.1 and when I set CUDA_VISIBLE_DEVICES=1 (or anything else than 0) it gives me the error below. After checking I think that the culprit is this [line](https://…

J4Q8 updated 5 months ago
2
unslothai/unsloth #668

System throws Segmentation Fault when starting training proc…

Hello there, I am trying to use fine-tuning using the SFT Trainer, but when starting the training process when calling `trainer.train()` my system throws a Segmentation Fault. My system is equipp…

coenvdgrinten updated 5 months ago
1
hiyouga/LLaMA-Factory #4270

intergration with vllm and unsloth in webui and/or cli

### Reminder - [X] I have read the README and searched the existing issues. ### System Info python 3.10 vllm:latest ### Reproduction do you have any guide or documentation on how to incoporate…

ndao600 updated 5 months ago
1
unslothai/unsloth #580

unsloth and trl weights are inconsistent

I use this code to fine-tune Llama-3-8B-Instruct (with unsloth): ```python from unsloth import FastLanguageModel import torch from trl import SFTTrainer from transformers import TrainingArguments…

MachineGunLin updated 5 months ago
5
huggingface/trl #1709

Got an abnormally high loss when training Gemma-7B.

I was trainging Gemma-2B and Gemma-7B using sfttrainer, with `packing=True` set. The Gemma-2B's loss was quite normal, but Gemma-7B's was abnormally high. I'm not sure why this would happen, since bot…

smartliuhw updated 4 months ago
14
unslothai/unsloth #788

Mistral 12b Nemo Error: Model_kwargs not used ['token_type_i…

Hi there, I finetuned Mistral 12b Nemo using unsloth. However, when downloading my LORA adapters from hugging face, to use this model to generate text, I get this error: "ValueError: The following …

DaddyCodesAlot updated 3 months ago
4
unslothai/unsloth #753

accelerate generate data

I am trying to generate batch data (greater than 100w pairs), but i find the speed of model inference is slow, any advice? ``` import torch from unsloth import FastLanguageModel model, tokenizer =…

zongshenmu updated 3 months ago
8
hiyouga/LLaMA-Factory #4120

Looks like the updated code requires trl>=0.9.3, which isn't…

### Reminder - [X] I have read the README and searched the existing issues. ### System Info - `llamafactory` version: 0.7.2.dev0 - Platform: Linux-4.18.0-517.el8.x86_64-x86_64-with-glibc2.28 - Py…

YifanDengWHU updated 5 months ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for unsloth

1000+ results
for unsloth