unsloth Search Results - Githubissues

1000+ results
for unsloth

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

stanfordnlp/dspy #1479

Average Metric: 0 / n for Multilingual RAG

Hi, I've seen a lot of tutorial of Dspy already for english,I wanted to give it a try for bangla,The code below gives me wrong answer for all the questions that i ask to my Dspy based RAG system,I ne…

mobassir94 updated 2 months ago
1
HydPy/HydPy-meetups #66

The Guide to Building Open Indic LLMs Today

**Title of the talk/workshop** The Guide to Building Open Indic LLMs Today **Abstract of the talk/workshop** - Steps in training modern LLMs - Challenges specific to Indic Models (Tokeni…

ramsrigouthamg updated 1 month ago
2
QwenLM/Qwen2.5 #339

有计划适配unsloth吗？

这个训练上比FA2更快，而且vram占用更少，在llama2的测试上非常有效。然而还不支持原生的qwen2，尽管有些三方脚本支持llamafy qwen，但是因为潜在的实现错误风险，让人不太有尝试欲望。unsloth已经被集成到llamafactory中。 benchmark：https://unsloth.ai/blog/mistral-benchmark#Benchmark%20tabl…

rangehow updated 5 months ago
1
PMahern/StableDiffusionTagManager #14

Error message when running built-in natural language interro…

When using the natural language interrogator, joycaption, it's bringing up an error message: ![Capture](https://github.com/user-attachments/assets/d9e07f22-31ad-4584-9529-c962b7c3639c) 'Failed t…

Fylifa updated 2 months ago
3
unslothai/unsloth #571

Gemma 7B IT prompt formatting query

In the [notebook](https://colab.research.google.com/drive/1fxDWAfPIbC-bHwDSVj5SBmEJ6KG3bUu5?usp=sharing#scrollTo=LjY75GoYUCB8) where you mentioned about how absence of `` token affects the training lo…

AvisP updated 3 weeks ago
3
linkedin/Liger-Kernel #51

Which GPUs does this work on?

I'm assuming it only works on Ampere, Hopper, Lovelace. Is that correct? It might be nice to specify in the readme, if it is limited to certain GPU types.

nbroad1881 updated 2 months ago
10
hiyouga/LLaMA-Factory #4705

Using LLamaPro and LORA gives error: KeyError: 'train.num_la…

### Reminder - [X] I have read the README and searched the existing issues. ### System Info Latest version, Ubuntu 24.04 ### Reproduction Run Llama Pro and Lora together for finetuning on a model…

MarlNox updated 4 months ago
8
unslothai/unsloth #808

Runing out of disk space on colab while finetuning nemo

RuntimeError: [enforce fail at inline_container.cc:595] . unexpected pos 3072467392 vs 3072467280 setup: from trl import SFTTrainer from transformers import TrainingArguments from unsloth impor…

ammary25 updated 3 months ago
3
unslothai/unsloth #828

Unsloth 2024.8 patched 32 layers with 0 QKV layers, 0 O laye…

Hi, I've finetuned llama 3.1 8b with Unsloth, but I get an unhandled exception when running inference. This seems related to the bugfix I saw in 2024.8, perhaps there's more missing there? Here'…

ztick updated 3 months ago
2
yangjianxin1/Firefly #266

使用Unsloth反而OOM了是为什么呢？

pytorch:2.3.0 cuda:11.8 flash-attn:2.5.9.post1 python 3.10 unsloth是pip install git+https://github.com/yangjianxin1/unsloth.git 这样下的不开unsloth可以跑，开了之后max_length改到512，per device_train_bat…

yuyu990116 updated 5 months ago
2

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for unsloth

1000+ results
for unsloth