-
Hi, I'm training a model (essentially copied from https://huggingface.co/blog/unsloth-trl#unsloth--trl-integration):
```python
import torch
from trl import SFTTrainer
from transformers import Tr…
-
I see the listed finetunes as the bottom but I cannot find out where to upload them to. How do we use them?
-
Hi,
I have been trying doing a DP based finetuning on a dataset using Pythia 1B model. I receive the following error at epoch 5 when I Increase the dataset size to around 1000.
> TypeError: zer…
-
### Description & Motivation
LLMs are trained on growing size of corpora, only resume by epochs is not enough, as models may only be trained on a few epochs and one epoch may take a few days to tra…
-
I wanted to test SD3 training results with a grid made with Efficient Loader with other custom nodes that use Efficient Loader.
Unfortunately, there are a few issues with the SD3 Efficient Loader. I…
-
[ ] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
**Describe the bug**
When I try to use generator.adapt(language='chinese'), I stil…
-
### Prerequisite
- [X] I have searched [Issues](https://github.com/open-mmlab/mmcv/issues) and [Discussions](https://github.com/open-mmlab/mmcv/discussions) but cannot get the expected help.
- [X] Th…
-
https://github.com/FlagOpen/FlagEmbedding/tree/master/Long_LLM/activation_beacon
https://huggingface.co/namespace-Pt/activation-beacon-llama2-7b-chat/tree/main
https://arxiv.org/abs/2401.03462
Cu…
-
### What is the issue?
My setup is a 4x A100 80GB, 2TB ram, dual intel cpu. Ubuntu server 22.04.
On a previous version of ollama, the model llama3.1:405b was loaded in a reasonable amount of second…
-
This bug popped up post `October-2024` tag. A glob file pattern makes it into HF's repo ID validator which causes it to fail.
Example:
```
model, tokenizer = FastLanguageModel.from_pretrain…