-
Currently we have no good way to train a `vocab.json` and `merges.txt` file for the `BytePairTokenizer`. This is the vocabulary format used by gpt-2, RoBERTa, and DeBERTa v1. It would be nice to allow…
-
Hi. Something (possibly not unsloth) changed between July and now.
I am getting an unexpected OOM error trying to do a LORA finetune. This worked before, but is now barfing.
Looked at #338, but not…
-
Hi, I carefully followed the tutorial of the post-editing task.
Although the tutorial has not been updated yet, I have succeeded until the learning step through your help.
However, I faced another…
-
When running `run.sh` in bash without changing anything, I got the following error:
```
File "msgpack/_unpacker.pyx", line 205, in msgpack._cmsgpack.unpackb
ValueError: Unpack failed: incomplete …
-
**System information (version)**
- OpenCV => 3.1.0
- Operating System / Platform => Windows 10 64 Bit
- Compiler => python 3.5
**Detailed description**
The var that i pass to the function i…
-
```
export LNG=de && python3 lm/preprocessing.py with configs/${LNG}_word_pp.json >configs/${LNG}_word_pp.json.output
export LNG=es && CUDA_VISIBLE_DEVICES=3 nohup python3 lm/trainer.py with con…
-
This plugin looks great. I just wanted to know what it does? I know it's a vocabulary trainer, but is it like Babbel or? Perhaps I can help with coding
-
I was trying to finetuning Meta-Llama-3-8B-Instruct using 4 gpus with the following command:
`torchrun --nproc_per_node 4 -m training.run --output_dir llama3test --model_name_or_path meta-llama/Met…
-
Hello!
Please help me start training
All the actions I've taken:
1) Created a new project (ocr_service) in pycharm with a virtual environment
2) Cloned the project (git clone https://github.com/…
-
Since world tokenizer training code is not available as far as I know, those of us who need a custom tokenizer train HF tokenizer (pip rwkv package, RWKV-LM trainer and json2binidx_tool all support it…