Closed 50516017 closed 11 months ago
cc @younesbelkada
Hi @50516017 Thanks a lot for raising this up, There are a couple of issues in your script
1- You are performing pure fine-tuning with the 8-bit model, which is not supported. If you want to train models with 8-bit weights, you need attach adapters on it using peft
package. Please have a look at few examples here: https://github.com/huggingface/peft/tree/main/examples/int8_training
2- You are using bitsandbytes compiled on windows, I am not sure how the interaction of that package + transformers will behave. In our case we only support this bitsandbytes package: https://github.com/TimDettmers/bitsandbytes so you might encounter some issues we cannot catch
Can you print the model and share the result here? Thanks!
I set the LoRa parameters based on the link and executed the learning, and it worked! thank you very much!
Awesome, @50516017 , glad that it worked!
System Info
Hi I want to create fine tuning using "rinna/japanese-gpt-neox-3.6b-instruction-ppo" on windows os
However, when I ran training and tried to save, the following error occurred and the model was not saved to output_dir. How should I solve it? I am building an environment using WSL2 and installing bitsandytes using the following. Could that be the cause?
https://github.com/jllllll/bitsandbytes-windows-webui
If this repository is causing problems, shouldn't I be using bitsandbytes in a windows environment?
enviroment
pip list
Who can help?
@pacman100 : @muellerz
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
execute training code
Expected behavior
error message