DLLXW / baby-llama2-chinese

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
MIT License
2.47k stars 305 forks source link

Windows support modifications in pretrain script #32

Closed jh01231230 closed 1 year ago

jh01231230 commented 1 year ago

Modified pretrain.py : Did a couple of changes for Windows compatibility, which will not affect the integrity and functionality of the original code for Linux/Unix-based OS.

Modified requirement.txt : Added scikit-learn as the dependency. Modified README.md : Added instructions on how to run PyTorch with distributed training and automatically set the environment variables.