Open ArthurMinovsky opened 1 year ago
new repository: https://github.com/mikeybellissimo/LoRA-MPT/tree/main
https://colab.research.google.com/drive/1rPwsy659VxOMyx7yUiYo6tsfsg_SyAuf?usp=sharing
still fail: loss == 0
@pitikorn32 (@boss-chanon บอกให้ tag) repo: https://github.com/mikeybellissimo/LoRA-MPT/tree/main
error message after upgrading peft lib: /tmp/tmpfgiig51b/main.c:2:10: fatal error: cuda.h: No such file or directory
Description:
Add MPT with Gradient Checkpointing and LoRa support into OpenThaiGPT pertaining code. We will use MPT with Lora for continue pertaining to task #179
To Do:
Outcomes:
References:
MPT with Lora code: https://github.com/iwalton3/mpt-lora-patch
MPT Weight: https://huggingface.co/mosaicml/mpt-1b-redpajama-200b
PEFT to implement Lora Finetuning: https://github.com/huggingface/peft
OpenThaiGPT Github: https://github.com/OpenThaiGPT/openthaigpt-pretraining/tree/main/src/model/openthaigpt_pretraining_model/models