Hello,
I've recently trained a model using GPT-NeoX and packed the checkpoint global_step1000 into a pytorch_model.bin file using the zero_to_fp32.py script. However, I'm having trouble figuring out how to load this file into the trained model for text generation.
I have reviewed the GPT-NeoX documentation and code, but I still don't understand the specific steps required. Any example code or detailed instructions would be greatly appreciated.
Thank you very much!
Hello, I've recently trained a model using GPT-NeoX and packed the checkpoint global_step1000 into a pytorch_model.bin file using the zero_to_fp32.py script. However, I'm having trouble figuring out how to load this file into the trained model for text generation. I have reviewed the GPT-NeoX documentation and code, but I still don't understand the specific steps required. Any example code or detailed instructions would be greatly appreciated. Thank you very much!