Open sksq96 opened 2 years ago
This issue has been automatically marked as stale. If this issue is still affecting you, please leave any comment (for example, "bump"), and we'll keep it open. We are sorry that we haven't been able to prioritize it yet. If you have any new additional information, please include it with your comment!
Hello, have you solved this problem? What is the solution to it?
I still have the same problem, is it solved?
❓ Questions and Help
What is your question?
I've pretrained a 13B GPT3 model with FSDP following this guide. However, whenever I try and finetune it, providing the checkpoint as the starting point, the job is killed with the following kill message.
Code
What's your environment?
pip
, source): source