Closed cyc00518 closed 6 months ago
I also meet the problem can't pull nvcr.io/ea-bignlp/ga-participants/nemofw-training too.
@thangdt277 Can you start to train with other images?
This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.
Describe the bug
Follow NeMo Framework PEFT with Mistral-7B
First error , I can't pull image although I logged in nvcr.io successfully:
So I change docker image to
nvcr.io/nvidia/nemo:23.10
which is mentioned in READMEWhen I start to Step 2: Run PEFT training
Here is the error :
So I commented out lines below in
megatron_gpt_peft_tuning.py
:Then it starts to work , Here is process:
But there is another error:
Additional context GPU : H100*1