microsoft / DeepSpeedExamples

Example models using DeepSpeed
Apache License 2.0
5.83k stars 990 forks source link

Suggested GPU to run the demo code of step2_reward_model_finetuning (DeepSpeed-Chat) #885

Open wenbozhangjs opened 3 months ago

wenbozhangjs commented 3 months ago

I follow the instructions of this page to do step2_reward_model_finetuning with demo code.

On the Google Cloud platform, I create one instance with a single V100(16GB) and another instance with double V100(16GB). I directly use this command bash training_scripts/opt/single_gpu/run_350m.sh but always meet the out-of-memory issues.

Are there any modifications I could do to run this demo code on double V100(16BG)? Or Are there recommendations about which type of GPU I should use to run this demo code successfully?

Appreciate the help!