FreedomIntelligence / OVM

51 stars 3 forks source link

Compatibility with 13B model #4

Closed hbin0701 closed 10 months ago

hbin0701 commented 10 months ago

Dear authors, I was curious about whether you have tried running the scripts with 13B. I'm using 4x80GB A100, and it gives me OOM for train_generator.sh - and changing the batch size didn't particularly work for me 😢

In case of you have succeeded, could you kindly share the code for 13B?

OakYU commented 10 months ago

you can try zero2_offload as the config

hbin0701 commented 10 months ago

Thank you! :)