beam-cloud / beta9

Run serverless GPU workloads with fast cold starts on bare-metal servers, anywhere in the world
https://docs.beam.cloud
GNU Affero General Public License v3.0
488 stars 27 forks source link

Cedana: Working GPU checkpoint #527

Closed yashanand1910 closed 2 months ago

yashanand1910 commented 2 months ago

GPU checkpoint should work now. I have added an example workload sdk/examples/demo/cr.py, which loads a GPT2 model using on_start.

As per the previous PR (#509), you can try either serve or deploy.

Changes made to make this possible, for now:

Existing bugs:

Future improvements: