GoogleCloudPlatform / ai-infra-cluster-provisioning

Apache License 2.0
37 stars 25 forks source link

Update lit_gpt commit to PyTorch 2.2 #364

Closed Chris113113 closed 6 months ago

Chris113113 commented 6 months ago

The PR's primary purpose is updating lit-gpt's commit to a PyTorch 2.2 commit. This also comes with a few other things:

Logs from new image: http://shortn/_klGl5LKuQm

Chris113113 commented 6 months ago

Mostly LGTM. In addition to my comments, two things:

  1. From this PR it looks like we are adjusting this example to run Llama-2-13B instead of Llama-2-70B. Just want to double check this is intentional
  2. Could you attach some link (using short-gen) where you are able to run this workload?
  1. Good catch on 13B, I was using it to experiment.

  2. http://shortn/_klGl5LKuQm, added to description.