pytorch / torchchat

Run PyTorch LLMs locally on servers, desktop and mobile
BSD 3-Clause "New" or "Revised" License
3.4k stars 224 forks source link

[WIP] Initial PR for generating and loading state dict #1329

Open Jack-Khuu opened 1 month ago

pytorch-bot[bot] commented 1 month ago

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1329

Note: Links to docs will display an error until the docs builds have been completed.

:x: 9 New Failures, 3 Cancelled Jobs

As of commit 8ade3c43a9b6d745360702bb085781056aacbd08 with merge base 70260eb4963b332ca6970ccc7cf74f3ec888efc1 (image):

NEW FAILURES - The following jobs have failed:

* [pull / runner-aoti (macos-14-xlarge)](https://hud.pytorch.org/pr/pytorch/torchchat/1329#32083659355) ([gh](https://github.com/pytorch/torchchat/actions/runs/11524143385/job/32083659355)) `torch._dynamo.exc.InternalTorchDynamoError: AttributeError: freqs_cis` * [pull / runner-et (16-core-ubuntu)](https://hud.pytorch.org/pr/pytorch/torchchat/1329#32083660083) ([gh](https://github.com/pytorch/torchchat/actions/runs/11524143385/job/32083660083)) `TypeError: 'NoneType' object is not subscriptable` * [pull / test-cpu-aoti (aarch64, stories15M)](https://hud.pytorch.org/pr/pytorch/torchchat/1329#32083670177) ([gh](https://github.com/pytorch/torchchat/actions/runs/11524143385/job/32083670177)) `torch._dynamo.exc.InternalTorchDynamoError: AttributeError: freqs_cis` * [pull / test-cpu-aoti (x86_64, stories15M)](https://hud.pytorch.org/pr/pytorch/torchchat/1329#32083669728) ([gh](https://github.com/pytorch/torchchat/actions/runs/11524143385/job/32083669728)) `torch._dynamo.exc.InternalTorchDynamoError: AttributeError: freqs_cis` * [pull / test-gpu-aoti-bfloat16 (cuda, stories15M) / linux-job](https://hud.pytorch.org/pr/pytorch/torchchat/1329#32083672096) ([gh](https://github.com/pytorch/torchchat/actions/runs/11524143385/job/32083672096)) `RuntimeError: Command docker exec -t ec5173b29d322828c0094f18add992729dbdf8d7c20c00ee0f9f495b56a207fa /exec failed with exit code 1` * [pull / test-gpu-aoti-float16 (cuda, stories15M) / linux-job](https://hud.pytorch.org/pr/pytorch/torchchat/1329#32083672675) ([gh](https://github.com/pytorch/torchchat/actions/runs/11524143385/job/32083672675)) `RuntimeError: Command docker exec -t 9d8fd50dab38b0e48c72d36fe17351c34000ac010b0b351407b1e8f23a618d49 /exec failed with exit code 1` * [pull / test-gpu-aoti-float32 (cuda, stories15M) / linux-job](https://hud.pytorch.org/pr/pytorch/torchchat/1329#32083672961) ([gh](https://github.com/pytorch/torchchat/actions/runs/11524143385/job/32083672961)) `RuntimeError: Command docker exec -t e8bd9ac3615250f3f60d724f6c53d4233f2057d2d08bb2ca1babd093938cae51 /exec failed with exit code 1` * [pull / test-tinystories-executorch (macos-14-xlarge)](https://hud.pytorch.org/pr/pytorch/torchchat/1329#32083661944) ([gh](https://github.com/pytorch/torchchat/actions/runs/11524143385/job/32083661944)) `TypeError: 'NoneType' object is not subscriptable` * [Run the aoti runner with CUDA using stories / test-runner-aot-cuda / linux-job](https://hud.pytorch.org/pr/pytorch/torchchat/1329#32083654924) ([gh](https://github.com/pytorch/torchchat/actions/runs/11524143362/job/32083654924)) `RuntimeError: Command docker exec -t 9027be63efb5d9ec980fb7d08d1c765efb245165e12f49c01703837473ffcf5f /exec failed with exit code 1`

CANCELLED JOBS - The following jobs were cancelled. Please retry:

* [pull / runner-aoti (16-core-ubuntu)](https://hud.pytorch.org/pr/pytorch/torchchat/1329#32083658850) ([gh](https://github.com/pytorch/torchchat/actions/runs/11524143385/job/32083658850)) `##[error]The operation was canceled.` * [pull / runner-et (macos-14-xlarge)](https://hud.pytorch.org/pr/pytorch/torchchat/1329#32083660536) ([gh](https://github.com/pytorch/torchchat/actions/runs/11524143385/job/32083660536)) `##[error]The operation was canceled.` * [pull / test-tinystories-executorch (16-core-ubuntu)](https://hud.pytorch.org/pr/pytorch/torchchat/1329#32083661727) ([gh](https://github.com/pytorch/torchchat/actions/runs/11524143385/job/32083661727)) `##[error]The operation was canceled.`

This comment was automatically generated by Dr. CI and updates every 15 minutes.