Bump PyTorch pin to 20241112

Jack-Khuu commented 2 weeks ago

Accounts for:

PyTorch changing weight_only default from False to True https://github.com/pytorch/torchchat/issues/1356
Moving from export to export_for_training https://github.com/pytorch/torchchat/pull/1319
Should also fix cuDNN error: https://github.com/huggingface/diffusers/issues/9704
_convert_weight_to_int4pack API change from https://github.com/pytorch/pytorch/pull/139611, requiring AO PinBump for https://github.com/pytorch/ao/pull/1278
Change in CUDA support in PT wheels https://github.com/pytorch/pytorch/issues/140885

pytorch-bot[bot] commented 2 weeks ago

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1367

:page_facing_up: Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

:heavy_exclamation_mark: 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[DomainsOnly] Jobs fail with GLIBC version not found

:x: 22 New Failures, 2 Cancelled Jobs

As of commit 5b91d46657368cbd12ef8604bade7b4fe7480170 with merge base b809b69e03f8f4b75a4b27b0778f0d3695ce94c2 ():

NEW FAILURES - The following jobs have failed:

* [pull / compile-gguf (macos-14)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365065718) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365065718)) `NotImplementedError: Could not run 'aten::_convert_weight_to_int4pack' with arguments from the 'CPU' backend. This could be because the operator doesn't exist for this backend, or was omitted during the selective/custom build process (if using custom build). If you are a Facebook employee using PyTorch on mobile, please visit https://fburl.com/ptmfixes for possible resolutions. 'aten::_convert_weight_to_int4pack' is only available for these backends: [MPS, Meta, BackendSelect, Python, FuncTorchDynamicLayerBackMode, Functionalize, Named, Conjugate, Negative, ZeroTensor, ADInplaceOrView, AutogradOther, AutogradCPU, AutogradCUDA, AutogradHIP, AutogradXLA, AutogradMPS, AutogradIPU, AutogradXPU, AutogradHPU, AutogradVE, AutogradLazy, AutogradMTIA, AutogradPrivateUse1, AutogradPrivateUse2, AutogradPrivateUse3, AutogradMeta, AutogradNestedTensor, Tracer, AutocastCPU, AutocastXPU, AutocastMPS, AutocastCUDA, FuncTorchBatched, BatchedNestedTensor, FuncTorchVmapMode, Batched, VmapMode, FuncTorchGradWrapper, PythonTLSSnapshot, FuncTorchDynamicLayerFrontMode, PreDispatch, PythonDispatcher].` * [pull / runner-aoti (macos-14-xlarge)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365068668) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365068668)) `torch._inductor.exc.CppCompileError: C++ compile error` * [pull / test-build-runner-et-android / linux-job](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365069470) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365069470)) `RuntimeError: Command docker exec -t 5fe5264e2bb12c67eb6007a01a9abd59cb97c02184cdac2d71e4c468cb098000 /exec failed with exit code 1` * [pull / test-cpu-aoti (aarch64, stories15M)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365076405) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365076405)) `torch._inductor.exc.CppCompileError: C++ compile error` * [pull / test-cpu-aoti (x86_64, stories15M)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365075871) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365075871)) `NotImplementedError: Could not run 'aten::_convert_weight_to_int4pack' with arguments from the 'CPU' backend. This could be because the operator doesn't exist for this backend, or was omitted during the selective/custom build process (if using custom build). If you are a Facebook employee using PyTorch on mobile, please visit https://fburl.com/ptmfixes for possible resolutions. 'aten::_convert_weight_to_int4pack' is only available for these backends: [Meta, BackendSelect, Python, FuncTorchDynamicLayerBackMode, Functionalize, Named, Conjugate, Negative, ZeroTensor, ADInplaceOrView, AutogradOther, AutogradCPU, AutogradCUDA, AutogradHIP, AutogradXLA, AutogradMPS, AutogradIPU, AutogradXPU, AutogradHPU, AutogradVE, AutogradLazy, AutogradMTIA, AutogradPrivateUse1, AutogradPrivateUse2, AutogradPrivateUse3, AutogradMeta, AutogradNestedTensor, Tracer, AutocastCPU, AutocastXPU, AutocastMPS, AutocastCUDA, FuncTorchBatched, BatchedNestedTensor, FuncTorchVmapMode, Batched, VmapMode, FuncTorchGradWrapper, PythonTLSSnapshot, FuncTorchDynamicLayerFrontMode, PreDispatch, PythonDispatcher].` * [pull / test-cpu-compile (aarch64, stories15M)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365077618) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365077618)) `CppCompileError: C++ compile error` * [pull / test-cpu-compile (x86_64, stories15M)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365076921) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365076921)) `NotImplementedError: Could not run 'aten::_convert_weight_to_int4pack' with arguments from the 'CPU' backend. This could be because the operator doesn't exist for this backend, or was omitted during the selective/custom build process (if using custom build). If you are a Facebook employee using PyTorch on mobile, please visit https://fburl.com/ptmfixes for possible resolutions. 'aten::_convert_weight_to_int4pack' is only available for these backends: [Meta, BackendSelect, Python, FuncTorchDynamicLayerBackMode, Functionalize, Named, Conjugate, Negative, ZeroTensor, ADInplaceOrView, AutogradOther, AutogradCPU, AutogradCUDA, AutogradHIP, AutogradXLA, AutogradMPS, AutogradIPU, AutogradXPU, AutogradHPU, AutogradVE, AutogradLazy, AutogradMTIA, AutogradPrivateUse1, AutogradPrivateUse2, AutogradPrivateUse3, AutogradMeta, AutogradNestedTensor, Tracer, AutocastCPU, AutocastXPU, AutocastMPS, AutocastCUDA, FuncTorchBatched, BatchedNestedTensor, FuncTorchVmapMode, Batched, VmapMode, FuncTorchGradWrapper, PythonTLSSnapshot, FuncTorchDynamicLayerFrontMode, PreDispatch, PythonDispatcher].` * [pull / test-cpu-eval-sanity-check (aarch64, stories15M)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365077126) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365077126)) `CppCompileError: C++ compile error` * [pull / test-cpu-eval-sanity-check (x86_64, stories15M)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365076179) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365076179)) `NotImplementedError: Could not run 'aten::_convert_weight_to_int4pack' with arguments from the 'CPU' backend. This could be because the operator doesn't exist for this backend, or was omitted during the selective/custom build process (if using custom build). If you are a Facebook employee using PyTorch on mobile, please visit https://fburl.com/ptmfixes for possible resolutions. 'aten::_convert_weight_to_int4pack' is only available for these backends: [Meta, BackendSelect, Python, FuncTorchDynamicLayerBackMode, Functionalize, Named, Conjugate, Negative, ZeroTensor, ADInplaceOrView, AutogradOther, AutogradCPU, AutogradCUDA, AutogradHIP, AutogradXLA, AutogradMPS, AutogradIPU, AutogradXPU, AutogradHPU, AutogradVE, AutogradLazy, AutogradMTIA, AutogradPrivateUse1, AutogradPrivateUse2, AutogradPrivateUse3, AutogradMeta, AutogradNestedTensor, Tracer, AutocastCPU, AutocastXPU, AutocastMPS, AutocastCUDA, FuncTorchBatched, BatchedNestedTensor, FuncTorchVmapMode, Batched, VmapMode, FuncTorchGradWrapper, PythonTLSSnapshot, FuncTorchDynamicLayerFrontMode, PreDispatch, PythonDispatcher].` * [pull / test-cpu-eval-sanity-check-float16 (aarch64, stories15M)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365077373) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365077373)) `Process completed with exit code 1.` * [pull / test-cpu-eval-sanity-check-float16 (x86_64, stories15M)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365076605) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365076605)) `NotImplementedError: Could not run 'aten::_convert_weight_to_int4pack' with arguments from the 'CPU' backend. This could be because the operator doesn't exist for this backend, or was omitted during the selective/custom build process (if using custom build). If you are a Facebook employee using PyTorch on mobile, please visit https://fburl.com/ptmfixes for possible resolutions. 'aten::_convert_weight_to_int4pack' is only available for these backends: [Meta, BackendSelect, Python, FuncTorchDynamicLayerBackMode, Functionalize, Named, Conjugate, Negative, ZeroTensor, ADInplaceOrView, AutogradOther, AutogradCPU, AutogradCUDA, AutogradHIP, AutogradXLA, AutogradMPS, AutogradIPU, AutogradXPU, AutogradHPU, AutogradVE, AutogradLazy, AutogradMTIA, AutogradPrivateUse1, AutogradPrivateUse2, AutogradPrivateUse3, AutogradMeta, AutogradNestedTensor, Tracer, AutocastCPU, AutocastXPU, AutocastMPS, AutocastCUDA, FuncTorchBatched, BatchedNestedTensor, FuncTorchVmapMode, Batched, VmapMode, FuncTorchGradWrapper, PythonTLSSnapshot, FuncTorchDynamicLayerFrontMode, PreDispatch, PythonDispatcher].` * [pull / test-cpu-eval-sanity-check-float32 (aarch64, stories15M)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365077990) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365077990)) `Process completed with exit code 1.` * [pull / test-cpu-eval-sanity-check-float32 (x86_64, stories15M)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365077799) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365077799)) `NotImplementedError: Could not run 'aten::_convert_weight_to_int4pack' with arguments from the 'CPU' backend. This could be because the operator doesn't exist for this backend, or was omitted during the selective/custom build process (if using custom build). If you are a Facebook employee using PyTorch on mobile, please visit https://fburl.com/ptmfixes for possible resolutions. 'aten::_convert_weight_to_int4pack' is only available for these backends: [Meta, BackendSelect, Python, FuncTorchDynamicLayerBackMode, Functionalize, Named, Conjugate, Negative, ZeroTensor, ADInplaceOrView, AutogradOther, AutogradCPU, AutogradCUDA, AutogradHIP, AutogradXLA, AutogradMPS, AutogradIPU, AutogradXPU, AutogradHPU, AutogradVE, AutogradLazy, AutogradMTIA, AutogradPrivateUse1, AutogradPrivateUse2, AutogradPrivateUse3, AutogradMeta, AutogradNestedTensor, Tracer, AutocastCPU, AutocastXPU, AutocastMPS, AutocastCUDA, FuncTorchBatched, BatchedNestedTensor, FuncTorchVmapMode, Batched, VmapMode, FuncTorchGradWrapper, PythonTLSSnapshot, FuncTorchDynamicLayerFrontMode, PreDispatch, PythonDispatcher].` * [pull / test-gpu-aoti-bfloat16 (cuda, stories15M) / linux-job](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365078648) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365078648)) `RuntimeError: Command docker exec -t dbd5f139e8f32cc1cda94796f44861d0d8d79a25301f51db1faecefdf770625d /exec failed with exit code 1` * [pull / test-gpu-aoti-float16 (cuda, stories15M) / linux-job](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365078246) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365078246)) `RuntimeError: Command docker exec -t 3cf85dd23196fff6109be8949df7e81694400aa4908310d0f9b83bae7d89a1c0 /exec failed with exit code 1` * [pull / test-gpu-aoti-float32 (cuda, stories15M) / linux-job](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365078451) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365078451)) `RuntimeError: Command docker exec -t 3c3789616283c48728d70b9bee8dd708a20fd4be6884b65bd3330041067f8f3f /exec failed with exit code 1` * [pull / test-gpu-compile (cuda, stories15M) / linux-job](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365078825) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365078825)) `RuntimeError: Command docker exec -t f7ddcd2315031a765e2621a48995a301cdc8853662fe033c4f769114cda4b7d5 /exec failed with exit code 1` * [pull / test-gpu-eval-sanity-check (cuda, stories15M) / linux-job](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365079000) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365079000)) `RuntimeError: Command docker exec -t d525897ce7b387275750040e0e9c21e13c0e5793bab6d6ce016bc69ea38a09bb /exec failed with exit code 1` * [pull / test-tinystories-executorch (macos-14-xlarge)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365069125) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365069125)) `fatal: unable to access 'https://review.mlplatform.org/ml/ethos-u/ethos-u-core-driver/': Failed to connect to review.mlplatform.org port 443 after 88 ms: Couldn't connect to server` * [pull / test-torchao-experimental (macos-14-xlarge)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365069310) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365069310)) `ninja: error: '/Library/Frameworks/Python.framework/Versions/3.10/lib/python3.10/site-packages/torch/lib/libomp.dylib', needed by 'libtorchao_ops_aten.dylib', missing and no known rule to make it` * [Run parallel prefill / test-cuda / linux-job](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365065024) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594086/job/33365065024)) `RuntimeError: Command docker exec -t 9f5f891ef29a961fd1a8f5a3dd3885f09828032f925e1b6c8a47783a91d96b4b /exec failed with exit code 1` * [Run the aoti runner with CUDA using stories / test-runner-aot-cuda / linux-job](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365065012) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594107/job/33365065012)) `RuntimeError: Command docker exec -t 93a34b4464330f1a020d28d0833d77c4407bcba4e6399c40c30e1e037661b0e3 /exec failed with exit code 1`

CANCELLED JOBS - The following jobs were cancelled. Please retry:

* [pull / runner-aoti (16-core-ubuntu)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365068029) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365068029)) `##[error]The operation was canceled.` * [pull / test-tinystories-executorch (16-core-ubuntu)](https://hud.pytorch.org/pr/pytorch/torchchat/1367#33365068813) ([gh](https://github.com/pytorch/torchchat/actions/runs/11967594108/job/33365068813))

This comment was automatically generated by Dr. CI and updates every 15 minutes.