pytorch / executorch

On-device AI across mobile, embedded and edge for PyTorch
https://pytorch.org/executorch/
Other
2.21k stars 368 forks source link

Bump runner memory for llama3_2 torchtune test_model #6890

Open dvorjackz opened 1 week ago

dvorjackz commented 1 week ago

Summary

Bump memory and disk space for TorchTune llama3_2 text decoder, since the test may have previously been OOMing.

Test plan

Ran the test locally with:

rm -rf cmake-out
cmake -DCMAKE_BUILD_TYPE=Debug -DEXECUTORCH_BUILD_KERNELS_OPTIMIZED=ON -DPYTHON_EXECUTABLE=python -Bcmake-out .
cmake --build cmake-out -j4 --config Debug
./cmake-out/executor_runner --model_path ./llama3_2_text_decoder.pte

Which passes because my local machine is very powerful, now temporarily enabling this test on pull to see if passes on CI.

pytorch-bot[bot] commented 1 week ago

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/6890

Note: Links to docs will display an error until the docs builds have been completed.

:heavy_exclamation_mark: 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

:white_check_mark: You can merge normally! (1 Unrelated Failure)

As of commit 6f75c3ea3c5d1e28702f23402f2ed61c40f4b4d0 with merge base 54899feb10667d30a667c672c7d8c4faf546c55e (image):

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

* [trunk / test-models-macos (cmake, llama3_2_text_decoder, portable, macos-m1-stable, 90) / macos-job](https://hud.pytorch.org/pr/pytorch/executorch/6890#33055807314) ([gh](https://github.com/pytorch/executorch/actions/runs/11860465568/job/33055807314)) ([trunk failure](https://hud.pytorch.org/pytorch/executorch/commit/54899feb10667d30a667c672c7d8c4faf546c55e#33037670799)) `##[error]The operation was canceled.`

This comment was automatically generated by Dr. CI and updates every 15 minutes.