pytorch / executorch

On-device AI across mobile, embedded and edge for PyTorch
https://pytorch.org/executorch/
Other
2.21k stars 368 forks source link

Fix Cuda out of memory issue for eager runner #6866

Closed helunwencser closed 4 days ago

helunwencser commented 1 week ago

Stack from ghstack (oldest at bottom):

This PR updates the eager runner to disable grad and save memory usage.

It also update the prompt format to not include bos.

Differential Revision: D65962743

pytorch-bot[bot] commented 1 week ago

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/6866

Note: Links to docs will display an error until the docs builds have been completed.

:heavy_exclamation_mark: 2 Active SEVs

There are 2 currently active SEVs. If your PR is affected, please view them below:

:white_check_mark: No Failures

As of commit 5c9dbfb6f4365c49f35b80eaec70bbb116c95f76 with merge base e95f171316421ccca5583ce0e4a31743dc9a58c1 (image): :green_heart: Looks good so far! There are no failures yet. :green_heart:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot commented 1 week ago

This pull request was exported from Phabricator. Differential Revision: D65962743

facebook-github-bot commented 1 week ago

This pull request was exported from Phabricator. Differential Revision: D65962743

facebook-github-bot commented 6 days ago

This pull request was exported from Phabricator. Differential Revision: D65962743

facebook-github-bot commented 6 days ago

This pull request was exported from Phabricator. Differential Revision: D65962743

facebook-github-bot commented 4 days ago

This pull request was exported from Phabricator. Differential Revision: D65962743