pytorch / serve

Serve, optimize and scale PyTorch models in production
https://pytorch.org/serve/
Apache License 2.0
4.22k stars 863 forks source link

Clear up neuron cache #3326

Closed chen3933 closed 2 months ago

chen3933 commented 2 months ago

Description

Please read our CONTRIBUTING.md prior to creating your first pull request.

Please include a summary of the feature or issue being fixed. Please also include relevant motivation and context. List any dependencies that are required for this change.

Fixes #(issue)

Type of change

Please delete options that are not relevant.

Feature/Issue validation/testing

Please describe the Unit or Integration tests that you ran to verify your changes and relevant result summary. Provide instructions so it can be reproduced. Please also list any relevant details for your test configuration.

Preparing local execution... Terminating any existing Torchserve instance ... torchserve --stop TorchServe has stopped. Setting up model store... Directory /var/tmp/neuron-compile-cache/ exists. Clearing contents... Cache cleared: /var/tmp/neuron-compile-cache/ Starting local Torchserve instance... Running: torchserve --start --model-store /tmp/model_store --enable-model-api --disable-token-auth --workflow-store /tmp/wf_store --ts-config /tmp/benchmark/conf/config.properties > /tmp/benchmark/logs/model_metrics.log torchserve --start --model-store /tmp/model_store --enable-model-api --disable-token-auth --workflow-store /tmp/wf_store --ts-config /tmp/benchmark/conf/config.properties > /tmp/benchmark/logs/model_metrics.log Testing system health... { "status": "Healthy" }



## Checklist:

- [ ] Did you have fun?
- [ ] Have you added tests that prove your fix is effective or that this feature works?
- [ ] Has code been commented, particularly in hard-to-understand areas?
- [ ] Have you made corresponding changes to the documentation?