Closed zJuuu closed 12 months ago
@zJuuu what provider version are you running?
@troian v0.3.1-rc0
updated to v0.3.1-rc1 but restarted again after deploying
upload logs after with v0.3.1-rc1
@zJuuu mind to dump a manifest object for this lease please
Confirmed that this issue only occurs when using Cloudmos to create deployments. Akash CLI and Console deployments do not encounter issue.
Example problem deployment via Cloudmos:
DSEQ - 278612
Provider - akash12ayncdl3lmln06rcs6falgkh2y4c5l7lp5eftn
Deployer account - akash1f53fp8kk470f7k26yr5gztd9npzpczqv4ufud7
Provider version - 0.3.1-rc1
Deployed via Cloudmos
Provider logs - https://transfer.sh/okq6q9eo3c/provider-logs.tar.gz
Manifest is not available on provider as the provider pod crashes:
root@node1:~/provider# kubectl -n lease get manifests --show-labels
No resources found in lease namespace.
https://gist.github.com/chainzero/9bc3c108b02987e53330a6ff94ce9ec7
Describe the bug SDL with GPU + no GPU deployment restarts akash-provider and all deployments.
SDL:
To Reproduce Steps to reproduce the behavior:
Expected behavior Deployment of SDL
Additional context Provider Logs: https://transfer.sh/f7828tQGGg/provider-logs.tar.gz