Open ghost opened 2 years ago
Let's first confirm if this is even still an issue, as this was opened in 2021. Make sure they are using the latest version of ae5-tools, and that everything is configured correctly.
I tested AE 5.6.1-8
and ae5-tools:0.6.0
and did not run into any issues
(base) [ec2-user@ip-172-31-57-113 ~]$ ae5 deployment list --k8s
Connecting to user account anaconda-enterprise@aaron-561-rhel87.svc.anaconda.com.
Password for anaconda-enterprise@aaron-561-rhel87.svc.anaconda.com:
--- usage --- resource
endpoint name owner mem cpu gpu node rst public command revision _profile id created updated state phase since rst project...
---------- ---------- ------------------- ------- --- --- ------------- --- ------ --------- -------- -------- ----------------------------------- ------------------- ------------------- ------- ------- ------------------- --- -----------
attractors Attractors anaconda-enterprise 297.8Mi 0 0 172.31.57.113 0 False dashboard 0.0.1 default a2-676b4155cf644124b02161aebd0b8563 04-03-2023 20:10:09 04-03-2023 20:10:26 started Running 04-03-2023 20:10:25 0 a0-9bede...
k8s k8s anaconda-enterprise 123.9Mi 0 0 172.31.57.113 0 False default latest default a2-4ae00b6d9fd9417796c880d830438789 04-12-2023 13:43:41 04-12-2023 13:44:30 started Running 04-12-2023 13:44:26 0 a0-2149f...
(base) [ec2-user@ip-172-31-57-113 ~]$ ae5 pod list
Connecting to user account anaconda-enterprise@aaron-561-rhel87.svc.anaconda.com.
--- usage --- resource
name owner mem cpu gpu node rst modified phase since _profile id project_id url
---------- ------------------- ------- --- --- ------------- --- -------- ------- ------------------- -------- ----------------------------------- ----------------------------------- -------------------------------------------------------------
Attractors anaconda-enterprise 329.3Mi 0 0 172.31.57.113 0 False Running 04-03-2023 20:10:11 default a1-b4cfe2fbc6cb42c987154a621047fd19 a0-9bede0e4a7184caea913b215a2aeeac9 http://anaconda-enterprise-ap-workspace/sessions/b4cfe2fbc...
Attractors anaconda-enterprise 297.8Mi 0 0 172.31.57.113 0 n/a Running 04-03-2023 20:10:25 default a2-676b4155cf644124b02161aebd0b8563 a0-9bede0e4a7184caea913b215a2aeeac9 https://attractors.aaron-561-rhel87.svc.anaconda.com/
k8s anaconda-enterprise 123.9Mi 0 0 172.31.57.113 0 n/a Running 04-12-2023 13:44:26 default a2-4ae00b6d9fd9417796c880d830438789 a0-2149f24a22fd4008b8dad3e157d0d0c7 https://k8s.aaron-561-rhel87.svc.anaconda.com/
(base) [ec2-user@ip-172-31-57-113 ~]$ ae5 node list
Connecting to user account anaconda-enterprise@aaron-561-rhel87.svc.anaconda.com.
--- capacity --- --- usage --- --- sessions --- --- deployments --- -- middleware -- --- system ---
name role ready pod mem cpu gpu pod mem cpu gpu pod mem cpu gpu pod mem cpu gpu pod mem cpu pod mem cpu
------------- --------- ----- --- ------- ----- --- --- ------- ---- --- --- ------- --- --- --- ------- --- --- --- ------- --- --- ------- ----
172.31.57.113 ae-master True 110 32.72Gi 5200m 0 38 4.265Gi 231m 0 1 330.0Mi 0 0 2 422.7Mi 0 0 17 2.484Gi 59m 18 1.028Gi 172m
Team,
I wanted to report a new issue we have found when using the api to list the pods or nodes using ae5-tools. I have attached the error message and the full deployment log that indicates a runtime timeout.
Best Regards, Moises Mejia
k8s.log
headers: {'Server': 'nginx', 'Date': 'Thu, 16 Dec 2021 15:23:12 GMT', 'Content-Type': 'text/plain; charset=utf-8', 'Content-Length': '55', 'Connection': 'keep-alive', 'Strict-Transport-Security': 'max-age=2592000; preload'} text: 500 Internal Server Error