anaconda / ae5-tools

A command-line tool for scripting AE5 actions
https://www.anaconda.com/enterprise/
BSD 3-Clause "New" or "Revised" License
9 stars 8 forks source link

AE5 : ae5 not able to list pod and node #138

Open ghost opened 2 years ago

ghost commented 2 years ago

Team,

I wanted to report a new issue we have found when using the api to list the pods or nodes using ae5-tools. I have attached the error message and the full deployment log that indicates a runtime timeout.

Best Regards, Moises Mejia

k8s.log

headers: {'Server': 'nginx', 'Date': 'Thu, 16 Dec 2021 15:23:12 GMT', 'Content-Type': 'text/plain; charset=utf-8', 'Content-Length': '55', 'Connection': 'keep-alive', 'Strict-Transport-Security': 'max-age=2592000; preload'} text: 500 Internal Server Error

A-Aron-T commented 1 year ago

Let's first confirm if this is even still an issue, as this was opened in 2021. Make sure they are using the latest version of ae5-tools, and that everything is configured correctly.

I tested AE 5.6.1-8 and ae5-tools:0.6.0 and did not run into any issues

(base) [ec2-user@ip-172-31-57-113 ~]$ ae5 deployment list --k8s
Connecting to user account anaconda-enterprise@aaron-561-rhel87.svc.anaconda.com.
Password for anaconda-enterprise@aaron-561-rhel87.svc.anaconda.com: 
                                               --- usage ---                                                     resource
 endpoint      name            owner           mem    cpu  gpu      node       rst  public   command   revision  _profile                  id                         created              updated         state    phase          since         rst   project...
----------  ----------  -------------------  -------  ---  ---  -------------  ---  ------  ---------  --------  --------  -----------------------------------  -------------------  -------------------  -------  -------  -------------------  ---  -----------
attractors  Attractors  anaconda-enterprise  297.8Mi  0    0    172.31.57.113  0    False   dashboard  0.0.1     default   a2-676b4155cf644124b02161aebd0b8563  04-03-2023 20:10:09  04-03-2023 20:10:26  started  Running  04-03-2023 20:10:25  0    a0-9bede...
k8s         k8s         anaconda-enterprise  123.9Mi  0    0    172.31.57.113  0    False   default    latest    default   a2-4ae00b6d9fd9417796c880d830438789  04-12-2023 13:43:41  04-12-2023 13:44:30  started  Running  04-12-2023 13:44:26  0    a0-2149f...
(base) [ec2-user@ip-172-31-57-113 ~]$ ae5 pod list
Connecting to user account anaconda-enterprise@aaron-561-rhel87.svc.anaconda.com.
                                   --- usage ---                                                                resource
   name            owner           mem    cpu  gpu      node       rst  modified   phase          since         _profile                  id                               project_id                                             url
----------  -------------------  -------  ---  ---  -------------  ---  --------  -------  -------------------  --------  -----------------------------------  -----------------------------------  -------------------------------------------------------------
Attractors  anaconda-enterprise  329.3Mi  0    0    172.31.57.113  0    False     Running  04-03-2023 20:10:11  default   a1-b4cfe2fbc6cb42c987154a621047fd19  a0-9bede0e4a7184caea913b215a2aeeac9  http://anaconda-enterprise-ap-workspace/sessions/b4cfe2fbc...
Attractors  anaconda-enterprise  297.8Mi  0    0    172.31.57.113  0    n/a       Running  04-03-2023 20:10:25  default   a2-676b4155cf644124b02161aebd0b8563  a0-9bede0e4a7184caea913b215a2aeeac9  https://attractors.aaron-561-rhel87.svc.anaconda.com/
k8s         anaconda-enterprise  123.9Mi  0    0    172.31.57.113  0    n/a       Running  04-12-2023 13:44:26  default   a2-4ae00b6d9fd9417796c880d830438789  a0-2149f24a22fd4008b8dad3e157d0d0c7  https://k8s.aaron-561-rhel87.svc.anaconda.com/
(base) [ec2-user@ip-172-31-57-113 ~]$ ae5 node list
Connecting to user account anaconda-enterprise@aaron-561-rhel87.svc.anaconda.com.
                                     --- capacity ---           --- usage ---          --- sessions ---      --- deployments ---    -- middleware --     --- system ---
    name         role     ready  pod    mem     cpu   gpu  pod    mem    cpu   gpu  pod    mem    cpu  gpu  pod    mem    cpu  gpu  pod    mem    cpu  pod    mem    cpu
-------------  ---------  -----  ---  -------  -----  ---  ---  -------  ----  ---  ---  -------  ---  ---  ---  -------  ---  ---  ---  -------  ---  ---  -------  ----
172.31.57.113  ae-master  True   110  32.72Gi  5200m  0    38   4.265Gi  231m  0    1    330.0Mi  0    0    2    422.7Mi  0    0    17   2.484Gi  59m  18   1.028Gi  172m