clusterinthecloud / ansible

Ansible config for Cluster in the Cloud
https://cluster-in-the-cloud.readthedocs.io
MIT License
10 stars 27 forks source link

AWS Installation problems #143

Open javlonsodikov opened 1 year ago

javlonsodikov commented 1 year ago

Ubuntu 20.04 running on AWS EC2 instance

./install-citc.sh aws

TASK [ldap : Ensure people OU exists] ******************************************
Thursday 13 July 2023  00:54:38 +0000 (0:00:02.267)       0:01:03.355 ********* 
An exception occurred during task execution. To see the full traceback, use -vvv. The error was: ModuleNotFoundError: No module named 'ldap'
fatal: [mgmt.pleasant-halibut.citc.local]: FAILED! => changed=false 
  msg: Failed to import the required Python library (python-ldap) on mgmt.pleasant-halibut.citc.local's Python /usr/bin/python3. Please read the module documentation and install it in the appropriate location. If the required library is installed, but Ansible is using the wrong Python interpreter, please consult the documentation on ansible_python_interpreter
javlonsodikov commented 1 year ago

Update: After installing python3-devel started to work Why was initially python3-devel not automatically installed? https://github.com/clusterinthecloud/ansible/blob/6/group_vars/management.yml#L19

the only missing package was:


yum install python3-devel
Last metadata expiration check: 0:04:04 ago on Thu 13 Jul 2023 04:38:26 AM UTC.
Dependencies resolved.
==============================================================================================================================================================
 Package                                  Architecture             Version                                                  Repository                   Size
==============================================================================================================================================================
Installing:
 python36-devel                           x86_64                   3.6.8-38.module+el8.5.0+671+195e4563                     appstream                    15 k
Installing dependencies:
 platform-python-devel                    x86_64                   3.6.8-51.el8.rocky.0                                     appstream                   240 k
 python-rpm-macros                        noarch                   3-45.el8                                                 appstream                    15 k
 python-srpm-macros                       noarch                   3-45.el8                                                 appstream                    15 k
 python3-pip                              noarch                   9.0.3-22.el8.rocky.0                                     appstream                    19 k
 python3-rpm-generators                   noarch                   5-8.el8                                                  appstream                    24 k
 python3-rpm-macros                       noarch                   3-45.el8                                                 appstream                    14 k
 python36                                 x86_64                   3.6.8-38.module+el8.5.0+671+195e4563                     appstream                    18 k

Transaction Summary
==============================================================================================================================================================
Install  8 Packages

Total download size: 361 k
Installed size: 756 k
Is this ok [y/N]: y
Downloading Packages:
(1/8): python-rpm-macros-3-45.el8.noarch.rpm                                                                                   50 kB/s |  15 kB     00:00    
(2/8): python-srpm-macros-3-45.el8.noarch.rpm                                                                                  49 kB/s |  15 kB     00:00    
(3/8): python3-pip-9.0.3-22.el8.rocky.0.noarch.rpm                                                                            370 kB/s |  19 kB     00:00    
(4/8): python3-rpm-generators-5-8.el8.noarch.rpm                                                                              309 kB/s |  24 kB     00:00    
(5/8): python3-rpm-macros-3-45.el8.noarch.rpm                                                                                 275 kB/s |  14 kB     00:00    
(6/8): python36-3.6.8-38.module+el8.5.0+671+195e4563.x86_64.rpm                                                               342 kB/s |  18 kB     00:00    
(7/8): platform-python-devel-3.6.8-51.el8.rocky.0.x86_64.rpm                                                                  519 kB/s | 240 kB     00:00    
(8/8): python36-devel-3.6.8-38.module+el8.5.0+671+195e4563.x86_64.rpm                                                         222 kB/s |  15 kB     00:00    
--------------------------------------------------------------------------------------------------------------------------------------------------------------
Total                                                                                                                         537 kB/s | 361 kB     00:00     
Running transaction check
Transaction check succeeded.
Running transaction test
Transaction test succeeded.
Running transaction
  Preparing        :                                                                                                                                      1/1 
  Installing       : python-srpm-macros-3-45.el8.noarch                                                                                                   1/8 
  Installing       : python-rpm-macros-3-45.el8.noarch                                                                                                    2/8 
  Installing       : python3-rpm-macros-3-45.el8.noarch                                                                                                   3/8 
  Installing       : python36-3.6.8-38.module+el8.5.0+671+195e4563.x86_64                                                                                 4/8 
  Running scriptlet: python36-3.6.8-38.module+el8.5.0+671+195e4563.x86_64                                                                                 4/8 
  Installing       : python3-pip-9.0.3-22.el8.rocky.0.noarch                                                                                              5/8 
  Installing       : python3-rpm-generators-5-8.el8.noarch                                                                                                6/8 
  Installing       : platform-python-devel-3.6.8-51.el8.rocky.0.x86_64                                                                                    7/8 
  Installing       : python36-devel-3.6.8-38.module+el8.5.0+671+195e4563.x86_64                                                                           8/8 
  Running scriptlet: python36-devel-3.6.8-38.module+el8.5.0+671+195e4563.x86_64                                                                           8/8 
  Verifying        : platform-python-devel-3.6.8-51.el8.rocky.0.x86_64                                                                                    1/8 
  Verifying        : python-rpm-macros-3-45.el8.noarch                                                                                                    2/8 
  Verifying        : python-srpm-macros-3-45.el8.noarch                                                                                                   3/8 
  Verifying        : python3-pip-9.0.3-22.el8.rocky.0.noarch                                                                                              4/8 
  Verifying        : python3-rpm-generators-5-8.el8.noarch                                                                                                5/8 
  Verifying        : python3-rpm-macros-3-45.el8.noarch                                                                                                   6/8 
  Verifying        : python36-3.6.8-38.module+el8.5.0+671+195e4563.x86_64                                                                                 7/8 
  Verifying        : python36-devel-3.6.8-38.module+el8.5.0+671+195e4563.x86_64                                                                           8/8 

Installed:
  platform-python-devel-3.6.8-51.el8.rocky.0.x86_64       python-rpm-macros-3-45.el8.noarch                             python-srpm-macros-3-45.el8.noarch   
  python3-pip-9.0.3-22.el8.rocky.0.noarch                 python3-rpm-generators-5-8.el8.noarch                         python3-rpm-macros-3-45.el8.noarch   
  python36-3.6.8-38.module+el8.5.0+671+195e4563.x86_64    python36-devel-3.6.8-38.module+el8.5.0+671+195e4563.x86_64  
javlonsodikov commented 1 year ago

One more error appeared

TASK [monitoring : Check admin password] ***************************************
Thursday 13 July 2023  04:57:27 +0000 (0:00:00.602)       0:14:28.930 ********* 
fatal: [mgmt.alert-mammoth.citc.local]: FAILED! => changed=false 
  cache_control: no-store
  connection: close
  content_length: '56'
  content_type: application/json; charset=UTF-8
  date: Thu, 13 Jul 2023 04:57:28 GMT
  elapsed: 0
  json:
    message: invalid username or password
    traceID: ''
  msg: 'Status code was 401 and not [200]: HTTP Error 401: Unauthorized'
  redirected: false
  status: 401
  url: http://localhost:3000/api/admin/settings
  x_content_type_options: nosniff
  x_frame_options: deny
  x_xss_protection: 1; mode=block
...ignoring
javlonsodikov commented 1 year ago

attaching full log fulllog.txt

javlonsodikov commented 1 year ago

Next issue

[root@mgmt citc]# sinfo
slurm_load_partitions: Unable to contact slurm controller (connect failure)
$ finish

Error: Could not find limits.yaml in this directory
Please create the file and rerun this script.

See https://cluster-in-the-cloud.readthedocs.io/en/latest/finalise.html#setting-service-limits for details.

created limits.yaml manually

t3a.small: 3
t3a.medium: 3

$ finish no errors reported

$ sinfo
PARTITION AVAIL  TIMELIMIT  NODES  STATE NODELIST 
compute*     up   infinite      0    n/a  

running sbatch from manual https://cluster-in-the-cloud.readthedocs.io/en/latest/running.html $ sbatch test.slm

$ sinfo

sinfo
PARTITION AVAIL  TIMELIMIT  NODES  STATE NODELIST 
compute*     up   infinite      1   idle alert-mammoth-t3a-medium-0001 
javlonsodikov commented 1 year ago

next step what is the webUI login and password? figured out

/usr/sbin/get_secrets

webui xxxxxx
grafana yyyy

after logging into the WebUI

Server Error (500)