nuvla / job-engine

Job engine for handling asynchronous tasks on the Nuvla platform
https://sixsq.com
Apache License 2.0
5 stars 0 forks source link

kubeadm: command not found - when deploying k8s cluster #179

Open konstan opened 3 years ago

konstan commented 3 years ago

On Exoscale with Large VMs and on Ubuntu 18.04

RuntimeError-Traceback (most recent call last):
  File "/usr/local/lib/python3.9/site-packages/nuvla/job/executor.py", line 71, in _process_jobs
    return_code = action_instance.do_work()
  File "/usr/local/lib/python3.9/site-packages/nuvla/job/actions/coe_provision.py", line 114, in do_work
    return self.provision_coe()
  File "/usr/local/lib/python3.9/site-packages/nuvla/job/actions/coe_provision.py", line 105, in provision_coe
    self._provision_coe(infra_service)
  File "/usr/local/lib/python3.9/site-packages/nuvla/job/actions/coe_provision.py", line 45, in _provision_coe
    result = coe.provision(coe_custer_params)
  File "/usr/local/lib/python3.9/site-packages/nuvla/connector/connector.py", line 15, in wrapper
    raise e
  File "/usr/local/lib/python3.9/site-packages/nuvla/connector/connector.py", line 13, in wrapper
    result = f(self, *f_args, **f_kwargs)
  File "/usr/local/lib/python3.9/site-packages/nuvla/connector/docker_machine_connector.py", line 638, in provision
    raise ex
  File "/usr/local/lib/python3.9/site-packages/nuvla/connector/docker_machine_connector.py", line 621, in provision
    self._create_coe(inventory, nodes)
  File "/usr/local/lib/python3.9/site-packages/nuvla/connector/docker_machine_connector.py", line 645, in _create_coe
    join_tokens = self._deploy_k8s(inventory)
  File "/usr/local/lib/python3.9/site-packages/nuvla/connector/docker_machine_connector.py", line 397, in _deploy_k8s
    join_token_worker = self._k8s_join_token(inventory.manager, self.cmd_env)
  File "/usr/local/lib/python3.9/site-packages/nuvla/connector/docker_machine_connector.py", line 357, in _k8s_join_token
    res = machine.ssh(manager_name, command, env=env)
  File "/usr/local/lib/python3.9/site-packages/nuvla/connector/machine/machine.py", line 399, in ssh
    stdout, _, _ = self._run(ssh_cmd, env_extra=env)
  File "/usr/local/lib/python3.9/site-packages/nuvla/connector/machine/machine.py", line 51, in _run
    raise RuntimeError("cmd returned error %s: %s" % (error_code, stderr.decode('utf-8').strip()))
RuntimeError: cmd returned error 1: sudo: kubeadm: command not found
exit status 1
0xbase12 commented 3 years ago

Konstan: I started using your Terraform scripts to provision the cluster. We need to substitute docker-machine by Terraform

konstan commented 3 years ago

Created #187