Closed paravatha closed 4 years ago
It looks like you're deploying to the wrong cloud. aws
represents AWS itself, there will be another cloud probably called ckkf
that you'll need to deploy Kubeflow to. In other words, you'll need to fix the add-model command to this:
juju add-model kubeflow ckkf --config update-status-hook-interval=30s
@knkski , I commented out this line
juju('add-model', model, cloud, '--config', 'update-status-hook-interval=30s')
tried this
sudo python3 scripts/cli.py deploy-to ckkf --no-build --public-address {pub-ip}
no luck, so tried this way
sudo python3 scripts/cli.py deploy-to ckkf --no-build --public-address {pub-ip}
now, its complaining about tmp folders
> Building kubeflow-dashboard
> Error: SubcommandError("charm", "No such file or directory (os error 2)")
> Command '('juju', 'bundle', 'deploy', '--build', '--', '-m', 'kubeflow', '--overlay=/tmp', '--overlay=/tmp/tmp4sn_6kij')' returned non-zero exit status 1.
It seems like the loop is not able to create folders under /tmp
I do following commands. but even waiting 20 minutes. installation still keeps. What should I do? I shoud update ck.yaml config (edit memory and cpu)?
sudo python3 scripts/cli.py ck setup --controller ckkf
repeating message is follows.
DEBUG:root:aws-integrator/0 workload status is blocked since 2020-05-07 13:43:43+00:00
DEBUG:root:kubernetes-master/0 workload status is waiting since 2020-05-07 13:26:42+00:00
DEBUG:root:kubernetes-master/1 workload status is waiting since 2020-05-07 13:26:50+00:00
DEBUG:root:kubernetes-worker/0 workload status is waiting since 2020-05-07 13:18:30+00:00
DEBUG:root:kubernetes-worker/1 workload status is waiting since 2020-05-07 13:18:07+00:00
DEBUG:root:kubernetes-worker/2 workload status is waiting since 2020-05-07 13:16:55+00:00
By the way, I now understands controller uk8s (on localmachine) and ckkf (on aws).
@paravatha: You'll need to run sudo snap install charm --classic
to get the charm command. You'll also need that add-model line in there, as that creates a Kubeflow-specific model. You won't be able to deploy Kubeflow to the default AWS model.
@sakaia: Can you post the output from juju debug-log --replay --no-tail --include=kubernetes-master
, as well as the output from juju status --relations
?
@knkski When we run this command, how many ec2 instances it tries to provision? also, what kind of authorization the IAM user needs?
sudo python3 scripts/cli.py ck setup --controller ckkf
@paravatha: That by default will create a Kubernetes deployment with 2 masters and 3 workers, 3 etcd machines, and a handful of other machines such as a load balancer, for a total of 12 machines.
I don't have an exact list of IAM permissions needed, but the IAM role that I'm using has full access to EC2 and read access to IAM and STS.
I am trying to setup ckkf as per https://github.com/juju-solutions/bundle-kubeflow#setup-charmed-kubernetes
Error