Closed marrobi closed 5 years ago
@lawrencegripper suggests wrapping in python using functions similar to mount.py to retry apt installs. This will be an issue with all VMs requiring apt commands in cloud-init.
Added - "- while ( fuser /var/lib/dpkg/lock >/dev/null 2>&1 ); do sleep 5; done;", to cloud-init
This has enabled deployment, but maybe isn't a great long term solution.
Docker must be running before you install the Log Analytics agent for Linux on your container hosts. If you've already installed the agent before installing Docker, you need to reinstall the Log Analytics agent for Linux. For more information about Docker, see the Docker website.
So are we thinking about adding the extension as a final step of the deployment via an ACI container?
I guess. Set all monitoring up later, but sort of defeats point, you want the logs when cloud-init commands run! Even with custom script extensions can't do dependencies within a VMSS... Messy. Cough... Kubernetes... Cough...
That would still leave Solr, Zookeeper, Daemon VM and Mongo to monitor though. As solr runs in docker wouldn't it have the same problem?
Daemon would be easy to put on a cluster, Mongo - Atlas, or https://github.com/helm/charts/tree/master/stable/mongodb-replicaset, Zookeeper - https://github.com/helm/charts/tree/master/incubator/zookeeper, and can see people have done Solr on K8s. This just feels very painful and lots of custom scripts and hacks, that is going to be hard to maintain/understand for anyone who hasn't done the work with us. I didn't think (and don't really feel) Kubernetes is right, but it just solves many of the problems - I'm sure it would have its own - also presents potential marketplace challenges.
Feels dirty, but to get moving: wget https://raw.githubusercontent.com/Microsoft/OMS-Agent-for-Linux/master/installer/scripts/onboard_agent.sh && sh onboard_agent.sh -w <YOUR OMS WORKSPACE ID> -s <YOUR OMS WORKSPACE PRIMARY KEY>
I like it, we then have deterministic approach with cloud init, in terms of timing. We have a lot of other stuff currently pulling down and executing script.
One change would be to pick a commitSHA or tag for the URL to isolate against changes on the master branch
closing this, as will work round to not use vm extension
Cloud-init script fails due to:
This is as the Linux diagnostics extension has a lock.
Full logs: