nchammas / flintrock

A command-line tool for launching Apache Spark clusters.
Apache License 2.0
637 stars 116 forks source link

Flintrock gets terribly slow if known_hosts is big #189

Open douglaz opened 7 years ago

douglaz commented 7 years ago

So slow that it's unable to create a cluster with more than a couple of machines.

This seems to be a paramiko issue (https://github.com/paramiko/paramiko/issues/191), must it's something flintrock's users and developers should be aware.

Perhaps there is a workaround for that? At least a warning would be helpful to avoid lots of wasted hours trying to figure out why flintrock is behaving so weird in an old machine.

nchammas commented 7 years ago

Hmm, I wonder if Flintrock even needs to read from known_hosts. If not, we can presumably just turn that off via an option to Paramiko.