Closed jamessantiago closed 5 years ago
Well ok, looks like I just need to learn docker more.
Consoling into the container using the "-u root" switch got me into the worker container with privlidges. I then ran a quick apt-get update
and then apt-get install python-numpy
to load the module I needed. However, that still doesn't get me past the numpy not found issue so I'm not sure where and how I should be loading that module to get this job working. Numpy for python3 installed via pip or apt-get doesn't seem to do the trick either.
Looks like I just needed to get the module installed specifically for 3.7 like so: python3.7 -m pip install numpy
I've got a simple notebook setup with HELK that pulls in some data from elastic via PySpark SQL and puts it into an RDD vector. When trying to send this data over to an ML job I run into an error. I'm running:
I get the error:
I want to go into the spark worker and add the numpy module manually, but I don't know the sparkuser or root password... Info on this:
https://stackoverflow.com/questions/35214231/importerror-no-module-named-numpy-on-spark-workers#
So what are the credentials for helk-spark-worker container?