Closed windowshopr closed 4 years ago
UPDATE:
Looks like I hit a winning combination with the below:
!pip install TPOT
!pip install dask==2.20.0 dask-glm==0.2.0 dask-ml==1.0.0
!pip install tornado==5.0
!pip install distributed==2.2.0
!pip install xgboost==0.90
This got my example up and running. I should have cross referenced my pip freeze
output with the pip freeze
output on Colab to check versions of the dependencies earlier, but this was able to get it going. Closing!
I should re-iterate, starting from a brand new "factory reset"ed session, I run those pip installs shown above, then when it gets to the training session, it errors out, but when you hit "Restart and Run All", it works. It's something to do with changing the version of Tornado, and you can only run the new pip installed version after you restart the runtime. Don't factory restart the session a second time, else you're just starting from scratch again. Seems finicky in Colab, but that's what worked for me.
I had to add fsspec to the end, but your last comment seems to have worked for me
!pip install TPOT
!pip install dask==2.20.0 dask-glm==0.2.0 dask-ml==1.0.0
!pip install tornado==5.0
!pip install distributed==2.2.0
!pip install xgboost==0.90
!pip install fsspec
Yes, we have updated installation guide with including fsspec.
I'm trying to run a TPOT session in a Google Colab notebook, but running into some issues regarding "dask", and I think it has to do with the way I'm pip installing the packages.
Context of the issue
Basically, using a Python 3.6 environment in Google Colab, I'd like to run a TPOT Classification session WITH Dask enabled, but I keep getting issues with this error:
ImportError: 'use_dask' requires the optional dask and dask-ml depedencies. cannot import name 'future_set_exc_info'
...even though I have confirmed that both dask and dask-ml are successfully installed and in the dist-packages folder from. I don't think this is a TPOT issue, but I'm opening it here in the hopes that this will leave some documentation that others can use for the successful install of the dependencies required (in the appropriate order, and what versions of each dependency need to be there).
Basically, after a few hours of trying different combinations of pip installs, I've landed on:
The Tornado upgrade and dask[complete] are things I was trying from some other similar issues I've found on GitHub in the Dask pages, but I am throwing in the towel. Can someone confirm a working install of TPOT Classification WITH the
use_dask=True
parameter, and what versions of each dependency are used in the environment?A simple reproducible example to run in a new Google Colab 3.6 environment would look like the following. If asked, I can paste the full traceback, but whoever runs this should be able to see it for themselves. Thanks!