mozilla / overscripted

Repository for the Mozilla Overscripted Data Mining Challenge
Mozilla Public License 2.0
74 stars 53 forks source link

Import error while using the provided environment in analyses folder #70

Closed Aimaanhasan closed 5 years ago

Aimaanhasan commented 5 years ago

System Specs: Anaconda: Version 2018.12 Windows 10 (64 -bit)

I get the error when importing Client from dask.distributed when using the overscripted environment. The error is `ImportError Traceback (most recent call last) C:\ProgramData\Anaconda3\envs\overscripted\lib\site-packages\dask\distributed.py in 4 try: ----> 5 from distributed import * 6 except ImportError:

C:\ProgramData\Anaconda3\envs\overscripted\lib\site-packages\distributed__init__.py in 3 from .config import config ----> 4 from .core import connect, rpc 5 from .deploy import LocalCluster

C:\ProgramData\Anaconda3\envs\overscripted\lib\site-packages\distributed\core.py in 19 ---> 20 from .comm import (connect, listen, CommClosedError, 21 normalize_address,

C:\ProgramData\Anaconda3\envs\overscripted\lib\site-packages\distributed\comm__init__.py in 16 ---> 17 _register_transports()

C:\ProgramData\Anaconda3\envs\overscripted\lib\site-packages\distributed\comm__init__.py in _register_transports() 12 def _register_transports(): ---> 13 from . import inproc 14 from . import tcp

C:\ProgramData\Anaconda3\envs\overscripted\lib\site-packages\distributed\comm\inproc.py in 14 from ..compatibility import finalize ---> 15 from ..protocol import nested_deserialize 16 from ..utils import get_ip

C:\ProgramData\Anaconda3\envs\overscripted\lib\site-packages\distributed\protocol__init__.py in 4 ----> 5 from .compression import compressions, default_compression 6 from .core import (dumps, loads, maybe_compress, decompress, msgpack)

C:\ProgramData\Anaconda3\envs\overscripted\lib\site-packages\distributed\protocol\compression.py in 22 from ..config import config ---> 23 from ..utils import ignoring, ensure_bytes 24

C:\ProgramData\Anaconda3\envs\overscripted\lib\site-packages\distributed\utils.py in 37 from tornado import gen ---> 38 from tornado.ioloop import IOLoop, PollIOLoop 39

ImportError: cannot import name 'PollIOLoop'

During handling of the above exception, another exception occurred:

ImportError Traceback (most recent call last)

in 1 import dask.dataframe as dd ----> 2 from dask.distributed import Client 3 4 Client() C:\ProgramData\Anaconda3\envs\overscripted\lib\site-packages\dask\distributed.py in 9 " conda install dask distributed # either conda install\n" 10 " pip install dask distributed --upgrade # or pip install") ---> 11 raise ImportError(msg) ImportError: Dask's distributed scheduler is not installed. Please either conda or pip install dask distributed: conda install dask distributed # either conda install pip install dask distributed --upgrade # or pip install` It works fine with the base environment. I included distributed in environment.yaml `name: overscripted channels: - defaults - conda-forge dependencies: - python=3.6 - jupyter=1.0.0 - pyarrow=0.9.0 - pandas=0.23.0 - distributed=1.21.1 - dask=0.17.5 - findspark=1.2.0 - tldextract=2.2.0 ` But it doesn't work with the overscripted environment. However, it works perfectly with base environment.
birdsarah commented 5 years ago

Anaconda's base environment includes every package you could conceivably need so I'm not surprised that work.

To debug this, please include information on: 1) How you set up overscripted environment 2) Output of conda list for your environment

Finally, we should definitely update the environment.yaml for the latest versions of packages.

birdsarah commented 5 years ago

I have just replicated the issue.

Running conda update --all -c conda-forge worked for me.

The -c conda-forge may not be necessary, I just use conda-forge for lots of things. I will update the environment.yaml.

birdsarah commented 5 years ago

Thanks for the report @Aimaanhasan