activeloopai / deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
https://activeloop.ai
Mozilla Public License 2.0
8.09k stars 619 forks source link

[BUG] Installation error #1497

Closed davidbuniat closed 1 year ago

davidbuniat commented 2 years ago

🐛🐛 Bug Report

⚗️ Current Behavior

A user followed the instructions here for installation https://docs.activeloop.ai/quickstart

Input Code

>>> pip3 install Hub

Create file: altest.py
import hub

dataset_path = 'hub://activeloop/mnist-train'
ds = hub.load(dataset_path) # Returns a Hub Dataset but does not download data locally
tack dump:
Traceback (most recent call last):
  File "/home/user/activeloop/altest.py", line 2, in <module>
    import hub
  File "/home/user/.local/lib/python3.9/site-packages/hub/__init__.py", line 28, in <module>
    from .api.dataset import dataset
  File "/home/user/.local/lib/python3.9/site-packages/hub/api/dataset.py", line 6, in <module>
    from hub.auto.unstructured.image_classification import ImageClassification
  File "/home/user/.local/lib/python3.9/site-packages/hub/auto/unstructured/image_classification.py", line 14, in <module>
    from hub.core.dataset import Dataset
  File "/home/user/.local/lib/python3.9/site-packages/hub/core/dataset/__init__.py", line 1, in <module>
    from .dataset import Dataset  # type: ignore
  File "/home/user/.local/lib/python3.9/site-packages/hub/core/dataset/dataset.py", line 18, in <module>
    from hub.core.tensor import Tensor, create_tensor, delete_tensor
  File "/home/user/.local/lib/python3.9/site-packages/hub/core/tensor.py", line 2, in <module>
    from hub.core.chunk.base_chunk import InputSample
  File "/home/user/.local/lib/python3.9/site-packages/hub/core/chunk/base_chunk.py", line 12, in <module>
    from hub.core.meta.tensor_meta import TensorMeta
  File "/home/user/.local/lib/python3.9/site-packages/hub/core/meta/tensor_meta.py", line 14, in <module>
    from hub.util.json import validate_json_schema
  File "/home/user/.local/lib/python3.9/site-packages/hub/util/json.py", line 7, in <module>
    from hub.core.sample import Sample  # type: ignore
  File "/home/user/.local/lib/python3.9/site-packages/hub/core/sample.py", line 2, in <module>
    from hub.core.compression import (
  File "/home/user/.local/lib/python3.9/site-packages/hub/core/compression.py", line 24, in <module>
    import numcodecs.lz4  # type: ignore
  File "/home/user/.local/lib/python3.9/site-packages/numcodecs/__init__.py", line 32, in <module>
    from numcodecs.bz2 import BZ2
  File "/home/user/.local/lib/python3.9/site-packages/numcodecs/bz2.py", line 1, in <module>
    import bz2 as _bz2
  File "/usr/local/lib/python3.9/bz2.py", line 18, in <module>
    from _bz2 import BZ2Compressor, BZ2Decompressor
ModuleNotFoundError: No module named '_bz2'

Expected behavior/code During installation correct lib are installed or required to be additionally installed

⚙️ Environment

rajdeepdas2000 commented 2 years ago

I'm interested to work on this issue.

mikayelh commented 2 years ago

@rajdeepdas2000 hey there, thanks a lot for wanting to contribute. You can join the Activeloop community slack (slack.activeloop.ai) to ask questions. :)

rajdeepdas2000 commented 1 year ago

I installed deeplake on Ubuntu 22.04.1 LTS. Then I run the above mentioned lines of code on jupyter notebook. The mnist-trian dataset is displayed successfully by running the ds.visualize() command. done I suggest you to kindly re run/install all the required things (jupyter, deeplake, pip, etc) and re- running this code. Hopefully it will execute without errors.