This PR is integrating deeplake with vectorflow. The current integration only supports adding vectors to Deep Lake Storage. Subsequent PRs will integrate other Storage options.
Why
It seeks to close #17
Usage
To work with this current implementation you will need an activeloop(deeplake) account and you will need to create API tokens.
Specify your vector_db_type to be DEEPLAKE. Place your API key in the X-VectorDB-Key header.
Then your index_name should be of this format hub://<active_loop_user_name>/<dataset_name>. The dataset doesn't need to exist, deeplake will create it for you but you can always use existing dataset.
@EteimZ is there a command or api call you can run in a separate script to verify that the data made it into DeepLake? If so, can you screenshot that and add it as verification evidence? Thanks
What
This PR is integrating deeplake with vectorflow. The current integration only supports adding vectors to Deep Lake Storage. Subsequent PRs will integrate other Storage options.
Why
It seeks to close #17
Usage
To work with this current implementation you will need an activeloop(deeplake) account and you will need to create API tokens.
Specify your
vector_db_type
to be DEEPLAKE. Place your API key in theX-VectorDB-Key
header. Then yourindex_name
should be of this formathub://<active_loop_user_name>/<dataset_name>
. The dataset doesn't need to exist, deeplake will create it for you but you can always use existing dataset.Here's a sample request:
Verification
To verify this PR works I created a file named
test_vectorflow.py
and added the following content to it:The environment variable contains my activeloop token. Running the script above yeilds this:
This basically says my specified dataset doesn't exist.
Then I use vectorflow to create that dataset:
I run the script a second time: