koaning / bulk

A Simple Bulk Labelling Tool
MIT License
550 stars 46 forks source link

Add `bulk embed` command. #47

Closed koaning closed 2 months ago

koaning commented 1 year ago

It should be possible, with embetter as an optional dependency, to add a bulk embed command to this project. Maybe use it like:

# For text
python -m bulk embed text file-in.jsonl file-out.jsonl --pipeline sentence-tfm --model LaBSE

# For images
python -m bulk embed image file-in.jsonl file-out.jsonl --pipeline timm --model VGG16

Things to think about:

koaning commented 2 months ago

Closing this issue. The future of this project is Jupyter Widgets.