Closed dcsan closed 2 months ago
I added a download-models tool that might help you get closer to your goal of pre-downloading/packaging the model files. https://github.com/jparkerweb/semantic-chunking?tab=readme-ov-file#-pre-downloading-models
@jparkerweb hey, could I run this on runpod or cloudflare workers AI?
I'm trying to use this library in a cloudflare worker, and getting this error:
Here's an overview of my setup:
not the author but would be surprised if it works in a worker - it needs to download a lot of model data first time its run.
@dcsan Hey! What would you recommend to deploy this?
@dcsan This error is occurring even after manually downloading the models locally. I have no clue on how to solve this.
i'd like to include this technique in a larger project but I'm worried about the requirements downloading the model as I'm using a containerized environment.
what would be involved to port this code to use a hosted model eg via HF? I assume its not enough to have an embedding API?
alternatively could there be an installer that grabs the required files, to add to a docker container?