Open maximedb opened 9 months ago
I think the issue here is TikToken needs to download some vocab files from OpenAI at runtime (the first time it runs). This SO thread has some suggestions to pre-fetch the cached data file that seem like they'd work, but FYI, I haven't tried them and can't vouch for them. If any of the proposed approaches work, it'd be great to share that with the community. Please report back!
I think the issue here is TikToken needs to download some vocab files from OpenAI at runtime (the first time it runs). This SO thread has some suggestions to pre-fetch the cached data file that seem like they'd work, but FYI, I haven't tried them and can't vouch for them. If any of the proposed approaches work, it'd be great to share that with the community. Please report back!
I can confirm this worked for me.
Hello,
Thank you for this nice library.
Is there a way to use the nlm-ingestor without an internet connection? It seem to download a tokenizer from openai. I get the following error:
Thank you, Maxime.