Open mrddter opened 7 months ago
Hi there 👋 Sorry for the late reply :)
Your usage looks correct, and adopts the advised "Singleton" pattern to avoid multiple reconstructions of the pipeline. Just a question about where you intend on running this: in-browser or server-side? If server-side, you can ignore what I will say, but if in-browser, it's usually advised to either use a web worker, or to use onnxruntime-web's proxy option.
@mrddter Did your implementation end up working?
Hi all, I'm writing a custom LLM to use transformer.js with langchain. Does a structure like this make sense? Any advice for optimizing it or best practices to apply?
Any suggestions or feedback would be greatly appreciated 😊 🚀